yacy_search_server

mirror of https://github.com/yacy/yacy_search_server.git synced 2024-09-21 00:00:13 +02:00

History

Michael Peter Christen 4d5da75814 fix for parser problem if a <a>-tag is 'within' html tags with unclosed tags. That prevented the <a> tags from beeing recognized. This is a fix for http://forum.yacy-websuche.de/viewtopic.php?p=25516#p25516		2012-04-18 10:30:04 +02:00
..
html	fix for parser problem if a <a>-tag is 'within' html tags with unclosed	2012-04-18 10:30:04 +02:00
images	smaller bug fixes for search behavior; should produce less unnecessary removals and an exact number of results as shown in counter	2011-11-18 13:09:07 +00:00
xml	some last-minute performance hacks	2011-11-25 11:23:52 +00:00
bzipParser.java	added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.	2011-09-07 10:08:57 +00:00
csvParser.java	- enhanced html parser: recognized much more details in the content	2011-04-21 13:58:49 +00:00
docParser.java	- enhanced html parser: recognized much more details in the content	2011-04-21 13:58:49 +00:00
genericParser.java	- enhanced html parser: recognized much more details in the content	2011-04-21 13:58:49 +00:00
gzipParser.java	added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.	2011-09-07 10:08:57 +00:00
htmlParser.java	bugfixes	2012-02-02 23:38:23 +01:00
mmParser.java	- enhanced html parser: recognized much more details in the content	2011-04-21 13:58:49 +00:00
odtParser.java	set a limit to CharBuffer object size to fight against bad/too large	2012-01-10 03:02:17 +01:00
ooxmlParser.java	set a limit to CharBuffer object size to fight against bad/too large	2012-01-10 03:02:17 +01:00
pdfParser.java	memory hacks	2012-02-02 07:37:00 +01:00
pptParser.java	- enhanced html parser: recognized much more details in the content	2011-04-21 13:58:49 +00:00
psParser.java	- enhanced html parser: recognized much more details in the content	2011-04-21 13:58:49 +00:00
rssParser.java	- enhanced html parser: recognized much more details in the content	2011-04-21 13:58:49 +00:00
rtfParser.java	- enhanced html parser: recognized much more details in the content	2011-04-21 13:58:49 +00:00
sevenzipParser.java	added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.	2011-09-07 10:08:57 +00:00
sidAudioParser.java	- enhanced html parser: recognized much more details in the content	2011-04-21 13:58:49 +00:00
sitemapParser.java	better abstraction of http client identification	2011-04-26 13:35:29 +00:00
swfParser.java	some last-minute performance hacks	2011-11-25 11:23:52 +00:00
tarParser.java	check if httpc has decompressed the release file and rename the file	2012-04-16 09:50:55 +02:00
torrentParser.java	- enhanced html parser: recognized much more details in the content	2011-04-21 13:58:49 +00:00
vcfParser.java	some last-minute performance hacks	2011-11-25 11:23:52 +00:00
vsdParser.java	- enhanced html parser: recognized much more details in the content	2011-04-21 13:58:49 +00:00
xlsParser.java	- enhanced html parser: recognized much more details in the content	2011-04-21 13:58:49 +00:00
zipParser.java	added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.	2011-09-07 10:08:57 +00:00