.. |
html
|
set a limit to CharBuffer object size to fight against bad/too large
|
2012-01-10 03:02:17 +01:00 |
images
|
smaller bug fixes for search behavior; should produce less unnecessary removals and an exact number of results as shown in counter
|
2011-11-18 13:09:07 +00:00 |
xml
|
some last-minute performance hacks
|
2011-11-25 11:23:52 +00:00 |
bzipParser.java
|
added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.
|
2011-09-07 10:08:57 +00:00 |
csvParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
docParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
genericParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
gzipParser.java
|
added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.
|
2011-09-07 10:08:57 +00:00 |
htmlParser.java
|
performance hacks
|
2011-12-15 15:15:53 +01:00 |
mmParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
odtParser.java
|
set a limit to CharBuffer object size to fight against bad/too large
|
2012-01-10 03:02:17 +01:00 |
ooxmlParser.java
|
set a limit to CharBuffer object size to fight against bad/too large
|
2012-01-10 03:02:17 +01:00 |
pdfParser.java
|
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
|
2012-01-24 20:42:30 +01:00 |
pptParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
psParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
rssParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
rtfParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
sevenzipParser.java
|
added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.
|
2011-09-07 10:08:57 +00:00 |
sidAudioParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
sitemapParser.java
|
better abstraction of http client identification
|
2011-04-26 13:35:29 +00:00 |
swfParser.java
|
some last-minute performance hacks
|
2011-11-25 11:23:52 +00:00 |
tarParser.java
|
added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.
|
2011-09-07 10:08:57 +00:00 |
torrentParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
vcfParser.java
|
some last-minute performance hacks
|
2011-11-25 11:23:52 +00:00 |
vsdParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
xlsParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
zipParser.java
|
added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.
|
2011-09-07 10:08:57 +00:00 |