..
html
ehanced location search:
2012-05-31 22:39:53 +02:00
images
smaller bug fixes for search behavior; should produce less unnecessary removals and an exact number of results as shown in counter
2011-11-18 13:09:07 +00:00
xml
some last-minute performance hacks
2011-11-25 11:23:52 +00:00
bzipParser.java
replaced old bzip2 library against better documented commons-compress
2012-05-28 23:53:48 +02:00
csvParser.java
- enhanced html parser: recognized much more details in the content
2011-04-21 13:58:49 +00:00
docParser.java
- enhanced html parser: recognized much more details in the content
2011-04-21 13:58:49 +00:00
dwgParser.java
removed (not all) warnings
2012-05-16 13:42:32 +02:00
genericParser.java
- enhanced html parser: recognized much more details in the content
2011-04-21 13:58:49 +00:00
gzipParser.java
- Redesigned crawler and parser to accept embedded links from the NOLOAD
2012-04-24 16:07:03 +02:00
htmlParser.java
bugfixes
2012-02-02 23:38:23 +01:00
mmParser.java
- enhanced html parser: recognized much more details in the content
2011-04-21 13:58:49 +00:00
odtParser.java
set a limit to CharBuffer object size to fight against bad/too large
2012-01-10 03:02:17 +01:00
ooxmlParser.java
set a limit to CharBuffer object size to fight against bad/too large
2012-01-10 03:02:17 +01:00
pdfParser.java
memory hacks
2012-02-02 07:37:00 +01:00
pptParser.java
- enhanced html parser: recognized much more details in the content
2011-04-21 13:58:49 +00:00
psParser.java
- enhanced html parser: recognized much more details in the content
2011-04-21 13:58:49 +00:00
rssParser.java
- enhanced html parser: recognized much more details in the content
2011-04-21 13:58:49 +00:00
rtfParser.java
- enhanced html parser: recognized much more details in the content
2011-04-21 13:58:49 +00:00
sevenzipParser.java
- Redesigned crawler and parser to accept embedded links from the NOLOAD
2012-04-24 16:07:03 +02:00
sidAudioParser.java
- enhanced html parser: recognized much more details in the content
2011-04-21 13:58:49 +00:00
sitemapParser.java
better abstraction of http client identification
2011-04-26 13:35:29 +00:00
swfParser.java
removed stack trace from swf parser since we cant do anything there
2012-05-21 02:27:06 +02:00
tarParser.java
- Redesigned crawler and parser to accept embedded links from the NOLOAD
2012-04-24 16:07:03 +02:00
torrentParser.java
- enhanced html parser: recognized much more details in the content
2011-04-21 13:58:49 +00:00
vcfParser.java
some last-minute performance hacks
2011-11-25 11:23:52 +00:00
vsdParser.java
- enhanced html parser: recognized much more details in the content
2011-04-21 13:58:49 +00:00
xlsParser.java
- enhanced html parser: recognized much more details in the content
2011-04-21 13:58:49 +00:00
zipParser.java
- Redesigned crawler and parser to accept embedded links from the NOLOAD
2012-04-24 16:07:03 +02:00