..
html
more fixes for double precision of coordinates
2012-06-04 23:37:41 +02:00
images
augmentedProxy, which forwards every proxy request to a
2012-06-10 10:15:34 +02:00
xml
added concurrency enhancement to xml parser
2012-06-04 23:35:56 +02:00
bzipParser.java
replaced old bzip2 library against better documented commons-compress
2012-05-28 23:53:48 +02:00
csvParser.java
- enhanced html parser: recognized much more details in the content
2011-04-21 13:58:49 +00:00
docParser.java
- enhanced html parser: recognized much more details in the content
2011-04-21 13:58:49 +00:00
dwgParser.java
removed (not all) warnings
2012-05-16 13:42:32 +02:00
genericParser.java
- enhanced html parser: recognized much more details in the content
2011-04-21 13:58:49 +00:00
gzipParser.java
- Redesigned crawler and parser to accept embedded links from the NOLOAD
2012-04-24 16:07:03 +02:00
htmlParser.java
bugfixes
2012-02-02 23:38:23 +01:00
mmParser.java
added concurrency enhancement to xml parser
2012-06-04 23:35:56 +02:00
odtParser.java
added concurrency enhancement to xml parser
2012-06-04 23:35:56 +02:00
ooxmlParser.java
added concurrency enhancement to xml parser
2012-06-04 23:35:56 +02:00
pdfParser.java
memory hacks
2012-02-02 07:37:00 +01:00
pptParser.java
- enhanced html parser: recognized much more details in the content
2011-04-21 13:58:49 +00:00
psParser.java
- enhanced html parser: recognized much more details in the content
2011-04-21 13:58:49 +00:00
rssParser.java
- enhanced html parser: recognized much more details in the content
2011-04-21 13:58:49 +00:00
rtfParser.java
- enhanced html parser: recognized much more details in the content
2011-04-21 13:58:49 +00:00
sevenzipParser.java
- Redesigned crawler and parser to accept embedded links from the NOLOAD
2012-04-24 16:07:03 +02:00
sidAudioParser.java
- enhanced html parser: recognized much more details in the content
2011-04-21 13:58:49 +00:00
sitemapParser.java
better abstraction of http client identification
2011-04-26 13:35:29 +00:00
swfParser.java
removed stack trace from swf parser since we cant do anything there
2012-05-21 02:27:06 +02:00
tarParser.java
- Redesigned crawler and parser to accept embedded links from the NOLOAD
2012-04-24 16:07:03 +02:00
torrentParser.java
- performance hacks
2012-06-04 15:37:39 +02:00
vcfParser.java
some last-minute performance hacks
2011-11-25 11:23:52 +00:00
vsdParser.java
- enhanced html parser: recognized much more details in the content
2011-04-21 13:58:49 +00:00
xlsParser.java
- enhanced html parser: recognized much more details in the content
2011-04-21 13:58:49 +00:00
zipParser.java
- Redesigned crawler and parser to accept embedded links from the NOLOAD
2012-04-24 16:07:03 +02:00