.. |
html
|
add-on to latest commit
|
2012-05-21 17:52:30 +02:00 |
images
|
smaller bug fixes for search behavior; should produce less unnecessary removals and an exact number of results as shown in counter
|
2011-11-18 13:09:07 +00:00 |
xml
|
some last-minute performance hacks
|
2011-11-25 11:23:52 +00:00 |
bzipParser.java
|
replaced old bzip2 library against better documented commons-compress
|
2012-05-28 23:53:48 +02:00 |
csvParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
docParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
dwgParser.java
|
removed (not all) warnings
|
2012-05-16 13:42:32 +02:00 |
genericParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
gzipParser.java
|
- Redesigned crawler and parser to accept embedded links from the NOLOAD
|
2012-04-24 16:07:03 +02:00 |
htmlParser.java
|
bugfixes
|
2012-02-02 23:38:23 +01:00 |
mmParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
odtParser.java
|
set a limit to CharBuffer object size to fight against bad/too large
|
2012-01-10 03:02:17 +01:00 |
ooxmlParser.java
|
set a limit to CharBuffer object size to fight against bad/too large
|
2012-01-10 03:02:17 +01:00 |
pdfParser.java
|
memory hacks
|
2012-02-02 07:37:00 +01:00 |
pptParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
psParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
rssParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
rtfParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
sevenzipParser.java
|
- Redesigned crawler and parser to accept embedded links from the NOLOAD
|
2012-04-24 16:07:03 +02:00 |
sidAudioParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
sitemapParser.java
|
better abstraction of http client identification
|
2011-04-26 13:35:29 +00:00 |
swfParser.java
|
removed stack trace from swf parser since we cant do anything there
|
2012-05-21 02:27:06 +02:00 |
tarParser.java
|
- Redesigned crawler and parser to accept embedded links from the NOLOAD
|
2012-04-24 16:07:03 +02:00 |
torrentParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
vcfParser.java
|
some last-minute performance hacks
|
2011-11-25 11:23:52 +00:00 |
vsdParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
xlsParser.java
|
- enhanced html parser: recognized much more details in the content
|
2011-04-21 13:58:49 +00:00 |
zipParser.java
|
- Redesigned crawler and parser to accept embedded links from the NOLOAD
|
2012-04-24 16:07:03 +02:00 |