.. |
augment
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
html
|
support scraping start-/enddate from html tag with property "datetime"
|
2016-01-26 21:27:44 +01:00 |
images
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
rdfa
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
xml
|
extract lastmodified from openoffice doc
|
2015-09-06 00:04:54 +02:00 |
apkParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
audioTagParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
bzipParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
csvParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
docParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
dwgParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
genericParser.java
|
eleminate dependency on file-extension in storeDocument but use supported mime-type
|
2016-08-14 03:53:16 +02:00 |
gzipParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
htmlParser.java
|
eleminate dependency on file-extension in storeDocument but use supported mime-type
|
2016-08-14 03:53:16 +02:00 |
linkScraperParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
mmParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
odtParser.java
|
fix delete of temp file after odt % ooxml parser
|
2016-03-04 23:05:55 +01:00 |
ooxmlParser.java
|
fix delete of temp file after odt % ooxml parser
|
2016-03-04 23:05:55 +01:00 |
pdfParser.java
|
upd to PDFBox 2.0.1
|
2016-05-20 23:12:16 +02:00 |
pptParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
psParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
rdfParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
rssParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
rtfParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
sevenzipParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
sidAudioParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
sitemapParser.java
|
removed unused imports
|
2016-09-06 18:46:24 +02:00 |
tarParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
torrentParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
vcfParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
vsdParser.java
|
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
|
2016-02-16 02:05:58 +01:00 |
xlsParser.java
|
refactor xlsParser to include Excel file attribute (like author) in parser result doc.
|
2016-08-13 23:46:36 +02:00 |
zipParser.java
|
removed unused imports
|
2016-09-06 18:46:24 +02:00 |