Commit Graph

342 Commits

Author SHA1 Message Date
luccioman
baa7154486 Upgraded Apache PDFBox dependency from 2.0.9 to 2.0.11
Release notes at
https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12310760&version=12343466
and https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12310760&version=12342889
2018-08-18 12:39:58 +02:00
luccioman
685122363d Added a parser for XZ compressed archives.
As suggested by LA_FORGE on mantis 781
(http://mantis.tokeek.de/view.php?id=781)
2018-08-15 10:07:39 +02:00
luccioman
1a91e87b05 Upgraded commons-compress dependency from version 1.16.1 to 1.17 2018-08-08 08:06:24 +02:00
luccioman
f2c479fe88 Cleaned up unused old jar files not removed on previous Solr upgrade 2018-08-08 08:04:33 +02:00
luccioman
2bbf070f57 Upgraded icu4j dependency from 61.1 to 62.1 2018-07-26 09:39:54 +02:00
luccioman
8811700e2e Upgraded Jetty dependency from 9.4.9 to 9.4.11 2018-06-20 09:33:26 +02:00
reger
d5af160e60 upd to slf4j-1.7.25 2018-05-20 21:51:41 +02:00
reger
7525594315 upd to jwat-warc-1.1.1 2018-05-06 00:49:30 +02:00
reger
b81debca2e upd to jsoup-1.11.3 2018-04-28 23:24:24 +02:00
reger
508050f79c upd to icu4j-61.1 2018-04-14 16:16:35 +02:00
reger
e7971fb888 upd to pdfbox-2.0.9 2018-04-08 20:13:53 +02:00
reger
e2b2c89feb upd to jetty-9.4.9.v20180320 2018-04-07 23:39:03 +02:00
luccioman
c867a52d96 Upgraded Solr dependencies from 6.6.2 to 6.6.3 2018-04-05 18:15:45 +02:00
reger
a57a04a003 upd to commons-codec-1.11 2018-03-19 02:02:35 +01:00
luccioman
5753ce0ac5 Upgraded Jaudiotagger dependency from 2.0.3 to 2.2.5 2018-02-26 09:17:26 +01:00
reger
aaa0ec6613 upd to commons-compress-1.16.1 2018-02-23 19:17:09 +01:00
reger
73c6ce7ae5 upd to httpclient-4.5.5 2018-02-10 20:01:35 +01:00
reger
5aa4fb1144 upd to metadata-extractor-2.11.0.jar 2018-01-27 18:32:45 +01:00
reger
cedb53be4e upd to commons-io-2.6 2017-12-28 03:13:42 +01:00
reger
270b77074e upd to httpclient-4.5.4 and httpmime-4.5.4 2017-12-24 01:34:23 +01:00
reger
6db7f5525b upd to icu4j-60.2 2017-12-24 01:02:18 +01:00
reger
c94bc82f6a upd to commons-compress-1.15 2017-12-16 00:49:48 +01:00
reger
e5b4799838 upd to Jetty-9.4.8.v20171121 2017-12-07 00:24:33 +01:00
reger
0704b1d644 upd to httpcore-4.4.8 2017-12-04 01:12:50 +01:00
reger
a1879115dc upd to Jsoup-1.11.2 2017-11-26 22:01:42 +01:00
luccioman
01dca12d05 Upgraded apache POI dependency from 3.16 to 3.17 2017-11-22 09:07:36 +01:00
luccioman
8f07df5f85 Upgraded com.twelvemonkeys.imageio dependencies from 3.3.1 to 3.3.2 2017-11-09 09:30:20 +01:00
luccioman
f61260c4c7 Upgraded icu4j dependency from 59_1 to 60.1 2017-11-06 09:37:44 +01:00
reger
d14c47d4d3 upd to pdfbox-2.0.8.jar 2017-11-05 00:52:14 +01:00
reger
b98acb33c3 upd to Solr 6.6.2 2017-10-22 20:00:00 +02:00
reger
cbaa492054 upd to Jetty-9.4.7.v20170914 2017-10-02 00:50:30 +02:00
reger
c4a7ad2865 update jars for upd solr 6.6. commit for ant 2017-09-17 08:25:14 +02:00
luccioman
366ceae35a Fixed missing transitive dependency to commons-collections4-4.1
Dependency required by poi-3.16. 

Dependency was not provided in YaCy but already defined on previous poi
versions. This only became problematic since upgrade from poi-3.15 to
poi-3.16 (commit dedc6552d3). Indeed in
this new poi release, a poi component used in some YaCy parsers code
paths now explicitely needs a class from the commons-collections4
library : org.apache.poi.hpsf.Section uses now
org.apache.commons.collections4.bidimap.TreeBidiMap.

Impacted YaCy parsers : xlsParser, pptParser, docParser.

Issue detected by the folowing JUnit tests failing :
ParserTest.testpptParsers(), ParserTest.testdocParsers(),
xlsParserTest.testParse()
2017-08-11 20:50:36 +02:00
reger
119b65389d upde to icu4j-59_1.jar 2017-08-10 23:57:37 +02:00
reger
fb71994342 Harmonizing use of xml reader / sax parser in XMLBlacklistImporter
eliminating the need for lib/xercesImpl.jar
2017-08-05 23:47:27 +02:00
reger
dedc6552d3 upd to poi-3.16.jar 2017-07-31 23:38:10 +02:00
reger
37f44941fb upd to pdfbox-2.0.7.jar 2017-07-30 20:09:06 +02:00
reger
44d455dfed upd to jwat-warc-1.1.0.jar 2017-07-16 23:37:28 +02:00
reger
af32d291c2 upd to commons-fileupload-1.3.3.jar 2017-07-08 23:46:10 +02:00
reger
e6e20dab52 upd to Jetty 9.4.6.v20170531
Modify loginservice to the changes in Jetty, partially based on pull 
request #101 https://github.com/yacy/yacy_search_server/pull/101 bu @automenta
2017-07-01 23:58:28 +02:00
reger
aeeb8a7dd5 upd to jwat-warc-1.0.6.jar 2017-06-25 20:05:37 +02:00
reger
f0ba828627 remove unused Solr optional extra handler lib solr-dataimporthandler-6.6.0.jar 2017-06-24 23:15:25 +02:00
reger
1773b61b3e upd to jsoup-1.10.3.jar 2017-06-24 22:54:43 +02:00
Michael Peter Christen
6fe735945d migrated Solr 5.5 -> Solr 6.6 and from Java 1.7 -> 1.8
Also: now Version 1.921
2017-06-09 12:25:23 +02:00
reger
c42d17f607 upd to commons-compress-1.14.jar 2017-06-03 21:58:04 +02:00
reger
b65a04087b upd to pdfbox-2.0.6.jar 2017-05-24 22:13:42 +02:00
reger
2b03e40134 upd to jwat-1.0.5 2017-04-22 23:32:40 +02:00
reger
46a4aaf09c upd to Solr-5.5.4 2017-04-06 21:18:01 +02:00
reger
eddb7a9804 upd to pdfbox-2.0.5.jar and transient dependency xmpcore-5.1.3.jar
required by metadata-extractor-2.10.1 (fix build.xml compiler warning)
2017-04-04 00:59:26 +02:00
reger
510f11d374 Implement surrogate import from Warc archives (as first option handle
warc = Web ARChive File Format.
Warc files with extension .warc or compressed warc.gz can be placed in the
DATA/surrogate/in and contained responses are imported to the index.
The used library is stream based so we can easily extend it later to use
and load warc's from the net.
2017-03-31 00:58:11 +02:00