Commit Graph

379 Commits

Author SHA1 Message Date
reger
5445f38070 upd to jetty 9.2.13.v20150730 2015-10-19 00:53:10 +02:00
reger
6ca02ad577 upd httpclient-4.5.1, httpmime-4.5.1, httpcore-4.4.3, commons-compress-1.10 2015-10-18 19:53:39 +02:00
luc
d6522fa4a2 Integrated haraldk/TwelveMonkeys library to first add TIF image format
support.
2015-10-15 10:06:51 +02:00
reger
43c27aa550 upd to solr/lucene 5.3.1 2015-10-03 23:20:33 +02:00
reger
fd5a1dc297 upd to poi-3.13 2015-10-03 21:43:41 +02:00
reger
c7c5e2dff9 fix old/obsolete solr dependency to stax
delete obsolete jar
2015-09-27 00:17:42 +02:00
reger
0f80bc8309 upd to jsoup-1.8.3 2015-08-19 22:46:48 +02:00
reger
92e5b217b6 upd to pdfbox-1.8.10 2015-08-01 00:25:40 +02:00
reger
165561706d upd to Solr-5.2.1 2015-07-30 00:16:09 +02:00
reger
7f49dbfbd1 upd to SLF4J-1.7.12 2015-07-27 00:57:19 +02:00
reger
807e3dc78a upd to httpclient-4.5 and httpmime-4.5 2015-07-26 00:53:40 +02:00
reger
202620b4a2 upd to icu4j-55.1.jar 2015-07-25 00:50:41 +02:00
reger
149e41f25b upd to jsch-0.1.53.jar 2015-07-21 22:31:34 +02:00
reger
30135d8964 upd to lib/weupnp-0.1.3.jar 2015-07-20 03:45:23 +02:00
Ryszard Goń
59096935d0 Use language-detection library for increased accuracy 2015-07-02 18:41:13 +02:00
Michael Peter Christen
694b22f165 migration to Solr 5.2: huge benefits - this is a lot faster!
This is a very complex migration: many classes had been renamed or
removed, dependencies changed and the solr index type is now aligned to
be a solr cloud repository.
Together with the Solr 5.2 library update, one other dependent library
had been updated as well: httpclient 4.4->4.4.1

Older indexes are migrated from 4_10 to 5_2. However, the new index
structure is more efficient and we recommend to re-index everything.
Please use the index export before you do the update to a large
surrogate xml file. After the update, start with an empty index and then
initialize this with your dump.
2015-06-24 01:55:51 +02:00
sixcooler
480e4a6a5c Update to Jetty-9.2.11 - a bugfix-release that did not solve my
Problems, but does not harm anything
2015-06-07 20:09:27 +02:00
reger
5183ad718d upd to poi-3.12.jar 2015-06-05 03:36:57 +02:00
reger
31346e873b upd library reference of missing jsch-0.1.21 in seeduploadscp.xml
upd to jsch-0.1.52.jar
2015-05-11 01:35:12 +02:00
reger
a6daddbeaa upd to commons-io-2.4.jar 2015-05-10 03:00:05 +02:00
reger
4d73e9de06 upd to metadata-extractor-2.8.1 2015-04-30 00:01:11 +02:00
reger
0904a041a6 upd to poi-3.11.jar 2015-04-29 01:53:04 +02:00
reger
b161473cd0 upd to jsoup-1.8.2 2015-04-26 17:41:05 +02:00
reger
642daad528 upd to httpcore 4.4.1 2015-04-08 22:42:30 +02:00
reger
01759e9af9 upd to PDFBox 1.8.9 2015-04-05 23:38:14 +02:00
reger
7cf28c4f94 upd to Jetty 9.2.10 2015-03-22 02:47:12 +01:00
Michael Peter Christen
c3aadcf899 Fix for Jetty "JetLeak" bug: update to jetty 9.2.9
The bug was inside the jetty library, for details see:
http://blog.gdssecurity.com/labs/2015/2/25/jetleak-vulnerability-remote-leakage-of-shared-buffers-in-je.html
We recommend to update your YaCy peer with this bugfix.
2015-02-28 15:46:46 +01:00
reger
49281617d2 upd to commons-codec-1.10.jar, commons-compress-1.9.jar 2015-02-14 23:04:05 +01:00
reger
6dbc976d8b upd to httpclient-4.4 2015-02-13 00:50:32 +01:00
reger
8fa542a8e1 upd to Jetty 9.2.7 2015-02-07 00:44:09 +01:00
sixcooler
5594c43d2e bump to Solr-/Lucene-4.10.3 2015-01-04 18:47:47 +01:00
reger
17808898c6 update to SLF4J 1.7.9 2014-12-23 19:11:21 +01:00
reger
5332c9df21 update to commons-fileupload-1.3.1.jar
(includes a security fix)
2014-12-22 20:34:13 +01:00
reger
fe5d4e6c7b update to Jetty 9.2.6 2014-12-19 21:54:17 +01:00
reger
eca578a5fa update to PDFBox 1.8.8 2014-12-19 02:54:38 +01:00
reger
0f7d4c42e9 include xmpcore.jar in classpath
used by metadata-extractor
2014-12-16 21:12:37 +01:00
reger
9edc7308aa update to metadata-extractor-2.7.0.jar
add 2 simple JUnit test cases for jpeg and tif parsing
2014-12-15 20:45:05 +01:00
Michael Peter Christen
413eeefed4 added character set detection library from
http://www-archive.mozilla.org/projects/intl/chardet.html
2014-12-10 13:08:29 +01:00
reger
8055ed5b2a update to commons-logging-1.2 2014-12-06 22:32:24 +01:00
reger
aa7122f079 update to guava.18.0.jar and jsch.0.1.51.jar 2014-11-30 19:43:53 +01:00
reger
4c14a8b44d update to poi-3.10.1.jar 2014-11-29 22:36:02 +01:00
reger
ef5c5b4489 update to Jetty 9.2.4 2014-11-28 20:24:39 +01:00
sixcooler
bad5dd7836 update to httpclient-4.3.6 2014-11-09 23:06:36 +01:00
sixcooler
725b206fb4 update to solr-/lucene-4.10.2 2014-11-07 18:51:31 +01:00
sixcooler
4a35570c90 bump to httpcore-4.3.3 2014-10-24 12:32:44 +02:00
reger
0773c73a19 upd to jsoup-1.8.1.jar 2014-10-03 20:54:45 +02:00
reger
cd0a16cf2e update to PDFBox 1.8.7 2014-09-28 03:18:18 +02:00
reger
0ecbf32134 update to Jetty 9.2.3 2014-09-27 23:27:05 +02:00
sixcooler
aa3e5f2046 bump to httpclient-4.3.5 2014-08-19 14:51:07 +02:00
reger
516c8c02b1 update to commons-compress-1.8.1
(bugfix release)
2014-08-18 00:04:55 +02:00
reger
8d1b764458 update to Jetty 9.2.2 2014-08-17 00:59:16 +02:00
Marc Nause
477be17c51 Replaced old UPNP library with Weupnp. UPNP should
work now, at least it does on my network. UPNP code in YaCy can still
be improved though (see TODO comment: make port on gateway configurable
or find free one).

*) removed old code
*) added new lib
*) changed code to work with new lib
2014-07-29 21:26:27 +02:00
reger
b4b937a046 update to pdfbox 1.8.6 2014-07-25 23:55:10 +02:00
Michael Peter Christen
09dcdb9b19 update to solr 4.9.0 2014-07-01 16:39:00 +02:00
Michael Peter Christen
282b53db42 update of commons-io and slf4j-api (as preparation for Solr 4.9.0) 2014-07-01 16:18:12 +02:00
sixcooler
66c784c552 bump to httpclient-4.3.4 2014-06-22 16:24:45 +02:00
reger
b9f6acee23 update to Jetty 9.2.1 2014-06-22 00:21:47 +02:00
orbiter
ee7416816b upgraded poi library (office document format parser) from 3.9 to 3.10 2014-05-22 15:53:07 +02:00
orbiter
a12701ddf6 upgraded bouncy caste libraries (needed for encrypted pdfs, dependency
in pdfbox) to 1.46
removed the activation.jar library; I don't know which other library
depends on it.
2014-05-22 12:09:21 +02:00
Michael Peter Christen
2520590b45 migrated from pdfbox 1.8.4 to 1.8.5. They have a very long bugfix list
for that update:
http://www.apache.org/dist/pdfbox/1.8.5/RELEASE-NOTES.txt
2014-05-21 22:48:41 +02:00
Michael Peter Christen
d4157184ec migration to Solr 4.8.1
This includes also an update to zookeeper 3.4.6 and a new library that
Solr initializes by default: org.restlet from
http://restlet.com/download/current#release=stable&edition=jse&distribution=zip
which is included in version 2.2.1 from may 6th 2014
2014-05-21 11:48:08 +02:00
reger
c947ee06bf remove redundant servlet-api 2014-05-11 02:37:00 +02:00
reger
e9060d31bd update to Jetty 9
besides adjustments in code it makes the servlet settings in web.xml significant.
This applies to solr, gsa and proxy servlet. There is no longer a default setup in code during init (as jetty 9 checks for double definition).
2014-05-11 01:53:11 +02:00
Michael Peter Christen
ebd44a7080 replaced solr 4.6.1 with solr 4.7.1 and added index migration to
lucene_47
2014-04-06 10:45:03 +02:00
reger
1a6e0354db update commons-compress.jar to 1.8 2014-04-06 03:59:11 +02:00
sixcooler
9afd7db043 bump to httpclient-4.3.3 2014-02-27 18:33:45 +01:00
sixcooler
fbfee8c583 add forgotten solrj-4.6.1 2014-02-10 14:48:07 +01:00
Michael Peter Christen
7eab0a2cb8 replaced guava-15.0 with guava-16.0.1 2014-02-10 11:55:40 +01:00
Michael Peter Christen
cb4fd27bbe replaced pdfbox 1.8.3 with 1.8.4 2014-02-10 11:49:48 +01:00
Michael Peter Christen
4fcddcf13b replaced solr 4.6.0. with 4.6.1 2014-02-10 11:42:11 +01:00
Michael Peter Christen
9f921c27f3 removed libraries which had been used by jena and were added together
with jena in commit 5d20cd324a, see:
5d20cd324a
2014-02-07 01:38:58 +01:00
Michael Peter Christen
6e59ca4ebf removed jena library and all code that depended on jena. When jena was
introduced, it was also used for search facets. The generic search
facets are now deduced from generic solr fields which makes jena as tool
for facet semantics superfluous.
2014-02-07 01:20:06 +01:00
sixcooler
0512e46c6a bump to httpclient-4.3.2 2014-01-22 01:31:22 +01:00
Michael Peter Christen
84cf7e8e9f backmigration from solrj 4.6.0 to 4.5.1. This is necessary because
solrj.4.6.0 has a bug which prevents the attachment of a remote solr (as
tested with a SolrCloud). See bug report
https://issues.apache.org/jira/browse/SOLR-5532
This bug shall be fixed in Solr 4.6.1.
Fortunately, solrj-4.5.1 works together with solr-4.6.0 thus the current
index does not need to be changed.
2014-01-15 17:18:32 +01:00
reger
37f2a82a5d making root context (htroot) a WebAppContext
- this allows additional features, like servlet configuration via web.xml and many more things.
- currently the standard servlets are still configured in the code (so the supplied defaults/web.xml is not realy needed, yet),
  but could be expanded
- lookup for web.xml - 1. in /DATA/SETTINGS then in /defaults
2014-01-10 10:42:47 +01:00
reger
1b6d173b14 update to Jetty 8.1.14 2014-01-06 08:48:43 +01:00
reger
2507722fbb updated poi-3.9 / poi-scratchpad-3.9 2013-12-25 22:31:41 +01:00
Michael Peter Christen
7603e879dc Merge branch 'master' into HEAD
Conflicts:
	.classpath
	source/net/yacy/cora/federate/solr/SolrServlet.java
2013-12-20 01:19:06 +01:00
Michael Peter Christen
8b97489ff2 updated guava to 15.0 2013-12-19 22:21:33 +01:00
Michael Peter Christen
34b4eda4a8 upgraded json-simple to 1.1.1 2013-12-19 22:16:01 +01:00
Michael Peter Christen
75ae36da9c upgraded jsch to 0.1.50 2013-12-19 22:13:34 +01:00
Michael Peter Christen
db793a2a5e removed mysql connector which was used only for testing in the past 2013-12-19 22:09:01 +01:00
Michael Peter Christen
7ebc74b76a migrated to pdfbox 1.8.3 2013-12-19 22:06:10 +01:00
Michael Peter Christen
2f16770681 migrated to solr 4.6.0 2013-12-19 21:51:05 +01:00
reger
06da6f517c add YaCyProxyServlet to handle /proxy.html?url=proxyurl
- based on Jetty ProxyServlet
- at this time use existing HTTPD ProxyHandler  for url rewrite
- add jetty-client jar (dependency in Jetty ProxyServlet)

reuse ProxyHandler.convertHeaderFromJetty in YaCyDefaultServlet
2013-10-27 05:04:24 +01:00
reger
1adb4b8741 merge rc1/master 2013-10-16 03:02:21 +02:00
sixcooler
dfb73c9519 bump to httpclient-4.3.1 - a bugfix release 2013-10-14 23:32:24 +02:00
reger
a44eede8b8 merge rc1/master 2013-10-11 01:50:25 +02:00
reger
71d2655c02 downgrade to Jetty 8 to assure support of JRE 1.6
- introduce a YaCyHttp interface to modulize/separate http server
- adjust the Jetty version specific implementation part (in package net.yacy.http)
     - putting the version specific code in classes starting with Jetty8xxxx
     - moved existing Jetty9xxx implementation into a test class (to keep the code)
- adjust build to the changed jars
- make use of the introduced YaCyHttpServer interface in related htroot servlets

- adjust other test cases/classes
2013-10-09 00:40:48 +02:00
Michael Peter Christen
21aa6a0321 migration to Solr 4.5.0 2013-10-07 17:09:40 +02:00
reger
c7c706fd9f merge with rc1/master 2013-09-30 03:46:39 +02:00
sixcooler
15b1bb2513 bump to httpClient-4.3 2013-09-25 14:48:37 +02:00
reger
f7f86d8a5d update to Jetty 9 jars
- include javax.servlet 3.0
2013-09-14 20:49:05 +02:00
reger
aafef72a8a merged current rc1/master into jetty branch to allow further development with latest version
ServerSideIncludes and servlet return values need further work (for working jetty integration)
- TODO: added nasty quickfix to allow SSI -  needs further work
- TODO: YaCy servlet return values/parameters are not handled
2013-09-09 02:36:06 +02:00
Michael Peter Christen
5b7c0d0745 update to pdfbox 1.8.2 2013-07-30 14:14:16 +02:00
Michael Peter Christen
f13df9dbb6 migration to solr 4.4.0 2013-07-30 14:01:16 +02:00
Roland Haeder
ebbb3bc5c1 Fixed CHMOD on many files + added missing loggers (e.g. jena) and made some noisy loggers quiet 2013-07-13 13:12:36 +02:00
Michael Peter Christen
c4538d8d91 added metadata-extractor-2.6.2.jar to eclipse classpath, removed old lib 2013-06-26 09:26:34 +02:00
reger
3760e2616b bump up lib/metadata-extractor-2.6.2.jar (used for image parser) with needed code adjustments 2013-06-25 23:24:02 +02:00
Michael Peter Christen
9bd2aee180 migrated to solr 4.3.0 2013-05-09 02:17:53 +02:00
Michael Peter Christen
ad050ec88d - upgraded httpclient, httpcore and httpmime
- removed httpclient 3.1 which has been used by solrj < 4.x.x and is now
not used any more
- fixed some parts in YaCy which used methods from httpclient 3.1
2013-05-09 00:22:45 +02:00
orbiter
48e9a54e80 updated pdf parser 2013-05-08 15:17:06 +02:00
Michael Peter Christen
27907c9739 added missing library after solr upgrade 2013-04-07 10:36:05 +02:00
Michael Peter Christen
cf0acd2cb4 upgrade to solr 4.2.1 2013-04-06 16:11:24 +02:00
Michael Peter Christen
461d46101d - Removed log4j from libraries. This can be removed because the package
log4j-over-slf4j is there. From slf4j all loggings are routed to the jdk
logger. Now all loggings are consistently done to the jdk logger.
- added some lines to the logging properties to suppress many solr
logging statements. The number of the logging entries had already become
a performance issue, therefore removing these from the log should
increase performance.
2013-02-23 16:45:05 +01:00
orbiter
36f9b0fc16 updated wstx-asl to 3.2.9 2013-02-23 14:33:17 +01:00
Michael Peter Christen
bc00097cbf arrr... forgot the new library 2013-02-04 12:02:37 +01:00
Michael Peter Christen
09a2b09c48 guava update 2013-02-04 11:21:05 +01:00
Michael Peter Christen
80fe3d7860 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
Conflicts:
	source/net/yacy/cora/federate/solr/connector/EmbeddedSolrConnector.java
2013-02-04 10:57:54 +01:00
Michael Peter Christen
4323621a76 update to Solr 4.1.0 2013-02-04 10:55:49 +01:00
reger
160ce568b3 move testing SolrServlet.main to test, making include of jetty*.jar in distribution and classpath obsolete
- move jetty*.jar to test library 
- move SolrServlet.main as is to test, add also a junit test simulating main 
  - add build.xml cleanup for EmbeddedSolrConnectorTest created test/DATA
- adjust some test compile errors
2013-02-03 22:32:38 +01:00
sixcooler
f3e705c4fe bump to httpclient / httpcore 4.2.3 (bugfix-release) 2013-01-17 20:10:49 +01:00
Michael Peter Christen
9dfc9c95d8 updated slf4j and log4j 2012-12-27 04:37:21 +01:00
Michael Peter Christen
95712fdc8b update to pdf parser 2012-12-27 04:16:31 +01:00
Michael Peter Christen
e2c4c3c7d3 migration to solr 4.0.0 2012-11-02 12:29:48 +01:00
Michael Peter Christen
69aa39d664 update to libraries required by solr 4.0.0 2012-11-02 10:27:44 +01:00
sixcooler
02957d5982 missing license-files
(sorry I didn't commit theses files by mistake)
2012-10-31 23:47:08 +01:00
Michael Peter Christen
16216c2344 added missing libraries 2012-10-31 23:29:47 +01:00
sixcooler
9d062873d2 bump to httpclient-4.2.2 2012-10-31 19:09:48 +01:00
sof
5cb244b79b Merge remote branch 'origin/master' 2012-10-05 18:54:39 +02:00
apfelmaennchen
88b062210c Added a parser for audio file tags (e.g. ID3 tags for MP3 files) based
on the jaudiotagger library. The parser is disabled by default as it
needs to store temporary files for non file:// protocols, which might be
disliked. For your local MP3-collection it loads nicely Artist,
Title, Album etc. from the audio files meta data.
2012-10-05 18:54:26 +02:00
sixcooler
9aa21506be bump to httpcore-4.2.2 (maintenance release) 2012-10-03 02:15:02 +02:00
Michael Peter Christen
d0015df61c added lucene memory library which is now necessary as solr has to
process more complex queries
2012-09-28 13:48:51 +02:00
Michael Peter Christen
e65cecc419 - updated lucene libraries to 3.6.1
- added lucene-grouping which enables faceted search; try this:
http://localhost:8090/solr/select?q=*:*&start=0&rows=3&facet=true&facet.field=host_s
2012-09-10 10:12:38 +02:00
Michael Peter Christen
2ccf1dba71 upgrade to solr 3.6.1 2012-08-17 15:11:21 +02:00
cominch
e74d66e28c augmented browsing: remove htmlparser library 2012-08-14 10:09:46 +02:00
cominch
e2119f4e76 augmented browsing: replace htmlparser by jsoup, which is more stable
and reliable
2012-08-14 10:06:12 +02:00
sixcooler
a99ef68422 bump to httpclient-4.2.1 2012-07-09 18:58:33 +02:00
Michael Peter Christen
65f56b1fd4 Merge branch 'master' of ssh://gitorious.org/yacy/rc1 into jetty
Conflicts:
	.classpath
	build.xml
	htroot/Status.java
	source/de/anomic/http/server/HTTPDProxyHandler.java
	source/net/yacy/yacy.java
2012-06-29 21:16:20 +02:00
Michael Peter Christen
7b53be141f upgraded to pdfbox 1.7.0
changes in http://www.apache.org/dist/pdfbox/1.7.0/RELEASE-NOTES.txt
with many bugfixes, including performance related
2012-06-22 16:49:58 +02:00
Michael Peter Christen
fad3b14813 added jetty libraries, needed for future use as web server and as
application server for the solr search interface
2012-06-22 15:31:17 +02:00
Michael Peter Christen
1be0025a9c - added test for EmbeddedSolrConnector
- added needed libraries for this test
this includes most (all) files needed for an embedded solr
2012-06-22 00:36:49 +02:00
Michael Peter Christen
3f55dc7c1e - added solr core and libraries that solr needs (lucene is missing, will
follow later)
- added embedded solr connector which can connect to solr
programmatically (without using a server in between)
2012-06-21 14:55:38 +02:00
cominch
5d20cd324a Add Triplestore and RDF query interface
Conflicts:
	build.xml
	defaults/yacy.init
	source/net/yacy/interaction/AugmentHtmlStream.java
2012-06-10 10:35:59 +02:00
cominch
b21048892b augmentedParser add features and integrate external html parser to
modify existing web pages

Conflicts:
	addon/YaCy.app/Contents/Info.plist
	build.xml
2012-06-10 10:23:35 +02:00
sixcooler
56087c1f23 bump to httpclient- httpcore-, httpmime- 4.2 2012-05-30 14:46:21 +02:00
Michael Peter Christen
4d3cc02168 replaced old bzip2 library against better documented commons-compress
package from http://commons.apache.org/compress/
2012-05-28 23:53:48 +02:00
Michael Peter Christen
62f2554a01 - fixed build problems (deprecated methods using httpclient 3.1)
- removed httpclient 3.1 lib which was used by solrj (solrj now uses
httpclient 4)
2012-04-27 17:46:08 +02:00
Michael Peter Christen
248299d10f updated solrj lib 2012-04-27 11:22:34 +02:00
Michael Peter Christen
f838997126 updated commons io from 2.0.1 to 2.1 2012-02-24 01:35:01 +01:00
Michael Peter Christen
eeb57ae824 updated http client libraries 2012-02-24 01:06:30 +01:00
Michael Peter Christen
a30b028cc0 updated libraries 2012-01-18 01:21:41 +01:00
orbiter
65ab067491 migration to solrj 3.4.0
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7952 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-09-14 20:08:59 +00:00
orbiter
dc25c48fc9 added more libraries that are needed by solrj
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7922 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-09-02 23:43:04 +00:00
sixcooler
52b477cf6f bump to httpclient-4.1.2, httpcore-4.1.3 - bugfixrelease
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7876 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-08-12 17:42:32 +00:00
sixcooler
48560a44a9 bump to httpcore-4.1.2: a bugfixrelease
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7853 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-07-25 00:48:29 +00:00
orbiter
d3c89b90ce temporary adding the old httpclient-3.1 again because the solrj classes need them. should be removed as soon solrj supports httpclient-4
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7831 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-07-04 17:04:49 +00:00
orbiter
768c59740c - replaced solrj 3.1 with solrj 3.3
- updated also slf4j
- added authentication for solrj


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7829 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-07-04 16:35:30 +00:00
orbiter
528b59e078 replaced xerces.jar library that was originally added 2005 with SVN 126 to the libx directory and that was moved to lib in SVN 5781
the new replacement is taken from http://xerces.apache.org and has the version 2.11.0 and was inside the file Xerces-J-bin.2.11.0.tar.gz
and consists of two files named xercesImpl.jar and xml-apis.jar
The original purpose of that library was to support:
- content parsers
- optional seed uploader
- SOAP API (which will be committed later)
Since the SOAP API does not exist any more the purpose is to support content parser and an optional seed uploader

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7819 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-07-02 22:33:35 +00:00
orbiter
e7e1a0f328 replaced commons-io v1.4 with v2.0.1
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7818 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-07-02 21:10:13 +00:00
orbiter
5092a14bcb replaced fontbox, jempbox, pdfbox v 1.5 with v1.6
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7817 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-07-02 20:52:33 +00:00
orbiter
77fe69395d added jempbox-1.5.0.jar which is required by pdfbox-1.5 as stated in http://pdfbox.apache.org/dependencies.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7774 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-06-05 20:04:41 +00:00
sixcooler
efcd21e0ed new httpclient, httcore (bugfixrelease)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7769 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-06-02 21:34:50 +00:00
orbiter
761b1c71dc added latest pdfbox
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7761 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-05-30 14:56:36 +00:00
apfelmaennchen
a0e4960a4d YMark:
- first attempt for a firefox json bookmark importer
- added JSON library json-simple-1.1.jar

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7658 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-04-15 20:58:58 +00:00
orbiter
e402622584 removed httpclient-3.1 (this was added with last commit which was a mistake)
the httpclient is required by solrj but no class from solrj is used which references to httpclient-3.1
Instead the YaCy http client library based on the apache http client 4.1 is used using a wrapper class
which is in net.yacy.cora.services.federated.solr.SolrHTTPClient

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7655 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-04-14 20:12:14 +00:00
orbiter
19fd13d3bc Added federated index storage to solr.
YaCy supports now the storage to remote solr indexes.
More federated storage (and search) methods may follow.

The remote index scheme is the same as produced by the SolrCell; see
http://wiki.apache.org/solr/ExtractingRequestHandler
Because this default scheme is used, the default example scheme can be used as solr configuration
This is also the same scheme that solr uses if documents are imported with apache tika.

federated solr storage is switched off by default.

To use this, do the following:
- set federated.service.solr.indexing.enabled = true
- download solr from http://www.apache.org/dyn/closer.cgi/lucene/solr/
- extract the solr (3.1) package, 'cd example' and start solr with 'java -jar start.jar'
- start yacy and then start a crawler. The crawler will fill both, YaCy and solr indexes.
- to check whats in solr after indexing, open http://localhost:8983/solr/admin/

Until now it is not possible to use the solr index to search with YaCy in that solr index.
This functionality is now available for two reasons:
1) to compare the functionality of Solr and YaCy and to compare the search speed
2) to use YaCy as a search appliance for people who need a crawler or other source harvesting methods
   that YaCy provides (like dublin core reading, wikimedia dump reading, rss feed reader etc) if people still
   want to use solr instead of YaCy.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7654 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-04-14 20:05:04 +00:00
Florian Richter
b6d67507db * implement proxy 2011-04-09 11:48:44 +02:00
Florian Richter
68ca0fbb2e * add copyright info
* implement basic authentication
* update jetty to 7.3.0
2011-03-15 00:33:36 +01:00
suessthomas
9956dc9dce Update jcifs-library to Version 1.3.15. Small Changes, read: http://jcifs.samba.org/ - "Minor adjustments have been applied to DcerpcHandle locking routines in the SID class to fix sporadic occurances of "All pipe instances are busy" errors under high load."
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7587 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-03-13 20:09:52 +00:00
Florian Richter
1989ba64c0 * jetty 2011-03-09 12:18:36 +01:00
sixcooler
45dcfa3460 update to httpclient-4.1
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7473 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-08 21:46:24 +00:00
sixcooler
f4357dff03 bump to httpclient-4.0.3 which fixes a number of bugs
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7197 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-27 13:24:40 +00:00
f1ori
e670e1ef8e add charset auto-detection for htmlParser
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7186 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-25 18:02:21 +00:00
orbiter
3552476fbe terminated migration from apache httpclient-3.1 to 4.1:
- remove the library
- added two classes from the httpclient-3.1 library as source code to YaCy because these classes were used by the YaCy HTTP Server
- modified the added classes ChunkedInputStream and ContentLengthInputStream in such a way that:
 * there are no more dependencies to httpclient-3.1
 * these classes had been simplified to serve only the purpose for the YaCy httpd

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7171 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-20 08:36:48 +00:00
orbiter
f9a27a05e5 migrated to log4j 1.2.16
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7153 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-15 09:18:35 +00:00
orbiter
5c67e6ca49 migrated to latest apache commons fileupload 1.2.2
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7152 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-15 08:54:41 +00:00
orbiter
848ef6937e licenses for pdfbox
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7121 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-07 17:17:08 +00:00
orbiter
5fe828fa06 - replaced pdfbox and fontbox version 1.1.0 with 1.2.1
- added some clear statements that shall clear static cache size within the pdfbox library
- the pdfbox library contains a memory leak; it is unsafe to run a peer with pdf parser permanently on.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7120 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-07 17:13:47 +00:00
lotus
965aa97993 including sbbi upnplib as source again
http://www.sbbi.net/site/upnp/index.html

renamed package to yacy
all options are also named "yacy" instead of "sbbi"

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6986 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-07-22 18:02:16 +00:00
sixcooler
c5c67f0504 start migrating to HttpComponents-Client-4.x
see http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2872

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6965 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-07-12 23:07:05 +00:00
orbiter
b5e190099d - updated pdfbox and fontbox to 1.1.0
- added license file to sbbi-upnplib

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6946 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-06-26 10:58:07 +00:00
orbiter
37b8827a7a - removed the UPnP library sources from sbbi and added the jar library again. The library was included to get support for fedora releases, but after this time the fact that the sbbi cannot be part of fedora should be re-discussed. If this will still not be possible, then we may integrate the sbbi UPnP package using reflection.
- cleaned uo the code. The new eclipse helios provided new warnings for dead code. This change cleans up most of these warnings

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6945 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-06-26 10:32:47 +00:00
orbiter
24e5faee75 added exif parsing for jpg images
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6745 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-03-12 12:23:38 +00:00
orbiter
1bbe14d23f SVN 6716 unfortunately contained parts of the unfinished SMB integration. To fix compile errors the remaining parts of the SMB implementation stub is added with this commit.
This adds the jcifs smb library.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6717 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-03-05 21:46:22 +00:00
mikeworks
cc074c1a36 Renamed, removed and added license information for each jar archive in external lib folder
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6704 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-02-27 03:04:11 +00:00
orbiter
f5ec7ad077 replaced four old libraries with latest version
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6702 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-02-26 14:14:50 +00:00
mikeworks
475ffabfa1 Added License (Apache 1.1) information for Jakarta ORO library 2.0.7 (2.0.8 available)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6700 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-02-26 04:26:04 +00:00
mikeworks
50169759ca Replaced old pdfbox and fontbox LICENSE files with new ones (still Apache 2.0)
Testing delete and adding files

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6689 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-02-22 07:09:57 +00:00
orbiter
1e2c011c98 updated the jsch lib from 0.1.21 to 0.1.42
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6688 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-02-21 23:43:50 +00:00
orbiter
c2b505ae87 updated bouncy castle libraries
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6687 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-02-21 23:31:40 +00:00
orbiter
681f4d185f replaced microsoft office document parser POI 3.5 with latest version 3.6
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6686 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-02-21 23:18:52 +00:00
orbiter
e9cdddcd0f updated parser libraries fontbox and pdfbox with latest version of jar files
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6685 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-02-21 23:05:38 +00:00
orbiter
9db928ce53 replaced fontbox 0.7.3 with fontbox 0.8.0
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6414 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-10-15 09:51:16 +00:00
lotus
0975b1b493 update for apache poi library
possible solves http://forum.yacy-websuche.de/viewtopic.php?p=17736#p17736

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6411 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-10-14 15:24:53 +00:00
f1ori
67da20647f * add new odf parser based on sax-xml-parser
* remove odf_utils-jar
* test metadata in ParserTest


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6231 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-18 15:04:34 +00:00
orbiter
eea4c17ef2 removed rpm parser
- no-one used that thing
- loading huge rpm files bay be causes for crashes


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6223 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-16 11:06:49 +00:00
lotus
477807e0e6 * updated jxpath to latest v1.3
* added upnplib as source
	without packages:
	jmx
	remote
	samples

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6218 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-15 16:13:24 +00:00
orbiter
49bbb9bd45 replaced tar library with integrated apache ant tar lib
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6212 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-14 11:31:40 +00:00
orbiter
3d26161dd1 removed unused libraries
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6204 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-13 14:47:09 +00:00
orbiter
50cf80056f removed jmimemagic library
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6203 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-13 10:58:37 +00:00
orbiter
e3c7f61145 removed unused libraries
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6202 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-13 10:21:22 +00:00
low012
b118bdd994 *) Deleted obsolete license file.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6199 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-12 16:38:13 +00:00
low012
f242e7d7bc *) using Apache POI library to parse Word documents now
*) removed tm-extractors library (can be found at http://www.textmining.org/ if necessary again)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6193 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-10 22:49:16 +00:00
orbiter
6fc18b9d51 removed unused ant targets
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6158 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-30 15:01:42 +00:00
orbiter
302a02cec8 moved all libraries from libx to lib
removed libx directory
all libraries are now in lib, instead the test libraries in libt which are not part of releases

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6157 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-30 14:55:23 +00:00
orbiter
d793bb0d76 the mysql lib was not in releases included; moved library from libx to lib
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5987 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-27 20:23:28 +00:00
orbiter
43bcd192cd ups
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5786 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-04-08 15:56:38 +00:00
orbiter
4905a17f6a moved xerces.jar from libx to lib
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5781 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-04-06 14:45:33 +00:00
lotus
4aad461100 added UPnP support
YaCy can now automatically forward ports on home routers
off by default

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5609 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-02-14 13:12:08 +00:00