Commit Graph

24 Commits

Author SHA1 Message Date
orbiter
768c59740c - replaced solrj 3.1 with solrj 3.3
- updated also slf4j
- added authentication for solrj


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7829 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-07-04 16:35:30 +00:00
orbiter
60ee245486 setting startup options:
-Xss256k
and
-XX:ReservedCodeCacheSize=1024m 
after appearance of a malloc error together with a crash of the jvm which stated at the end of the log:

# There is insufficient memory for the Java Runtime Environment to continue.
# Native memory allocation (malloc) failed to allocate 32756 bytes for ChunkPool::allocate
# Possible reasons:
#   The system is out of physical RAM or swap space
#   In 32 bit mode, the process size limit was hit
# Possible solutions:
#   Reduce memory load on the system
#   Increase physical memory or swap space
#   Check if swap backing store is full
#   Use 64 bit Java on a 64 bit OS
#   Decrease Java heap size (-Xmx/-Xms)
#   Decrease number of Java threads
#   Decrease Java thread stack sizes (-Xss)
#   Set larger code cache with -XX:ReservedCodeCacheSize=

this follows the last two points in the list of recommendations. To set appropriate values the default values from
http://www.oracle.com/technetwork/java/hotspotfaq-138619.html
and 
http://www.oracle.com/technetwork/java/javase/tech/vmoptions-jsp-140102.html
had been considered

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7823 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-07-03 21:33:24 +00:00
orbiter
528b59e078 replaced xerces.jar library that was originally added 2005 with SVN 126 to the libx directory and that was moved to lib in SVN 5781
the new replacement is taken from http://xerces.apache.org and has the version 2.11.0 and was inside the file Xerces-J-bin.2.11.0.tar.gz
and consists of two files named xercesImpl.jar and xml-apis.jar
The original purpose of that library was to support:
- content parsers
- optional seed uploader
- SOAP API (which will be committed later)
Since the SOAP API does not exist any more the purpose is to support content parser and an optional seed uploader

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7819 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-07-02 22:33:35 +00:00
orbiter
e7e1a0f328 replaced commons-io v1.4 with v2.0.1
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7818 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-07-02 21:10:13 +00:00
orbiter
5092a14bcb replaced fontbox, jempbox, pdfbox v 1.5 with v1.6
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7817 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-07-02 20:52:33 +00:00
orbiter
7bf39c8bcf added XX:MaxPermSize to debian and mac start scripts
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7813 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-07-01 22:50:46 +00:00
orbiter
77fe69395d added jempbox-1.5.0.jar which is required by pdfbox-1.5 as stated in http://pdfbox.apache.org/dependencies.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7774 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-06-05 20:04:41 +00:00
sixcooler
efcd21e0ed new httpclient, httcore (bugfixrelease)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7769 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-06-02 21:34:50 +00:00
orbiter
761b1c71dc added latest pdfbox
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7761 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-05-30 14:56:36 +00:00
sixcooler
0abd99621c correct slip of click in classpath from last commit - I wonder there are 7658'is around
apflemaenchen, please don't take this amiss

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7659 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-04-16 03:08:25 +00:00
orbiter
19fd13d3bc Added federated index storage to solr.
YaCy supports now the storage to remote solr indexes.
More federated storage (and search) methods may follow.

The remote index scheme is the same as produced by the SolrCell; see
http://wiki.apache.org/solr/ExtractingRequestHandler
Because this default scheme is used, the default example scheme can be used as solr configuration
This is also the same scheme that solr uses if documents are imported with apache tika.

federated solr storage is switched off by default.

To use this, do the following:
- set federated.service.solr.indexing.enabled = true
- download solr from http://www.apache.org/dyn/closer.cgi/lucene/solr/
- extract the solr (3.1) package, 'cd example' and start solr with 'java -jar start.jar'
- start yacy and then start a crawler. The crawler will fill both, YaCy and solr indexes.
- to check whats in solr after indexing, open http://localhost:8983/solr/admin/

Until now it is not possible to use the solr index to search with YaCy in that solr index.
This functionality is now available for two reasons:
1) to compare the functionality of Solr and YaCy and to compare the search speed
2) to use YaCy as a search appliance for people who need a crawler or other source harvesting methods
   that YaCy provides (like dublin core reading, wikimedia dump reading, rss feed reader etc) if people still
   want to use solr instead of YaCy.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7654 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-04-14 20:05:04 +00:00
suessthomas
9956dc9dce Update jcifs-library to Version 1.3.15. Small Changes, read: http://jcifs.samba.org/ - "Minor adjustments have been applied to DcerpcHandle locking routines in the SID class to fix sporadic occurances of "All pipe instances are busy" errors under high load."
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7587 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-03-13 20:09:52 +00:00
sixcooler
45dcfa3460 update to httpclient-4.1
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7473 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-08 21:46:24 +00:00
orbiter
75964909aa added missing path to htroot (may only be necessary for cross-linking of servlet classes)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7209 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-29 22:19:41 +00:00
orbiter
fc3ee9d8a1 changed paths for new libs also in Info.plist
when changing paths in build.xml, please also update the paths in /addon/YaCy.app/Contents/Info.plist

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7198 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-27 14:49:51 +00:00
orbiter
0bc6284e27 - added bugfix for access tracker in case of concurrency conflicts
- added missing entry for new icu4j path in Mac App

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7188 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-25 21:10:50 +00:00
orbiter
3552476fbe terminated migration from apache httpclient-3.1 to 4.1:
- remove the library
- added two classes from the httpclient-3.1 library as source code to YaCy because these classes were used by the YaCy HTTP Server
- modified the added classes ChunkedInputStream and ContentLengthInputStream in such a way that:
 * there are no more dependencies to httpclient-3.1
 * these classes had been simplified to serve only the purpose for the YaCy httpd

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7171 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-20 08:36:48 +00:00
orbiter
f9a27a05e5 migrated to log4j 1.2.16
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7153 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-15 09:18:35 +00:00
orbiter
5c67e6ca49 migrated to latest apache commons fileupload 1.2.2
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7152 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-15 08:54:41 +00:00
orbiter
5fe828fa06 - replaced pdfbox and fontbox version 1.1.0 with 1.2.1
- added some clear statements that shall clear static cache size within the pdfbox library
- the pdfbox library contains a memory leak; it is unsafe to run a peer with pdf parser permanently on.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7120 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-07 17:13:47 +00:00
orbiter
2cb8cf5b41 added overall utf-8 default support for mac releases
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7099 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-03 22:34:08 +00:00
orbiter
9d080f387e change in handling of the all-visible home path for storage in YaCy:
the home path can now be distinguished between
- data home; the path where the DATA directory is created
- application home; everything else
This will make it possible to store application data on Mac releases within the
~/Library/YaCy
directory; a place where Mac applications write their data.
Similar techniques will be possible for debian and windows.
To use the new data path, YaCy can be started with
-start <data path>
or
-gui <data path>


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7092 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-02 19:24:22 +00:00
lotus
091281c9f2 Mac app ant task building a ready-to-distribute zip file
extending r7080

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7089 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-02 08:01:01 +00:00
orbiter
838e8a1cb7 added a Macintosh OSX application wrapper.
You can now create a YaCy Mac application like this:
- build a new version of YaCy
- take everything that is in the yacy home directory (from the fresh build) and move it into a copy of the YaCy.app directory under YaCy.app/Contents/Resources/Java/
- ready: double-click the YaCy.app icon and the new YaCy gui starts, then the browsers is also opened.
- if the Mac application is terminated, also the YaCy demon should be stopped automatically and vice versa. This is currently not really good working and needs some more care.
A build script for this is missing. Does anyone know how to do that in ant?

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7023 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-08-05 11:11:15 +00:00