yacy_search_server/source/de/anomic/search
orbiter 19fd13d3bc Added federated index storage to solr.
YaCy supports now the storage to remote solr indexes.
More federated storage (and search) methods may follow.

The remote index scheme is the same as produced by the SolrCell; see
http://wiki.apache.org/solr/ExtractingRequestHandler
Because this default scheme is used, the default example scheme can be used as solr configuration
This is also the same scheme that solr uses if documents are imported with apache tika.

federated solr storage is switched off by default.

To use this, do the following:
- set federated.service.solr.indexing.enabled = true
- download solr from http://www.apache.org/dyn/closer.cgi/lucene/solr/
- extract the solr (3.1) package, 'cd example' and start solr with 'java -jar start.jar'
- start yacy and then start a crawler. The crawler will fill both, YaCy and solr indexes.
- to check whats in solr after indexing, open http://localhost:8983/solr/admin/

Until now it is not possible to use the solr index to search with YaCy in that solr index.
This functionality is now available for two reasons:
1) to compare the functionality of Solr and YaCy and to compare the search speed
2) to use YaCy as a search appliance for people who need a crawler or other source harvesting methods
   that YaCy provides (like dublin core reading, wikimedia dump reading, rss feed reader etc) if people still
   want to use solr instead of YaCy.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7654 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-04-14 20:05:04 +00:00
..
AccessTracker.java - replaced more direct string-based UTF-8 conversions by predefined UTF-8 conversion 2011-03-10 23:25:07 +00:00
ContentDomain.java *) set SVN properties 2011-03-08 01:51:51 +00:00
DocumentIndex.java moved getBytes() to UTF8.getBytes() to use a default String encoding 2011-03-10 12:35:32 +00:00
IndexAbstracts.java
MediaSnippet.java replaced all 'new String' with default encoding (missing) or UTF-8 encoding with a String generation method that uses a pre-defined Charset constant for UTF-8. This avoids a cache-lookup for the Charset object using String hashing of the String 'UTF-8'. 2011-03-07 20:36:40 +00:00
MetadataRepository.java enhanced speed for OrderedScoreMap inc method and size comparisment in concurrent environments 2011-04-13 22:04:23 +00:00
QueryParams.java more UTF8 getBytes() performance hacks 2011-04-12 05:02:36 +00:00
RankingProcess.java enhanced speed for OrderedScoreMap inc method and size comparisment in concurrent environments 2011-04-13 22:04:23 +00:00
RankingProfile.java always try to guess the size of a StringBuilder to prevent too many memory re-allocations 2011-03-09 09:29:05 +00:00
ReferenceOrder.java redesigned the ScoreMap classes: 2011-03-13 01:41:44 +00:00
ResultEntry.java the location search shows now not re-evaluated locations but only such locations that are attached as metadata to web pages 2011-03-30 23:26:36 +00:00
ResultFetcher.java - enhancements in surrogate reading 2011-04-02 00:11:37 +00:00
SearchEvent.java *) fixed NPE which occured when empty String was entered as search word 2011-03-23 00:44:38 +00:00
SearchEventCache.java added more configuration options for search: 2011-03-21 07:50:34 +00:00
Segment.java - added geo information parsing to html parser 2011-03-30 00:49:47 +00:00
Segments.java moved getBytes() to UTF8.getBytes() to use a default String encoding 2011-03-10 12:35:32 +00:00
Switchboard.java Added federated index storage to solr. 2011-04-14 20:05:04 +00:00
SwitchboardConstants.java more UTF8 getBytes() performance hacks 2011-04-12 05:02:36 +00:00
TextSnippet.java added more configuration options for search: 2011-03-21 07:50:34 +00:00