yacy_search_server

mirror of https://github.com/yacy/yacy_search_server.git synced 2024-09-21 00:00:13 +02:00

History

orbiter 85a5487d6d YaCy can now use the solr index to compute text snippets. This makes search result preparation MUCH faster because no document fetching and parsing is necessary any more. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7943 6c8d7289-2bf4-0310-a012-ef5d649a1542		2011-09-13 14:39:41 +00:00
..
content	abstraction of surrogate main element (xmlns:geo was missing for wiki extracts)	2011-05-17 08:57:49 +00:00
geolocalization	replaced more appearance of double values by float values	2011-02-02 00:06:29 +00:00
importer	added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.	2011-09-07 10:08:57 +00:00
language	enhanced identificator: using AtomicInteger for counter	2011-06-19 13:31:10 +00:00
parser	protection against OOM cases in image parser. See also bugs.yacy.net/view.php?id=54	2011-09-09 23:00:45 +00:00
AbstractParser.java	added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.	2011-09-07 10:08:57 +00:00
Classification.java	- added a 'add every media object linked in a html document as a new document' to the html parser. This causes that all image, app, video or audio file that is linked in a html file is added as document. In fact that means that parsing a single html document may cause that a number of documents is inserted into the search index.	2011-09-01 16:05:00 +00:00
Condenser.java	hack to reduce resource contention caused by massive UTF8 decodings which use java.nio resources:	2011-05-27 08:24:54 +00:00
Document.java	YaCy can now use the solr index to compute text snippets. This makes search result preparation MUCH faster because no document fetching and parsing is necessary any more.	2011-09-13 14:39:41 +00:00
ImageParser.java	reduce teh effect of 'Bildersuche findet generierte HTML-Seiten als Bilder'	2011-05-07 07:37:46 +00:00
LargeNumberCache.java	more performance hacks	2010-10-09 08:55:57 +00:00
LibraryProvider.java	added new word recommendation library in DictionaryLoader_p.html	2011-09-01 10:14:17 +00:00
Parser.java	*) added SID file (Commodore 64) sound file parser	2010-12-28 12:06:04 +00:00
Phrase.java	more performance hacks	2010-10-09 08:55:57 +00:00
SentenceReader.java	*) set SVN properties	2011-03-08 01:51:51 +00:00
SnippetExtractor.java	finishing up my commits (7855-7858) which could be helpful for	2011-08-01 23:35:24 +00:00
TextParser.java	added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.	2011-09-07 10:08:57 +00:00
WordCache.java	added new word recommendation library in DictionaryLoader_p.html	2011-09-01 10:14:17 +00:00
WordTokenizer.java	encapsulate potential expensive objects in TextSnippet to allow GC them asap	2011-08-11 21:07:52 +00:00