yacy_search_server/source/net/yacy/document
orbiter ebd840ebf6 - enhanced description on search front page
- fixed language and heuristic modifier
- added hint to crawl start that we can do also ftp and smb crawls
- added a protocol extension to remote crawls to transport all search modifiers to remote peers

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@8108 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-11-26 13:40:33 +00:00
..
content abstraction of surrogate main element (xmlns:geo was missing for wiki extracts) 2011-05-17 08:57:49 +00:00
geolocalization replaced String with StringBuilder in suggestion process 2011-11-09 14:42:55 +00:00
importer - tested the ARC methods 2011-11-25 14:09:25 +00:00
language enhanced identificator: using AtomicInteger for counter 2011-06-19 13:31:10 +00:00
parser some last-minute performance hacks 2011-11-25 11:23:52 +00:00
AbstractParser.java added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled. 2011-09-07 10:08:57 +00:00
Classification.java - added a 'add every media object linked in a html document as a new document' to the html parser. This causes that all image, app, video or audio file that is linked in a html file is added as document. In fact that means that parsing a single html document may cause that a number of documents is inserted into the search index. 2011-09-01 16:05:00 +00:00
Condenser.java replaced String with StringBuilder in suggestion process 2011-11-09 14:42:55 +00:00
Document.java some last-minute performance hacks 2011-11-25 11:23:52 +00:00
ImageParser.java - enhanced description on search front page 2011-11-26 13:40:33 +00:00
LargeNumberCache.java more performance hacks 2010-10-09 08:55:57 +00:00
LibraryProvider.java some last-minute performance hacks 2011-11-25 11:23:52 +00:00
Parser.java *) added SID file (Commodore 64) sound file parser 2010-12-28 12:06:04 +00:00
Phrase.java more performance hacks 2010-10-09 08:55:57 +00:00
SentenceReader.java *) set SVN properties 2011-03-08 01:51:51 +00:00
SnippetExtractor.java finishing up my commits (7855-7858) which could be helpful for 2011-08-01 23:35:24 +00:00
StringBuilderComparator.java replaced String with StringBuilder in suggestion process 2011-11-09 14:42:55 +00:00
TextParser.java fixed urls to media content during indexing 2011-11-09 15:40:14 +00:00
WordCache.java replaced String with StringBuilder in suggestion process 2011-11-09 14:42:55 +00:00
WordTokenizer.java replaced String with StringBuilder in suggestion process 2011-11-09 14:42:55 +00:00