yacy_search_server/source/net/yacy/document
Michael Peter Christen f8cd57c92f new indexing strategy: ALL links that appear anywhere are indexed, not
only links where the content can be parsed. All non-parseable links are
placed into the noload queue. The search process must therefore be able
to filter out non-text search results.
- This fixes the problem that image search results appeared in the text
search.
- The interactive search can retrieve now ALL types of links
- The p2p interface is now extended to retrieve only certain types of
links (text, image, video, apps)
- The search process has an extension to filter the right document type
according to the search query
2012-04-22 02:05:17 +02:00
..
content added changes from copperdust (submitted by email): 2012-02-22 12:21:27 +01:00
geolocalization added autotaggig stub .. only reading and parsing of vocabularies at 2012-01-07 17:34:38 +01:00
importer !Important: move from Hashtable to HashMap 2012-01-09 01:29:18 +01:00
language added changes from copperdust (submitted by email): 2012-02-22 12:21:27 +01:00
parser fix for parser problem if a <a>-tag is 'within' html tags with unclosed 2012-04-18 10:30:04 +02:00
AbstractParser.java
Autotagging.java fix for single-word vocabulary lines 2012-01-26 16:44:30 +01:00
Condenser.java new indexing strategy: ALL links that appear anywhere are indexed, not 2012-04-22 02:05:17 +02:00
Document.java new indexing strategy: ALL links that appear anywhere are indexed, not 2012-04-22 02:05:17 +02:00
ImageParser.java
LargeNumberCache.java
LibraryProvider.java added autotagging to document condenser: 2012-01-15 22:17:57 +01:00
Parser.java
Phrase.java
SentenceReader.java
SnippetExtractor.java performance hack 2012-01-25 12:48:48 +01:00
StringBuilderComparator.java
TextParser.java refactoring: moved document Classification to cora package 2012-04-21 21:31:13 +02:00
WordCache.java vocabularies are now also used as source for a did-you-mean computation 2012-01-08 02:13:52 +01:00
WordTokenizer.java performance hack 2012-01-25 12:48:48 +01:00