yacy_search_server/source/net/yacy/document
2015-12-18 02:35:44 +01:00
..
content fix dc:date in mediawikiimporter/document.writexml to use lastmodified 2015-12-17 02:53:10 +01:00
importer fix dc:date in mediawikiimporter/document.writexml to use lastmodified 2015-12-17 02:53:10 +01:00
language less logging in new language detection 2015-12-03 00:39:15 +01:00
parser let html scraper accept html5 <link rel="icon"> for favicon links 2015-12-17 00:36:08 +01:00
AbstractParser.java improve TexParser.mimeOf( fileextension ) by returning 1st defined in supported list. 2015-01-02 04:20:02 +01:00
Condenser.java refactoring: separated condenser and tokenizer 2015-07-01 18:28:18 +02:00
DateDetection.java add Portuguese month names to date recognition 2015-09-20 23:28:42 +02:00
Document.java fix (todo) remove redundant holding of email link nameproperty in parser document 2015-12-18 02:35:44 +01:00
ImageParser.java BMP and ICO image formats support : integrated /haraldk/TwelveMonkeys 2015-11-20 09:38:16 +01:00
LargeNumberCache.java more performance hacks 2010-10-09 08:55:57 +00:00
LibraryProvider.java fix link to DeReWo download file 2015-03-11 20:02:23 +01:00
Parser.java enhanced timezone managament for indexed data: 2015-04-15 13:17:23 +02:00
Phrase.java more performance hacks 2010-10-09 08:55:57 +00:00
ProbabilisticClassifier.java added / corrected charste to be 1.7 compatible. 2015-08-10 20:53:20 +02:00
SentenceReader.java hacks to prevent storage of data longer than necessary during search and 2013-10-25 15:05:30 +02:00
SnippetExtractor.java skip unused call parameter for hashSentence() 2014-11-30 19:42:33 +01:00
TextParser.java optimize order of parsers to try 2015-11-04 21:52:02 +01:00
Tokenizer.java added enrichment of synonyms and vocabularies for imported documents 2015-07-02 00:23:50 +02:00
VocabularyScraper.java added enrichment of synonyms and vocabularies for imported documents 2015-07-02 00:23:50 +02:00
WordTokenizer.java added enrichment of synonyms and vocabularies for imported documents 2015-07-02 00:23:50 +02:00