yacy_search_server

mirror of https://github.com/yacy/yacy_search_server.git synced 2024-09-19 00:01:41 +02:00

History

luccioman e357ade47d Reduced memory footprint of text snippet extraction By not parsing and storing at first all sentences of a document, but only on the fly the ones necessary to compute the snippet.		2018-05-13 10:29:52 +02:00
..
content	Ensure lower case conversion consistency with any default locale.	2017-06-27 06:42:33 +02:00
importer	added a crawl filter based on <div> tag class names	2017-12-09 22:29:35 +01:00
language	Fixed language detector initialization and NullPointerException cases.	2016-12-05 18:12:21 +01:00
parser	Added RSS reader support for `enclosure` feed item sub element.	2018-03-20 07:38:29 +01:00
AbstractParser.java	added a crawl filter based on <div> tag class names	2017-12-09 22:29:35 +01:00
Condenser.java	Added basic support for autotagging microdata annotated item types.	2018-02-06 10:25:38 +01:00
DateDetection.java	Remove old hard-coded holiday dates from DateDection class.	2017-11-07 19:02:09 +01:00
Document.java	Added a crawl filtering possibility on documents Media Type (MIME)	2018-03-23 10:28:19 +01:00
ImageParser.java	BMP and ICO image formats support : integrated /haraldk/TwelveMonkeys	2015-11-20 09:38:16 +01:00
LargeNumberCache.java	Cleaned up some Javadoc warnings.	2017-01-09 16:44:47 +01:00
LibraryProvider.java	Cleaned up some Javadoc warnings.	2017-01-09 16:44:47 +01:00
Parser.java	added a crawl filter based on <div> tag class names	2017-12-09 22:29:35 +01:00
Phrase.java	more performance hacks	2010-10-09 08:55:57 +00:00
ProbabilisticClassifier.java	Fixed a NullPointerException case.	2016-12-02 13:45:45 +01:00
SentenceReader.java	Reduced memory footprint of text snippet extraction	2018-05-13 10:29:52 +02:00
SnippetExtractor.java	Reduced memory footprint of text snippet extraction	2018-05-13 10:29:52 +02:00
TextParser.java	added a crawl filter based on <div> tag class names	2017-12-09 22:29:35 +01:00
Tokenizer.java	Refactoring : documented and extracted autotagging processing functions.	2018-02-02 10:27:36 +01:00
VocabularyScraper.java	added enrichment of synonyms and vocabularies for imported documents	2015-07-02 00:23:50 +02:00
WordTokenizer.java	Reduced text snippet extraction processing time.	2018-05-11 15:42:53 +02:00