yacy_search_server/source/net/yacy/document
2020-04-24 11:45:25 +02:00
..
content Small perf improvement : initialize threads names early when possible 2018-05-23 14:45:35 +02:00
importer remove warnings: 'Statement unnecessarily nested within else clause' 2019-01-10 20:02:57 +01:00
language Fixed language detector initialization and NullPointerException cases. 2016-12-05 18:12:21 +01:00
parser Support parsing audio URLs without file extension 2019-04-09 11:40:21 +02:00
AbstractParser.java added a crawl filter based on <div> tag class names 2017-12-09 22:29:35 +01:00
Condenser.java Added basic support for autotagging microdata annotated item types. 2018-02-06 10:25:38 +01:00
DateDetection.java Removed remaining unsafe accesses to SimpleDateFormat instances. 2018-07-02 10:00:40 +02:00
Document.java Added a crawl filtering possibility on documents Media Type (MIME) 2018-03-23 10:28:19 +01:00
ImageParser.java BMP and ICO image formats support : integrated /haraldk/TwelveMonkeys 2015-11-20 09:38:16 +01:00
LargeNumberCache.java Cleaned up some Javadoc warnings. 2017-01-09 16:44:47 +01:00
LibraryProvider.java Upgraded the OpenGeoDB dump URL 2018-08-03 18:39:41 +02:00
Parser.java added a crawl filter based on <div> tag class names 2017-12-09 22:29:35 +01:00
Phrase.java more performance hacks 2010-10-09 08:55:57 +00:00
ProbabilisticClassifier.java Fixed a NullPointerException case. 2016-12-02 13:45:45 +01:00
SentenceReader.java Reduced memory footprint of text snippet extraction 2018-05-13 10:29:52 +02:00
SnippetExtractor.java Reduced memory footprint of text snippet extraction 2018-05-13 10:29:52 +02:00
TextParser.java Added new crawler attribute for finer control over Media Type detection 2018-10-25 10:42:12 +02:00
Tokenizer.java Refactoring : documented and extracted autotagging processing functions. 2018-02-02 10:27:36 +01:00
VocabularyScraper.java replaced json library from JSON.org with libandroid-json-java 2020-04-24 11:45:25 +02:00
WordTokenizer.java Reduced text snippet extraction processing time. 2018-05-11 15:42:53 +02:00