.. |
content
|
Small perf improvement : initialize threads names early when possible
|
2018-05-23 14:45:35 +02:00 |
importer
|
remove warnings: 'Statement unnecessarily nested within else clause'
|
2019-01-10 20:02:57 +01:00 |
language
|
Fixed language detector initialization and NullPointerException cases.
|
2016-12-05 18:12:21 +01:00 |
parser
|
Support parsing audio URLs without file extension
|
2019-04-09 11:40:21 +02:00 |
AbstractParser.java
|
added a crawl filter based on <div> tag class names
|
2017-12-09 22:29:35 +01:00 |
Condenser.java
|
Added basic support for autotagging microdata annotated item types.
|
2018-02-06 10:25:38 +01:00 |
DateDetection.java
|
Removed remaining unsafe accesses to SimpleDateFormat instances.
|
2018-07-02 10:00:40 +02:00 |
Document.java
|
Added a crawl filtering possibility on documents Media Type (MIME)
|
2018-03-23 10:28:19 +01:00 |
ImageParser.java
|
BMP and ICO image formats support : integrated /haraldk/TwelveMonkeys
|
2015-11-20 09:38:16 +01:00 |
LargeNumberCache.java
|
Cleaned up some Javadoc warnings.
|
2017-01-09 16:44:47 +01:00 |
LibraryProvider.java
|
Upgraded the OpenGeoDB dump URL
|
2018-08-03 18:39:41 +02:00 |
Parser.java
|
added a crawl filter based on <div> tag class names
|
2017-12-09 22:29:35 +01:00 |
Phrase.java
|
more performance hacks
|
2010-10-09 08:55:57 +00:00 |
ProbabilisticClassifier.java
|
Fixed a NullPointerException case.
|
2016-12-02 13:45:45 +01:00 |
SentenceReader.java
|
Reduced memory footprint of text snippet extraction
|
2018-05-13 10:29:52 +02:00 |
SnippetExtractor.java
|
Reduced memory footprint of text snippet extraction
|
2018-05-13 10:29:52 +02:00 |
TextParser.java
|
Added new crawler attribute for finer control over Media Type detection
|
2018-10-25 10:42:12 +02:00 |
Tokenizer.java
|
Refactoring : documented and extracted autotagging processing functions.
|
2018-02-02 10:27:36 +01:00 |
VocabularyScraper.java
|
replaced json library from JSON.org with libandroid-json-java
|
2020-04-24 11:45:25 +02:00 |
WordTokenizer.java
|
Reduced text snippet extraction processing time.
|
2018-05-11 15:42:53 +02:00 |