yacy_search_server

mirror of https://github.com/yacy/yacy_search_server.git synced 2024-09-21 00:00:13 +02:00

History

Michael Peter Christen 67beef657f strong redesign of html parser: object recursion is now made using a stack on html tag objects, not using a recursive parse-again method which may cause bad performance and huge memory allocation. The new method also produced better parsed image objects with exact anchor text references.		2014-04-10 18:58:03 +02:00
..
content	added missing @Override annotation	2014-03-28 13:48:37 +01:00
importer	added missing @Override annotation	2014-03-28 13:48:37 +01:00
language	added missing @Override annotation	2014-03-28 13:48:37 +01:00
parser	strong redesign of html parser: object recursion is now made using a	2014-04-10 18:58:03 +02:00
AbstractParser.java	- refactoring of log to ConcurrentLog:	2013-07-09 14:28:25 +02:00
Condenser.java	less word hash computations (removing some overhead because of MD5	2013-11-25 15:20:54 +01:00
Document.java	introduced new solr field crawldepth_i which records the crawl depth of	2014-04-02 23:37:01 +02:00
ImageParser.java	Added 'final' for all exception blocks as this helps the Java compiler	2013-07-17 18:31:30 +02:00
LargeNumberCache.java	more performance hacks	2010-10-09 08:55:57 +00:00
LibraryProvider.java	removed jena library and all code that depended on jena. When jena was	2014-02-07 01:20:06 +01:00
Parser.java	- replaced the properties object in AnchorURL with distinct variables	2013-09-15 23:27:04 +02:00
Phrase.java	more performance hacks	2010-10-09 08:55:57 +00:00
SentenceReader.java	hacks to prevent storage of data longer than necessary during search and	2013-10-25 15:05:30 +02:00
SnippetExtractor.java	Added 'final' for all exception blocks as this helps the Java compiler	2013-07-17 18:31:30 +02:00
TextParser.java	fix not needed getFileExtension().toLower (double)	2014-02-05 03:45:02 +01:00
WordTokenizer.java	hacks to prevent storage of data longer than necessary during search and	2013-10-25 15:05:30 +02:00