yacy_search_server/source/net/yacy/document
Michael Peter Christen 67beef657f strong redesign of html parser: object recursion is now made using a
stack on html tag objects, not using a recursive parse-again method
which may cause bad performance and huge memory allocation. The new
method also produced better parsed image objects with exact anchor text
references.
2014-04-10 18:58:03 +02:00
..
content added missing @Override annotation 2014-03-28 13:48:37 +01:00
importer added missing @Override annotation 2014-03-28 13:48:37 +01:00
language added missing @Override annotation 2014-03-28 13:48:37 +01:00
parser strong redesign of html parser: object recursion is now made using a 2014-04-10 18:58:03 +02:00
AbstractParser.java - refactoring of log to ConcurrentLog: 2013-07-09 14:28:25 +02:00
Condenser.java less word hash computations (removing some overhead because of MD5 2013-11-25 15:20:54 +01:00
Document.java introduced new solr field crawldepth_i which records the crawl depth of 2014-04-02 23:37:01 +02:00
ImageParser.java Added 'final' for all exception blocks as this helps the Java compiler 2013-07-17 18:31:30 +02:00
LargeNumberCache.java more performance hacks 2010-10-09 08:55:57 +00:00
LibraryProvider.java removed jena library and all code that depended on jena. When jena was 2014-02-07 01:20:06 +01:00
Parser.java - replaced the properties object in AnchorURL with distinct variables 2013-09-15 23:27:04 +02:00
Phrase.java more performance hacks 2010-10-09 08:55:57 +00:00
SentenceReader.java hacks to prevent storage of data longer than necessary during search and 2013-10-25 15:05:30 +02:00
SnippetExtractor.java Added 'final' for all exception blocks as this helps the Java compiler 2013-07-17 18:31:30 +02:00
TextParser.java fix not needed getFileExtension().toLower (double) 2014-02-05 03:45:02 +01:00
WordTokenizer.java hacks to prevent storage of data longer than necessary during search and 2013-10-25 15:05:30 +02:00