yacy_search_server/source/net/yacy/document
reger 8a7c68e4c7 content of surrogates/out never accessed (remove)
After import the conent is never accessed but may take up a lot of disk space,
also the getLoadedOAIServer (which lists the files in surrogate out) is not used.
Making the surrogate.out obsolete. Removed keeping of xmls after import.
2014-05-04 09:29:07 +02:00
..
content added missing @Override annotation 2014-03-28 13:48:37 +01:00
importer content of surrogates/out never accessed (remove) 2014-05-04 09:29:07 +02:00
language added missing @Override annotation 2014-03-28 13:48:37 +01:00
parser add html5 audio/video <source> tag to html content scraper 2014-04-29 00:41:29 +02:00
AbstractParser.java - refactoring of log to ConcurrentLog: 2013-07-09 14:28:25 +02:00
Condenser.java less word hash computations (removing some overhead because of MD5 2013-11-25 15:20:54 +01:00
Document.java add canonical links to the same crawldepth, not the next crawldepth 2014-04-18 06:51:46 +02:00
ImageParser.java Added 'final' for all exception blocks as this helps the Java compiler 2013-07-17 18:31:30 +02:00
LargeNumberCache.java more performance hacks 2010-10-09 08:55:57 +00:00
LibraryProvider.java removed jena library and all code that depended on jena. When jena was 2014-02-07 01:20:06 +01:00
Parser.java - replaced the properties object in AnchorURL with distinct variables 2013-09-15 23:27:04 +02:00
Phrase.java more performance hacks 2010-10-09 08:55:57 +00:00
SentenceReader.java hacks to prevent storage of data longer than necessary during search and 2013-10-25 15:05:30 +02:00
SnippetExtractor.java Added 'final' for all exception blocks as this helps the Java compiler 2013-07-17 18:31:30 +02:00
TextParser.java - added a new Crawler Balancer: HostBalancer and HostQueues: 2014-04-16 21:34:28 +02:00
WordTokenizer.java hacks to prevent storage of data longer than necessary during search and 2013-10-25 15:05:30 +02:00