yacy_search_server/source/net/yacy/document/parser/html
Michael Peter Christen 9ad1d8dde2 complete redesign of crawl queue monitoring: do not look at a
ready-prepared crawl list but at the stacks of the domains that are
stored for balanced crawling. This affects also the balancer since that
does not need to prepare the pre-selected crawl list for monitoring. As
a effect:
- it is no more possible to see the correct order of next to-be-crawled
links, since that depends on the actual state of the balancer stack the
next time another url is requested for loading
- the balancer works better since the next url can be selected according
to the current situation and not according to a pre-selected order.
2012-02-02 21:33:42 +01:00
..
AbstractScraper.java do not allocate a StringBuilder object in case that there is not enough memory for that 2011-07-16 23:17:19 +00:00
AbstractTransformer.java removed finalize methods because of a hint in 2010-04-23 09:32:29 +00:00
CharacterCoding.java *) added comments 2011-09-25 13:16:52 +00:00
ContentScraper.java complete redesign of crawl queue monitoring: do not look at a 2012-02-02 21:33:42 +01:00
ContentTransformer.java memory hacks 2012-02-02 07:37:00 +01:00
Evaluation.java refactoring 2011-12-16 23:59:29 +01:00
ImageEntry.java added more attributes for html parser and enhanced data structures 2011-04-28 13:09:01 +00:00
Scraper.java free some memory after parsing html 2012-02-02 09:55:27 +01:00
ScraperInputStream.java memory hacks 2012-02-02 07:37:00 +01:00
ScraperListener.java
Transformer.java
TransformerWriter.java free some memory after parsing html 2012-02-02 09:55:27 +01:00