yacy_search_server/source/net/yacy/crawler
Michael Peter Christen 542c20a597 changed handling of crawl profile field crawlingIfOlder: this should be
filled with the date, when the url is recognized as to be outdated. That
field was partly misinterpreted and the time interval was filled in. In
case that all the urls which are in the index shall be treated as
outdated, the field is filled now with Long.MAX_VALUE because then all
crawl dates are before that date and therefore outdated.
2014-07-22 00:23:17 +02:00
..
data changed handling of crawl profile field crawlingIfOlder: this should be 2014-07-22 00:23:17 +02:00
retrieval added an option to set 'obey nofollow' for links with rel="nofollow" 2014-07-18 12:43:01 +02:00
robots new Strategies in Balancer: 2014-04-17 12:52:54 +02:00
Balancer.java - added a new Crawler Balancer: HostBalancer and HostQueues: 2014-04-16 21:34:28 +02:00
CrawlStacker.java refactoring (change Metadata name of load time data structure to avoid 2014-07-21 23:54:23 +02:00
CrawlSwitchboard.java changed handling of crawl profile field crawlingIfOlder: this should be 2014-07-22 00:23:17 +02:00
HarvestProcess.java fix for wrong display of error urls in HostBrowser 2012-12-07 00:31:10 +01:00
HostBalancer.java enhanced crawler host round-robin strategy 2014-07-11 16:01:42 +02:00
HostQueue.java Crawler hostbalancer does not delete finished queue files, 2014-06-05 02:13:08 +02:00
LegacyBalancer.java special strategy for balancer: do not remove targets with zero wait time 2014-04-18 06:50:07 +02:00