yacy_search_server/source/net/yacy/crawler
orbiter 20bbde8665 fix for mustmatch regex computation: result had correct semantic, but
may have contained multiple same expressions within the disjunction of
domain-restrictions. This fix removes the redundant restrictions and
makes the regex shorter.
2013-10-18 13:55:37 +02:00
..
data fix for mustmatch regex computation: result had correct semantic, but 2013-10-18 13:55:37 +02:00
retrieval fix NPE on modified since check ( Response.requestHeader allowed to be null) 2013-09-30 02:50:53 +02:00
robots - the webgraph shall store all links which appear on a web page and not 2013-09-15 00:30:23 +02:00
Balancer.java self-healing of mistakenly deactivated crawl profiles. This fixes a bug 2013-09-25 18:27:54 +02:00
CrawlStacker.java Patch the citation index for links with canonical tags. 2013-10-07 11:15:58 +02:00
CrawlSwitchboard.java enhanced postprocessing: fixed bugs, enable proper postprocessing also 2013-10-16 11:27:06 +02:00
HarvestProcess.java fix for wrong display of error urls in HostBrowser 2012-12-07 00:31:10 +01:00
HostQueue.java Added new data structure to be used by the balancer (not used yet). 2013-09-24 21:08:40 +02:00
HostQueues.java Added new data structure to be used by the balancer (not used yet). 2013-09-24 21:08:40 +02:00