mirror of
https://github.com/yacy/yacy_search_server.git
synced 2024-09-19 00:01:41 +02:00
3f0446f14b
Previously, when checking for the first time the robots.txt policy on a unknown host (not cached in the robots table), result was always empty in the /getpageinfo_p.xml api and in the /CrawlCheck_p.html page. Next calls returned however the correct information. |
||
---|---|---|
.. | ||
data | ||
retrieval | ||
robots | ||
Balancer.java | ||
CrawlStacker.java | ||
CrawlStarterFromSraper.java | ||
CrawlSwitchboard.java | ||
FileCrawlStarterTask.java | ||
HarvestProcess.java | ||
HostBalancer.java | ||
HostQueue.java | ||
IllegalCrawlProfileException.java | ||
LegacyBalancer.java | ||
RecrawlBusyThread.java |