yacy_search_server/source/net/yacy/crawler/data
luccioman 6b45cd5799 New optional crawl filter on the URL a doc must match to crawl its links
For finer control over which parsed documents can trigger an addition of
their links to the crawl stack, complementary to the existing crawl
depth parameter.
2019-05-01 08:54:19 +02:00
..
Cache.java Small perf improvement : initialize threads names early when possible 2018-05-23 14:45:35 +02:00
CrawlProfile.java New optional crawl filter on the URL a doc must match to crawl its links 2019-05-01 08:54:19 +02:00
CrawlQueues.java Fixed exceeding max size of failreason_s Solr field on large link list 2018-07-11 08:13:29 +02:00
Latency.java use supplied url port to get robots.txt in crawlers hostqueue 2016-03-02 00:12:34 +01:00
NoticedURL.java Added new crawler attribute for finer control over Media Type detection 2018-10-25 10:42:12 +02:00
ResultImages.java fix for image alt attachment to AnchorURLs in html parser. 2014-08-01 12:04:15 +02:00
ResultURLs.java fix logger name 2016-04-17 03:20:14 +02:00
Snapshots.java Fixed raw IPV6 addresses snapshots read/write on FAT32 and NTFS fs 2018-09-12 17:34:40 +02:00
Transactions.java Added a configurable timeout to wkhtmltopdf calls for pdf snapshots 2018-12-11 22:31:31 +01:00