yacy_search_server

mirror of https://github.com/yacy/yacy_search_server.git synced 2024-09-21 00:00:13 +02:00

History

luccioman 6b45cd5799 New optional crawl filter on the URL a doc must match to crawl its links For finer control over which parsed documents can trigger an addition of their links to the crawl stack, complementary to the existing crawl depth parameter.		2019-05-01 08:54:19 +02:00
..
Cache.java	Small perf improvement : initialize threads names early when possible	2018-05-23 14:45:35 +02:00
CrawlProfile.java	New optional crawl filter on the URL a doc must match to crawl its links	2019-05-01 08:54:19 +02:00
CrawlQueues.java	Fixed exceeding max size of failreason_s Solr field on large link list	2018-07-11 08:13:29 +02:00
Latency.java	use supplied url port to get robots.txt in crawlers hostqueue	2016-03-02 00:12:34 +01:00
NoticedURL.java	Added new crawler attribute for finer control over Media Type detection	2018-10-25 10:42:12 +02:00
ResultImages.java	fix for image alt attachment to AnchorURLs in html parser.	2014-08-01 12:04:15 +02:00
ResultURLs.java	fix logger name	2016-04-17 03:20:14 +02:00
Snapshots.java	Fixed raw IPV6 addresses snapshots read/write on FAT32 and NTFS fs	2018-09-12 17:34:40 +02:00
Transactions.java	Added a configurable timeout to wkhtmltopdf calls for pdf snapshots	2018-12-11 22:31:31 +01:00