yacy_search_server/source/net/yacy/crawler
Michael Peter Christen 74c249288a added a push api to make it possible to upload files directly without
crawling to the YaCy indexer. Files are uploaded using POST multipart
requests; multiple file uploads are possible as well. Each file has
attached the file date and mime type which is used to get the right
parser for the submitted data. Also an url is submitted which is
assigned to the document.
The CrawlSwitchboard has a new option for default Crawl Profiles which
are assigned dynamically from the new push interface.
2014-06-12 18:10:07 +02:00
..
data made number of open files in crawler configurable and increased default 2014-05-31 09:29:55 +02:00
retrieval refactoring 2014-06-12 05:23:26 +02:00
robots new Strategies in Balancer: 2014-04-17 12:52:54 +02:00
Balancer.java - added a new Crawler Balancer: HostBalancer and HostQueues: 2014-04-16 21:34:28 +02:00
CrawlStacker.java added crawl depth for failed documents 2014-04-17 13:21:43 +02:00
CrawlSwitchboard.java added a push api to make it possible to upload files directly without 2014-06-12 18:10:07 +02:00
HarvestProcess.java fix for wrong display of error urls in HostBrowser 2012-12-07 00:31:10 +01:00
HostBalancer.java made number of open files in crawler configurable and increased default 2014-05-31 09:29:55 +02:00
HostQueue.java Crawler hostbalancer does not delete finished queue files, 2014-06-05 02:13:08 +02:00
LegacyBalancer.java special strategy for balancer: do not remove targets with zero wait time 2014-04-18 06:50:07 +02:00