yacy_search_server/source/net/yacy/crawler/data
Michael Peter Christen a88a62f7aa added a feature to set a collection for a crawl result based on a
regular expression on th url: the collection attribut for a crawl start
may be now either a token or a list of tokens, seperated by ',' where a
token is either a string or a pair <string,pattern> where the string is
separated to the pattern with a ':' and the string is assigned to the
document as collection only if the pattern matches with the url.
2013-08-25 00:13:48 +02:00
..
Cache.java Added 'final' for all exception blocks as this helps the Java compiler 2013-07-17 18:31:30 +02:00
CrawlProfile.java added a feature to set a collection for a crawl result based on a 2013-08-25 00:13:48 +02:00
CrawlQueues.java fixed a crawler bug where a double-occurring url was not re-crawled 2013-08-22 15:56:09 +02:00
Latency.java Redesign of crawler identification and robots steering. A non-p2p user 2013-08-22 14:23:47 +02:00
NoticedURL.java Redesign of crawler identification and robots steering. A non-p2p user 2013-08-22 14:23:47 +02:00
ResultImages.java added url_file_name_s in default collection schema for the file name 2013-06-25 16:27:20 +02:00
ResultURLs.java migrated the index export methods from the old metadata to solr. Now 2013-01-24 12:39:19 +01:00
ZURL.java Added 'final' for all exception blocks as this helps the Java compiler 2013-07-17 18:31:30 +02:00