yacy_search_server/source/net/yacy/crawler/data
Michael Peter Christen 25499eead5 - added a new field for the regular expression in crawl start
- added the field in crawl profile
- adopted logging end error management
- adopted duplicate document detection
- added a new rule to the indexing process to reject non-matching
content
- full redesign of the expert crawl start servlet
The new filter field can now be seen in /CrawlStartExpert_p.html at
Section "Document Filter", subsection item "Filter on Content of
Document"
2013-04-26 10:49:55 +02:00
..
Cache.java Merge remote-tracking branch 'aleksejs/fixtrans' 2013-01-22 11:54:38 +01:00
CrawlProfile.java - added a new field for the regular expression in crawl start 2013-04-26 10:49:55 +02:00
CrawlQueues.java introduced a second core named 'webgraph'. This core will hold the link 2013-02-21 13:23:55 +01:00
Latency.java introduced a better place to update the lastacc time value in latency 2012-12-07 15:49:23 +01:00
NoticedURL.java update to HostBrowser: 2012-11-02 13:57:43 +01:00
ResultImages.java added the generation of 50 (!!) new solr field in the core 'webgraph'. 2013-02-22 15:45:15 +01:00
ResultURLs.java migrated the index export methods from the old metadata to solr. Now 2013-01-24 12:39:19 +01:00
ZURL.java introduced a second core named 'webgraph'. This core will hold the link 2013-02-21 13:23:55 +01:00