yacy_search_server/source/de/anomic/crawler
orbiter 09badc697b - low-memory patch for crawler
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7304 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-04 13:26:27 +00:00
..
retrieval * add a bit documentation to DigestURI, use DigestURI(string) instead of DigestURI(string, null) 2010-10-26 16:10:20 +00:00
Balancer.java fixed http://forum.yacy-websuche.de/viewtopic.php?p=21113#p21113 2010-11-03 20:58:50 +00:00
CrawlProfile.java custom + generic skins: 2010-10-11 00:00:10 +00:00
CrawlQueues.java - low-memory patch for crawler 2010-11-04 13:26:27 +00:00
CrawlStacker.java * add option to network definition to provide a domainlist (syntax like in blacklists) 2010-10-30 14:44:33 +00:00
CrawlSwitchboard.java fixed a number of small bugs: 2010-09-30 23:57:58 +00:00
ImporterException.java
Latency.java preparations to move the HTCache into cora: 2010-08-23 12:32:02 +00:00
NoticedURL.java redesign of crawl profiles data structure. target will be: 2010-08-31 15:47:47 +00:00
ResourceObserver.java change in handling of the all-visible home path for storage in YaCy: 2010-09-02 19:24:22 +00:00
ResultImages.java redesign of parser interface: 2010-06-29 19:20:45 +00:00
ResultURLs.java - added a tag cloud to search results (using the topics) 2010-10-15 22:01:39 +00:00
RobotsEntry.java - replaced pdfbox and fontbox version 1.1.0 with 1.2.1 2010-09-07 17:13:47 +00:00
robotsParser.java added a sitemap entry parser and loader for sitemaps 2010-11-03 19:48:33 +00:00
RobotsTxt.java - moved yacybot user agent string definition to MultiProtocolURI since there are basic access mechanisms where the bot string is needed 2010-09-27 14:54:32 +00:00
RSSLoader.java fix for scheduling of rss feeds 2010-10-13 13:00:36 +00:00
SitemapImporter.java added a sitemap entry parser and loader for sitemaps 2010-11-03 19:48:33 +00:00
ZURL.java fixed crawler bug caused by NPE in logging 2010-08-12 01:29:56 +00:00