yacy_search_server/source/de/anomic/crawler
orbiter 727dd9b193 - fixed a bug in robots.txt parser
- moved storage of robots.txt entries to WorkTables, so it is now possible to browse the robots entries with the table browser

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6710 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-03-04 11:58:07 +00:00
..
retrieval added an api recording function: it shall be possible to record 2010-01-21 22:06:03 +00:00
AbstractImporter.java start of a really extensive refactoring which will produce a hierarchical package structure with the domain yacy.net as package root 2009-10-09 23:13:30 +00:00
Balancer.java better look-ahead into the crawl queue: show more on crawl monitor 2010-02-24 23:11:58 +00:00
CrawlProfile.java - added BEncoder class 2010-01-13 16:21:37 +00:00
CrawlQueues.java - increased thread priority for server threads 2010-01-19 11:29:22 +00:00
CrawlStacker.java fixed network switch process - crawling did not work after a switch before this fix 2010-01-17 23:33:15 +00:00
CrawlSwitchboard.java added some modifications recommended by PMD for better performance 2010-01-10 01:40:26 +00:00
Importer.java refactoring: 2008-05-06 13:44:38 +00:00
ImporterException.java added final where possible 2008-08-02 12:12:04 +00:00
ImporterManager.java start of a really extensive refactoring which will produce a hierarchical package structure with the domain yacy.net as package root 2009-10-09 23:13:30 +00:00
Latency.java - clean-up of robots.txt parser 2010-01-11 16:36:30 +00:00
NoticedURL.java added extensive memory protection logic to avoid out of memory errors that may be caused by the RowCollection memory allocation function 2009-12-09 23:27:26 +00:00
ResourceObserver.java allow global search if res. observer disabled index transmission 2010-02-09 17:14:16 +00:00
ResultImages.java refactoring of yacy documents and parsers: they depend now only on the kelondro classes 2009-10-18 00:53:43 +00:00
ResultURLs.java redesign of CrawlResult data structures because of OOM occurrences during URL deletion processes. 2010-02-16 23:06:04 +00:00
RobotsEntry.java - fixed a bug in robots.txt parser 2010-03-04 11:58:07 +00:00
robotsParser.java - fixed a bug in robots.txt parser 2010-03-04 11:58:07 +00:00
RobotsTxt.java - fixed a bug in robots.txt parser 2010-03-04 11:58:07 +00:00
SitemapImporter.java applied code changes that are recommended by PMD 2010-01-10 23:09:48 +00:00
ZURL.java fix for http://forum.yacy-websuche.de/viewtopic.php?p=18563#p18563 2010-01-13 18:41:33 +00:00