yacy_search_server/source/de/anomic/crawler
orbiter 114bdd8ba7 fixed old sitemap importer which was not able to parse urls containing post elements
- removed old parser
- removed old importer framework (was only used by removed old parser)
- added a new sitemap parser in parser framework
- linked new parser with parser access in old sitemap processing routines

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7126 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-08 14:13:15 +00:00
..
retrieval redesign of crawl profiles data structure. target will be: 2010-08-31 15:47:47 +00:00
Balancer.java redesign of crawl profiles data structure. target will be: 2010-08-31 15:47:47 +00:00
CrawlProfile.java redesign of crawl profiles data structure. target will be: 2010-08-31 15:47:47 +00:00
CrawlQueues.java redesign of crawl profiles data structure. target will be: 2010-08-31 15:47:47 +00:00
CrawlStacker.java redesign of crawl profiles data structure. target will be: 2010-08-31 15:47:47 +00:00
CrawlSwitchboard.java redesign of crawl profiles data structure. target will be: 2010-08-31 15:47:47 +00:00
ImporterException.java
Latency.java preparations to move the HTCache into cora: 2010-08-23 12:32:02 +00:00
NoticedURL.java redesign of crawl profiles data structure. target will be: 2010-08-31 15:47:47 +00:00
ResourceObserver.java change in handling of the all-visible home path for storage in YaCy: 2010-09-02 19:24:22 +00:00
ResultImages.java redesign of parser interface: 2010-06-29 19:20:45 +00:00
ResultURLs.java - more abstraction (HashMap -> Map) 2010-06-01 13:02:11 +00:00
RobotsEntry.java - replaced pdfbox and fontbox version 1.1.0 with 1.2.1 2010-09-07 17:13:47 +00:00
robotsParser.java enhanced computation speed of many replaceAll string operations 2010-09-05 13:19:42 +00:00
RobotsTxt.java enhanced computation speed of many replaceAll string operations 2010-09-05 13:19:42 +00:00
RSSLoader.java - added nice colors to feed indexing state messages 2010-08-27 11:56:51 +00:00
SitemapImporter.java fixed old sitemap importer which was not able to parse urls containing post elements 2010-09-08 14:13:15 +00:00
ZURL.java fixed crawler bug caused by NPE in logging 2010-08-12 01:29:56 +00:00