..
retrieval
preparations to move the HTCache into cora:
2010-08-23 12:32:02 +00:00
AbstractImporter.java
- cleanup, removed unused imports
2010-04-27 21:47:41 +00:00
Balancer.java
redirect uncaught exceptions to logging + small other changes
2010-08-16 12:33:06 +00:00
CrawlProfile.java
- MapHeap now implements Map<byte[], Map<String, String>>
2010-08-24 12:36:56 +00:00
CrawlQueues.java
... migrating to HttpComponents-Client-4.x ...
2010-08-22 17:38:27 +00:00
CrawlStacker.java
preparations to move the HTCache into cora:
2010-08-23 12:32:02 +00:00
CrawlSwitchboard.java
added the new crawl scheduling function to the crawl start menu:
2010-08-19 23:52:38 +00:00
Importer.java
refactoring:
2008-05-06 13:44:38 +00:00
ImporterException.java
added final where possible
2008-08-02 12:12:04 +00:00
ImporterManager.java
*) some minor changes for better code readability
2010-04-05 12:37:33 +00:00
Latency.java
preparations to move the HTCache into cora:
2010-08-23 12:32:02 +00:00
NoticedURL.java
- better url double check in crawler
2010-08-11 09:54:18 +00:00
ResourceObserver.java
allow global search if res. observer disabled index transmission
2010-02-09 17:14:16 +00:00
ResultImages.java
redesign of parser interface:
2010-06-29 19:20:45 +00:00
ResultURLs.java
- more abstraction (HashMap -> Map)
2010-06-01 13:02:11 +00:00
RobotsEntry.java
redesign of remote proxy settings
2010-05-26 00:01:16 +00:00
robotsParser.java
- fixed a bug in robots.txt parser
2010-03-04 11:58:07 +00:00
RobotsTxt.java
more abstraction for tables stored in heaps:
2010-08-23 21:27:58 +00:00
RSSLoader.java
- added nice colors to feed indexing state messages
2010-08-27 11:56:51 +00:00
SitemapImporter.java
applied code changes that are recommended by PMD
2010-01-10 23:09:48 +00:00
ZURL.java
fixed crawler bug caused by NPE in logging
2010-08-12 01:29:56 +00:00