..
retrieval
- fixes for some problems with the new crawling/caching strategies
2009-07-25 21:38:57 +00:00
AbstractImporter.java
serialized all logging using concurrency:
2009-06-15 21:19:54 +00:00
Balancer.java
- fixes for some problems with the new crawling/caching strategies
2009-07-25 21:38:57 +00:00
CrawlProfile.java
- added cache usage properties to crawl start
2009-07-24 11:54:04 +00:00
CrawlQueues.java
- fixes for some problems with the new crawling/caching strategies
2009-07-25 21:38:57 +00:00
CrawlStacker.java
redesign of access to the HTCache (now http.client.Cache):
2009-07-23 21:31:51 +00:00
CrawlSwitchboard.java
fixed bug that caused deletion of crawl profiles at every application startup
2009-07-23 22:09:02 +00:00
ExternalIndexImporter.java
refactoring:
2009-07-19 20:37:44 +00:00
Importer.java
refactoring:
2008-05-06 13:44:38 +00:00
ImporterException.java
added final where possible
2008-08-02 12:12:04 +00:00
ImporterManager.java
serialized all logging using concurrency:
2009-06-15 21:19:54 +00:00
Latency.java
- fixes for some problems with the new crawling/caching strategies
2009-07-25 21:38:57 +00:00
LoaderMessage.java
-removed superfluous crawl cache
2009-07-15 21:07:46 +00:00
NoticedURL.java
-removed superfluous crawl cache
2009-07-15 21:07:46 +00:00
NoticeURLImporter.java
-removed superfluous crawl cache
2009-07-15 21:07:46 +00:00
ResourceObserver.java
refactoring:
2009-07-19 20:37:44 +00:00
ResultImages.java
refactoring of parsers and document processing
2009-07-08 21:48:08 +00:00
ResultURLs.java
serialized all logging using concurrency:
2009-06-15 21:19:54 +00:00
robotsParser.java
* Robots.txt: don't interpret Crawl-Delays for other robots
2008-12-18 15:35:41 +00:00
RobotsTxt.java
- fixes for some problems with the new crawling/caching strategies
2009-07-25 21:38:57 +00:00
SitemapImporter.java
refactoring:
2009-07-19 20:37:44 +00:00
ZURL.java
-removed superfluous crawl cache
2009-07-15 21:07:46 +00:00