yacy_search_server/source/de/anomic/crawler
2009-07-11 21:01:27 +00:00
..
AbstractImporter.java serialized all logging using concurrency: 2009-06-15 21:19:54 +00:00
Balancer.java extending visibility of objects and methods to avoid synthetic accessor methods and increase performance 2009-06-30 13:25:46 +00:00
CrawlEntry.java - added writing of temporary file names and renaming to final file name when index dump/merge are done. Interrupted merges can be cleaned up. 2009-04-01 12:39:11 +00:00
CrawlProfile.java simplification of the code: removed unused classes, methods and variables 2009-06-30 09:27:46 +00:00
CrawlQueues.java refactoring of parsers and document processing 2009-07-08 21:48:08 +00:00
CrawlStacker.java serialized all logging using concurrency: 2009-06-15 21:19:54 +00:00
CrawlSwitchboard.java refactoring of parsers and document processing 2009-07-08 21:48:08 +00:00
FTPLoader.java redesign of parser mime type detection and parser steering 2009-07-10 14:22:17 +00:00
HTTPLoader.java redesign of parser mime type detection and parser steering 2009-07-10 14:22:17 +00:00
Importer.java
ImporterException.java added final where possible 2008-08-02 12:12:04 +00:00
ImporterManager.java serialized all logging using concurrency: 2009-06-15 21:19:54 +00:00
IndexingStack.java small refactoring to prepare for new queues 2009-07-04 12:17:10 +00:00
Latency.java - fixed a not working selection rule in balancer 2009-06-06 08:46:59 +00:00
LoaderMessage.java refactoring of parsers and document processing 2009-07-08 21:48:08 +00:00
NoticedURL.java serialized all logging using concurrency: 2009-06-15 21:19:54 +00:00
NoticeURLImporter.java enable warnings and fix most of it 2009-07-11 21:01:27 +00:00
ProtocolLoader.java refactoring of parsers and document processing 2009-07-08 21:48:08 +00:00
ResourceObserver.java all stack/heap files that had been stored in DATA/PLASMA are now stored in the network-specific QUEUES path 2009-07-02 17:01:23 +00:00
ResultImages.java refactoring of parsers and document processing 2009-07-08 21:48:08 +00:00
ResultURLs.java serialized all logging using concurrency: 2009-06-15 21:19:54 +00:00
robotsParser.java * Robots.txt: don't interpret Crawl-Delays for other robots 2008-12-18 15:35:41 +00:00
RobotsTxt.java enable warnings and fix most of it 2009-07-11 21:01:27 +00:00
SitemapImporter.java more performance hacks 2008-12-04 12:54:16 +00:00
ZURL.java extending visibility of objects and methods to avoid synthetic accessor methods and increase performance 2009-06-30 13:25:46 +00:00