..
AbstractImporter.java
serialized all logging using concurrency:
2009-06-15 21:19:54 +00:00
Balancer.java
extending visibility of objects and methods to avoid synthetic accessor methods and increase performance
2009-06-30 13:25:46 +00:00
CrawlEntry.java
- added writing of temporary file names and renaming to final file name when index dump/merge are done. Interrupted merges can be cleaned up.
2009-04-01 12:39:11 +00:00
CrawlProfile.java
simplification of the code: removed unused classes, methods and variables
2009-06-30 09:27:46 +00:00
CrawlQueues.java
all stack/heap files that had been stored in DATA/PLASMA are now stored in the network-specific QUEUES path
2009-07-02 17:01:23 +00:00
CrawlStacker.java
serialized all logging using concurrency:
2009-06-15 21:19:54 +00:00
CrawlSwitchboard.java
all stack/heap files that had been stored in DATA/PLASMA are now stored in the network-specific QUEUES path
2009-07-02 17:01:23 +00:00
FTPLoader.java
code cleanup: call of static methods directly to the class
2009-06-30 13:01:35 +00:00
HTTPLoader.java
code cleanup: call of static methods directly to the class
2009-06-30 13:01:35 +00:00
Importer.java
refactoring:
2008-05-06 13:44:38 +00:00
ImporterException.java
added final where possible
2008-08-02 12:12:04 +00:00
ImporterManager.java
serialized all logging using concurrency:
2009-06-15 21:19:54 +00:00
IndexingStack.java
- renamed Stack to RecordStack to avoid name confusion with new classes
2009-07-03 16:35:34 +00:00
Latency.java
- fixed a not working selection rule in balancer
2009-06-06 08:46:59 +00:00
LoaderMessage.java
refactoring: better abstraction of reference and metadata prototypes.
2009-04-03 13:23:45 +00:00
NoticedURL.java
serialized all logging using concurrency:
2009-06-15 21:19:54 +00:00
NoticeURLImporter.java
refactoring of plasmaWordIndex:
2009-05-28 14:26:05 +00:00
ProtocolLoader.java
serialized all logging using concurrency:
2009-06-15 21:19:54 +00:00
ResourceObserver.java
all stack/heap files that had been stored in DATA/PLASMA are now stored in the network-specific QUEUES path
2009-07-02 17:01:23 +00:00
ResultImages.java
* refactoring
2008-08-02 13:57:00 +00:00
ResultURLs.java
serialized all logging using concurrency:
2009-06-15 21:19:54 +00:00
robotsParser.java
* Robots.txt: don't interpret Crawl-Delays for other robots
2008-12-18 15:35:41 +00:00
RobotsTxt.java
code cleanup: call of static methods directly to the class
2009-06-30 13:01:35 +00:00
SitemapImporter.java
more performance hacks
2008-12-04 12:54:16 +00:00
ZURL.java
extending visibility of objects and methods to avoid synthetic accessor methods and increase performance
2009-06-30 13:25:46 +00:00