yacy_search_server/source/de/anomic/crawler
orbiter 63a0255166 - refactoring: added new content package, which will contain connector classes for different types of data sources to import texts into the YaCy index
- refactoring: migrated data objects for the new connector classes
- added a DAO interface class to specify an abstract interface for database retrieval connector methods

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5977 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-26 07:44:22 +00:00
..
AbstractImporter.java refactoring of logging 2009-01-30 23:33:47 +00:00
Balancer.java - removed useCell option: the indexCell data structure is now the default index structure; old collection data is still migrated 2009-04-22 22:39:12 +00:00
CrawlEntry.java - added writing of temporary file names and renaming to final file name when index dump/merge are done. Interrupted merges can be cleaned up. 2009-04-01 12:39:11 +00:00
CrawlProfile.java - added migration class to go from index collections to the index cell data structure. 2009-03-30 15:31:25 +00:00
CrawlQueues.java - refactoring: added new content package, which will contain connector classes for different types of data sources to import texts into the YaCy index 2009-05-26 07:44:22 +00:00
CrawlStacker.java code-split 2009-04-20 13:28:28 +00:00
FTPLoader.java refactoring: better abstraction of reference and metadata prototypes. 2009-04-03 13:23:45 +00:00
HTTPLoader.java fixed problem with not http client 2009-04-13 21:21:47 +00:00
Importer.java
ImporterException.java
ImporterManager.java more memory leak fixing hacks 2009-02-11 13:31:10 +00:00
IndexingStack.java - fixed possible bugs in Stack (may affect Crawler reset) and RandomAccess handling 2009-05-02 01:40:03 +00:00
Latency.java - removed deprecated threads 2009-04-01 20:13:57 +00:00
LoaderMessage.java refactoring: better abstraction of reference and metadata prototypes. 2009-04-03 13:23:45 +00:00
NoticedURL.java - removed useCell option: the indexCell data structure is now the default index structure; old collection data is still migrated 2009-04-22 22:39:12 +00:00
NoticeURLImporter.java - added migration class to go from index collections to the index cell data structure. 2009-03-30 15:31:25 +00:00
ProtocolLoader.java refactoring: better abstraction of reference and metadata prototypes. 2009-04-03 13:23:45 +00:00
ResourceObserver.java simplified code, removed one unused method in all implementing classes 2009-05-21 23:53:01 +00:00
ResultImages.java
ResultURLs.java refactoring: better abstraction of reference and metadata prototypes. 2009-04-03 13:23:45 +00:00
robotsParser.java * Robots.txt: don't interpret Crawl-Delays for other robots 2008-12-18 15:35:41 +00:00
RobotsTxt.java Robots.txt is now reused after one day. See forum-topic: 2009-04-02 15:29:36 +00:00
SitemapImporter.java more performance hacks 2008-12-04 12:54:16 +00:00
ZURL.java - added migration class to go from index collections to the index cell data structure. 2009-03-30 15:31:25 +00:00