yacy_search_server/source/de/anomic/plasma
orbiter a5dd0d41af - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
- added test migration method to migrate the old LURL to a new LURL
the new LURL will be splitted into different tables for each month
this solves several problems:
- the biggest table in YaCy is splitted in different parts and can
  also be managed in filesystems that are limited to 2GB
- the oldest entries can easily be identified, used for re-crawl und
  deleted
- The complete database can be limited to a specific size (as wanted many times)


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2755 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-12 23:14:41 +00:00
..
cache - code cleanup 2006-09-29 22:27:20 +00:00
crawler *) CrawlWorker.java: only keep content in memory if size is equal or less than 5MB 2006-10-03 12:16:25 +00:00
dbImport - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format 2006-10-12 23:14:41 +00:00
parser First version of the MS Powerpoint parser based on Apache POI 2006-10-12 17:28:53 +00:00
urlPattern *) adding missing classes 2006-08-12 14:41:26 +00:00
plasmaCondenser.java lines inside tags without punctuation are extended by a single dot. 2006-10-08 01:24:00 +00:00
plasmaCrawlBalancer.java - code cleanup 2006-09-29 22:27:20 +00:00
plasmaCrawlEURL.java *) Better errorhandling for charset encoding problem during content parsing 2006-10-10 10:14:03 +00:00
plasmaCrawlLoader.java *) plasmaHTCache: 2006-10-03 11:05:48 +00:00
plasmaCrawlLoaderMessage.java added snippet-url re-indexing 2006-10-09 23:07:10 +00:00
plasmaCrawlLURL.java - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format 2006-10-12 23:14:41 +00:00
plasmaCrawlNURL.java - code cleanup 2006-09-29 22:27:20 +00:00
plasmaCrawlProfile.java * simplified initialization of database objects 2006-08-24 02:19:25 +00:00
plasmaCrawlRobotsTxt.java - code cleanup 2006-09-29 22:27:20 +00:00
plasmaCrawlStacker.java - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format 2006-10-12 23:14:41 +00:00
plasmaDHTChunk.java - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format 2006-10-12 23:14:41 +00:00
plasmaDHTFlush.java - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format 2006-10-12 23:14:41 +00:00
plasmaDHTTransfer.java - Cache known URLs during indexReceive to avoid getting blocked during loadedURL.exists() whenever possible 2006-08-07 11:42:00 +00:00
plasmaGrafics.java - code cleanup 2006-09-29 22:27:20 +00:00
plasmaHTCache.java reverse SVN 2744, it is not needed 2006-10-10 22:02:23 +00:00
plasmaParser.java *) Better errorhandling for charset encoding problem during content parsing 2006-10-10 10:14:03 +00:00
plasmaParserConfig.java - code cleanup 2006-09-29 22:27:20 +00:00
plasmaParserDocument.java removed lowercase of snippets (and other things): 2006-10-07 00:06:09 +00:00
plasmaRankingCRProcess.java - code cleanup 2006-09-29 22:27:20 +00:00
plasmaRankingDistribution.java - code cleanup 2006-09-29 22:27:20 +00:00
plasmaRankingRCIEvaluation.java - code cleanup 2006-09-29 22:27:20 +00:00
plasmaSearchEvent.java - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format 2006-10-12 23:14:41 +00:00
plasmaSearchImages.java - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format 2006-10-12 23:14:41 +00:00
plasmaSearchPreOrder.java - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format 2006-10-12 23:14:41 +00:00
plasmaSearchQuery.java - code cleanup 2006-09-29 22:27:20 +00:00
plasmaSearchRankingProfile.java - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format 2006-10-12 23:14:41 +00:00
plasmaSearchResult.java - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format 2006-10-12 23:14:41 +00:00
plasmaSearchTimingProfile.java - code cleanup 2006-09-29 22:27:20 +00:00
plasmaSnippetCache.java - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format 2006-10-12 23:14:41 +00:00
plasmaStore.java code cleanup 2005-12-05 14:24:13 +00:00
plasmaSwitchboard.java - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format 2006-10-12 23:14:41 +00:00
plasmaSwitchboardQueue.java - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format 2006-10-12 23:14:41 +00:00
plasmaURLPattern.java - code cleanup 2006-09-29 22:27:20 +00:00
plasmaURLPool.java - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format 2006-10-12 23:14:41 +00:00
plasmaWordConnotation.java * simplified initialization of database objects 2006-08-24 02:19:25 +00:00
plasmaWordIndex.java - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format 2006-10-12 23:14:41 +00:00
plasmaWordIndexAssortment.java - code cleanup 2006-09-29 22:27:20 +00:00
plasmaWordIndexAssortmentCluster.java - code cleanup 2006-09-29 22:27:20 +00:00
plasmaWordIndexFile.java - code cleanup 2006-09-29 22:27:20 +00:00
plasmaWordIndexFileCluster.java bugfix for old WORDS storage method 2006-10-09 02:20:27 +00:00