..
cache
- code cleanup
2006-09-29 22:27:20 +00:00
crawler
*) CrawlWorker.java: only keep content in memory if size is equal or less than 5MB
2006-10-03 12:16:25 +00:00
dbImport
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
2006-10-12 23:14:41 +00:00
parser
First version of the MS Powerpoint parser based on Apache POI
2006-10-12 17:28:53 +00:00
urlPattern
*) adding missing classes
2006-08-12 14:41:26 +00:00
plasmaCondenser.java
lines inside tags without punctuation are extended by a single dot.
2006-10-08 01:24:00 +00:00
plasmaCrawlBalancer.java
- code cleanup
2006-09-29 22:27:20 +00:00
plasmaCrawlEURL.java
*) Better errorhandling for charset encoding problem during content parsing
2006-10-10 10:14:03 +00:00
plasmaCrawlLoader.java
*) plasmaHTCache:
2006-10-03 11:05:48 +00:00
plasmaCrawlLoaderMessage.java
added snippet-url re-indexing
2006-10-09 23:07:10 +00:00
plasmaCrawlLURL.java
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
2006-10-12 23:14:41 +00:00
plasmaCrawlLURLEntry.java
- some bugfixing and code cleanup
2006-10-13 01:19:26 +00:00
plasmaCrawlLURLOldEntry.java
- some bugfixing and code cleanup
2006-10-13 01:19:26 +00:00
plasmaCrawlNURL.java
- code cleanup
2006-09-29 22:27:20 +00:00
plasmaCrawlProfile.java
* simplified initialization of database objects
2006-08-24 02:19:25 +00:00
plasmaCrawlRobotsTxt.java
- code cleanup
2006-09-29 22:27:20 +00:00
plasmaCrawlStacker.java
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
2006-10-12 23:14:41 +00:00
plasmaDHTChunk.java
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
2006-10-12 23:14:41 +00:00
plasmaDHTFlush.java
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
2006-10-12 23:14:41 +00:00
plasmaDHTTransfer.java
- Cache known URLs during indexReceive to avoid getting blocked during loadedURL.exists() whenever possible
2006-08-07 11:42:00 +00:00
plasmaGrafics.java
- code cleanup
2006-09-29 22:27:20 +00:00
plasmaHTCache.java
reverse SVN 2744, it is not needed
2006-10-10 22:02:23 +00:00
plasmaParser.java
*) Better errorhandling for charset encoding problem during content parsing
2006-10-10 10:14:03 +00:00
plasmaParserConfig.java
- code cleanup
2006-09-29 22:27:20 +00:00
plasmaParserDocument.java
removed lowercase of snippets (and other things):
2006-10-07 00:06:09 +00:00
plasmaRankingCRProcess.java
- code cleanup
2006-09-29 22:27:20 +00:00
plasmaRankingDistribution.java
- code cleanup
2006-09-29 22:27:20 +00:00
plasmaRankingRCIEvaluation.java
- code cleanup
2006-09-29 22:27:20 +00:00
plasmaSearchEvent.java
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
2006-10-12 23:14:41 +00:00
plasmaSearchImages.java
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
2006-10-12 23:14:41 +00:00
plasmaSearchPreOrder.java
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
2006-10-12 23:14:41 +00:00
plasmaSearchQuery.java
- code cleanup
2006-09-29 22:27:20 +00:00
plasmaSearchRankingProfile.java
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
2006-10-12 23:14:41 +00:00
plasmaSearchResult.java
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
2006-10-12 23:14:41 +00:00
plasmaSearchTimingProfile.java
- code cleanup
2006-09-29 22:27:20 +00:00
plasmaSnippetCache.java
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
2006-10-12 23:14:41 +00:00
plasmaStore.java
plasmaSwitchboard.java
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
2006-10-12 23:14:41 +00:00
plasmaSwitchboardQueue.java
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
2006-10-12 23:14:41 +00:00
plasmaURLPattern.java
- code cleanup
2006-09-29 22:27:20 +00:00
plasmaURLPool.java
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
2006-10-12 23:14:41 +00:00
plasmaWordConnotation.java
* simplified initialization of database objects
2006-08-24 02:19:25 +00:00
plasmaWordIndex.java
- some bugfixing and code cleanup
2006-10-13 01:19:26 +00:00
plasmaWordIndexAssortment.java
- code cleanup
2006-09-29 22:27:20 +00:00
plasmaWordIndexAssortmentCluster.java
- code cleanup
2006-09-29 22:27:20 +00:00
plasmaWordIndexFile.java
- code cleanup
2006-09-29 22:27:20 +00:00
plasmaWordIndexFileCluster.java
bugfix for old WORDS storage method
2006-10-09 02:20:27 +00:00