.. |
cache
|
- code cleanup
|
2006-09-29 22:27:20 +00:00 |
crawler
|
*) CrawlWorker.java: only keep content in memory if size is equal or less than 5MB
|
2006-10-03 12:16:25 +00:00 |
dbImport
|
refactoring of indexing methods
|
2006-10-16 15:04:16 +00:00 |
parser
|
First version of the MS Powerpoint parser based on Apache POI
|
2006-10-12 17:28:53 +00:00 |
urlPattern
|
*) adding missing classes
|
2006-08-12 14:41:26 +00:00 |
plasmaCondenser.java
|
lines inside tags without punctuation are extended by a single dot.
|
2006-10-08 01:24:00 +00:00 |
plasmaCrawlBalancer.java
|
- code cleanup
|
2006-09-29 22:27:20 +00:00 |
plasmaCrawlEURL.java
|
*) Better errorhandling for charset encoding problem during content parsing
|
2006-10-10 10:14:03 +00:00 |
plasmaCrawlLoader.java
|
*) plasmaHTCache:
|
2006-10-03 11:05:48 +00:00 |
plasmaCrawlLoaderMessage.java
|
added snippet-url re-indexing
|
2006-10-09 23:07:10 +00:00 |
plasmaCrawlLURL.java
|
refactoring of indexing methods
|
2006-10-16 15:04:16 +00:00 |
plasmaCrawlLURLEntry.java
|
- some bugfixing and code cleanup
|
2006-10-13 01:19:26 +00:00 |
plasmaCrawlLURLOldEntry.java
|
- some bugfixing and code cleanup
|
2006-10-13 01:19:26 +00:00 |
plasmaCrawlNURL.java
|
- code cleanup
|
2006-09-29 22:27:20 +00:00 |
plasmaCrawlProfile.java
|
* simplified initialization of database objects
|
2006-08-24 02:19:25 +00:00 |
plasmaCrawlRobotsTxt.java
|
- code cleanup
|
2006-09-29 22:27:20 +00:00 |
plasmaCrawlStacker.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
2006-10-12 23:14:41 +00:00 |
plasmaDHTChunk.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
2006-10-12 23:14:41 +00:00 |
plasmaDHTFlush.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
2006-10-12 23:14:41 +00:00 |
plasmaDHTTransfer.java
|
- Cache known URLs during indexReceive to avoid getting blocked during loadedURL.exists() whenever possible
|
2006-08-07 11:42:00 +00:00 |
plasmaGrafics.java
|
- code cleanup
|
2006-09-29 22:27:20 +00:00 |
plasmaHTCache.java
|
reverse SVN 2744, it is not needed
|
2006-10-10 22:02:23 +00:00 |
plasmaParser.java
|
*) Trying to be more tolerant against wrong charset names
|
2006-10-13 05:30:20 +00:00 |
plasmaParserConfig.java
|
- code cleanup
|
2006-09-29 22:27:20 +00:00 |
plasmaParserDocument.java
|
removed lowercase of snippets (and other things):
|
2006-10-07 00:06:09 +00:00 |
plasmaRankingCRProcess.java
|
- code cleanup
|
2006-09-29 22:27:20 +00:00 |
plasmaRankingDistribution.java
|
- code cleanup
|
2006-09-29 22:27:20 +00:00 |
plasmaRankingRCIEvaluation.java
|
- code cleanup
|
2006-09-29 22:27:20 +00:00 |
plasmaSearchEvent.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
2006-10-12 23:14:41 +00:00 |
plasmaSearchImages.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
2006-10-12 23:14:41 +00:00 |
plasmaSearchPreOrder.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
2006-10-12 23:14:41 +00:00 |
plasmaSearchQuery.java
|
- code cleanup
|
2006-09-29 22:27:20 +00:00 |
plasmaSearchRankingProfile.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
2006-10-12 23:14:41 +00:00 |
plasmaSearchResult.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
2006-10-12 23:14:41 +00:00 |
plasmaSearchTimingProfile.java
|
- code cleanup
|
2006-09-29 22:27:20 +00:00 |
plasmaSnippetCache.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
2006-10-12 23:14:41 +00:00 |
plasmaStore.java
|
code cleanup
|
2005-12-05 14:24:13 +00:00 |
plasmaSwitchboard.java
|
refactoring of indexing methods
|
2006-10-16 15:04:16 +00:00 |
plasmaSwitchboardQueue.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
2006-10-12 23:14:41 +00:00 |
plasmaURLPool.java
|
- refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
|
2006-10-12 23:14:41 +00:00 |
plasmaWordConnotation.java
|
* simplified initialization of database objects
|
2006-08-24 02:19:25 +00:00 |
plasmaWordIndex.java
|
null pointer bugfix
|
2006-10-13 08:03:11 +00:00 |
plasmaWordIndexAssortment.java
|
- code cleanup
|
2006-09-29 22:27:20 +00:00 |
plasmaWordIndexAssortmentCluster.java
|
- code cleanup
|
2006-09-29 22:27:20 +00:00 |
plasmaWordIndexFile.java
|
- code cleanup
|
2006-09-29 22:27:20 +00:00 |
plasmaWordIndexFileCluster.java
|
bugfix for old WORDS storage method
|
2006-10-09 02:20:27 +00:00 |