yacy_search_server/source/de/anomic/crawler
Michael Peter Christen 528d6763fa - added new solr fields:
title_count_i, title_chars_val, title_words_val
description_count_i, description_chars_val, description_words_val
- added many asserts to ensure data type correctness from YaCy to Solr
and vice versa
- made many fixes according to new findings from these asserts (!)
2012-08-31 10:30:43 +02:00
..
retrieval - added new solr fields: 2012-08-31 10:30:43 +02:00
Balancer.java Abstraction of HandleMap and HandleSet 2012-07-27 12:13:53 +02:00
Cache.java Abstraction of HandleMap and HandleSet 2012-07-27 12:13:53 +02:00
CrawlProfile.java - replaced all length() == 0 and size() == 0 with isEmpty() 2012-07-10 22:59:03 +02:00
CrawlQueues.java small fixes 2012-08-24 21:44:22 +02:00
CrawlStacker.java content control: apply filter if enabled to crawls 2012-08-29 09:52:14 +02:00
CrawlSwitchboard.java Abstraction of HandleMap and HandleSet 2012-07-27 12:13:53 +02:00
ImporterException.java added final where possible 2008-08-02 12:12:04 +00:00
Latency.java enhanced crawler/balancer: better remaining waiting-time guessing 2012-05-15 12:24:54 +02:00
NoticedURL.java Abstraction of HandleMap and HandleSet 2012-07-27 12:13:53 +02:00
ResourceObserver.java more logging in resource observer 2012-02-23 01:20:42 +01:00
ResultImages.java collection of speed and memory saving hacks 2012-07-13 21:15:38 +02:00
ResultURLs.java - Implemented and integrated the URIMetadataNode object which is a 2012-08-10 13:26:51 +02:00
RobotsTxt.java Abstraction of HandleMap and HandleSet 2012-07-27 12:13:53 +02:00
RobotsTxtEntry.java - correct length computation for BStringObject (bugfix suggested by 2012-08-26 17:46:40 +02:00
RobotsTxtParser.java - replaced all length() == 0 and size() == 0 with isEmpty() 2012-07-10 22:59:03 +02:00
RSSLoader.java snippet retrieval loading processes may use a smaller minimum load time 2012-07-30 10:38:23 +02:00
SitemapImporter.java refactoring 2012-08-17 15:52:33 +02:00
ZURL.java redesign of YaCySchema and SolrDoc handling 2012-08-23 09:51:45 +02:00