yacy_search_server/source/de/anomic/crawler
orbiter b77b8cac0c - enhanced html parser: recognized much more details in the content
- added more properties to solr index
- refactoring
- more constants in switchboard
- fix for some NPEs
- recognition of more images
- removed synchronization in HandleMap (obviously not necessary?)
- added a nolocal configuration to remove excessive dns lookup (works only on allip - default off). Indexes produced with this setting are all flagged with 'local' and are (on purpose) not usable for freeworld because they will be rejected as beeing local.



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7672 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-04-21 13:58:49 +00:00
..
retrieval - enhanced html parser: recognized much more details in the content 2011-04-21 13:58:49 +00:00
Balancer.java Added federated index storage to solr. 2011-04-14 20:05:04 +00:00
CrawlProfile.java enhanced location search: 2011-04-15 15:54:19 +00:00
CrawlQueues.java more UTF8 getBytes() performance hacks 2011-04-12 05:02:36 +00:00
CrawlStacker.java - enhanced html parser: recognized much more details in the content 2011-04-21 13:58:49 +00:00
CrawlSwitchboard.java more UTF8 getBytes() performance hacks 2011-04-12 05:02:36 +00:00
ImporterException.java
Latency.java fix for bug http://bugs.yacy.net/view.php?id=10 2011-04-04 12:20:20 +00:00
NoticedURL.java added a handling of appearances of yacy bot entries in robots.txt if this entry addresses the yacy peer 2011-04-03 23:39:45 +00:00
ResourceObserver.java more UTF8 getBytes() performance hacks 2011-04-12 05:02:36 +00:00
ResultImages.java - fixed a bug in crawl start with file name (npe in new url) 2011-04-18 16:11:16 +00:00
ResultURLs.java more UTF8 getBytes() performance hacks 2011-04-12 05:02:36 +00:00
RobotsEntry.java more UTF8 getBytes() performance hacks 2011-04-12 05:02:36 +00:00
robotsParser.java fixes to crawler and new user-agent crawl-delay handling 2011-04-04 09:47:18 +00:00
RobotsTxt.java - enhanced html parser: recognized much more details in the content 2011-04-21 13:58:49 +00:00
RSSLoader.java more UTF8 getBytes() performance hacks 2011-04-12 05:02:36 +00:00
SitemapImporter.java more UTF8 getBytes() performance hacks 2011-04-12 05:02:36 +00:00
ZURL.java - enhanced html parser: recognized much more details in the content 2011-04-21 13:58:49 +00:00