yacy_search_server/source/net/yacy/document
orbiter c288fcf634 redesigned CrawlStartScanner user interface and added more features:
- multiple hosts for environment scans can be given (comma-separated)
- each service (ftp, smb, http, https) for the scan can be selected
- the scan result can be accumulated or refreshed each time a network scan is made
- a scheduler was added to repeat a scan and add all found urls to the indexer automatically

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7378 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-12-16 02:15:20 +00:00
..
content * add a bit documentation to DigestURI, use DigestURI(string) instead of DigestURI(string, null) 2010-10-26 16:10:20 +00:00
geolocalization enhancements in did-you-mean guessing 2010-10-12 09:45:15 +00:00
importer * fix system update if urls are in blacklist (for example for very general blacklists like *.de) 2010-12-15 19:20:00 +00:00
language - more abstraction (HashMap -> Map) 2010-06-01 13:02:11 +00:00
parser redesigned CrawlStartScanner user interface and added more features: 2010-12-16 02:15:20 +00:00
AbstractParser.java redesign of parser interface: 2010-06-29 19:20:45 +00:00
Classification.java redesign of parser interface: 2010-06-29 19:20:45 +00:00
Condenser.java fixed bugs in parser and ftp client 2010-12-02 11:05:04 +00:00
Document.java fixed bugs in parser and ftp client 2010-12-02 11:05:04 +00:00
ImageParser.java redesign of parser interface: 2010-06-29 19:20:45 +00:00
LargeNumberCache.java more performance hacks 2010-10-09 08:55:57 +00:00
Parser.java - enhancements for search speed 2010-10-04 11:54:48 +00:00
Phrase.java more performance hacks 2010-10-09 08:55:57 +00:00
SnippetExtractor.java *) cleaning up the code a little bit 2010-11-28 02:57:31 +00:00
TextParser.java enhanced crawler: 2010-12-11 00:31:57 +00:00