yacy_search_server/source/net/yacy/document/parser
orbiter 3d5104d357 - fixed a bug in crawl start with file name (npe in new url)
- added deletion of solr index in IndexControlRWIs
- added asynchronous adding of large url lists (happens when crawls are startet with file)
- fixed npe in Image display
- replaced language warning with fine logging
- added a domain name cache in Domains that helps to speed up the isLocal property (less DNS lookups)
- added a new storage class for this new cache: KeyList. The domain key list is stored in DATA/WORK/globalhosts.list
- added concurrent solr updates and chunked transfers (50 documents until a commit is done) for high-speed feeding (> 40000 ppm)
- fixed a bug in content scraper that chopped off large parts of crawl lists (using crawl start from file)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7666 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-04-18 16:11:16 +00:00
..
html - fixed a bug in crawl start with file name (npe in new url) 2011-04-18 16:11:16 +00:00
images - added geo information parsing to html parser 2011-03-30 00:49:47 +00:00
xml more UTF8 getBytes() performance hacks 2011-04-12 05:02:36 +00:00
bzipParser.java *) minor changes 2010-12-27 15:37:11 +00:00
csvParser.java - added geo information parsing to html parser 2011-03-30 00:49:47 +00:00
docParser.java - added geo information parsing to html parser 2011-03-30 00:49:47 +00:00
genericParser.java - added geo information parsing to html parser 2011-03-30 00:49:47 +00:00
gzipParser.java fixed bugs in parser and ftp client 2010-12-02 11:05:04 +00:00
htmlParser.java - added geo information parsing to html parser 2011-03-30 00:49:47 +00:00
mmParser.java - added geo information parsing to html parser 2011-03-30 00:49:47 +00:00
odtParser.java - added geo information parsing to html parser 2011-03-30 00:49:47 +00:00
ooxmlParser.java - added geo information parsing to html parser 2011-03-30 00:49:47 +00:00
pdfParser.java - added geo information parsing to html parser 2011-03-30 00:49:47 +00:00
pptParser.java - added geo information parsing to html parser 2011-03-30 00:49:47 +00:00
psParser.java - added geo information parsing to html parser 2011-03-30 00:49:47 +00:00
rssParser.java - added geo information parsing to html parser 2011-03-30 00:49:47 +00:00
rtfParser.java - added geo information parsing to html parser 2011-03-30 00:49:47 +00:00
sevenzipParser.java - added geo information parsing to html parser 2011-03-30 00:49:47 +00:00
sidAudioParser.java - added geo information parsing to html parser 2011-03-30 00:49:47 +00:00
sitemapParser.java - added geo information parsing to html parser 2011-03-30 00:49:47 +00:00
swfParser.java - added geo information parsing to html parser 2011-03-30 00:49:47 +00:00
tarParser.java *) minor changes 2010-12-27 15:53:41 +00:00
torrentParser.java - added geo information parsing to html parser 2011-03-30 00:49:47 +00:00
vcfParser.java more UTF8 getBytes() performance hacks 2011-04-12 05:02:36 +00:00
vsdParser.java - added geo information parsing to html parser 2011-03-30 00:49:47 +00:00
xlsParser.java - added geo information parsing to html parser 2011-03-30 00:49:47 +00:00
zipParser.java *) minor changes 2010-12-27 15:53:41 +00:00