yacy_search_server/source/net/yacy/document/parser/html
orbiter 56264dcc17 - added CamelCase parser to MultiProtocolURI: generate better to-be-indexed words from urls
- integrated new parser into loader processes: enrich document parser
- fixed a concurrent modification exception in kelondro iterator
- hand-over of document size from crawler to indexer

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7374 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-12-15 00:03:19 +00:00
..
AbstractScraper.java performance hacks for better search performance 2010-10-08 23:50:28 +00:00
AbstractTransformer.java removed finalize methods because of a hint in 2010-04-23 09:32:29 +00:00
CharacterCoding.java performance hacks for better search performance 2010-10-08 23:50:28 +00:00
ContentScraper.java - added CamelCase parser to MultiProtocolURI: generate better to-be-indexed words from urls 2010-12-15 00:03:19 +00:00
ContentTransformer.java applied code changes that are recommended by PMD 2010-01-10 23:09:48 +00:00
ImageEntry.java - added new protocol loader for 'file'-type URLs 2010-05-25 12:54:57 +00:00
Scraper.java refactoring of yacy documents and parsers: they depend now only on the kelondro classes 2009-10-18 00:53:43 +00:00
ScraperInputStream.java - added new protocol loader for 'file'-type URLs 2010-05-25 12:54:57 +00:00
ScraperListener.java refactoring of yacy documents and parsers: they depend now only on the kelondro classes 2009-10-18 00:53:43 +00:00
Transformer.java refactoring of yacy documents and parsers: they depend now only on the kelondro classes 2009-10-18 00:53:43 +00:00
TransformerWriter.java * add a bit documentation to DigestURI, use DigestURI(string) instead of DigestURI(string, null) 2010-10-26 16:10:20 +00:00