yacy_search_server/source/net/yacy/document/parser/html
orbiter b77b8cac0c - enhanced html parser: recognized much more details in the content
- added more properties to solr index
- refactoring
- more constants in switchboard
- fix for some NPEs
- recognition of more images
- removed synchronization in HandleMap (obviously not necessary?)
- added a nolocal configuration to remove excessive dns lookup (works only on allip - default off). Indexes produced with this setting are all flagged with 'local' and are (on purpose) not usable for freeworld because they will be rejected as beeing local.



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7672 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-04-21 13:58:49 +00:00
..
AbstractScraper.java *) cleaning up the code a little bit 2010-12-27 17:07:21 +00:00
AbstractTransformer.java removed finalize methods because of a hint in 2010-04-23 09:32:29 +00:00
CharacterCoding.java performance hacks for better search performance 2010-10-08 23:50:28 +00:00
ContentScraper.java - enhanced html parser: recognized much more details in the content 2011-04-21 13:58:49 +00:00
ContentTransformer.java performance enhancements using an alternative to a insensitive collator (a complex string compare): 2011-02-25 11:23:57 +00:00
ImageEntry.java - enhanced html parser: recognized much more details in the content 2011-04-21 13:58:49 +00:00
Scraper.java
ScraperInputStream.java - added new protocol loader for 'file'-type URLs 2010-05-25 12:54:57 +00:00
ScraperListener.java
Transformer.java
TransformerWriter.java - replaced more direct string-based UTF-8 conversions by predefined UTF-8 conversion 2011-03-10 23:25:07 +00:00