yacy_search_server/source/net/yacy/document/parser/html
Michael Peter Christen 50421171c3 added new schema fields:
hreflang_url_sxt and hreflang_cc_sxt
for
http://support.google.com/webmasters/bin/answer.py?hl=de&answer=189077

navigation_url_sxt and navigation_type_sxt
for
http://googlewebmastercentral.blogspot.de/2011/09/pagination-with-relnext-and-relprev.html

publisher_url_s
for http://support.google.com/plus/answer/1713826?hl=de

all fields are disabled by default and not written to the index.
2013-04-18 17:21:17 +02:00
..
AbstractScraper.java redesign of the QueryParams class: introduced QueryGoal which holds the 2012-11-18 01:22:41 +01:00
AbstractTransformer.java Now used synchronization in all close() methods to make sure all objects 2012-05-14 07:41:55 +02:00
CharacterCoding.java *) added comments 2011-09-25 13:16:52 +00:00
ContentScraper.java added new schema fields: 2013-04-18 17:21:17 +02:00
ContentTransformer.java - replaced all length() == 0 and size() == 0 with isEmpty() 2012-07-10 22:59:03 +02:00
EmbedEntry.java added the generation of 50 (!!) new solr field in the core 'webgraph'. 2013-02-22 15:45:15 +01:00
Evaluation.java refactoring + parser bugfixes 2012-05-04 17:28:27 +02:00
ImageEntry.java added the generation of 50 (!!) new solr field in the core 'webgraph'. 2013-02-22 15:45:15 +01:00
Scraper.java free some memory after parsing html 2012-02-02 09:55:27 +01:00
ScraperInputStream.java added the generation of 50 (!!) new solr field in the core 'webgraph'. 2013-02-22 15:45:15 +01:00
ScraperListener.java refactoring of yacy documents and parsers: they depend now only on the kelondro classes 2009-10-18 00:53:43 +00:00
Transformer.java refactoring of yacy documents and parsers: they depend now only on the kelondro classes 2009-10-18 00:53:43 +00:00
TransformerWriter.java fix in html parser and bookmark generation 2013-02-11 13:28:08 +01:00