yacy_search_server/source/net/yacy/document/parser/html
Michael Peter Christen 50421171c3 added new schema fields:
hreflang_url_sxt and hreflang_cc_sxt
for
http://support.google.com/webmasters/bin/answer.py?hl=de&answer=189077

navigation_url_sxt and navigation_type_sxt
for
http://googlewebmastercentral.blogspot.de/2011/09/pagination-with-relnext-and-relprev.html

publisher_url_s
for http://support.google.com/plus/answer/1713826?hl=de

all fields are disabled by default and not written to the index.
2013-04-18 17:21:17 +02:00
..
AbstractScraper.java redesign of the QueryParams class: introduced QueryGoal which holds the 2012-11-18 01:22:41 +01:00
AbstractTransformer.java
CharacterCoding.java
ContentScraper.java added new schema fields: 2013-04-18 17:21:17 +02:00
ContentTransformer.java - replaced all length() == 0 and size() == 0 with isEmpty() 2012-07-10 22:59:03 +02:00
EmbedEntry.java added the generation of 50 (!!) new solr field in the core 'webgraph'. 2013-02-22 15:45:15 +01:00
Evaluation.java
ImageEntry.java added the generation of 50 (!!) new solr field in the core 'webgraph'. 2013-02-22 15:45:15 +01:00
Scraper.java
ScraperInputStream.java added the generation of 50 (!!) new solr field in the core 'webgraph'. 2013-02-22 15:45:15 +01:00
ScraperListener.java
Transformer.java
TransformerWriter.java fix in html parser and bookmark generation 2013-02-11 13:28:08 +01:00