yacy_search_server/source/net/yacy/document/parser/html
Michael Peter Christen 16d1d744fa added url_file_name_s in default collection schema for the file name
without the file extension. This part of the file path is removed from
the multi-field url_paths_sxt, which has now not the file name as last
part of the path list.

The same applies to the new fields source_file_name_s and
target_file_name_s in the webgraph schema.
2013-06-25 16:27:20 +02:00
..
AbstractScraper.java redesign of the QueryParams class: introduced QueryGoal which holds the 2012-11-18 01:22:41 +01:00
AbstractTransformer.java Now used synchronization in all close() methods to make sure all objects 2012-05-14 07:41:55 +02:00
CharacterCoding.java *) added comments 2011-09-25 13:16:52 +00:00
ContentScraper.java added url_file_name_s in default collection schema for the file name 2013-06-25 16:27:20 +02:00
ContentTransformer.java - replaced all length() == 0 and size() == 0 with isEmpty() 2012-07-10 22:59:03 +02:00
EmbedEntry.java added the generation of 50 (!!) new solr field in the core 'webgraph'. 2013-02-22 15:45:15 +01:00
Evaluation.java refactoring + parser bugfixes 2012-05-04 17:28:27 +02:00
ImageEntry.java added the generation of 50 (!!) new solr field in the core 'webgraph'. 2013-02-22 15:45:15 +01:00
Scraper.java free some memory after parsing html 2012-02-02 09:55:27 +01:00
ScraperInputStream.java added the generation of 50 (!!) new solr field in the core 'webgraph'. 2013-02-22 15:45:15 +01:00
ScraperListener.java
Transformer.java
TransformerWriter.java fix in html parser and bookmark generation 2013-02-11 13:28:08 +01:00