yacy_search_server/source/net/yacy/crawler/retrieval
Michael Peter Christen 16d1d744fa added url_file_name_s in default collection schema for the file name
without the file extension. This part of the file path is removed from
the multi-field url_paths_sxt, which has now not the file name as last
part of the path list.

The same applies to the new fields source_file_name_s and
target_file_name_s in the webgraph schema.
2013-06-25 16:27:20 +02:00
..
FileLoader.java added url_file_name_s in default collection schema for the file name 2013-06-25 16:27:20 +02:00
FTPLoader.java introduced a better place to update the lastacc time value in latency 2012-12-07 15:49:23 +01:00
HTTPLoader.java reduced load on solr; no seed update in Status and no exists-check in 2013-06-12 00:14:55 +02:00
ImporterException.java refactoring 2012-09-21 15:48:16 +02:00
Request.java replaced the custom robots.txt loader by the standard http loader 2012-10-28 22:48:11 +01:00
Response.java added url_file_name_s in default collection schema for the file name 2013-06-25 16:27:20 +02:00
RSSLoader.java reduced locking situation in crawler: shifted synchronized location and 2013-05-20 22:05:28 +02:00
SitemapImporter.java redesign of index.exist-test: this shall now not be done using a single 2013-05-17 13:59:37 +02:00
SMBLoader.java added url_file_name_s in default collection schema for the file name 2013-06-25 16:27:20 +02:00
URLRewriterLibrary.java added an url rewriter which can be used to remove session ids from urls 2012-10-09 11:24:48 +02:00