yacy_search_server/source/net/yacy/crawler/retrieval
Michael Peter Christen 8c3e5b7b6d added experimental pdf splitting which enables YaCy to split pdfs during
parsing into individual pages and add them all using different URLs.
These constructed urls are generated from the source url with an
appended page=<pagenumber> attribute to the url get/post properties.
This will distinguish the different page entries. The search result list
will then replace the post parameter with a url anchor # mark which
causes that the original url is presented in the search result. These
URLs can be opened directly on the correct page using pdf.js which is
now built-in into firefox. That means: if you find a search hit on page
5 and click on the search result, firefox will open the pdf viewer and
shows page 5.
2014-12-21 18:10:15 +01:00
..
FileLoader.java fix for file urls 2014-07-23 17:54:31 +02:00
FTPLoader.java fixes and speed updates for search process 2014-09-10 14:24:03 +02:00
HTTPLoader.java moved snapshot generation out of the html handler to prevent that 2014-12-01 17:37:25 +01:00
ImporterException.java refactoring 2012-09-21 15:48:16 +02:00
Request.java remove the unused Request variable 2014-12-05 03:03:28 +01:00
Response.java fixes to usage of no-cache: use and recognize also the no-store 2014-12-19 17:37:58 +01:00
RSSLoader.java better error handling for remote solr queries and exists-checks 2014-08-01 11:00:10 +02:00
SitemapImporter.java added experimental pdf splitting which enables YaCy to split pdfs during 2014-12-21 18:10:15 +01:00
SMBLoader.java use configured admin-username for api calls 2014-01-07 21:26:50 +01:00
URLRewriterLibrary.java Added 'final' for all exception blocks as this helps the Java compiler 2013-07-17 18:31:30 +02:00