yacy_search_server/source/net/yacy
luccioman fcf6b16db4 Added new crawler attribute for finer control over Media Type detection
New "Media Type detection" section in the advanced crawl start page
allow to choose between :
- not loading URLs with unknown or unsupported file extension without
checking the actual Media Type (relying Content-Type header for now).
This was the old default behavior, faster, but not really accurate.
- always cross check URL file extension against the actual Media Type.
This lets properly parse URLs ending with an apparently odd file
extension, but which have actually a supported Media Type such as
text/html.

Sample URLs with misleading file extensions added as documentation in
the crawl start page.

fixes issue #244
2018-10-25 10:42:12 +02:00
..
contentcontrol Removed some unnecessary uses of java.lang.reflect api. 2017-08-24 18:47:18 +02:00
cora Added suport for PDF snapshots generation when running on MS Windows 2018-10-18 12:41:57 +02:00
crawler Added new crawler attribute for finer control over Media Type detection 2018-10-25 10:42:12 +02:00
data Removed remaining unsafe accesses to SimpleDateFormat instances. 2018-07-02 10:00:40 +02:00
document Added new crawler attribute for finer control over Media Type detection 2018-10-25 10:42:12 +02:00
gui Fixed small unnecessary lines of code 2018-07-08 08:15:26 +02:00
http Render a relevant HTTP status code on snapshot image rendering error 2018-10-14 10:30:30 +02:00
kelondro Added server side columns sorting on the Process Scheduler table 2018-07-04 10:28:32 +02:00
peers Use a constant default target file name for seed SCP upload method 2018-09-16 10:37:47 +02:00
repository Decode blacklist entries for easier edition of non ascii chars 2018-10-04 09:33:58 +02:00
search Added new crawler attribute for finer control over Media Type detection 2018-10-25 10:42:12 +02:00
server Fixed concurrency issue in custom classloader used for template classes 2018-10-11 18:34:39 +02:00
utils Added an utility to generate/update XLIFF master file from lng files. 2018-01-29 18:34:47 +01:00
visualization Enable full size images preview for users with extended search rights 2017-12-22 11:39:30 +01:00
migration.java Updated the list of audio file formats supported by the audioTagParser 2018-02-27 18:04:12 +01:00
yacy.java Improved support for Java logs file pattern options 2018-09-13 12:17:02 +02:00