yacy_search_server/source/de/anomic
orbiter dba7ef5144 extended crawling constraints:
- removed never-used secondary crawl depth
- added a must-not-match filter that can be used to exclude urls from a crawl
- added stub for crawl tags which will be used to identify search results that had been produced from specific crawls
please update the yacybar: replace property name 'crawlFilter' with 'mustmatch'.
Additionally, a new parameter named 'mustnotmatch' can be used, which should be by default the empty sring (match-never)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5342 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-11-14 09:58:56 +00:00
..
crawler extended crawling constraints: 2008-11-14 09:58:56 +00:00
data extended crawling constraints: 2008-11-14 09:58:56 +00:00
htmlFilter * improve encoding detection of http service 2008-11-12 21:06:32 +00:00
http * improve encoding detection of http service 2008-11-12 21:06:32 +00:00
icap - added some performance tweaks to the new BLOB buffer 2008-10-19 18:10:42 +00:00
index * added utf8-encoding to many getBytes-calls 2008-11-08 20:24:31 +00:00
kelondro - more space in error db to store larger error messages 2008-11-11 21:42:12 +00:00
language/identification integrated language detection classes into condenser environment 2008-09-18 13:12:33 +00:00
net refactoring and new architecture to store the files of the web cache: 2008-10-16 21:24:09 +00:00
plasma extended crawling constraints: 2008-11-14 09:58:56 +00:00
server added property index.storeCommons to switch commons storage on or off 2008-11-02 23:30:09 +00:00
tools performance hacks 2008-10-20 14:07:09 +00:00
urlRedirector extended crawling constraints: 2008-11-14 09:58:56 +00:00
xml * removed some warnings of findbugs (http://findbugs.sf.net) 2008-08-06 19:43:12 +00:00
yacy simple fix to get DHT working again (maybe something more has to be done ;) 2008-11-11 18:55:16 +00:00
ymage different handling of error cases that occur during loading files with http or ftp: 2008-11-11 21:33:40 +00:00