Commit Graph

8443 Commits

Author SHA1 Message Date
Michael Peter Christen
8bee1472c9 there is no noindex, only nofollow in links 2012-01-31 23:46:35 +01:00
Michael Peter Christen
5e18f54a8c added shell script to get a servlet. this is the same as apicall.sh but it prints the result to stdout 2012-01-31 23:21:49 +01:00
Michael Peter Christen
3cd6dcd352 do not add new solr fields as activated fields 2012-01-31 22:21:48 +01:00
Michael Peter Christen
e3bb73c3d6 serialized some database access methods 2012-01-31 21:13:49 +01:00
Michael Peter Christen
9727015213 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 2012-01-31 18:18:13 +01:00
Michael Peter Christen
7e728867e5 added a synchronization around iterations to prevent IO-deadlocking
during concurrent remote search requests
2012-01-31 18:17:25 +01:00
david
f077b11d38 Merge branch 'master' of git://git.gitorious.org/yacy/rc1.git 2012-01-30 20:02:11 +01:00
Lotus
29675d9766 more label on search options (usability) 2012-01-30 20:02:02 +01:00
Michael Peter Christen
355ecf330f reduced target file site to 64mb 2012-01-29 20:35:48 +01:00
reger
fa1f35b0c8 Merge rc1/master 2012-01-29 20:06:10 +01:00
Michael Peter Christen
b4bc1e2875 remote search does not do snippet generation 2012-01-29 19:25:09 +01:00
reger
46c986f8f7 Merge rc1/master 2012-01-29 00:48:54 +01:00
Lotus
335a776351 xss hardening on Status.html 2012-01-28 13:25:12 +01:00
reger
55518c600f Merge rc1/master 2012-01-26 22:43:34 +01:00
reger
943165c9a4 upd Netbeans IDE lib setting
to use currently updated jars (e.g. solrj3.5.0 etc.)
2012-01-26 22:25:42 +01:00
Michael Peter Christen
10ae6d94a1 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 2012-01-26 18:11:06 +01:00
Michael Peter Christen
2ea585d616 fix for host navigator 2012-01-26 18:10:34 +01:00
Michael Peter Christen
2f6dde92e2 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 2012-01-26 16:45:33 +01:00
Michael Peter Christen
c560a582ac fix for single-word vocabulary lines 2012-01-26 16:44:30 +01:00
Michael Peter Christen
4c5edab1ec added option to have exception search result windows 2012-01-26 15:32:30 +01:00
Michael Peter Christen
329e3eebcf added example vocabularies and explanation how to use them 2012-01-26 11:20:14 +01:00
Michael Peter Christen
046d7de95b Merge remote branch 'reger/master' 2012-01-26 10:47:40 +01:00
reger
a95f645a61 Bugfix class repository.Loaddispatcher fixed download file limit of 10000
line 355: final Response response = this.load(request, cachePolicy, 10000, true);
2012-01-26 04:10:44 +01:00
Michael Peter Christen
32adad7dd5 show less navigation by default 2012-01-26 01:09:34 +01:00
Michael Peter Christen
ef78f22ee1 performance hack 2012-01-25 12:48:48 +01:00
Michael Peter Christen
41536eb4a2 performance hack 2012-01-25 12:28:56 +01:00
Michael Peter Christen
88b86afc89 no DoS protection for intranet mode 2012-01-25 12:13:03 +01:00
Michael Peter Christen
0f443ac755 automatic switching off of navigation that is not useful 2012-01-25 12:07:24 +01:00
Michael Peter Christen
852ce43d99 better rules for default open/close of navigation objetcs 2012-01-25 11:53:25 +01:00
Michael Peter Christen
f91487fc50 added delete-button for host navigation 2012-01-25 11:19:18 +01:00
Michael Peter Christen
e8d24fd802 author navigator can be switched off 2012-01-25 11:11:42 +01:00
Michael Peter Christen
558ab7bd4e made the protocol navigator reversible 2012-01-25 02:54:52 +01:00
Michael Peter Christen
96cb75f1d4 made the filetype navigator be able to deselect the search constraint 2012-01-25 02:50:06 +01:00
Michael Peter Christen
9ebcae2fbc enhanced url parser to understand urls with & instead of & in post
urls
2012-01-25 02:42:24 +01:00
Michael Peter Christen
30891d026f added a remove-navigation for vocabularies 2012-01-25 00:22:51 +01:00
Michael Peter Christen
1f4f60654a Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
Conflicts:
	source/net/yacy/document/parser/pdfParser.java
2012-01-24 20:42:30 +01:00
Michael Peter Christen
d5ead5314d changed navigation links: now using checkboxes.
This looks better and allows that negative checkboxes (such that remove
the navigation) are possible. These are not yet implemented (comming
next)
2012-01-24 19:03:47 +01:00
reger
32104360ce PDFParser - return at least first 3 pages of PDF
fix for pdf parsing without returning parsed text due to interruption by
time out.
2012-01-23 20:58:36 +01:00
Michael Peter Christen
696ee5fc16 removed pdf from default parser deny list 2012-01-23 17:27:58 +01:00
Michael Peter Christen
ef5192f8c9 using the generic document parser for crawl starts instead of the html
parser. This makes it possible that every type of document can be a
crawl start point, not only text documents or html documents. Testet
this with a pdf document.
2012-01-23 17:27:29 +01:00
Michael Peter Christen
33a71a61fa Merge commit 'b60e2e952102c3eae40ab98c892a8c7d1b478345' 2012-01-23 11:20:32 +01:00
reger
b60e2e9521 PDFParser - return at least first 3 pages of PDF
fix for pdf parsing without returning parsed text due to interruption by time out.
2012-01-23 04:33:07 +01:00
Michael Peter Christen
a02fdf8625 better error messages 2012-01-23 00:47:25 +01:00
Michael Peter Christen
eadb58dd87 small enhancements in pdf parser 2012-01-23 00:46:02 +01:00
Michael Peter Christen
c6ba44468e timeout = 5000 instead 3000 2012-01-23 00:45:32 +01:00
Michael Peter Christen
44491ec6dd Merge commit 'b616de59735d33b922b7d2fbccdcc9031b77fa6e' 2012-01-22 19:08:27 +01:00
Lotus
d2ca33ccd7 Java update 2012-01-21 18:33:21 +01:00
Lotus
59ebee5de2 highlight changed row on api table 2012-01-21 18:29:53 +01:00
reger
b616de5973 PDFParser - return at least first 3 pages of PDF
fix for pdf parsing without returning parsed text due to interruption by time out.
2012-01-21 03:15:12 +01:00
Michael Peter Christen
ce620be783 for for crawl start with smb url 2012-01-19 23:07:15 +01:00