Michael Peter Christen
8bee1472c9
there is no noindex, only nofollow in links
2012-01-31 23:46:35 +01:00
Michael Peter Christen
5e18f54a8c
added shell script to get a servlet. this is the same as apicall.sh but it prints the result to stdout
2012-01-31 23:21:49 +01:00
Michael Peter Christen
3cd6dcd352
do not add new solr fields as activated fields
2012-01-31 22:21:48 +01:00
Michael Peter Christen
e3bb73c3d6
serialized some database access methods
2012-01-31 21:13:49 +01:00
Michael Peter Christen
9727015213
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
2012-01-31 18:18:13 +01:00
Michael Peter Christen
7e728867e5
added a synchronization around iterations to prevent IO-deadlocking
...
during concurrent remote search requests
2012-01-31 18:17:25 +01:00
david
f077b11d38
Merge branch 'master' of git://git.gitorious.org/yacy/rc1.git
2012-01-30 20:02:11 +01:00
Lotus
29675d9766
more label on search options (usability)
2012-01-30 20:02:02 +01:00
Michael Peter Christen
355ecf330f
reduced target file site to 64mb
2012-01-29 20:35:48 +01:00
reger
fa1f35b0c8
Merge rc1/master
2012-01-29 20:06:10 +01:00
Michael Peter Christen
b4bc1e2875
remote search does not do snippet generation
2012-01-29 19:25:09 +01:00
reger
46c986f8f7
Merge rc1/master
2012-01-29 00:48:54 +01:00
Lotus
335a776351
xss hardening on Status.html
2012-01-28 13:25:12 +01:00
reger
55518c600f
Merge rc1/master
2012-01-26 22:43:34 +01:00
reger
943165c9a4
upd Netbeans IDE lib setting
...
to use currently updated jars (e.g. solrj3.5.0 etc.)
2012-01-26 22:25:42 +01:00
Michael Peter Christen
10ae6d94a1
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
2012-01-26 18:11:06 +01:00
Michael Peter Christen
2ea585d616
fix for host navigator
2012-01-26 18:10:34 +01:00
Michael Peter Christen
2f6dde92e2
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
2012-01-26 16:45:33 +01:00
Michael Peter Christen
c560a582ac
fix for single-word vocabulary lines
2012-01-26 16:44:30 +01:00
Michael Peter Christen
4c5edab1ec
added option to have exception search result windows
2012-01-26 15:32:30 +01:00
Michael Peter Christen
329e3eebcf
added example vocabularies and explanation how to use them
2012-01-26 11:20:14 +01:00
Michael Peter Christen
046d7de95b
Merge remote branch 'reger/master'
2012-01-26 10:47:40 +01:00
reger
a95f645a61
Bugfix class repository.Loaddispatcher fixed download file limit of 10000
...
line 355: final Response response = this.load(request, cachePolicy, 10000, true);
2012-01-26 04:10:44 +01:00
Michael Peter Christen
32adad7dd5
show less navigation by default
2012-01-26 01:09:34 +01:00
Michael Peter Christen
ef78f22ee1
performance hack
2012-01-25 12:48:48 +01:00
Michael Peter Christen
41536eb4a2
performance hack
2012-01-25 12:28:56 +01:00
Michael Peter Christen
88b86afc89
no DoS protection for intranet mode
2012-01-25 12:13:03 +01:00
Michael Peter Christen
0f443ac755
automatic switching off of navigation that is not useful
2012-01-25 12:07:24 +01:00
Michael Peter Christen
852ce43d99
better rules for default open/close of navigation objetcs
2012-01-25 11:53:25 +01:00
Michael Peter Christen
f91487fc50
added delete-button for host navigation
2012-01-25 11:19:18 +01:00
Michael Peter Christen
e8d24fd802
author navigator can be switched off
2012-01-25 11:11:42 +01:00
Michael Peter Christen
558ab7bd4e
made the protocol navigator reversible
2012-01-25 02:54:52 +01:00
Michael Peter Christen
96cb75f1d4
made the filetype navigator be able to deselect the search constraint
2012-01-25 02:50:06 +01:00
Michael Peter Christen
9ebcae2fbc
enhanced url parser to understand urls with & instead of & in post
...
urls
2012-01-25 02:42:24 +01:00
Michael Peter Christen
30891d026f
added a remove-navigation for vocabularies
2012-01-25 00:22:51 +01:00
Michael Peter Christen
1f4f60654a
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
...
Conflicts:
source/net/yacy/document/parser/pdfParser.java
2012-01-24 20:42:30 +01:00
Michael Peter Christen
d5ead5314d
changed navigation links: now using checkboxes.
...
This looks better and allows that negative checkboxes (such that remove
the navigation) are possible. These are not yet implemented (comming
next)
2012-01-24 19:03:47 +01:00
reger
32104360ce
PDFParser - return at least first 3 pages of PDF
...
fix for pdf parsing without returning parsed text due to interruption by
time out.
2012-01-23 20:58:36 +01:00
Michael Peter Christen
696ee5fc16
removed pdf from default parser deny list
2012-01-23 17:27:58 +01:00
Michael Peter Christen
ef5192f8c9
using the generic document parser for crawl starts instead of the html
...
parser. This makes it possible that every type of document can be a
crawl start point, not only text documents or html documents. Testet
this with a pdf document.
2012-01-23 17:27:29 +01:00
Michael Peter Christen
33a71a61fa
Merge commit 'b60e2e952102c3eae40ab98c892a8c7d1b478345'
2012-01-23 11:20:32 +01:00
reger
b60e2e9521
PDFParser - return at least first 3 pages of PDF
...
fix for pdf parsing without returning parsed text due to interruption by time out.
2012-01-23 04:33:07 +01:00
Michael Peter Christen
a02fdf8625
better error messages
2012-01-23 00:47:25 +01:00
Michael Peter Christen
eadb58dd87
small enhancements in pdf parser
2012-01-23 00:46:02 +01:00
Michael Peter Christen
c6ba44468e
timeout = 5000 instead 3000
2012-01-23 00:45:32 +01:00
Michael Peter Christen
44491ec6dd
Merge commit 'b616de59735d33b922b7d2fbccdcc9031b77fa6e'
2012-01-22 19:08:27 +01:00
Lotus
d2ca33ccd7
Java update
2012-01-21 18:33:21 +01:00
Lotus
59ebee5de2
highlight changed row on api table
2012-01-21 18:29:53 +01:00
reger
b616de5973
PDFParser - return at least first 3 pages of PDF
...
fix for pdf parsing without returning parsed text due to interruption by time out.
2012-01-21 03:15:12 +01:00
Michael Peter Christen
ce620be783
for for crawl start with smb url
2012-01-19 23:07:15 +01:00