Michael Peter Christen
4540174fe0
memory hacks
2012-02-02 07:37:00 +01:00
Michael Peter Christen
b4409cc803
small redesign of blob column index and usage
2012-02-02 06:43:57 +01:00
Michael Peter Christen
d5c1f2746e
performance hack
2012-02-02 06:43:15 +01:00
Michael Peter Christen
803963aebd
performance hack: better space grow in CharBuffer (speeds up html
...
parser)
2012-02-01 23:27:59 +01:00
Michael Peter Christen
8b0920b0b5
tried to fix the ipv6 problem as reported in bug
...
but this did not solve all problems because a bug in the apache http
client prevented that it worked. Thread dump:
Caused by: java.lang.NumberFormatException: For input string:
"1450:400c:c01:0:0:0:69"
at
java.lang.NumberFormatException.forInputString(NumberFormatException.java:48)
at java.lang.Integer.parseInt(Integer.java:458)
at java.lang.Integer.parseInt(Integer.java:499)
at org.apache.http.client.utils.URIUtils.extractHost(URIUtils.java:310)
at
org.apache.http.impl.client.AbstractHttpClient.determineTarget(AbstractHttpClient.java:764)
at
org.apache.http.impl.client.AbstractHttpClient.execute(AbstractHttpClient.java:754)
at net.yacy.cora.protocol.http.HTTPClient.execute(HTTPClient.java:597)
at
net.yacy.cora.protocol.http.HTTPClient.getContentBytes(HTTPClient.java:558)
at net.yacy.cora.protocol.http.HTTPClient.GETbytes(HTTPClient.java:341)
at de.anomic.crawler.retrieval.HTTPLoader.load(HTTPLoader.java:131)
at de.anomic.crawler.retrieval.HTTPLoader.load(HTTPLoader.java:74)
at
net.yacy.repository.LoaderDispatcher.loadInternal(LoaderDispatcher.java:274)
at net.yacy.repository.LoaderDispatcher.load(LoaderDispatcher.java:164)
at net.yacy.repository.LoaderDispatcher.load(LoaderDispatcher.java:150)
at
net.yacy.repository.LoaderDispatcher.loadDocument(LoaderDispatcher.java:355)
at getpageinfo_p.respond(getpageinfo_p.java:97)
2012-02-01 22:26:19 +01:00
Michael Peter Christen
e2f8f263e8
changed storage of search words: keep order
2012-02-01 18:13:31 +01:00
Michael Peter Christen
ed39ef2890
changed generation of protocol information
2012-02-01 18:12:59 +01:00
Michael Peter Christen
0b67a0a5d8
added a column index for tables in blob files. This is heavily used
...
during receiving of DHT submissions and when answering remote search
requests. Both events together may have caused IO-deadlocking and this
commit shall fix that.
2012-02-01 15:11:21 +01:00
Michael Peter Christen
ffb72249ea
added missing apicat.sh
2012-02-01 00:49:40 +01:00
Michael Peter Christen
c166eb68b6
fixes in solr schema file
2012-02-01 00:22:43 +01:00
Michael Peter Christen
2e5cd6a1b2
fixed parser extension deny list generation and usage
2012-02-01 00:15:59 +01:00
Michael Peter Christen
8bee1472c9
there is no noindex, only nofollow in links
2012-01-31 23:46:35 +01:00
Michael Peter Christen
5e18f54a8c
added shell script to get a servlet. this is the same as apicall.sh but it prints the result to stdout
2012-01-31 23:21:49 +01:00
Michael Peter Christen
3cd6dcd352
do not add new solr fields as activated fields
2012-01-31 22:21:48 +01:00
Michael Peter Christen
e3bb73c3d6
serialized some database access methods
2012-01-31 21:13:49 +01:00
Michael Peter Christen
9727015213
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
2012-01-31 18:18:13 +01:00
Michael Peter Christen
7e728867e5
added a synchronization around iterations to prevent IO-deadlocking
...
during concurrent remote search requests
2012-01-31 18:17:25 +01:00
david
f077b11d38
Merge branch 'master' of git://git.gitorious.org/yacy/rc1.git
2012-01-30 20:02:11 +01:00
Lotus
29675d9766
more label on search options (usability)
2012-01-30 20:02:02 +01:00
Michael Peter Christen
355ecf330f
reduced target file site to 64mb
2012-01-29 20:35:48 +01:00
reger
fa1f35b0c8
Merge rc1/master
2012-01-29 20:06:10 +01:00
Michael Peter Christen
b4bc1e2875
remote search does not do snippet generation
2012-01-29 19:25:09 +01:00
reger
46c986f8f7
Merge rc1/master
2012-01-29 00:48:54 +01:00
Lotus
335a776351
xss hardening on Status.html
2012-01-28 13:25:12 +01:00
reger
55518c600f
Merge rc1/master
2012-01-26 22:43:34 +01:00
reger
943165c9a4
upd Netbeans IDE lib setting
...
to use currently updated jars (e.g. solrj3.5.0 etc.)
2012-01-26 22:25:42 +01:00
Michael Peter Christen
10ae6d94a1
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
2012-01-26 18:11:06 +01:00
Michael Peter Christen
2ea585d616
fix for host navigator
2012-01-26 18:10:34 +01:00
Michael Peter Christen
2f6dde92e2
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
2012-01-26 16:45:33 +01:00
Michael Peter Christen
c560a582ac
fix for single-word vocabulary lines
2012-01-26 16:44:30 +01:00
Michael Peter Christen
4c5edab1ec
added option to have exception search result windows
2012-01-26 15:32:30 +01:00
Michael Peter Christen
329e3eebcf
added example vocabularies and explanation how to use them
2012-01-26 11:20:14 +01:00
Michael Peter Christen
046d7de95b
Merge remote branch 'reger/master'
2012-01-26 10:47:40 +01:00
reger
a95f645a61
Bugfix class repository.Loaddispatcher fixed download file limit of 10000
...
line 355: final Response response = this.load(request, cachePolicy, 10000, true);
2012-01-26 04:10:44 +01:00
Michael Peter Christen
32adad7dd5
show less navigation by default
2012-01-26 01:09:34 +01:00
Michael Peter Christen
ef78f22ee1
performance hack
2012-01-25 12:48:48 +01:00
Michael Peter Christen
41536eb4a2
performance hack
2012-01-25 12:28:56 +01:00
Michael Peter Christen
88b86afc89
no DoS protection for intranet mode
2012-01-25 12:13:03 +01:00
Michael Peter Christen
0f443ac755
automatic switching off of navigation that is not useful
2012-01-25 12:07:24 +01:00
Michael Peter Christen
852ce43d99
better rules for default open/close of navigation objetcs
2012-01-25 11:53:25 +01:00
Michael Peter Christen
f91487fc50
added delete-button for host navigation
2012-01-25 11:19:18 +01:00
Michael Peter Christen
e8d24fd802
author navigator can be switched off
2012-01-25 11:11:42 +01:00
Michael Peter Christen
558ab7bd4e
made the protocol navigator reversible
2012-01-25 02:54:52 +01:00
Michael Peter Christen
96cb75f1d4
made the filetype navigator be able to deselect the search constraint
2012-01-25 02:50:06 +01:00
Michael Peter Christen
9ebcae2fbc
enhanced url parser to understand urls with & instead of & in post
...
urls
2012-01-25 02:42:24 +01:00
Michael Peter Christen
30891d026f
added a remove-navigation for vocabularies
2012-01-25 00:22:51 +01:00
Michael Peter Christen
1f4f60654a
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
...
Conflicts:
source/net/yacy/document/parser/pdfParser.java
2012-01-24 20:42:30 +01:00
Michael Peter Christen
d5ead5314d
changed navigation links: now using checkboxes.
...
This looks better and allows that negative checkboxes (such that remove
the navigation) are possible. These are not yet implemented (comming
next)
2012-01-24 19:03:47 +01:00
reger
32104360ce
PDFParser - return at least first 3 pages of PDF
...
fix for pdf parsing without returning parsed text due to interruption by
time out.
2012-01-23 20:58:36 +01:00
Michael Peter Christen
696ee5fc16
removed pdf from default parser deny list
2012-01-23 17:27:58 +01:00