Michael Peter Christen
7603e879dc
Merge branch 'master' into HEAD
...
Conflicts:
.classpath
source/net/yacy/cora/federate/solr/SolrServlet.java
2013-12-20 01:19:06 +01:00
Michael Peter Christen
25250405f1
solr servlet preparation for join with jetty branch
2013-12-20 00:45:58 +01:00
Michael Peter Christen
8b97489ff2
updated guava to 15.0
2013-12-19 22:21:33 +01:00
Michael Peter Christen
34b4eda4a8
upgraded json-simple to 1.1.1
2013-12-19 22:16:01 +01:00
Michael Peter Christen
75ae36da9c
upgraded jsch to 0.1.50
2013-12-19 22:13:34 +01:00
Michael Peter Christen
db793a2a5e
removed mysql connector which was used only for testing in the past
2013-12-19 22:09:01 +01:00
Michael Peter Christen
7ebc74b76a
migrated to pdfbox 1.8.3
2013-12-19 22:06:10 +01:00
Michael Peter Christen
2f16770681
migrated to solr 4.6.0
2013-12-19 21:51:05 +01:00
Michael Peter Christen
b5ce834059
next try to fix m4
2013-12-19 15:35:31 +01:00
Michael Peter Christen
439f8927fc
fix for deb m4
2013-12-19 14:40:11 +01:00
Michael Peter Christen
57f0f71ac6
added patch to allow binary response writer
2013-12-19 10:13:43 +01:00
Michael Peter Christen
31f4607f2e
update to Mac start parameters
2013-12-19 09:42:01 +01:00
Michael Peter Christen
8de3f55a7a
removed malicious java start arguments in debian start script which had
...
been removed from the other start scripts recently. The reason to do so
was a comparisment of a debian-installed YaCy with 20 million document
which crashed after 10 hours with the debian start script, but did not
crash with the startYACY.sh start script. Both scripts now use the same
java start arguments.
Added also the Solr MMapDirectoryFactory switch which was missing so far
in the debian start script.
2013-12-19 09:07:04 +01:00
Michael Peter Christen
a909a3dbc4
removed non-root restriction in startYACY.sh because that is also not
...
applied for the debian version. There is no reason to apply this
restriction.
Added also the -d64 option if the system is 64 bit.
2013-12-19 09:04:23 +01:00
orbiter
937273d4e3
added parsing of metadata to surrogate reading:
...
a dublin core record inside of surrogate input files may now contain
tokens within the namespace 'md' (short for: metadata). The token names
must be valid withing the namespace of the solr field names. All
md-tokens inside of surrogate files then overwrite values within solr
documents before they are written to the solr index. This makes it
possible to assign collection names to each surrogate entry and also
ranking information can be added. Please see the example file.
2013-12-17 14:02:27 +01:00
reger
18497f6475
remove unused init parameter from DefaultServlet
...
- remove "RelativeResourceBase" parameter
2013-12-15 23:39:19 +01:00
orbiter
4de3fefdb5
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
2013-12-15 19:13:00 +01:00
orbiter
7e346e1d79
using stringbuilder in query construction
2013-12-15 19:12:49 +01:00
reger
c84c313fe1
Merge origin/master into jetty
2013-12-14 20:02:24 +01:00
reger
69829fc417
update Netbeans GUI build classpath
2013-12-14 19:50:41 +01:00
Michael Peter Christen
2702d9e56b
- added a SolrQueryResponse2SolrDocumentList method which is able to
...
work around the unfolding process in Solr's BinaryResponseWriter.
This was a huge performance bottleneck in the embedded solr connector
and the problem is actually on Solr side, but we have now a workaround.
- This made it possible to abstract a high-performance index access
method which is implemented as method getDocumentListByParams. That
method is also implemented in the SolrServerConnector and provides a
very efficient access to a solr index if the index is embedded.
- a popular use of the document list retrieval is a result count which
can now also make use of the new method, via getDocumentCountByParams.
- enhanced the Error cache which now does not store error documents
within the ram cache if the document is also written to solr. When
documents are retrieved from the cache, they are partly read from the
ram cache and if not existent there, from the Solr index.
2013-12-13 15:56:29 +01:00
Michael Peter Christen
74466d731a
use pre-compiled patterns in ymark
2013-12-12 11:50:48 +01:00
Michael Peter Christen
34633044b4
made pattern computation static
2013-12-12 10:55:36 +01:00
Michael Peter Christen
ef7ddbc933
added date parser caches to prevent re-calculation of costly date
...
parsing
2013-12-12 10:55:12 +01:00
Michael Peter Christen
552ef9f18e
fix for bad ErrorCache.exists test (bug from latest commit)
2013-12-12 10:38:32 +01:00
Michael Peter Christen
09412ea3a4
counting search requests in solr interface
2013-12-12 03:37:19 +01:00
Michael Peter Christen
303f5694ba
avoid usage of existsByQuery. If a document can be loaded by the ID
...
before testing other fields from the existsByQuery request, then a
document cache fills and queries after that one can be avoided.
2013-12-12 03:36:30 +01:00
reger
b43bbd3cc4
join DefaultServlet and Jetty8 implementation
...
- removing Jetty 8 specific dependencies
2013-12-09 23:45:57 +01:00
reger
8ac48aac27
update Maven pom to latest version number
...
- include newer dependency versions of several lib/jar for eval.
2013-12-09 23:43:58 +01:00
reger
089c5007ee
move conditionalHeader to DefaultServlet
...
- by removing Jetty specific implementation detail
2013-12-08 00:56:45 +01:00
Michael Peter Christen
67e7dc0cc6
added more properties to seedlist servlet
2013-12-06 14:30:47 +01:00
Michael Peter Christen
79771c60c0
IPv6 fixes
2013-12-06 14:30:08 +01:00
Michael Peter Christen
4e3375d983
next development version
2013-12-06 13:47:50 +01:00
reger
92d9c56f9f
Merge origin/master into jetty
2013-12-05 22:53:29 +01:00
Michael Peter Christen
f722e450b3
changed start parameters which caused deadlocks in mac and windows
...
versions
2013-12-05 00:55:35 +01:00
Michael Peter Christen
ddc7a24853
intermediate release 1.66
2013-12-04 23:16:06 +01:00
Michael Peter Christen
78eac85161
better calibration of caches and queue maximum sizes
2013-12-04 23:15:10 +01:00
Michael Peter Christen
da380343c2
perform greedy learning heuristic only if load < 1.0
2013-12-04 22:44:51 +01:00
Michael Peter Christen
81926c055d
fixed bug with image search in yacyinteractive
2013-12-04 18:44:23 +01:00
Michael Peter Christen
edda0699e4
changed default timeout for port scanner
2013-12-04 18:13:43 +01:00
Michael Peter Christen
c8af19bd37
removed unnecessary check which causes a NPE when searching with empty
...
search string
2013-12-04 17:58:36 +01:00
Michael Peter Christen
e3c2f09de9
- reduce computation in case that specific postprocessing fields are not
...
selected
- de-select citation rank computation
2013-12-04 17:48:12 +01:00
Michael Peter Christen
cfa08024c7
removed optimization bevore postprocessing because that may cause a
...
time-out which will cause that postprocessing fails.
2013-12-04 16:04:29 +01:00
Michael Peter Christen
6f3a923691
fixed urlmask which was not able to combine several constraints
2013-12-04 13:48:01 +01:00
Michael Peter Christen
9a27bf6e82
removed filter computation in Protocol class for remote searches because
...
that is already done in the QueryParams class
2013-12-04 13:09:15 +01:00
Michael Peter Christen
f1b5db2c45
- performance graph does not shop peer ping in memory monitor any more
...
- after a forced GC, the PerformanceMemory view switches to automatic
update by default
2013-12-04 12:59:30 +01:00
Michael Peter Christen
a125904a1c
fixed a NPE in surrogat processing
2013-12-04 01:56:38 +01:00
Michael Peter Christen
0db8e34625
enhanced webgraph processing
2013-12-04 01:54:45 +01:00
Michael Peter Christen
9d8b32c63a
fixed a division by zero
2013-12-04 01:54:14 +01:00
reger
ac067b5236
clean-up Jetty handler classes
2013-12-01 19:36:24 +01:00