Commit Graph

10178 Commits

Author SHA1 Message Date
Michael Peter Christen
7603e879dc Merge branch 'master' into HEAD
Conflicts:
	.classpath
	source/net/yacy/cora/federate/solr/SolrServlet.java
2013-12-20 01:19:06 +01:00
Michael Peter Christen
25250405f1 solr servlet preparation for join with jetty branch 2013-12-20 00:45:58 +01:00
Michael Peter Christen
8b97489ff2 updated guava to 15.0 2013-12-19 22:21:33 +01:00
Michael Peter Christen
34b4eda4a8 upgraded json-simple to 1.1.1 2013-12-19 22:16:01 +01:00
Michael Peter Christen
75ae36da9c upgraded jsch to 0.1.50 2013-12-19 22:13:34 +01:00
Michael Peter Christen
db793a2a5e removed mysql connector which was used only for testing in the past 2013-12-19 22:09:01 +01:00
Michael Peter Christen
7ebc74b76a migrated to pdfbox 1.8.3 2013-12-19 22:06:10 +01:00
Michael Peter Christen
2f16770681 migrated to solr 4.6.0 2013-12-19 21:51:05 +01:00
Michael Peter Christen
b5ce834059 next try to fix m4 2013-12-19 15:35:31 +01:00
Michael Peter Christen
439f8927fc fix for deb m4 2013-12-19 14:40:11 +01:00
Michael Peter Christen
57f0f71ac6 added patch to allow binary response writer 2013-12-19 10:13:43 +01:00
Michael Peter Christen
31f4607f2e update to Mac start parameters 2013-12-19 09:42:01 +01:00
Michael Peter Christen
8de3f55a7a removed malicious java start arguments in debian start script which had
been removed from the other start scripts recently. The reason to do so
was a comparisment of a debian-installed YaCy with 20 million document
which crashed after 10 hours with the debian start script, but did not
crash with the startYACY.sh start script. Both scripts now use the same
java start arguments.
Added also the Solr MMapDirectoryFactory switch which was missing so far
in the debian start script.
2013-12-19 09:07:04 +01:00
Michael Peter Christen
a909a3dbc4 removed non-root restriction in startYACY.sh because that is also not
applied for the debian version. There is no reason to apply this
restriction.
Added also the -d64 option if the system is 64 bit.
2013-12-19 09:04:23 +01:00
orbiter
937273d4e3 added parsing of metadata to surrogate reading:
a dublin core record inside of surrogate input files may now contain
tokens within the namespace 'md' (short for: metadata). The token names
must be valid withing the namespace of the solr field names. All
md-tokens inside of surrogate files then overwrite values within solr
documents before they are written to the solr index. This makes it
possible to assign collection names to each surrogate entry and also
ranking information can be added. Please see the example file.
2013-12-17 14:02:27 +01:00
reger
18497f6475 remove unused init parameter from DefaultServlet
- remove "RelativeResourceBase" parameter
2013-12-15 23:39:19 +01:00
orbiter
4de3fefdb5 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 2013-12-15 19:13:00 +01:00
orbiter
7e346e1d79 using stringbuilder in query construction 2013-12-15 19:12:49 +01:00
reger
c84c313fe1 Merge origin/master into jetty 2013-12-14 20:02:24 +01:00
reger
69829fc417 update Netbeans GUI build classpath 2013-12-14 19:50:41 +01:00
Michael Peter Christen
2702d9e56b - added a SolrQueryResponse2SolrDocumentList method which is able to
work around the unfolding process in Solr's BinaryResponseWriter.
This was a huge performance bottleneck in the embedded solr connector
and the problem is actually on Solr side, but we have now a workaround.
- This made it possible to abstract a high-performance index access
method which is implemented as method getDocumentListByParams. That
method is also implemented in the SolrServerConnector and provides a
very efficient access to a solr index if the index is embedded.
- a popular use of the document list retrieval is a result count which
can now also make use of the new method, via getDocumentCountByParams.
- enhanced the Error cache which now does not store error documents
within the ram cache if the document is also written to solr. When
documents are retrieved from the cache, they are partly read from the
ram cache and if not existent there, from the Solr index.
2013-12-13 15:56:29 +01:00
Michael Peter Christen
74466d731a use pre-compiled patterns in ymark 2013-12-12 11:50:48 +01:00
Michael Peter Christen
34633044b4 made pattern computation static 2013-12-12 10:55:36 +01:00
Michael Peter Christen
ef7ddbc933 added date parser caches to prevent re-calculation of costly date
parsing
2013-12-12 10:55:12 +01:00
Michael Peter Christen
552ef9f18e fix for bad ErrorCache.exists test (bug from latest commit) 2013-12-12 10:38:32 +01:00
Michael Peter Christen
09412ea3a4 counting search requests in solr interface 2013-12-12 03:37:19 +01:00
Michael Peter Christen
303f5694ba avoid usage of existsByQuery. If a document can be loaded by the ID
before testing other fields from the existsByQuery request, then a
document cache fills and queries after that one can be avoided.
2013-12-12 03:36:30 +01:00
reger
b43bbd3cc4 join DefaultServlet and Jetty8 implementation
- removing Jetty 8 specific dependencies
2013-12-09 23:45:57 +01:00
reger
8ac48aac27 update Maven pom to latest version number
- include newer dependency versions of several lib/jar for eval.
2013-12-09 23:43:58 +01:00
reger
089c5007ee move conditionalHeader to DefaultServlet
- by removing Jetty specific implementation detail
2013-12-08 00:56:45 +01:00
Michael Peter Christen
67e7dc0cc6 added more properties to seedlist servlet 2013-12-06 14:30:47 +01:00
Michael Peter Christen
79771c60c0 IPv6 fixes 2013-12-06 14:30:08 +01:00
Michael Peter Christen
4e3375d983 next development version 2013-12-06 13:47:50 +01:00
reger
92d9c56f9f Merge origin/master into jetty 2013-12-05 22:53:29 +01:00
Michael Peter Christen
f722e450b3 changed start parameters which caused deadlocks in mac and windows
versions
2013-12-05 00:55:35 +01:00
Michael Peter Christen
ddc7a24853 intermediate release 1.66 2013-12-04 23:16:06 +01:00
Michael Peter Christen
78eac85161 better calibration of caches and queue maximum sizes 2013-12-04 23:15:10 +01:00
Michael Peter Christen
da380343c2 perform greedy learning heuristic only if load < 1.0 2013-12-04 22:44:51 +01:00
Michael Peter Christen
81926c055d fixed bug with image search in yacyinteractive 2013-12-04 18:44:23 +01:00
Michael Peter Christen
edda0699e4 changed default timeout for port scanner 2013-12-04 18:13:43 +01:00
Michael Peter Christen
c8af19bd37 removed unnecessary check which causes a NPE when searching with empty
search string
2013-12-04 17:58:36 +01:00
Michael Peter Christen
e3c2f09de9 - reduce computation in case that specific postprocessing fields are not
selected
- de-select citation rank computation
2013-12-04 17:48:12 +01:00
Michael Peter Christen
cfa08024c7 removed optimization bevore postprocessing because that may cause a
time-out which will cause that postprocessing fails.
2013-12-04 16:04:29 +01:00
Michael Peter Christen
6f3a923691 fixed urlmask which was not able to combine several constraints 2013-12-04 13:48:01 +01:00
Michael Peter Christen
9a27bf6e82 removed filter computation in Protocol class for remote searches because
that is already done in the QueryParams class
2013-12-04 13:09:15 +01:00
Michael Peter Christen
f1b5db2c45 - performance graph does not shop peer ping in memory monitor any more
- after a forced GC, the PerformanceMemory view switches to automatic
update by default
2013-12-04 12:59:30 +01:00
Michael Peter Christen
a125904a1c fixed a NPE in surrogat processing 2013-12-04 01:56:38 +01:00
Michael Peter Christen
0db8e34625 enhanced webgraph processing 2013-12-04 01:54:45 +01:00
Michael Peter Christen
9d8b32c63a fixed a division by zero 2013-12-04 01:54:14 +01:00
reger
ac067b5236 clean-up Jetty handler classes 2013-12-01 19:36:24 +01:00