Michael Peter Christen
de8cfbe1d7
added export option to export the fulltext of the search index text only
2015-07-30 03:21:40 +02:00
Michael Peter Christen
fbeae20b3a
try a healing of the cache if the index file is corrupted
2015-07-27 15:16:08 +02:00
Michael Peter Christen
7e158ae085
Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
2015-07-27 15:03:34 +02:00
Michael Peter Christen
03ea723889
added log lines for query performance profiling
2015-07-27 15:03:13 +02:00
reger
7f49dbfbd1
upd to SLF4J-1.7.12
2015-07-27 00:57:19 +02:00
reger
807e3dc78a
upd to httpclient-4.5 and httpmime-4.5
2015-07-26 00:53:40 +02:00
reger
202620b4a2
upd to icu4j-55.1.jar
2015-07-25 00:50:41 +02:00
reger
149e41f25b
upd to jsch-0.1.53.jar
2015-07-21 22:31:34 +02:00
reger
30135d8964
upd to lib/weupnp-0.1.3.jar
2015-07-20 03:45:23 +02:00
Michael Peter Christen
ec75959162
Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
2015-07-16 23:42:51 +02:00
Michael Peter Christen
785781253e
added jsonp to suggest servlet
2015-07-16 23:42:41 +02:00
reger
5cf988f224
upd NB classpath
2015-07-15 01:04:59 +02:00
Michael Peter Christen
32a804b10c
Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
2015-07-13 12:15:58 +02:00
Michael Peter Christen
0e87a99ab8
more fixes for special windows paths
2015-07-10 17:34:29 +02:00
Michael Peter Christen
e5b6424eed
patch for bad windows file paths
2015-07-10 17:14:14 +02:00
Michael Peter Christen
0aa6fcf259
remove old vocabularies and synonyms before adding new
2015-07-10 16:47:19 +02:00
Michael Peter Christen
e1cd9c0dba
added another default network / commented out
2015-07-09 16:25:11 +02:00
Michael Peter Christen
289018b559
Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
2015-07-08 17:37:03 +02:00
Michael Peter Christen
7b412e8c07
added msg (text emails) format; should be handled by html parser.
2015-07-08 17:36:37 +02:00
reger
f91298d3b6
fix one implicit Integer/Long type conversion
...
-> causes Java 1.8 compile error
2015-07-08 03:02:10 +02:00
reger
821262a179
add CommonPattern for multiple spaces
...
to eliminate empty split words on following spaces
2015-07-04 22:49:01 +02:00
Michael Peter Christen
90f75c8c3d
added enrichment of synonyms and vocabularies for imported documents
...
during surrogate reading: those attributes from the dump are removed
during the import process and replaced by new detected attributes
according to the setting of the YaCy peer.
This may cause that all such attributes are removed if the importing
peer has no synonyms and/or no vocabularies defined.
2015-07-02 00:23:50 +02:00
Michael Peter Christen
7829480b82
refactoring: separated condenser and tokenizer
2015-07-01 18:28:18 +02:00
reger
00d2062813
Rem depreciated AdminHandlers in solrconfig.xml
...
avoid warning log
W org.apache.solr.handler.admin.AdminHandlers <requestHandler name="/admin/" class="solr.admin.AdminHandlers" /> is deprecated . It is not required anymore
2015-07-01 00:58:23 +02:00
Michael Peter Christen
f901e7d3cf
fix for non-authorized view of IndexBrowser: show only the number of
...
non-failure documents
2015-06-30 11:12:36 +02:00
Michael Peter Christen
593de05922
enhanced surrogate import process speed (dramatically!)
2015-06-29 12:28:34 +02:00
Michael Peter Christen
3c4c69adea
fix for
...
- bad regex computation for crawl start from file (limitation on domain
did not work)
- servlet error when starting crawl from a large list of urls
2015-06-29 02:02:01 +02:00
Michael Peter Christen
1fec7fb3c1
suppress access to solr when doing search suggestions in case that the
...
index has more than two million documents. This protects the index from
beeing flooded with search requests that cannot be resolved before the
real search query has to be computet.
2015-06-24 13:02:12 +02:00
Michael Peter Christen
886fca2260
Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
2015-06-24 01:59:46 +02:00
Michael Peter Christen
694b22f165
migration to Solr 5.2: huge benefits - this is a lot faster!
...
This is a very complex migration: many classes had been renamed or
removed, dependencies changed and the solr index type is now aligned to
be a solr cloud repository.
Together with the Solr 5.2 library update, one other dependent library
had been updated as well: httpclient 4.4->4.4.1
Older indexes are migrated from 4_10 to 5_2. However, the new index
structure is more efficient and we recommend to re-index everything.
Please use the index export before you do the update to a large
surrogate xml file. After the update, start with an empty index and then
initialize this with your dump.
2015-06-24 01:55:51 +02:00
Michael Peter Christen
6c2e6f1f37
remove redundant code
2015-06-23 23:41:43 +02:00
sixcooler
e427efbe54
Next Try for a fix for upload-connection staying in blocked state.
...
This was caused by reading via GZIP from close-wait connection an caused
high cpu- and system-loads.
Instat of implementing handling of the RedListener now I found a
timelimeted 'get' "realy" solving this problem.
2015-06-14 22:56:26 +02:00
reger
0fab445b19
Resourceobserver log warning - deleting releases files - only on actual deletes
...
instead of entering routine
2015-06-10 02:35:37 +02:00
sixcooler
ef6a64b2a4
Fix for upload-connection staying in blocked state.
...
This was caused by reading via GZIP from close-wait connection an caused
high cpu- and system-loads.
Solved by implementing handling of the RedListener.
2015-06-09 21:26:10 +02:00
reger
c973f94936
add log entry on release file delete by ResourceObserver
2015-06-08 03:17:12 +02:00
reger
121972752c
implement deleteOldDownloads in RexourceObserver on low diskspace
...
- direct assign sb.observer (skip redundant InitThread)
2015-06-08 02:52:13 +02:00
Michael Peter Christen
0d5ac6e527
Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
2015-06-07 22:25:26 +02:00
Michael Peter Christen
9c12555be5
added link to Snapshots in search results if the snapshot exists and
...
option is set in ConfigSearchPage_p
(this is a stub: we also need a visualization of pdf files!)
2015-06-07 20:37:37 +02:00
sixcooler
480e4a6a5c
Update to Jetty-9.2.11 - a bugfix-release that did not solve my
...
Problems, but does not harm anything
2015-06-07 20:09:27 +02:00
reger
72f6a0b0b2
enhance recrawl job
...
- allow to modify the query to select documents to process (after job has started)
- allow to include failed urls (httpstatus <> 200)
2015-06-06 18:45:39 +02:00
Michael Peter Christen
e0a23c56c7
Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
2015-06-05 08:32:55 +02:00
Michael Peter Christen
fb9e1dd3f5
servlet for latest commit
2015-06-05 07:22:35 +02:00
reger
5183ad718d
upd to poi-3.12.jar
2015-06-05 03:36:57 +02:00
reger
7478338a40
remove augmented parsing activation from frontend
...
experimental implementation not used and based on error prone experimental rdfaparser
2015-06-05 00:51:00 +02:00
reger
11aa2edfe1
remove RDFa parser activation from frontend
...
reason: experimental implementatin of RDFa parser not executed (limited to special urls) but may cause error on normal html parsing due to a inputstream.reset
2015-06-05 00:15:16 +02:00
Michael Peter Christen
ff11ac89f7
Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
2015-06-04 23:04:04 +02:00
Michael Peter Christen
5e2d23b7a0
removed the new index export method from the IndexControlURLs_p.html
...
servlet and moved it to a new /IndexExport_p.html servlet. This servlet
is now more prominent linked in the main menu under Production -> Index
Export/Import
2015-06-04 23:03:46 +02:00
reger
64a7b0b140
Merge origin/master
2015-06-04 22:44:46 +02:00
reger
49b79987c9
remove obsolete searchfl work table
...
was used to register urls with not complete words in snippet but is never accessed
2015-06-04 22:44:01 +02:00
sixcooler
4533f392b0
correct the dark themes to show also a dark navbar on searchresults
2015-06-04 22:15:38 +02:00