Commit Graph

13133 Commits

Author SHA1 Message Date
reger
a4498e17c0 fix edit current user form to required post mehtod
introduced with cde237b687
2017-04-08 22:54:57 +02:00
reger
46a4aaf09c upd to Solr-5.5.4 2017-04-06 21:18:01 +02:00
reger
b7417ac329 Introduce a Keyword search navigator using the index field keywords.
The keywords field string is split into words as navigator entries.

A keyword navigator facet is essential for search appliance usage were
documents and metadata use often specialized keyword vocabularies to 
filter search results. This navi can be used without custom index schema.

As we don't have defined a search query command to filter "keywords" yet,
the filtering is limited by adding the keyword to the search query.
2017-04-05 00:08:25 +02:00
reger
eddb7a9804 upd to pdfbox-2.0.5.jar and transient dependency xmpcore-5.1.3.jar
required by metadata-extractor-2.10.1 (fix build.xml compiler warning)
2017-04-04 00:59:26 +02:00
reger
27884da1ff add CookieTest_p.html text to master.lng 2017-04-03 22:53:07 +02:00
luccioman
665d087d76 Enforced access controls on a few more administration pages.
- ensure use of HTTP POST method when performing server side effect
operations
 - transaction token required to ensure the request has effectively been
requested by user interaction
2017-04-03 12:20:16 +02:00
luccioman
0feded21dd Escaped HTML eventually active content from recorded API call comments. 2017-04-03 11:40:37 +02:00
luccioman
09e72eb0a4 Set Config Portal as a private administration page.
Consistently with its required action from submission credentials, and
because external unauthenticated users do not need to access these
settings.
2017-04-03 11:34:49 +02:00
reger
c19d60f06b update master.lng with recent text changes
to IndexExport_p.html, IndexImportWarc_p.html
2017-04-02 22:30:23 +02:00
reger
9339a6a4c5 use css error class for error msg in IndexImportOAIPMH_p.html,
adjust to xhtml <p> usage rule
2017-04-02 20:36:22 +02:00
reger
777cb5b812 remove test case for Standard_MemoryControl which will always fail
see https://github.com/yacy/yacy_search_server/pull/114
2017-04-02 03:59:37 +02:00
reger
ba339a2a45 Add servlet to import warc file from filesystem IndexImportWarc_p.html.
Apply Importer interface to WarcImporter
2017-04-02 03:32:21 +02:00
Michael Peter Christen
1d81b8f102 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git 2017-04-01 01:04:27 +02:00
Michael Peter Christen
69081bce00 added export to elasticsearch. The export dump can easily be imported to
elasticsearch using the command
curl -XPOST localhost:9200/collection1/yacy/_bulk --data-binary
@yacy_dump_XXX.flatjson
2017-04-01 01:04:17 +02:00
reger
510f11d374 Implement surrogate import from Warc archives (as first option handle
warc = Web ARChive File Format.
Warc files with extension .warc or compressed warc.gz can be placed in the
DATA/surrogate/in and contained responses are imported to the index.
The used library is stream based so we can easily extend it later to use
and load warc's from the net.
2017-03-31 00:58:11 +02:00
luccioman
5b5b9d5d96 URL Viewer : only display the link to metadata when metadata exists 2017-03-30 16:14:22 +02:00
luccioman
4b649b0a11 Fixed NPE case and API URL link on Solr HTML output for webgraph core. 2017-03-30 15:41:14 +02:00
luccioman
39ffa42a3c Modified RWI settings page radio click event to use HTTP POST 2017-03-30 10:23:47 +02:00
luccioman
af28a07780 Updated API calls recording/replay with recent changes.
- enabled HTTP POST calls with Digest HTTP authentication
 - made API calls compatible with API newly restricted to HTTP POST only
with transaction token validation
 - ensured backward compatibility with older entries recorded as HTTP
GET
2017-03-30 09:22:28 +02:00
reger
1ccc44e681 fix default/httpd.mime Z file extension to lower case
+ test case
2017-03-26 23:52:31 +02:00
reger
44a9a580e3 remove seedlist bootstrap target (not working for some longer time) 2017-03-26 23:26:40 +02:00
reger
c16498305b Add label text for search word statistic (AccessTracker_p.html) to master
lng file
2017-03-26 23:13:12 +02:00
reger
81670c3484 One more use of SwitchboardConstants.SERVER_PORT constant,
apply standard servlet design pattern initialization of solrselectservlet
2017-03-26 20:05:48 +02:00
luccioman
cde237b687 Enforced access controls on some administrative actions.
- ensure use of HTTP POST method : HTTP GET should only be used for
information retrieval and not to perform server side effect operations
(see HTTP standard https://tools.ietf.org/html/rfc7231#section-4.2.1)
 - a transaction token is now required for these administrative form
submissions to ensure the request can not be included in an external
site and performed silently/by mistake by the user browser
2017-03-26 11:48:00 +02:00
luccioman
df5970df6d Extended Apache HTTP Digest Auth. for use of YaCy encoded password
When programmatically requesting the local peer with Apache http client,
authentication credentials must be passed as clear-text values. 
This extension to the apache org.apache.http.impl.auth.DigestScheme
permits use of the YaCy encoded password stored in the
adminAccountBase64MD5 configuration property.
2017-03-26 11:32:44 +02:00
luccioman
40403942db Updated dump/restore shell scripts : the API is now IndexExport_p.html 2017-03-26 10:59:04 +02:00
luccioman
29e5110627 Updated shell scripts to be compatible with HTTP Digest authentication
Because curl and wget do not let use a hashed password as parameter,
YaCy shell scripts which require authentication are now interactive by
default when HTTP Digest is the only available authentication method.
Batch mode can still be available trough the use of an environment
variable : YACY_ADMIN_PASSWORD.  

Other improvements :
 - added backward compatibility for Basic Authentication
 - fixed curl/wget presence detection 
 - do not return with exit code 0 when an API call failed, and print an
error message when the case occurs
 - documented available authentication options for API calls
2017-03-21 17:15:01 +01:00
reger
bdadbda5fa Update master lng file with added text in Settings_ServerAccess
remove outdated file entry in fr.lng & sk.lng
2017-03-21 01:16:16 +01:00
reger
c55d526cb8 Add hint how to build with maven (for the first time) to readme 2017-03-20 02:33:21 +01:00
reger
cbf58d5f0a Add hint text to default ServerAcess Port Settings page 2017-03-19 21:45:33 +01:00
reger
f05976c017 Display the local search word statistic in alphabetic order 2017-03-19 07:12:35 +01:00
reger
3dd23c178b Introduce the option to configure a shutdown port.
A port value of -1 will disable this option.

If set to a value greater 0, YaCy listens on this of on the local loopback 
address (127.0.0.1) for a shutdown or restart signal.
E.g. connect to http://localhost:8005/shutdown will stop the YaCy server.
http://localhost:8005/restart will restart it.
This option allows to stop YaCy locally independant from the web web 
frontend (which might be configured for password protected remote access).
2017-03-19 02:30:08 +01:00
reger
c4d5f1fc54 upd to slf4j-1.7.24.jar 2017-03-18 20:32:53 +01:00
reger
c4b90eae98 upd to icu4j-58_2.jar 2017-03-18 20:06:58 +01:00
reger
a2afb4bae0 add switchboardconstants for server ports config keys 2017-03-18 20:02:26 +01:00
reger
e0c5b28331 update to jsoup-1.10.2.jar 2017-03-17 02:19:33 +01:00
reger
5b5ada38c3 update to jsch-0.1.54.jar 2017-03-17 02:07:02 +01:00
reger
038b9cd98e update translation for ConfigNetwork_p.html 2017-03-15 22:36:53 +01:00
reger
f7fce1baad make digest default authentication in defaults/web.xml 2017-03-15 01:39:15 +01:00
reger
56d0a87a83 remove double occuance of geo:lat in rss tokens 2017-03-13 03:08:44 +01:00
reger
882d99dae4 upd to metadata-extractor-2.10.1.jar 2017-03-13 00:34:40 +01:00
reger
b4fa1141b8 implement RequestHeader getRequestURI, getRequestURL for legacy request 2017-03-12 01:54:56 +01:00
reger
209a7374bd remove unused import pdfParser 2017-03-09 22:57:51 +01:00
reger
de1c1c16db Improve pdf text extraction resource handling.
For sort pdf <= 3 pages use already extracted content,
only for long pdf > 3 pages reassign content and close internal writer (to direct free buffers)
2017-03-09 22:56:33 +01:00
reger
52c9d0c858 upd to pdfbox-2.0.4.jar 2017-03-09 22:50:19 +01:00
reger
9b6d1abd9e eliminate some compiler unchecked and deprecation warnings
in nav plugins by explicite type declaration and replacing date.getYear
with Calendar.get
2017-03-09 01:42:36 +01:00
reger
6eb7d27449 upd to httpclient v4.5.3 2017-03-08 22:35:48 +01:00
luccioman
8e77fe3860 Fixed unresolved pattern case in search results progress bar.
This is a fix for mantis 715 (http://mantis.tokeek.de/view.php?id=715).

A possible path scenario that could leading to this case :
 - YaCy is running low in memory
 - a search is requested
 - before the end of search results rendering, the cleanup job runs and
deletes the running search event from the cache because of short memory
 - then yacysearchitem renders with "-UNRESOLVED_PATTERN-" parameter
values passed to the statistics() JavaScript function
2017-03-08 10:27:18 +01:00
luccioman
79df5bb20a Fixed settingsAck_p.html back link for case where referrer is stripped. 2017-03-07 12:27:27 +01:00
reger
18c7563dbe Extend DCEntry.getLanguage convert to ISO639-1 codes for more languages
by using icu.ULocale for languages not already covered (ICU normalizes 
to ISO639-1 2 char codes).
Add test class
Use DublinCore vocabulary declarations in DCEntry and SurrogateReader 
for easier usage debugging, 
Init SurrogateReader.inputSource on first use.
2017-03-05 02:26:10 +01:00