Commit Graph

13599 Commits

Author SHA1 Message Date
luccioman
c867a52d96 Upgraded Solr dependencies from 6.6.2 to 6.6.3 2018-04-05 18:15:45 +02:00
luccioman
69690c13a0 Optionally allow external Solr server with self-signed certificate
This is necessary when you want to attach to a dedicated external Solr
server protected with basic http authentication and requested over https
but having only a self-signed certificate.
2018-04-04 18:16:26 +02:00
Marc Nause
1e4ceaac3f Removed seed URLs pointing to server low.audioattack.de since it will not be updated anymore. 2018-04-03 23:19:05 +02:00
luccioman
b882f85900 Fixed NPE case in Solr select servlet on external Solr only setup
Regression introduced with commit
0d7625ecfb
2018-04-03 15:36:17 +02:00
luccioman
6784c9be68 Updated external Solr setup basic instructions 2018-04-03 15:34:44 +02:00
luccioman
211f3d04ab Added hint message inciting to check accounts settings on fresh install
When unrestricted access from localhost is set and the accounts config
page has not been visited at all.
2018-04-02 19:48:11 +02:00
luccioman
2fd4d05e2f Added a shared Java constant for setting key server.servlets.called 2018-04-02 15:16:10 +02:00
luccioman
033f7c4c00 Adjusted localhost/qualified account admin access informational texts.
Following remarks from @etam on issue #170
2018-04-02 15:04:56 +02:00
luccioman
05702c2ced Adjusted api table query matching strategies
When inlined (for example in the CrawlProfileEditor_p.html page) :
search only on the comment, as the url is not visible

On regular display : search on comment OR url, instead of comment AND
url. Otherwise searching on comments terms is almost useless as these
terms are not necessarily present in the url.
2018-03-30 11:12:48 +02:00
luccioman
65451a3d62 Fixed start record on the last api table results page
When the last results page size was lower than maximumRecords, results
from the previous page where displayed again.
2018-03-30 10:53:06 +02:00
luccioman
86c902b853 Enable api table page navigation with search query
Applied the same default results page size as when a type filter is
defined for proper and consistend page navigation when combining type
filter and search query.
2018-03-30 10:21:42 +02:00
luccioman
9c7faa04d8 Display the total number of matching items when filtering on table API
Notably for a proper page navigation of the crawl scheduler table
(CrawlProfileEditor_p.html page).
2018-03-29 14:24:25 +02:00
luccioman
311e91ff77 Added hint to clarify results rendered dates and 'Sort by date' switch 2018-03-27 18:05:20 +02:00
luccioman
90dc580158 Fixed initial ViewFile mode and suggestions links from previous commit 2018-03-27 08:25:40 +02:00
luccioman
0b6aed4de6 Keep the selected view mode when typing a new URL in the ViewFile page
Otherwise, when interested in viewing `Link List` for example, each time
you typed a new URL, `Parsed Sentences` view mode was selected as
default and you had to selected again the view mode you are insterested
in.
2018-03-27 07:42:26 +02:00
luccioman
db55eaa673 Updated link to Solr Function Queries documentation page 2018-03-26 16:26:46 +02:00
luccioman
ba9cd14516 Removed hard-coded patch for Solr 5.0 on ranking boost function
The current default boost function
(`recip(ms(NOW,last_modified),3.16e-11,1,1)`) for the Date ranking
profile is indeed working fine.
What can trigger the error `unexpected docvalues type NUMERIC for field
'last_modified'` is the previous default boost function (quite old now)
or any custom one using the Solr `ord` or `rord` functions on the
last_modified field.
Then the problem was that the migration code in the Switchboard supposed
to detect the old date boost function was incorrect (one trailing right
parenthesis in excess), so the deprecated function remained.

This fixes issue #169.
2018-03-26 16:24:27 +02:00
luccioman
7496df93c3 Fixed error 414 (URI Too Long) when manually selecting to many RSS items
Switched form method to HTTP POST to prevent this.
2018-03-23 10:49:39 +01:00
luccioman
fb3032c530 Added a crawl filtering possibility on documents Media Type (MIME) 2018-03-23 10:28:19 +01:00
luccioman
90d4802082 Updated link URL to IANA Media Types with https 2018-03-23 10:23:54 +01:00
luccioman
e45afedee4 Added support for enclosures (media links) to the RSS loader 2018-03-21 08:22:29 +01:00
luccioman
29166e7837 Updated Docker readme base image links from java (deprecated) to openjdk 2018-03-21 07:37:53 +01:00
luccioman
aaefd5219c Reduce log verbosity of RSS loader on feed items with no link 2018-03-20 10:09:17 +01:00
luccioman
cf62b571bd Added RSS reader support for enclosure feed item sub element.
Enclosure element (see
http://www.rssboard.org/rss-specification#ltenclosuregtSubelementOfLtitemgt
) can be seen for example in podcasts feeds.
2018-03-20 07:38:29 +01:00
luccioman
e5f5de0fc7 Added some JavaDoc to the RSSMessage class. 2018-03-19 11:15:31 +01:00
reger
a57a04a003 upd to commons-codec-1.11 2018-03-19 02:02:35 +01:00
luccioman
0d7625ecfb Handle Solr fields restrict and alias in YaCy html and exml writers
Thus allowing for example to read more easily the local Solr index full
metadata in HTML by restricting if desired to some fields of interest.

See Solr documentation about the 'fl' (Field List) parameter at
https://lucene.apache.org/solr/guide/6_6/common-query-parameters.html#CommonQueryParameters-Thefl_FieldList_Parameter
2018-03-16 11:35:42 +01:00
luccioman
3da2739bbd Parse and index more common audio metadata text tag fields. 2018-03-15 09:59:57 +01:00
luccioman
846aba00fa Added parsing of URLs eventually present in audio metadata tags 2018-03-13 23:08:52 +01:00
Michael Peter Christen
187075b878 added nav filter 2018-03-10 15:46:53 +01:00
luccioman
bcbd0ae1a4 Enabled partial parsing of audio resources. 2018-03-01 20:50:44 +01:00
luccioman
fda0189613 Updated audio file extensions with ones recently added to audioTagParser 2018-02-28 13:46:40 +01:00
luccioman
978e2be95b Let a chance for other parsers on audioTagParser error
As done in all other parsers, eventually falling back in the end to the
genericParser which creates a minimal index entry.
2018-02-28 12:27:17 +01:00
luccioman
9e5846a26e Small fix on svg parser error message 2018-02-28 12:23:52 +01:00
luccioman
11611dbdcf Reuse existing File copy function to handle audio parser tmp files 2018-02-28 11:58:32 +01:00
luccioman
f77f8f40f9 Factored audio parser tag processing 2018-02-28 08:19:13 +01:00
luccioman
9a7a353d0e Removed some unnecessary intermediate list creation on array copy. 2018-02-28 07:49:40 +01:00
luccioman
fb6457f5bc Fixed NPE case when on audio resource parsed with null tag 2018-02-28 07:31:32 +01:00
luccioman
c3ff50c17a Updated the list of audio file formats supported by the audioTagParser
Follows upgrade to Jaudiotagger dependency to version 2.2.5.
2018-02-27 18:04:12 +01:00
luccioman
5753ce0ac5 Upgraded Jaudiotagger dependency from 2.0.3 to 2.2.5 2018-02-26 09:17:26 +01:00
reger
aaa0ec6613 upd to commons-compress-1.16.1 2018-02-23 19:17:09 +01:00
luccioman
07e8628853 Added HTML5 embedded audio for results playing on supporting browsers
Restricted to authenticated or localhost users only to prevent
redistribution license issues.
2018-02-23 11:41:50 +01:00
luccioman
1b90479a76 Added missing vocabulary navigator increment on results from RWI 2018-02-23 11:36:03 +01:00
luccioman
46c9da6428 Allow creation of vocabularies from remote CSV file URLs. 2018-02-21 08:41:13 +01:00
luccioman
17c7a85f18 Make StreamResponse usable in Java try-with-resources statements 2018-02-21 08:38:35 +01:00
luccioman
348d07a999 Enforced controls on vocabulary editing operations. 2018-02-20 12:22:54 +01:00
luccioman
2532db2ce6 Vocabulary editor : use accessible labels and CSS for elements position 2018-02-20 11:22:34 +01:00
luccioman
ac14437316 Vocabulary_p.html : richer semantics for HTML tables
Also replaced deprecated attributes
2018-02-19 15:15:02 +01:00
luccioman
b67742336e Provide user interface messages on vocabulary creation read/write errors 2018-02-19 11:48:40 +01:00
luccioman
ea57763294 Mark vocabulary name field as required using html instead of JavaScript 2018-02-19 09:35:44 +01:00