Commit Graph

13845 Commits

Author SHA1 Message Date
luccioman
60b520fb13 Cleaned up Spanish translation after merge of PR #238
* Fixed some indentation
* Removed untranslated entries
2018-12-20 15:02:07 +01:00
luccioman
cd72515188
Merge pull request #238 from ivanhercaz/esLang
[WIP] Spanish translation
2018-12-20 14:57:14 +01:00
luccioman
440d9f2fa0 Exclude peers with empty or disabled RWI from remote RWI search 2018-12-20 14:53:01 +01:00
luccioman
2f75e2d9c8 Fixed a case of NullPointerException on disconnected RWI data structure 2018-12-17 14:12:21 +01:00
luccioman
55b059bfb5 Fixed wkhtmltopdf integration in Alpine flavor Docker image
Building now wkhtmltopdf (used for snapshots generation) from sources,
as its package is only available on the Alpine edge branch and is not
compatible with the current Alpine (3.8) stable base image used for
YaCy.
2018-12-17 10:34:51 +01:00
luccioman
0cf96031ac Fixed Alpine flavor Docker image (wkhtmltopdf moved to community repo)
See also 10a2b3a8cd
2018-12-14 21:07:06 +01:00
luccioman
7728de4ba6 Added support for pdf snapshots rendering to Snap package 2018-12-12 08:43:26 +01:00
luccioman
08ea0b0397 Added a configurable timeout to wkhtmltopdf calls for pdf snapshots
Necessary to prevent blocking the indexing workflow when some
wkhtmltopdf renderings fail without terminating
2018-12-11 22:31:31 +01:00
luccioman
7c7b38cb5a Updated the down.sh script, fixing the same kind of issue as in PR #260 2018-12-10 18:23:46 +01:00
luccioman
b3d8407462 Updated up.sh to support passing the target peer address as a parameter 2018-12-10 17:53:12 +01:00
luccioman
1b3cb3820e Fixed broken searchall shell scripts 2018-12-10 17:51:15 +01:00
luccioman
8be829ca6b
Merge pull request #261 from agnelvishal/patch-3
Update searchall1.sh

This should have been done ealrier after removal of up1.sh script in commit 3d9ce9cd04.
2018-12-08 15:49:55 +01:00
luccioman
d38c6565e1
Merge pull request #260 from agnelvishal/patch-2
Update up.sh

It looks like this is a missing impact for earlier commit 65e6ae52fb.
2018-12-08 15:33:56 +01:00
Agnel Vishal
c8ad904123 Grammar change (#258)
Grammar change to Readme APIs section
2018-12-07 21:15:05 +01:00
luccioman
3fb449b3b6 Properly resolve relative URLs against document URL in html base tags
Fixes issue #256
2018-12-06 20:18:00 +01:00
Agnel Vishal
82cd681b2d
Update searchall1.sh
The up1.sh does not exist. up.sh does the work
2018-12-06 16:09:10 +05:30
Agnel Vishal
15f76367f4
Update up.sh
IP addresses are not selected if "&ip=" is present
2018-12-06 16:07:38 +05:30
luccioman
73a6e45524 Extended detection of external tools used for Snapshots generation
This enable detecting wkhtmltopdf and Imagemagick convert executables
when they are at system Path in addition to common installation paths.
2018-12-06 09:53:08 +01:00
luccioman
dbacff2044 Added a command to set the admin password from the Snap package
Fixes issue #254
2018-12-01 11:04:09 +01:00
luccioman
3b383bff63 Removed shell temporary debug echo message 2018-12-01 10:55:15 +01:00
luccioman
3061b58c8c Let setting admin password with script when YaCy server is not running
Still using the ConfigAccounts_p API when the server is running.
2018-11-28 21:44:49 +01:00
luccioman
ede5cc0a2b Improvements to passwd.sh script
- Use the configured administrator user name instead of always
defaulting to "admin"
 - Do not echo the password in clear text
 - Check the password minimum size as will be applied in
ConfigAccounts_p
 - Let user type a password when not provided as a parameter
2018-11-27 11:41:00 +01:00
luccioman
106bfd0e5f Also check yacy.conf file existence in shell scripts where relevant 2018-11-27 11:36:40 +01:00
luccioman
17ad1f7e65 Added support for custom DATA path in shell scripts
When the YACY_DATA_PATH environment variable is set, shell scripts will
now use the given path instead of relative ../DATA which remains the
default when the variable is not set.

Necessary in the context of Snap package (see issue #254) as YaCy is
started with startYACY.sh and an absolute DATA parent path in parameter.
2018-11-26 18:26:52 +01:00
luccioman
07730fe040 Updated the JRE URL from 8u181 to 8u191 for the MS Windows installer 2018-11-21 17:59:43 +01:00
luccioman
4196101379 Enable soft autocommit in default Solr config
Since upgrade from Solr 5.5 to Solr 6.6 (commit 6fe7359), hard
autocommits were still enabled to regularly persist the Solr index to
the file system, but new index entries were no more automatically made
available for use by the application (soft autocommit).
Therefore, YaCy features such as index statistics, that do not perform
an explicit commit (as recommended by Solr documentation) were no more
accurate.
Soft autocommit is now restored as a default, with a time period
expected to be sufficient for accuracy while adding only a reasonable
system load overhead.

Fixes issue #251
2018-11-19 08:49:13 +01:00
luccioman
8f1722b755 Updated mirror repository source url used in alpine flavor Docker image
Switching to a mirror providing a valid certificate
2018-11-19 08:40:17 +01:00
luccioman
7dc1f60619 Fixed detection of absolute data folder path on MS Windows 2018-11-18 10:08:20 +01:00
luccioman
595e144797 Trace a message on incomplete proper server finish when killing process 2018-11-15 17:32:22 +01:00
luccioman
43d86be37b
Added a link to YaCy Gitter chat room in Readme
As suggested by PR #159 and reminded by @ivanhercaz in issue #247
2018-11-13 17:50:29 +01:00
luccioman
4129d712a7 Added details to the keystore configuration properties documentation 2018-11-13 07:50:27 +01:00
luccioman
9daeea823b Fixed concurrency issue on cache used for circles rendering
Without synchronization lock, concurrent rendering of images including
circles could lead to glitches as reported in issue #248
2018-11-10 22:00:49 +01:00
Michael Peter Christen
c347e7d3f8 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git 2018-11-08 14:42:52 +01:00
Michael Peter Christen
848e9304d9 evil bots may crawl harder 2018-11-08 14:42:40 +01:00
luccioman
8e72863a7f
Merge pull request #250 from theel0ja/patch-1
Improved formatting of markdown
2018-11-08 08:39:32 +01:00
luccioman
a997133260 Fixed gzip decompression regression on index transfer APIs
Processing of gzip encoded incoming requests (on /yacy/transferRWI.html
and /yacy/transferURL.html) was no more working since upgrade to Jetty
9.4.12 (see commit 51f4be1).

To prevent any conflicting behavior with Jetty internals, use now the
GzipHandler provided by Jetty to decompress incoming gzip encoded
requests rather than the previously used custom GZIPRequestWrapper.

Fixes issue #249
2018-11-07 14:52:42 +01:00
luccioman
e85f231bdf Fixed termination of Host browser and link structure Solr query threads
On some conditions (especially when reaching timeout), concurrent Solr
query tasks used by the /HostBrowser.html and /api/linkstructure.json
never terminated, thus leaking resources, as reported by @Vort in issue
#246
2018-11-06 10:10:09 +01:00
Elias Ojala
4a126881bb
Improved formatting of markdown 2018-11-05 09:27:17 +02:00
luccioman
260ac11c65 Limit length of initially visible text in link structure graph nodes
To improve a bit readability of graphs having a large number of nodes.
2018-10-31 07:43:42 +01:00
luccioman
5a8d9abd8a Upgraded d3js dependency from 3.4.4 to 5.7.0 2018-10-28 10:07:46 +01:00
luccioman
9f8e1994a4 Added missing CSS width units to some HostBrowser.html styling 2018-10-26 09:11:23 +02:00
luccioman
0b1d2cb0dd Fixed "TypeError: table.tBodies[0] is undefined" host browser JS error
Traced in browser console when a host details table is empty.
2018-10-26 08:55:25 +02:00
luccioman
fcf6b16db4 Added new crawler attribute for finer control over Media Type detection
New "Media Type detection" section in the advanced crawl start page
allow to choose between :
- not loading URLs with unknown or unsupported file extension without
checking the actual Media Type (relying Content-Type header for now).
This was the old default behavior, faster, but not really accurate.
- always cross check URL file extension against the actual Media Type.
This lets properly parse URLs ending with an apparently odd file
extension, but which have actually a supported Media Type such as
text/html.

Sample URLs with misleading file extensions added as documentation in
the crawl start page.

fixes issue #244
2018-10-25 10:42:12 +02:00
luccioman
88d0ed676c Render http status instead of null responses on snapshot api errors 2018-10-19 10:12:10 +02:00
luccioman
a83a56473e Added suport for PDF snapshots generation when running on MS Windows 2018-10-18 12:41:57 +02:00
luccioman
18d07538ad Upgraded Apache Ant from 1.10.1 to 1.10.5 in Docker alpine image flavor 2018-10-17 10:04:48 +02:00
luccioman
053df1f312 Added support for snapshots generation to Docker images 2018-10-17 08:42:31 +02:00
luccioman
92e10d7d1c Added a crawl start hint message on availability or not of wkhtmltopdf
As this tool is required to produce pdf snapshots
2018-10-16 08:02:43 +02:00
luccioman
8852c97cee Added basic styling for cleaner rendering of missing image snapshots
For the output of the Solr snapshots writer
2018-10-15 18:19:57 +02:00
luccioman
746e0e788d Render a relevant HTTP status code on snapshot image rendering error
Instead of a null response body which is not very helpful.
2018-10-14 10:30:30 +02:00