Commit Graph

716 Commits

Author SHA1 Message Date
Michael Peter Christen
910a496c9f replaced http links with https
Some checks failed
CI Script to build on self-hosted server / build (push) Has been cancelled
2024-07-21 18:02:58 +02:00
sgaebel
66cf7d4ca5 disables autowarm of filtercache, corrects luceneMatchVersion 2024-06-21 19:10:12 +02:00
Michael Peter Christen
8eb0d490aa migrated solr to 9.0
This is a major step because solr removed support for embedded solr
instances in 9.0 and we want to keep it because we want to ship
YaCy with an embedded solr. It was necessary to add parts of solr
code into YaCy to make this migration possible. Further on with
Solr 9.1 they removed even more parts which are required for embedded
operation, therefore we cannot migrate yet further without big
changes.
If you are running a YaCy instance with Solr 8.x, the migration should
be done automatically. If not you require to first migrate to a YaCy
version 1.93 with Solr 8.x to migrate to Solr 8 data.
2024-05-19 17:34:57 +02:00
Michael Peter Christen
13fbff0bff Added a RAG Proxy for AI Chat with YaCy
RAG (Retrieval Augmented Generation) is a method to combine a search
engine with a LLM (Large Language Model). When a new prompt is
submitted, a search engine injects knowledge from a search into the
content. This is done using a reverse proxy between the Chat Client and
the LLM. In this case, we used the following software:

LLM Backend - Ollama:
https://github.com/ollama/ollama
Install ollama and then load two required LLM models
with the following commands:
ollama pull phi3:3.8b
ollama pull llama3:8b

Chat Client - susi_chat:
https://github.com/susiai/susi_chat
just clone the repository and the open the file
susi_chat/chat_terminal/index.html
in your browser. This displays a chat terminal.
In this terminal, run the following command:
host http://localhost:8090
This sets the LLM backend to your YaCy peer.

Then start YaCy. It will provide the LLM endpoint to the client
while using ollama in the backend. It then injects search results
only from the local Solr index, not from the p2p network (so far).
2024-05-19 17:19:09 +02:00
pr0vieh
35620762ac bring defaults for recrawlindex to init config 2023-12-09 01:32:31 +01:00
Michael Christen
d097a642c2
Merge pull request #615 from okybaca/logging2
Logging unclutter
2023-12-03 16:40:21 +01:00
pr0vieh
dfb2b79609 Add setting for DHT receive loadprereq insted of hardcoded load < 2.0 2023-12-03 01:27:36 +01:00
okybaca
1c5fca9a58 changed network operation log category from YACY to NETWORK 2023-11-26 12:24:09 +01:00
okybaca
2f44fc0257 added some logging prefixes to yacy.logging 2023-11-25 18:39:08 +01:00
Michael Peter Christen
dc6f218520 set the default password for the admin account to "yacy" 2023-10-21 12:09:19 +02:00
Michael Peter Christen
88cd17ea57 migrated solr from 8.9.0 to 8.11.2; activated also migration script. A YaCy index with solr 8.9.0 will automatically be migrated to 8.11.2. This is a preparation step to migrate to 9.0.0 soon. 2023-09-01 18:24:52 +02:00
Michael Peter Christen
5afcba162b updated libraries 2023-07-27 22:55:46 +02:00
Michael Peter Christen
1c0f50985c fixed documentation and some details of handling of keywords 2023-04-04 12:41:12 +02:00
Michael Peter Christen
b1199e97f8 enabling new update location release.yacy.net
with new version numbers
2022-12-05 14:26:17 +01:00
Michael Peter Christen
5d9587fc1d removed deprecated networks 2022-12-05 12:43:47 +01:00
Michael Peter Christen
d49f937b98 added iso,apk,dmg to extension-deny list
see also https://github.com/yacy/yacy_search_server/issues/510
zip is not on the list because it can be parsed
2022-10-05 16:28:50 +02:00
Michael Peter Christen
761dbdf06d increases log history length to 10000
implements https://github.com/yacy/yacy_search_server/issues/512
2022-10-05 16:09:28 +02:00
Michael Peter Christen
60c9986a0e new release file names with date and git hash
...without reference to 9000ish SVN
2022-10-04 15:31:47 +02:00
Michael Peter Christen
ede9efe50c write git revision number to yacyBuild.properties 2022-10-04 01:50:56 +02:00
Michael Christen
347b676b76 changed system to load build properties 2022-10-03 10:12:47 +02:00
Michael Peter Christen
9228fcd481 updated seedlist 2022-09-28 23:52:58 +02:00
Michael Peter Christen
fc98ca7a9c removed ContentControl servlet and functinality
This was not used at all (as I know) and was blocking a smooth
integration of ivy in the context of an existing JSON parser.
2022-09-28 17:25:04 +02:00
Burkhard
a6a9828181
Merge pull request #440 from lfuelling/master
Add setting for public facing port
2022-02-11 08:09:17 +01:00
reger24
417899dda2 Correction of last commit 9dace71aea
accidently commented out kaskelix
2022-02-10 05:55:47 +01:00
reger24
9dace71aea Update yacy.net release download location
to //download.yacy.net/ in network.*.unit files

@Orbiter   for the latest avail. releases (v1.924 ...tar.gz)  the *.tar.gz.sig  file is missing,
so download fails with error "Download of releas .... failed"
2022-02-10 05:51:11 +01:00
Michael Peter Christen
96e44e11bb added more bootstrap addresses 2022-01-28 13:26:51 +01:00
Lukas Fülling
e8a00007f6 add setting for public facing port 2022-01-11 17:10:48 +01:00
lifeofguenter
870319e769
Fix typo + remove dead seeds 2021-12-27 14:12:17 +01:00
sgaebel
1cdc55a425 lets SOLR merge bigger segments (up to 50GB)
+ some setting to reduce caches
2021-10-31 11:33:42 +01:00
Michael Peter Christen
49cae8ca62 network bootstraping addresses update 2021-10-25 18:32:57 +02:00
Michael Peter Christen
be0aebad84 fixes https://github.com/yacy/yacy_search_server/issues/424 2021-10-04 14:38:49 +02:00
Michael Peter Christen
8084960392 disabled citation index
that was created but never used
2021-09-15 18:46:37 +02:00
admin
9b7668fa58 reduced memory footprint during indexing/crawling 2021-08-24 12:24:52 +02:00
Michael Peter Christen
e6a87e0426 enhanced crawler
a main problem when crawling is long waiting time cuased by crawl-delay
values from robots.txt entries. that attribute is not supported by
google and interpreted by yandex and bing in different ways. In large
crawls there is always one host which blocks the whole crawl with
extreme large values. YaCy now still obeys crawl-delay but limits them
to 10 seconds.
Additionally the blocking logic when loading new robots.txt was analyzed
and a deadlock was removed. Furthermore the construction of new queue
lists was redesigned and it was ensured that always a large list of
different hosts for host-balancing is provided for the loader.
2021-08-17 15:23:21 +02:00
Michael Peter Christen
15b7461bc7 removed Xms java memory startup parameter
We will use the default value for now on.
This is much better for resource economy and fits better into a
container/docker/kubernetes strategy.
Furthermore, a small memory footprint is essential for the usage on
small devices like RaspberryPi.
2021-07-19 20:04:11 +02:00
jfhs
2135d259e3 Replace hardcoded html/xml entities with a file, support decoding all defined HTML entities 2021-03-30 22:24:54 +02:00
Michael Peter Christen
8b4394a6c5 fixes for solr 8.8.1 migration
- replace new guava 30 with older 25 because that is the correct
dependency for solr 8.8.1. The newer one did actually not work!
- index will be crated in a DATA/INDEX/freeworld/SEGMENTS/solr_8_8_1
subfolder. The older solr_6_6 index is not touched but also not
migrated. The index starts with fresh (empty) content.
- Older indexes must be migrated by hand (export/import) so far until a
better solution is found.
- Large schema adoptions for lucene 8.8.1
2021-03-08 13:39:27 +01:00
Michael Peter Christen
96592a10cf added option to set yacy configuration values using environment
variables
To use that feature, set an environment variable with prefix "yacy." and
suffix identical to the yacy configuration attribute name.
Additionaly we implemented a way to set a peer name using the setting
"network.unit.agent". This can therefore now be used to set a peer name
with the java call parameter
-Dyacy.network.unit.agent=anonymous
The purpose for this feature is the ability to set peer names in
mass-deployed kubernetes clusters to the same name to prevent that we
are flooding peer name statistics with auto-deployment-generated names.
2021-01-24 22:50:37 +01:00
Michael Peter Christen
ca10f0afca fixed optional default PW 2020-12-29 20:19:07 +01:00
Michael Peter Christen
baad56d83d beautified default peer names 2020-12-14 02:08:49 +01:00
Michael Peter Christen
43a9f4f574 updated solr 6.6.6 -> 7.7.3
dropped GSA support (GSA API is still in YaCy Grid)
The 6.6.6 solr index works without migration also with 7.7.3
2020-12-12 02:06:43 +01:00
Michael Peter Christen
c0d9a3e9a7 turned HostBrowser into a admin-only page, now called IndexBrowser
This was required because spiders and bots crawled through this page and
created load on the peer without use for the user or the YaCy network.
2020-12-11 00:50:52 +01:00
Michael Peter Christen
39f87f7f28 added a hint to the default settings how to set a default password 2020-12-09 02:42:05 +01:00
parnikkapore
a251727b96
Typo fix 2020-01-20 20:11:03 +07:00
Michael Christen
cb20aa7e54 removed donation message in search result column 2019-10-17 01:35:44 +02:00
Michael Christen
ab467b1764 fixed css profile name 2019-10-14 01:53:09 +02:00
Michael Peter Christen
dddf5930fa more space for sponsoring 2019-09-29 00:26:48 +02:00
Michael Peter Christen
897582d23b updated seedlist bootstrap locations 2019-09-25 22:51:25 +02:00
luccioman
5a3d5cb92c Upgraded Solr config files with the ones provided by Solr release
Fixes #292
2019-04-16 10:25:48 +02:00
luccioman
a5771b1f14 Made SNI extension user configurable without the need for server restart
TLS Server Name Indication (SNI) extension activation can now be
configured with the new Settings_p.html?page=httpClient administration
page.
SNI extension is also now enabled by default, as in 2019 the
unrecognized_name(112) alert is more properly handled by major web
servers TLS implementations, following the RFC 6066 standard.

Related YaCy issues : #153 #189 and #272
JDK 1.7 bug :
https://bugs.java.com/bugdatabase/view_bug.do?bug_id=7127374
Apache httpd issue :
https://bz.apache.org/bugzilla/show_bug.cgi?id=56241
RFC 6066 : https://tools.ietf.org/html/rfc6066#section-3
2019-04-14 15:41:13 +02:00