Commit Graph

14432 Commits

Author SHA1 Message Date
Michael Peter Christen
c2b6b6e7b9 Fixed a large number of problems in the ZIM reader.
This library was not prepared for large data because it was missing long
data types for pointers. I had to modify the code-base in a fundamental
way:
- Proof-Reading,
- unclustering,
- refactoring,
- naming adoption to https://wiki.openzim.org/wiki/ZIM_file_format,
- change of Exception handling,
- extension to more attributes as defined in spec (bugfix for mime type
loading)
- bugfix to long parsing (prevented reading of large files)
The code is furthermore very inefficient and requires more attention.
However the format is very useful for YaCy as there are numerous data
sources for ZIM-Files.
2023-10-27 15:49:23 +02:00
Michael Peter Christen
5ba5fb5d23 upgraded pdfbox to 3.0.0 2023-10-27 12:05:24 +02:00
Michael Peter Christen
c10944bd4a updated bcmail-jdk15on 1.75 to bcmail-jdk18on 1.67 2023-10-27 11:08:19 +02:00
Michael Peter Christen
1fefae9baf integrated the source code of a openzim file format reader. These are
the raw format reader files with no integration in YaCy yet, which will
maybe follow as a next step. The zim file format is documented in
https://openzim.org and the reader code was taken from the archived,
non-maintained repository at https://github.com/openzim/zimreader-java
2023-10-27 10:59:06 +02:00
Michael Peter Christen
4308aa5415 removed concept of empty passwords as "no passwords used",
because we now start YaCy with a default password (yacy).
This has impact of all function that check the current state of
password-protection that included the empty password situation,
including the warnings to set a password in case that none is set (which
cannot be the case any more).
2023-10-25 22:56:06 +02:00
Michael Peter Christen
2c60ff14bb fixed default pw comparison 2023-10-25 13:59:02 +02:00
Michael Peter Christen
4da320bebf added a warning message in ConfigBasic in case that the default password
was not changed.
2023-10-24 23:36:26 +02:00
Michael Peter Christen
7830268be1 fix 756c817b5a
must be applied to all code where a transaction token is generated.
2023-10-21 13:00:49 +02:00
Michael Peter Christen
dc6f218520 set the default password for the admin account to "yacy" 2023-10-21 12:09:19 +02:00
Michael Peter Christen
756c817b5a fix for https://github.com/yacy/yacy_search_server/issues/544 2023-10-21 11:45:26 +02:00
Michael Christen
bab1cfc7ea
added required build tools installation 2023-10-20 16:09:47 +02:00
Michael Peter Christen
03bf259601 fix for https://github.com/yacy/yacy_search_server/issues/363
We still need to set the load in the process because a demand for higher
crawl speed may require to increase the maximum load limit. However,
following the criticism in the bug, we do never reduce the load limit
again.
2023-10-16 18:26:47 +02:00
Michael Christen
5bc09af426
Merge pull request #600 from okybaca/scheduler-sort
UI: modified link to Process Scheduler in left menu
2023-10-16 13:00:24 +02:00
okybaca
4c1eb34e85 modified link to Process Scheduler in left menu 2023-10-10 08:30:04 +02:00
Michael Peter Christen
aeb4c7a660 removed warnings during normal build 2023-10-04 22:00:30 +02:00
Michael Peter Christen
095a444aa7 removed wiki links and added more shields badges 2023-09-30 18:16:38 +02:00
Michael Peter Christen
ca2a21008a added screenshots 2023-09-30 13:07:18 +02:00
Michael Christen
961d3cc8af
Merge pull request #597 from joestr/issue/574-fix-mac-script
Fix macOS script
2023-09-28 21:10:49 +02:00
Michael Christen
a035b21f63
Merge pull request #598 from joestr/improvement/remove-travis-yml
Remove .travis.yml
2023-09-28 21:10:04 +02:00
Joel Strasser
b29c0ef133
remove .travis.yml since YaCy is not build on Travis CI anymore 2023-09-27 21:29:22 +02:00
Joel Strasser
09783ae89e
apply patches from @HenryLoenwind 2023-09-27 19:56:08 +02:00
Michael Peter Christen
94db89a757 small remaining changes in readme 2023-09-26 16:15:58 +02:00
Michael Peter Christen
0c4478cd71 migrated jetty to 9.4.52.v20230823 2023-09-26 16:15:42 +02:00
Michael Peter Christen
938724caa8 new development on-boarding process in eclipse with changes for ivy 2023-09-26 16:07:59 +02:00
mchristen
8fc51f66c6 fixed a test class which prevented compilation on latest jvm 2023-09-26 15:39:34 +02:00
Michael Christen
bda118af5d
Merge pull request #594 from joestr/master
Match more YaCy versions
2023-09-26 09:39:15 +02:00
Joel Strasser
53bafa1544
consistent formatting in string concatenation 2023-09-25 23:31:55 +02:00
Joel Strasser
22c4188001
additionally match release stub for YaCy version 2023-09-25 22:41:04 +02:00
Michael Peter Christen
4a5820eb03 7zip parser was removed previously.
see also https://github.com/yacy/yacy_search_server/issues/491
2023-09-03 20:23:23 +02:00
Michael Peter Christen
ff8fe7b6a4 fix for ',' or '.' appearing within a word or number. This will not
tokenize the query into parts around that character to make it possible
to search for numbers or version numbers.
2023-09-03 11:37:25 +02:00
Michael Peter Christen
0689f4f0ae Check if the character is a minus sign and is followed by a letter or a
digit. Treat it as part of the word/number.
2023-09-03 10:22:03 +02:00
Michael Peter Christen
5db97a8928 parser can now separate numbers from words also when they are not
separated by space, i.e. 4.7Ohm
2023-09-02 19:15:22 +02:00
Michael Peter Christen
079eafe7f1 removed 7zip from eclipse classpath 2023-09-02 11:44:33 +02:00
Michael Peter Christen
e3797de7de enhanced the word tokenizer to recognize numbers in a proper way 2023-09-01 20:10:08 +02:00
Michael Peter Christen
88cd17ea57 migrated solr from 8.9.0 to 8.11.2; activated also migration script. A YaCy index with solr 8.9.0 will automatically be migrated to 8.11.2. This is a preparation step to migrate to 9.0.0 soon. 2023-09-01 18:24:52 +02:00
Michael Peter Christen
0089f234f4 added npe protection 2023-09-01 12:18:47 +02:00
Michael Peter Christen
8285fe715a tab to spaces for classes supporting the condenser.
This is a preparation step to make changes in condenser and parser more
visible; no functional changes so far.
2023-09-01 11:00:42 +02:00
Michael Peter Christen
ce4a2450da fixed workflow for ci process/2 2023-08-31 18:05:58 +02:00
Michael Peter Christen
a3ca4eac08 fixed workflow for ci process 2023-08-31 18:03:04 +02:00
Michael Peter Christen
6bd5f49c41 Migrated from java 8 to java 11. This step is required to upgrade certain packages, most important solr which will be migrated from 8.9 to 9.x 2023-08-31 17:52:30 +02:00
Michael Christen
376bcfd54c
Merge pull request #588 from okybaca/crawlurl
modified crawl list so the URL links to external URL
2023-08-28 22:29:04 +02:00
Michael Christen
d353202489
Merge pull request #589 from okybaca/restartbuild
added restartYACY.sh so it's included in release package
2023-08-28 22:22:26 +02:00
okybaca
1de37bc60b added restartYACY.sh so it's included in release package 2023-08-28 13:17:30 +02:00
okybaca
08b769f63a modified crawl list so the URL links to external URL 2023-08-28 13:01:45 +02:00
Michael Peter Christen
195bd2e444 extended the maximum header size to 16k to prevent http error 431 2023-08-19 15:21:24 +02:00
Michael Peter Christen
0554056c63 added .txt search result page (just replace '.html' with '.txt' in yacysearch.html page to get a url list) 2023-08-19 14:57:31 +02:00
Michael Christen
117e2d0663
Merge pull request #580 from okybaca/restartyacy
added a restartYACY.sh script
2023-08-09 16:32:43 +02:00
Michael Christen
c3809fbc36
Merge pull request #575 from okybaca/readme-ant
added link to faq and more ant build instructions to README
2023-08-09 16:30:31 +02:00
Michael Christen
eeeec0b8fe
Merge pull request #585 from okybaca/stopYacyDots
instead of WAITING lines, WAITING FOR STOP and dots are echoed
2023-08-09 16:28:17 +02:00
Michael Christen
2907981f87
Merge pull request #584 from okybaca/darkgreenskin
added a dark green color scheme (skin) based on 28c3
2023-08-09 16:27:20 +02:00