yacy_search_server/source
Michael Peter Christen 7db0534d8a Added a zim parser to the surrogate import option.
You can now import zim files into YaCy by simply moving them
to the DATA/SURROGATE/IN folder. They will be fetched and after
parsing moved to DATA/SURROGATE/OUT.
There are exceptions where the parser is not able to identify the
original URL of the documents in the zim file. In that case the file
is simply ignored.
This commit also carries an important fix to the pdf parser and an
increase of the maximum parsing speed to 60000 PPM which should make it
possible to index up to 1000 files in one second.
2023-11-05 02:16:40 +01:00
..
net/yacy Added a zim parser to the surrogate import option. 2023-11-05 02:16:40 +01:00
org Added a zim parser to the surrogate import option. 2023-11-05 02:16:40 +01:00