yacy_search_server/htroot
Michael Peter Christen 7db0534d8a Added a zim parser to the surrogate import option.
You can now import zim files into YaCy by simply moving them
to the DATA/SURROGATE/IN folder. They will be fetched and after
parsing moved to DATA/SURROGATE/OUT.
There are exceptions where the parser is not able to identify the
original URL of the documents in the zim file. In that case the file
is simply ignored.
This commit also carries an important fix to the pdf parser and an
increase of the maximum parsing speed to 60000 PPM which should make it
possible to index up to 1000 files in one second.
2023-11-05 02:16:40 +01:00
..
api replaced all the links to legacy legacy wiki to legacy wiki 2023-10-29 13:12:24 +01:00
env modified link to Process Scheduler in left menu 2023-10-10 08:30:04 +02:00
jquery
js modified crawl list so the URL links to external URL 2023-08-28 13:01:45 +02:00
p2p moved more servlets to new location 2022-10-02 22:57:58 +02:00
portalsearch Fetch result pages one by one when scrolling in portal search widget 2018-08-28 15:49:30 +02:00
processing
proxymsg moved more servlets to new location 2022-10-02 22:57:58 +02:00
yacy refactoring - moved htroot/yacy classes 2022-10-02 22:26:53 +02:00
AccessGrid_p.html
AccessTracker_p.html
AccessTracker_p.xml
autoconfig.pac
Autocrawl_p.html
Blacklist_p.html Enhance notability of current blacklist by diff color in header 2022-02-06 09:43:59 +01:00
BlacklistCleaner_p.html
BlacklistImpExp_p.html
BlacklistTest_p.html
Blog.html
Blog.rss
Blog.xml
BlogComments.html
Bookmarks.html replaced all the links to legacy legacy wiki to legacy wiki 2023-10-29 13:12:24 +01:00
Bookmarks.rss
CacheResource_p.html
Collage.html
compare_yacy.html Update htroot compare_yacy servlet 2022-01-29 08:53:02 +01:00
ConfigAccountList_p.html
ConfigAccounts_p.html removed concept of empty passwords as "no passwords used", 2023-10-25 22:56:06 +02:00
ConfigAppearance_p.html added stub of rc3assembly style 2021-02-09 20:30:10 +01:00
ConfigBasic.html added a warning message in ConfigBasic in case that the default password 2023-10-24 23:36:26 +02:00
ConfigHeuristics_p.html Prevent entering empty OpenSearch URLs in ConfigHeuristics_p.html 2018-08-06 12:07:47 +02:00
ConfigHTCache_p.html
ConfigLanguage_p.html
ConfigNetwork_p.html
ConfigParser_p.html Added a zim parser to the surrogate import option. 2023-11-05 02:16:40 +01:00
ConfigPortal_p.html Allow JS resorting of search results by unauthenticated users 2019-04-03 14:21:53 +02:00
ConfigProfile_p.html
ConfigProperties_p.html Hide password values from visible HTML in the Advanced Config page 2018-09-21 09:59:32 +02:00
ConfigRobotsTxt_p.html
ConfigSearchBox.html
ConfigSearchPage_p.html fixes showing metadata from Searchresult, by removing defType=edismax 2021-03-21 00:06:26 +01:00
ConfigUpdate_p.html
ConfigUser_p.html
Connections_p.html
ContentAnalysis_p.html
ContentIntegrationPHPBB3_p.html
CookieMonitorIncoming_p.html
CookieMonitorOutgoing_p.html
CookieTest_p.html
CrawlCheck_p.html
Crawler_p.html Added a zim parser to the surrogate import option. 2023-11-05 02:16:40 +01:00
Crawler_p.json
CrawlMonitorRemoteStart.html
CrawlProfileEditor_p.html
CrawlProfileEditor_p.xml added canonical filter 2023-01-16 14:50:30 +01:00
CrawlResults.html Added and updated hint messages about remote crawler status 2018-07-06 11:30:30 +02:00
CrawlStartExpert.html added canonical filter 2023-01-16 14:50:30 +01:00
CrawlStartScanner_p.html allow network scans for non-standard http/https ports 2021-01-11 00:28:24 +01:00
CrawlStartSite.html new limitation documentation 2020-12-22 16:33:12 +01:00
DictionaryLoader_p.html Update link to Moby in DictionaryLoader_p.html 2022-02-11 03:19:30 +01:00
favicon.bmp
favicon.ico
favicon.png
goto_p.html
Help.html
index.html Fixed encoding of '+' character on search pages links 2018-08-20 18:44:04 +02:00
IndexBrowser_p.html IndexBroswser chart last 100 days - see https://github.com/yacy/yacy_search_server/issues/453 2022-02-04 06:09:08 +01:00
IndexBrowser_p.xml turned HostBrowser into a admin-only page, now called IndexBrowser 2020-12-11 00:50:52 +01:00
IndexControlRWIs_p.html
IndexControlURLs_p.html replaced all the links to legacy legacy wiki to legacy wiki 2023-10-29 13:12:24 +01:00
IndexControlURLs_p.xml
IndexCreateLoaderQueue_p.html
IndexCreateParserErrors_p.html
IndexCreateQueues_p.html Add Sorting functionality to Crawler Queue Table 2022-01-09 16:06:14 +01:00
IndexDeletion_p.html adding hint how to shrink the disk size after an index deletion. 2021-01-06 22:02:00 +01:00
IndexExport_p.html calculating the correct size of an export. 2021-09-16 01:05:09 +02:00
IndexFederated_p.html fixed doku link 2021-08-03 16:57:24 +02:00
IndexImportJsonList_p.html stub for jsonlist index importer web page 2022-10-23 12:22:31 +02:00
IndexImportMediawiki_p.html Added a link to MediaWiki dumps summary in import page for convenience 2018-08-08 08:11:02 +02:00
IndexImportOAIPMH_p.html
IndexImportOAIPMHList_p.html
IndexImportWarc_p.html
IndexReIndexMonitor_p.html adds deleting during recrawl 2020-07-09 19:32:16 +02:00
IndexReIndexMonitor_p.json
IndexSchema_p.html
IndexShare_p.html
jslicense.html removed ymarks 2021-09-16 22:23:51 +02:00
Load_MediawikiWiki.html
Load_PHPBB3.html
Load_RSS_p.html
mediawiki_p.html
Messages_p.html
Messages_p.rss
Messages_p.xml
MessageSend_p.html
Network.html replaced all the links to legacy legacy wiki to legacy wiki 2023-10-29 13:12:24 +01:00
Network.json fix for bad json 2019-11-06 17:28:11 +01:00
Network.xml
News.html
News.rss
opensearchdescription.xml
Performance_p.html feature https://github.com/yacy/yacy_search_server/issues/434 2021-12-26 23:33:31 +01:00
PerformanceConcurrency_p.html
PerformanceMemory_p.html updated solr 6.6.6 -> 7.7.3 2020-12-12 02:06:43 +01:00
PerformanceMemory_p.xml
PerformanceQueues_p.html
PerformanceQueues_p.xml
PerformanceSearch_p.html
ProxyIndexingMonitor_p.html
QuickCrawlLink_p.html
QuickCrawlLink_p.xml
RankingRWI_p.html Replaced RWI ranking JQuery sliders with standard HTML range inputs 2018-08-28 08:34:23 +02:00
RankingSolr_p.html
rct_p.html
RegexTest.html
RemoteCrawl_p.html
robots.txt turned HostBrowser into a admin-only page, now called IndexBrowser 2020-12-11 00:50:52 +01:00
rssTerminal.html
SearchAccessRate_p.html Allow JS resorting of search results by unauthenticated users 2019-04-03 14:21:53 +02:00
ServerScannerList.html
Settings_Crawler.inc Enforced access controls to System settings pages 2018-09-19 09:18:36 +02:00
Settings_Debug.inc Enforced access controls to System settings pages 2018-09-19 09:18:36 +02:00
Settings_HttpClient.inc Settings_HttpClient.inc spelling correction 2022-04-07 17:33:16 +03:00
Settings_MessageForwarding.inc Enforced access controls to System settings pages 2018-09-19 09:18:36 +02:00
Settings_p.html Made SNI extension user configurable without the need for server restart 2019-04-14 15:41:13 +02:00
Settings_Proxy.inc Enforced access controls to System settings pages 2018-09-19 09:18:36 +02:00
Settings_ProxyAccess.inc Enforced access controls to System settings pages 2018-09-19 09:18:36 +02:00
Settings_Referrer.inc Enforced access controls to System settings pages 2018-09-19 09:18:36 +02:00
Settings_Seed_UploadFile.inc Enforced access controls to System settings pages 2018-09-19 09:18:36 +02:00
Settings_Seed_UploadFtp.inc Enforced access controls to System settings pages 2018-09-19 09:18:36 +02:00
Settings_Seed_UploadScp.inc Enforced access controls to System settings pages 2018-09-19 09:18:36 +02:00
Settings_Seed.inc Enforced access controls to System settings pages 2018-09-19 09:18:36 +02:00
Settings_ServerAccess.inc add setting for public facing port 2022-01-11 17:10:48 +01:00
Settings_UrlProxyAccess.inc Enforced access controls to System settings pages 2018-09-19 09:18:36 +02:00
SettingsAck_p.html add setting for public facing port 2022-01-11 17:10:48 +01:00
sharedBlacklist_p.html Blacklist import from file, exclude comment lines 2022-02-05 17:38:29 +01:00
ssitest.html
ssitest.inc
ssitestservlet.html
Status_p.inc removed concept of empty passwords as "no passwords used", 2023-10-25 22:56:06 +02:00
Status.html changed link to new forum location 2022-02-03 13:27:06 +01:00
Steering.html changed link to new forum location 2022-02-03 13:27:06 +01:00
suggest.json
suggest.xml
Supporter.html
Surftips.html fixed lock image 2020-12-20 23:18:50 +01:00
Surftips.rss
Table_API_p.html replaced all the links to legacy legacy wiki to legacy wiki 2023-10-29 13:12:24 +01:00
Table_RobotsTxt_p.html replaced all the links to legacy legacy wiki to legacy wiki 2023-10-29 13:12:24 +01:00
Tables_p.html
Tables_p.xml
terminal_p.html
test.html
test.xml
Threaddump_p.html
Trails.html
Translator_p.html Move sub-menu UI Translations from public Status to secure Sys Administration 2022-02-08 22:42:11 +01:00
TransNews_p.html fixed html error 2022-10-02 23:42:54 +02:00
User.html
ViewFile.html turned HostBrowser into a admin-only page, now called IndexBrowser 2020-12-11 00:50:52 +01:00
ViewLog_p.html increases log history length to 10000 2022-10-05 16:09:28 +02:00
ViewLog_p.json
ViewProfile.html
ViewProfile.rdf
ViewProfile.vcf
ViewProfile.xml
Vocabulary_p.html replaced all the links to legacy legacy wiki to legacy wiki 2023-10-29 13:12:24 +01:00
Vocabulary_p.xml
WatchWebStructure_p.html replaced all the links to legacy legacy wiki to legacy wiki 2023-10-29 13:12:24 +01:00
Wiki.html
WikiHelp.html replaced all the links to legacy legacy wiki to legacy wiki 2023-10-29 13:12:24 +01:00
yacyinteractive.html replaced all the links to legacy legacy wiki to legacy wiki 2023-10-29 13:12:24 +01:00
yacysearch_location.html replaced all the links to legacy legacy wiki to legacy wiki 2023-10-29 13:12:24 +01:00
yacysearch_location.kml
yacysearch_location.rss
yacysearch_location.xml
yacysearch.atom
yacysearch.html enhanced search result design 2019-09-28 22:11:11 +02:00
yacysearch.json
yacysearch.rss
yacysearch.txt added .txt search result page (just replace '.html' with '.txt' in yacysearch.html page to get a url list) 2023-08-19 14:57:31 +02:00
yacysearch.xsl
yacysearchitem.atom
yacysearchitem.html Add option to add host to default blacklist from search result 2022-02-09 19:42:04 +01:00
yacysearchitem.json Improved the Image search page to have bigger thumbnails, use a bigger area for results and a smaller left sidebar. 2021-12-26 23:41:04 -07:00
yacysearchitem.txt added .txt search result page (just replace '.html' with '.txt' in yacysearch.html page to get a url list) 2023-08-19 14:57:31 +02:00
yacysearchitem.xml Improved the Image search page to have bigger thumbnails, use a bigger area for results and a smaller left sidebar. 2021-12-26 23:41:04 -07:00
yacysearchlatestinfo.json
yacysearchpagination.html Properly render the href attribute of the active page button 2019-03-09 08:28:39 +01:00
YaCySearchPluginFF.gif
YaCySearchPluginFF.html
YaCySearchPluginFF.src
yacysearchtrailer.html Render a relevant message and status on blocked search requests 2019-04-05 11:06:09 +02:00
yacysearchtrailer.json
yacysearchtrailer.txt added .txt search result page (just replace '.html' with '.txt' in yacysearch.html page to get a url list) 2023-08-19 14:57:31 +02:00
yacysearchtrailer.xml