Commit Graph

51 Commits

Author SHA1 Message Date
Michael Peter Christen
a6ad1d6fd1 update to search tests (use yacy interface and a bugfix) 2012-12-27 03:15:50 +01:00
orbiter
f86d469973 more search command tools 2012-12-10 21:01:14 +01:00
orbiter
1228a5798d you can now search for '*' to get just ALL entries in the search index
as result list. This makes sense if you intend to search just by using
the navigation tools to cut the data set into navigation 'slices'.
2012-12-10 20:55:11 +01:00
Michael Peter Christen
68c7ed5ce9 added a shell script which can be used to delete the api action steering
table. This may be necessary if the api is called by remote command and
the recordings are not used. Then they can be deleted frequently by
calling this clear command using a cron job
2012-10-17 00:44:16 +02:00
Michael Peter Christen
ed803708ab added a shell script which can be used to add a rss feed to the index.
All pages linked in the rss feed are added. The process is not repeated
automatically. If you want to repeat this, add the command to a cron
job.
2012-10-17 00:31:59 +02:00
Michael Peter Christen
f3fc8eac80 fixed clear scripts 2012-10-11 10:16:37 +02:00
Michael Peter Christen
640339ee21 added the indexrestore.sh script which must be called with the path of
the index dump. This is the reverse of indexdump.sh which takes the
output of indexdump.sh as input to restore an index.
Now it should be possible to transfer a complete YaCy Solr index from
one peer yacy1 to another peer yacy2 with the following command:
yacy2/bin/indexrestore.sh ´yacy1/bin/indexdump.sh´
2012-09-25 00:28:20 +02:00
Michael Peter Christen
15ea053c3a - added xml output in IndexControlURLs to get the storage page of index
dump commands
- adjusted the apicall.sh script to get the downloaded text as output to
stdout which is necessary to parse the content out of it
- added indexdump.sh script which creates a solr dump and prints out the
storage path for the index dump
- added synchronization to the Fulltext class to prevent that data is
stored to a non-existing solr index while this index is disabled during
the storage of the dump
2012-09-25 00:19:52 +02:00
Roland 'Quix0r' Haeder
aef9dd0350 - removed cleaning of blacklist cache on startup
- added cleaning of blacklist cache if cache is modified in interface
- extended cache saving to all cache types
- moved cache location to DATA/LISTS
- fixed static file path which was relative to the application path but
should be relative to data path - which is different in debian and mac
implementations
2012-07-10 13:08:16 +02:00
Michael Peter Christen
cc1b6762bb root, not yacy 2012-06-24 10:58:09 +02:00
Michael Peter Christen
2589158f44 changed recommended line in /etc/crontab for high-availability 2012-06-24 10:57:18 +02:00
Michael Peter Christen
2823e33531 fixes for wrong port number in shell scripts 2012-05-29 13:08:27 +02:00
Michael Peter Christen
8b974905ee changed log-in text for all servlets with authentication:
- added hint how to set the password using a shell script
- added a shell script to change the password
2012-05-24 13:24:31 +02:00
Michael Peter Christen
6c4f8fdc44 removed superfluous files 2012-04-30 11:09:54 +02:00
Lotus
e14eb9de82 checkalive.sh: try to fetch only once (default: 20) 2012-03-12 09:30:44 +01:00
Michael Peter Christen
5e18f54a8c added shell script to get a servlet. this is the same as apicall.sh but it prints the result to stdout 2012-01-31 23:21:49 +01:00
Michael Christen
7a5ff9b808 added checkalive script that can be called frequently to restart crashed
peers automatically
2012-01-02 02:09:49 +01:00
orbiter
3ac6fb0baf added dump check script
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7978 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-09-28 21:18:49 +00:00
orbiter
fabda9ad31 added script that can be used to delete a single url from the index
call:
bin/deleteurl.sh <url>


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7969 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-09-21 23:33:44 +00:00
orbiter
2e8c8ebf54 fix for clear commands
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7939 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-09-08 12:54:10 +00:00
low012
24e76a7b69 *) Replaced occurrences of "Wikimedia" with "MediaWiki" where applicable. (Thanks to the folks of 0x20.be for pointing this out.)
*) Added description of where to place MediaWiki dump for import.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7905 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-08-28 00:16:36 +00:00
orbiter
d0d6123b18 added a deploy script that can be used to deploy yacy releases into the current release for testing
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7768 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-06-01 19:52:05 +00:00
orbiter
265b7ce4f9 removed pause in search test
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7767 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-06-01 19:49:44 +00:00
orbiter
752576b521 - localsearch test script does also a snippet-fetch
- killYACY.sh does not need a sleep between kill -3 and kill -9

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7750 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-05-27 12:08:45 +00:00
orbiter
f6077b3cc0 added more attributes for html parser and enhanced data structures
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7679 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-04-28 13:09:01 +00:00
orbiter
9b25d07295 - added geo information parsing to html parser
- extended metadata information in index with geolocalisation
- added display of location in yacydoc and ViewFile

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7629 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-03-30 00:49:47 +00:00
orbiter
18ec7fe53c added a clearall.sh script that deletes the complete index and everything else that belongs to crawling
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7611 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-03-20 08:36:29 +00:00
orbiter
a50f28e6e7 - fixed missing save operation for peer name change
- fixed import of mediawiki dump files
- added script to add mediawiki dump files

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7609 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-03-19 23:52:09 +00:00
orbiter
88773e4daa changed the default port from 8080 to 8090
see also: http://forum.yacy-websuche.de/viewtopic.php?p=21683#p21683

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7454 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-28 10:54:13 +00:00
orbiter
5dcb838293 - removed thread overhead when calling dns services
- fixed localsearch (changed it by accident)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7306 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-05 00:29:32 +00:00
orbiter
becc463d8a enhanced did-you-mean
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7300 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-04 00:25:19 +00:00
orbiter
fc924f024e import of oai sources from a list using a command line interface:
if you have a list of oai servers you can import them all using the linux command:
bin/importOAIList.sh <name-of-oai-list-file>


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7141 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-13 10:13:34 +00:00
orbiter
3197ca42ed preparations to move the HTCache into cora:
- move the header framework classes to cora
- move the ARC caching classes to cora
- refactoring of code to call these classes from cora

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7068 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-08-23 12:32:02 +00:00
orbiter
b68deb407a - moved test data from /bin to /test/words
- refactoring of stopYACY.sh by introduction of /bin/apicall which is able to call any api file with attached authorization

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6691 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-02-22 20:14:16 +00:00
orbiter
5d7045387b added more word lists and a multi-access search test tool for high-performance query testing:
run searchtestmulti.sh; then 10 concurrent processes fire 1000 requests each to  the local peer.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6045 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-10 22:01:48 +00:00
orbiter
8ce5bb4f31 added shell scripts that list host addresses
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5793 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-04-10 09:45:22 +00:00
orbiter
51ea865569 small fix for localsearch shell script
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5792 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-04-10 09:44:03 +00:00
orbiter
d39d420b39 performance hacks
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5376 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-12-03 15:38:29 +00:00
orbiter
bb5c898441 enhancements to localsearch behavior
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5131 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-09 10:24:42 +00:00
orbiter
42e2d195ac added hint from http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1294
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5130 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-08 22:37:58 +00:00
f1ori
f0012554de * improved localsearch.sh:
* read port from DATA/SETTINGS/yacy.conf
  * use curl or wget, depending on what is available


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4972 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-07-06 16:48:41 +00:00
orbiter
994c609cf8 added new shell script to do a web search from the terminal
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4916 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-06-11 21:33:36 +00:00
orbiter
e9dd0e6831 added a test script for profiling of local search speed and a list of search words
current result: 120 milliseconds for a single search during a sequence of 100 searches,
while the crawler is running with 500 PPM!

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4913 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-06-11 10:36:37 +00:00
orbiter
724bbdf9b2 refactoring of RSS reader
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4736 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-24 21:31:07 +00:00
orbiter
0702dd2507 added a profiling script to analyse search process timing
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4733 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-24 13:28:18 +00:00
orbiter
40621a5663 anhancements in ranking preparation and fixed problem with parser/mime recognition
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1132 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-27 11:55:24 +00:00
orbiter
0e25020f51 added first generation and usage of YBR index-files. Enhanced overall ranking of search results.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1118 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-22 15:17:05 +00:00
orbiter
0ec54d9c5f enhanced CR-file handling and added first RCI-evaluation tests
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1110 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-20 18:55:35 +00:00
orbiter
88e3234393 fine-tuning of rci-generation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1105 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-18 02:00:25 +00:00
orbiter
a12759c1bf first try to implement a rci-computation from cr-files
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1103 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-17 16:17:56 +00:00