Distributed Peer-to-Peer Web Search Engine and Intranet Search Appliance http://yacy.net/
Go to file
orbiter 764a40e37d speed enhancements for crawler and url retrieval (affects also search speed)
- concurrency for LURL-fetching: this can be done using a concurrent lookup into the separated url databases. Concurrency is possible because there is no IO during lookup. The more LURL-Tables are present, the better is the speedup. More CPUs will increase speed
- because a large number of LURL-lookups are made during crawling (for double-check), the LURL-Lookup speed enhancements enhances also crawling speed
- search speed also profits from LURL-lookup enhancement
- changed some flushing parameters in word index caching which should make better use of large word index caches and should speed up indexing
- removed flush chunksize parameter, because this was only useful for IO path enhancement feature which was removed some weeks ago to prevent blocking and deadlocks during search requests

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4628 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-31 15:41:19 +00:00
addon changed handling of default values and database paths: 2008-03-16 22:31:54 +00:00
bin anhancements in ranking preparation and fixed problem with parser/mime recognition 2005-11-27 11:55:24 +00:00
defaults speed enhancements for crawler and url retrieval (affects also search speed) 2008-03-31 15:41:19 +00:00
htroot speed enhancements for crawler and url retrieval (affects also search speed) 2008-03-31 15:41:19 +00:00
lib - added NEAR operator (must be written in UPPERCASE in search query) 2008-01-08 20:12:31 +00:00
libt *) New lib directory containing libraries only needed for testing (e.g. junit) 2006-10-19 05:21:15 +00:00
libx Add another external dependency from PDFBox package ("Bouncy Castle"). This is necessary for parsing of some encrypted PDF files. 2007-11-27 23:13:26 +00:00
locales update 2008-03-18 17:53:54 +00:00
ranking/YBR added YBR ranking files 2006-03-05 09:00:25 +00:00
skins - removed dashed line from default skin (looks much better!) 2008-02-02 11:30:47 +00:00
source speed enhancements for crawler and url retrieval (affects also search speed) 2008-03-31 15:41:19 +00:00
test joined anomic.net.URL, plasmaURL and url hash computation: 2007-09-05 09:01:35 +00:00
.classpath some code enhancements and bugfixes 2008-03-09 23:48:24 +00:00
.project - plasmaParserDocument can process subdocuments now (other archive-parsers may want to use this method) 2007-05-18 23:13:44 +00:00
AUTHORS more generics 2008-01-19 00:40:19 +00:00
build.properties speed enhancements for crawler and url retrieval (affects also search speed) 2008-03-31 15:41:19 +00:00
build.xml fixed build problem see http://forum.yacy-websuche.de/viewtopic.php?f=6&t=956&hilit= 2008-03-17 06:53:20 +00:00
ChangeLog finished refactoring of searchtemplates. 2007-01-18 10:42:36 +00:00
COPYRIGHT *) changed name of COPYING and removed email address as suggested in the forum 2006-02-17 17:08:59 +00:00
gpl.txt initial load with yacy 0.36 2005-04-07 19:19:42 +00:00
httpd.mime - plasmaParserDocument can process subdocuments now (other archive-parsers may want to use this method) 2007-05-18 23:13:44 +00:00
killYACY.sh fixed killYACY.sh 2005-11-15 13:54:54 +00:00
readme.txt *) fixed more links 2007-09-01 11:24:23 +00:00
startYACY_noconsole_Win9x.bat Added scripts for Windows ME and 98 as requested in http://www.yacy-forum.de/viewtopic.php?t=839. 2005-08-05 18:18:50 +00:00
startYACY_noconsole.bat changed handling of default values and database paths: 2008-03-16 22:31:54 +00:00
startYACY_Win9x.bat Added scripts for Windows ME and 98 as requested in http://www.yacy-forum.de/viewtopic.php?t=839. 2005-08-05 18:18:50 +00:00
startYACY.bat changed handling of default values and database paths: 2008-03-16 22:31:54 +00:00
startYACY.command *) fixed more links 2007-09-01 11:24:23 +00:00
startYACY.sh removed debug from startYACY.sh *ups* 2008-03-17 13:02:39 +00:00
stopYACY_Win9x.bat Added scripts for Windows ME and 98 as requested in http://www.yacy-forum.de/viewtopic.php?t=839. 2005-08-05 18:18:50 +00:00
stopYACY.bat *) solving problems with wrong classpath 2005-06-10 06:18:55 +00:00
stopYACY.command Many additions/corrections to the German language file 2006-05-12 21:25:46 +00:00
stopYACY.sh changed handling of default values and database paths: 2008-03-16 22:31:54 +00:00
yacy-svn-4.spec changed handling of default values and database paths: 2008-03-16 22:31:54 +00:00
yacy.badwords.example *added translation for 2007-01-17 16:58:48 +00:00
yacy.logging more information (BASE64) 2008-01-12 00:24:24 +00:00
yacy.nsi changed handling of default values and database paths: 2008-03-16 22:31:54 +00:00
yacy.stopwords erased stopwords. We need a different solution here. 2007-04-03 12:43:40 +00:00
yacy.stopwords.de german stopwords. 2005-09-15 16:30:16 +00:00
yacy.yellow performance setting for remote indexing configuration and latest changes for 0.39 2005-07-22 13:56:19 +00:00

README for YaCy (C) by Michael Peter Christen; mc@anomic.de
---------------------------------------------------------------------------
Please visit www.yacy.net for latest changes or new documentation.
YaCy comes with ABSOLUTELY NO WARRANTY!
This is free software, and you are welcome to redistribute it
under certain conditions; see file gpl.txt for details.
---------------------------------------------------------------------------

WHAT IS THIS?

This is a Peer-to-Peer - based Web Search Engine.
There is no search central, the YaCy users create a web search network.
You can also use this software to set up your own search portal.


WHERE IS THE DOCUMENTATION?

The complete documentation can be found at:
(English)  http://yacy.net/
(Wiki:de)  http://www.yacy-websuche.de/wiki/index.php/De:Start
(Wiki:en)  http://www.yacy-websearch.net/wiki/index.php/En:Start


WHAT CAN I DO WITH THIS SOFTWARE?

- search the web (automatically using all other YaCy peers)
- crawl the web (and you contribute to the global web index)
- set up your own search portal
- use it as your personal web server
- use it as your web proxy (..and visited pages are indexed)
- many more


DEPENDENCIES? WHAT OTHER SOFTWARE DO I NEED?

You need java 1.4.2 or later to run YaCy.
Please download it from http://java.sun.com
NO OTHER SOFTWARE IS REQUIRED!
(you don't need apache, tomcat or mysql or whatever)


HOW DO I START THIS SOFTWARE?

Startup and Shutdown of YaCy:

- on Linux:
to start: execute startYACY.sh
to stop : execute stopYACY.sh

- on Windows:
to start: double-click startYACY.bat
to stop : double-click stopYACY.bat

- on Mac OS X:
to start: double-click startYACY.command (alias possible!)
to stop : double-click stopYACY.command


HOW DO I USE THIS SOFTWARE, WHERE IS THE ADMINISTRATION INTERFACE?

YaCy is a server process that can be administrated and used
with your web browser: open

   http://localhost:8080

There you can see your personal search and administration interface.


ANY MORE CONFIGURATIONS?

- after startup, you see the configuration page in your web browser.
  just open http://localhost:8080
  all you have to do (should do) is to enter a password for your peer

- You can use YaCy as your web proxy. This is an option, you don't need to do that.
  Simply configure your internet connection to use a proxy at port 8080.



CONTACT:

If you have any questions, please do not hesitate to contact the author:
Send an email to Michael Christen (mc@yacy.net) with a meaningful subject
including the word 'yacy' to prevent that your email gets stuck
in my anti-spam filter.

If you like to have a customized version for special needs,
feel free to ask the author for a business proposal to customize YaCy
according to your needs. We also provide integration solutions if the
software is about to be integrated into your enterprise application.

Germany, Frankfurt a.M., 19.07.2007
Michael Peter Christen