Commit Graph

245 Commits

Author SHA1 Message Date
Michael Peter Christen
f07e5fb553 release 1.2 2012-11-07 23:14:45 +01:00
orbiter
a1227879a9 release 1.1 2012-08-24 23:59:10 +02:00
Michael Peter Christen
23226676c6 FOR THE BRAVE.. this is a forced migration to solr which is now ready
for production as a replacement of the metadata-db.
This intermediate release 1.041 will switch on the previously optional
solr index and the old metadata-db will still work as it did before.
Solr+metadata are accessed in mixed mode, no migration is done yet.
If this causes not a catastrophe until the end of the weekend, we will
do a YaCy 1.1 main release containing this as default.
2012-08-16 18:17:47 +02:00
Michael Peter Christen
2160f9a819 Release 1.04 2012-07-09 00:13:59 +02:00
Michael Peter Christen
15f4551d88 Release 1.03 2012-06-19 08:51:26 +02:00
Michael Peter Christen
9c51db4243 Release_1.02 2012-02-25 12:59:19 +01:00
sixcooler
d14ee8e464 Revision 9000+ hack
do not handle the revision in build.properties anymore
(9000 as fallback)
build-date from git-HEAD (instead when build is fired)
(orginal build-date as fallback)
2011-12-07 04:20:49 +01:00
Michael Christen
7afcdcd573 release 1.01 - now with virtual svn number 9000 2011-12-07 01:03:08 +01:00
Michael Christen
9cd469e6d6 added pull request from als plus an NPE fix 2011-12-04 12:15:03 +01:00
daburna
73925b3c24 I am back with a small update for the German translation 2011-12-03 15:50:10 +01:00
Al Sutton
8993cac4d8 Initial performance improvements 2011-11-30 11:15:54 +00:00
orbiter
4fb5dfc312 YaCy 1.0 :)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@8118 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-11-27 16:57:33 +00:00
orbiter
6b70393d1d - new java version 1.6
- replaced old gif animator by java 1.6 gif animator

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7388 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-12-20 22:51:50 +00:00
orbiter
9fc940aa35 release 0.99
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7309 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-05 13:20:51 +00:00
orbiter
efa59250f8 release 0.98 for SuMa-eV Demo tomorrow
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7224 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-10-05 17:52:54 +00:00
orbiter
2c7edea35e - better shutdown behavior for the GUI (waits until data is written if GUI is killed)
- release 0.97

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7135 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-10 12:47:24 +00:00
orbiter
a33f39832e - small change in display of use cases
- explain usage of ftp, smb and file search domains

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6913 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-06-06 23:26:04 +00:00
orbiter
fc5efcc05a enhanced and fixed OAI-PMH import
- now importing OAI-PMH server list fron two sources
- simultanous import from several servers (even > 2000)
- check buttons on OAI-PMH server list to select multiple servers for import start
- it is possible to select all servers at once for import
- imported XML data is gzipped after import from surrogate reader

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6847 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-04-30 14:03:51 +00:00
orbiter
881a1065ce version number step to 0.94
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6679 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-02-17 21:56:56 +00:00
orbiter
a44112b562 - moved index cleaner to blacklist submenu, because the index cleaner cleans the index with the blacklist
- version switch to 0.93 to reflect advancements

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6516 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-12-03 14:26:30 +00:00
orbiter
735e2737e3 * added index segments
This is a major change in the organization of indexes.
Please consider a back-up of your data before you run this update.
All existing index files will be moved and renamed to a new position.
With this change, it will be possible to maintain different indexes for different purposes and it will be possible to have a distinction between DHT-in and DHT-out specific indexes. Tenants may also have their own index, and it may be possible to have histories and back-ups of indexes. This is just the beginning, many servlets must be adopted after this change, but all functions that had been there should still work.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6389 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-10-09 14:44:20 +00:00
f1ori
8544cfd5a6 * remove seperate build-files for parsers
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6170 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-03 20:06:06 +00:00
orbiter
6fc18b9d51 removed unused ant targets
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6158 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-30 15:01:42 +00:00
orbiter
69b573a8ac - next release cycle
- doku update for phpBB3
- added more files for search test scripts

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6130 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-25 05:53:57 +00:00
orbiter
2b3e3ef6df release for linuxtag
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6128 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-23 21:38:39 +00:00
orbiter
99fa265e1d fix for search bug caused by tenant patch
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6125 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-22 22:31:29 +00:00
orbiter
1457bfce16 added updateYACY.sh to release build
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6085 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-16 21:02:41 +00:00
orbiter
d58b395993 fix for http://forum.yacy-websuche.de/viewtopic.php?p=15693#p15693
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6049 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-11 09:38:25 +00:00
orbiter
a5d481eab1 enhanced navigation
- fixed too early computation of navigation
- moved navigation rendering to yacysearchtrailer
- added more asserts

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6006 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-01 22:45:28 +00:00
orbiter
4b4bddca00 added new submenu to crawler menu: import of phpbb3 forum postings from mysql
- yacy can import phpbb3 posts without crawling
- all data is written as surrogate
- indexed surrogate files can be re-used

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5985 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-27 14:53:23 +00:00
orbiter
27eb8d62cb - new development cycle
- removed temporary configuration with safe setting for indexer threads (=1) and replaced it with best value computed during performance tests (1/2 of number of processors)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5963 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-18 21:20:06 +00:00
orbiter
fdb87a0616 release 0.8
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5962 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-17 22:23:36 +00:00
orbiter
a642d6a7b5 - added navigation icons for search result pages
- modified result page rendering to use new icons instead of numbers
- set different default values in yacy.init for higher indexing performance; removed pro-values
- modified WatchCrawler to accept 30000 PPM instead of only a maximum of 6000 PPM

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5952 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-14 23:11:10 +00:00
orbiter
c01d6f43e1 - fixed problem with thread dump if no arguments are given
- rejecting peers that are older than 6 hours (not-seen during 6 hours)
- 0.78, targeting 0.8 at the end of the week

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5948 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-11 22:26:21 +00:00
orbiter
f678472f46 fix for quote problem in json output
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5895 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-04-27 22:27:02 +00:00
orbiter
138422990a - removed useCell option: the indexCell data structure is now the default index structure; old collection data is still migrated
- added some debugging output to balancer to find a bug
- removed unused classes for index collection handling
- changed some default values for the process handling: more memory needed to prevent OOM

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5856 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-04-22 22:39:12 +00:00
orbiter
89ec3acb3e - full abstraction of index content type: the kelondro full text index may now also contain indexes about other content than text, i.e. navigation indexes or reverse linking indexes.
- during index joins all word positions are maintained: better ranking for word distance possible; exact phrase match can be implemented soundly


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5804 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-04-15 06:34:27 +00:00
orbiter
c2359f20dd refactoring: better abstraction of reference and metadata prototypes.
This is a preparation to introduce other index tables as used now only for reverse text indexes. Next application of the reverse index is a citation index.
Moved to version 0.74

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5777 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-04-03 13:23:45 +00:00
orbiter
a9cea419ef Integration of the new index data structure IndexCell
This is the start of a testing phase for IndexCell data structure which will replace
the collections and caching strategy. IndexCall creation and maintenance is fast, has
no caching overhead, very low IO load and is the basis for the next data structure,
index segments.

IndexCell files are stored at DATA/<network>/TEXT/RICELL
With this commit still the old data structures are used, until a flag in yacy.conf is set.
To switch to the new data structure, set
useCell = true
in yacy.conf. Then you will have no access any more to TEXT/RICACHE and TEXT/RICOLLECTION

This code is still bleeding-edge development. Please do not use the new data structure for
production now. Future versions may have changed data types, or other storage locations.
The next main release will have a migration feature for old data structures.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5724 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-03-17 13:03:27 +00:00
f1ori
c545fcb9fa * add class to handle keys and signatures
* fix bug in serverCharBuffer
* add build-target to sign tar.gz (run ant dist sign)


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5665 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-03-02 13:29:50 +00:00
orbiter
c25c334b75 replaced old DHT transmission method with new method. Many things have changed! some of them:
- after a index selection is made, the index is splitted into its vertical components
- from differrent index selctions the splitted components can be accumulated before they are placed into the transmission queue
- each splitted chunk gets its own transmission thread
- multiple transmission threads are started concurrently
- the process can be monitored with the blocking queue servlet
To implement that, a new package de.anomic.yacy.dht was created. Some old files have been removed.
The new index distribution model using a vertical DHT was implemented. An abstraction of this model
is implemented in the new dht package as interface. The freeworld network has now a configuration
of two vertial partitions; sixteen partitions are planned and will be configured if the process is bug-free.
This modification has three main targets:
- enhance the DHT transmission speed
- with a vertical DHT, a search will speed up. With two partitions, two times. With sixteen, sixteen times.
- the vertical DHT will apply a semi-dht for URLs, and peers will receive a fraction of the overall URLs they received before.
  with two partitions, the fractions will be halve. With sixteen partitions, a 1/16 of the previous number of URLs.
BE CAREFULL, THIS IS A MAJOR CODE CHANGE, POSSIBLY FULL OF BUGS AND HARMFUL THINGS.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5586 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-02-10 00:06:59 +00:00
orbiter
063f74ae03 new release cycle. next stable will be 0.8
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5496 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-01-14 18:19:52 +00:00
orbiter
4e7881096f release 0.7
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5494 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-01-13 23:50:02 +00:00
orbiter
9ef77d57f5 added an access control to the search interface using white/blacklists:
in the network configuration, you can configure a whiteliste and a blacklist
- blacklistet clients cannot search
- whitelistet client get never any search restrictions
- for all other clients: apply DoS search restrictions
Please see the example configuriation in yacy.network.freeworld.unit
by default, all clients from localhosts get whitlistet.
If you have your own YaCy network, please put all the IPs of your peers into the whitelist

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5475 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-01-12 10:55:48 +00:00
orbiter
c6880ce28b removed the permanent cache flush and replaced it with a periodic cache flush
The cache is now flushed only for one second every ten seconds. During a crawl the cache
fills up completely, and is only flushed if space is needed for more documents.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5446 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-01-06 13:51:59 +00:00
orbiter
abdd4aa414 added a index dump for blob heaps:
this will increase the shutdown time for at most some seconds, but will speed up the start-up

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5419 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-12-29 21:36:27 +00:00
orbiter
d39d420b39 performance hacks
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5376 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-12-03 15:38:29 +00:00
orbiter
45ad1c3dd5 - re-activated concurrent iterator for EcoFiles
- added javadoc for new concurrent intialization in kelondroBytesLongMap
- switched default value for commons storage to false
- version step

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5361 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-11-23 18:25:40 +00:00
orbiter
867d0f2f56 removed some unnecessary pause delays
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5346 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-11-14 23:36:33 +00:00
orbiter
3f746be5d4 - consolidation and refactoring of many DHT target - computing methods
- implemented vertical DHT acceptance ("my own DHT") to accept new targets
- added new target computation for global search: addresses vertical targets also
- enhanced remote crawling: collection of remote crawl urls if queue has less than 100 entries (was: 0 entries)
- better performance value computations for PPM selection in network configuration

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5319 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-11-06 10:07:53 +00:00