Commit Graph

5667 Commits

Author SHA1 Message Date
orbiter
4d4315687f fix for problem with concurrency in host navigator, bug reported by wsb
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5993 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-29 10:52:50 +00:00
orbiter
88426912ad more refactoring to make the segment object easier to use and to be prepared to integrate author navigation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5992 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-29 10:03:35 +00:00
lotus
d813fd26ed reset sent/received counters on index delete
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5991 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-28 15:49:42 +00:00
orbiter
99bf0b8e41 refactoring of plasmaWordIndex:
divided that class into three parts:
- the peers object is now hosted by the plasmaSwitchboard
- the crawler elements are now in a new class, crawler.CrawlerSwitchboard
- the index elements are core of the new segment data structure, which is a bundle of different indexes for the full text and (in the future) navigation indexes and the metadata store. The new class is now in kelondro.text.Segment

The refactoring is inspired by the roadmap to create index segments, the option to host different indexes on one peer.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5990 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-28 14:26:05 +00:00
orbiter
876746602d catch problems of file hash computation, see also:
http://forum.yacy-websuche.de/viewtopic.php?p=15245#p15245

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5989 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-28 10:08:36 +00:00
orbiter
fec6f9054f some refactoring of search methods
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5988 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-27 23:51:34 +00:00
orbiter
d793bb0d76 the mysql lib was not in releases included; moved library from libx to lib
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5987 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-27 20:23:28 +00:00
orbiter
3d4b826ca5 migration of all databases that use the deprecated BLOBTree format into the BLOBHeap format. Old databases are migrated automatically.
This removes the last very IO-intensive data structures which were still used for Wiki, Blog and Bookmarks. Old database files will still remain in the DATA subdirectory but can be deleted manually if no major bugs appear during migration. There is no need for any user action, all migration is done automatically.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5986 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-27 15:04:04 +00:00
orbiter
4b4bddca00 added new submenu to crawler menu: import of phpbb3 forum postings from mysql
- yacy can import phpbb3 posts without crawling
- all data is written as surrogate
- indexed surrogate files can be re-used

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5985 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-27 14:53:23 +00:00
apfelmaennchen
f1a9253baa see: http://forum.yacy-websuche.de/viewtopic.php?f=9&t=1904#p15198
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5984 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-27 08:56:55 +00:00
apfelmaennchen
369bb78a7a - added "visibility: hidden;" for yacysearch.html sidebar
- this only works with javascript....sorry

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5983 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-27 07:30:59 +00:00
apfelmaennchen
16a226b016 removed debug-alert() from last commit
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5982 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-27 07:13:45 +00:00
apfelmaennchen
01334686bd small fixes for yacysearch.html sidebar (display=3)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5981 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-27 07:06:13 +00:00
orbiter
d8284046b0 enhanced speed of site navigation computation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5980 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-26 22:30:20 +00:00
orbiter
c72a5cf326 added stub for PHPBB3 extraction code using direct access to mySQL
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5979 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-26 15:58:49 +00:00
orbiter
e735d3a69f fix for http://forum.yacy-websuche.de/viewtopic.php?p=15175#p15175
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5978 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-26 15:03:50 +00:00
orbiter
63a0255166 - refactoring: added new content package, which will contain connector classes for different types of data sources to import texts into the YaCy index
- refactoring: migrated data objects for the new connector classes
- added a DAO interface class to specify an abstract interface for database retrieval connector methods

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5977 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-26 07:44:22 +00:00
orbiter
f246928c20 first attempt to add 'real' Navigation to yacy search results: host navigation
- after a search is started, it is analysed how many hits are in each site
- this can be done really efficient, because the navigation information is hidden in the url hash and can be computed very fast
- the search result shows a column on the right with the hosts and the hits per host
- after a click on a host the search is modified using the efficient site: - operator

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5976 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-25 22:27:34 +00:00
orbiter
54b9e99c01 - more information about peer tags
- peer tag is by default '*'

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5975 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-25 21:43:33 +00:00
orbiter
26a46b5521 increased default maximum file size for database files to 2GB
Other file sizes can now be configured with the attributes
filesize.max.win and filesize.max.other
the default maximum file size for non-windows OS is now 32GB

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5974 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-25 06:59:21 +00:00
lotus
eb36c9a092 hint for Java in start script
this can usually only be seen if it isn't installed

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5973 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-24 17:06:37 +00:00
orbiter
addecdb18c simplified code, removed one unused method in all implementing classes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5972 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-21 23:53:01 +00:00
borg-0300
47fce9020c small change (Orbiter's wish)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5971 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-21 17:51:52 +00:00
borg-0300
e07b14e5d7 finally a working fix for 5960
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5970 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-21 16:07:04 +00:00
borg-0300
3ebb904d2c fix for 5960, http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2119
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5969 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-21 11:47:57 +00:00
lotus
734680dc70 initialize the ResourceObsever in own thread
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5968 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-21 08:30:34 +00:00
orbiter
e005cfea37 fix for bug in -incell option of URLAnalysis
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5967 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-21 06:57:03 +00:00
orbiter
a7e392f31b The collection index will not be supported any more.
Existing indexes based on the old index collections must be migrated with YaCy 0.8
- removed index collection classes and all migration tools
- added a 'incell' reference collection feature in URL analysis


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5966 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-20 14:51:26 +00:00
orbiter
a2f48863fc - added prototype for navigation index
- refactoring of word index prototype
(no functional changes so far)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5965 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-20 09:00:24 +00:00
lotus
47fd226bdb proper parsing of sentences
does not affect tokens/words

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5964 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-19 16:41:27 +00:00
orbiter
27eb8d62cb - new development cycle
- removed temporary configuration with safe setting for indexer threads (=1) and replaced it with best value computed during performance tests (1/2 of number of processors)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5963 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-18 21:20:06 +00:00
orbiter
fdb87a0616 release 0.8
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5962 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-17 22:23:36 +00:00
orbiter
41dd31cad2 replaced new navigation icons with same images but smaller resolution (16x16 instead of 128x128)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5961 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-17 22:22:29 +00:00
orbiter
b7457d3807 patch for http://forum.yacy-websuche.de/viewtopic.php?p=14720#p14720
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5960 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-17 21:44:02 +00:00
orbiter
bffbe43e09 fix for http://forum.yacy-websuche.de/viewtopic.php?p=14522#p14522
fix for http://forum.yacy-websuche.de/viewtopic.php?p=14955#p14955

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5959 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-17 21:15:06 +00:00
orbiter
f133d6065c fix for http://forum.yacy-websuche.de/viewtopic.php?p=14955#p14955
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5958 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-17 18:28:33 +00:00
low012
16d28496ac *) YaCy will load status page once it is up again after restart or update if Javascript is enabled.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5957 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-17 16:28:39 +00:00
lotus
82af994041 added missing loglevel
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5956 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-16 08:21:51 +00:00
orbiter
ad9762746d no exception in case of uniq() time-out, see also
http://forum.yacy-websuche.de/viewtopic.php?p=13177#p13177

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5955 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-15 23:07:10 +00:00
orbiter
1efe686e3f fix for http://forum.yacy-websuche.de/viewtopic.php?p=13960#p13960
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5954 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-15 22:51:15 +00:00
lotus
13fb84ab81 you can define your default number of search results displayed by search.items
this applies only to requests through the classic-style page

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5953 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-15 14:48:34 +00:00
orbiter
a642d6a7b5 - added navigation icons for search result pages
- modified result page rendering to use new icons instead of numbers
- set different default values in yacy.init for higher indexing performance; removed pro-values
- modified WatchCrawler to accept 30000 PPM instead of only a maximum of 6000 PPM

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5952 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-14 23:11:10 +00:00
low012
012157c87f *) fixed "select all" and "deselect all"-button
This bug was introduced when search boxes were added to the header of each page and there may be more pages which are affected.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5951 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-12 22:32:30 +00:00
orbiter
f2e4d156e8 removed debug messages
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5950 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-12 22:03:33 +00:00
orbiter
709bfc2cd4 added a memory check in http post protocol
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5949 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-12 20:23:55 +00:00
orbiter
c01d6f43e1 - fixed problem with thread dump if no arguments are given
- rejecting peers that are older than 6 hours (not-seen during 6 hours)
- 0.78, targeting 0.8 at the end of the week

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5948 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-11 22:26:21 +00:00
orbiter
a49edd9415 fix for bug in search with site: constraint
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5947 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-11 21:20:23 +00:00
orbiter
d3ca038979 fixed domain graph applet (visible in Terminal)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5946 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-11 21:12:30 +00:00
orbiter
04ec42e1d0 removed index Transfer, because it does not make sense any more to do a complete index transfer using the DHT methods. It is far easier and faster to just copy the whole index files from the source to the target peer. To do this, copy all files from DATA/INDEX/<network>/TEXT/RICACHE and DATA/INDEX/<network>/TEXT/METADATA from the source to the destination peer.
Future versions of YaCy will have different file structures than given here, because of the new index segment concept, where each peer may have several indexes (called segments) for each network definition. A copy from one peer to another will then create a new index segment.
To track changes for this feature, see forum discussion at
http://forum.yacy-websuche.de/viewtopic.php?p=14728#p14728

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5945 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-10 21:00:39 +00:00
orbiter
c1e5fad9a7 fix for http://forum.yacy-websuche.de/viewtopic.php?p=14767#p14767
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5944 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-10 20:50:46 +00:00