orbiter
9a8a87612d
added new qph column to search tracker servlet
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3854 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-10 22:02:17 +00:00
orbiter
e07458bad4
added time-out function to web analysis
...
the default time-out is 1 second
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3852 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-10 20:00:44 +00:00
hydrox
4a1bc4743a
*)News-entries with blacklisted URLs are now ignored
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3849 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-10 08:05:18 +00:00
allo
6074264267
dynamic rights.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3847 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-09 19:34:09 +00:00
theli
99062c0c9e
*) SOAP should support authentication against the user-DB now (requested by KoH)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3846 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-09 15:49:04 +00:00
theli
339153d40e
*) favicons that are specified in the document content via html link-tags
...
are now detected and displayed on the search page (requested by allo).
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3845 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-09 15:22:37 +00:00
allo
854eb1492f
.yacy /.yacyh urls for the feedreader
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3844 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-09 12:56:08 +00:00
allo
7a5b22a0b8
Integration of FeedReader in Bookmarks.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3841 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-08 23:27:42 +00:00
karlchenofhell
6265d321bd
- more constants
...
- display why global search is not available on search page
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3839 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-08 20:01:16 +00:00
allo
7921f07c9d
userDB fix
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3837 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-08 16:11:10 +00:00
allo
7b2e1bb8f2
Feedparser with reflection.
...
TODO: This needs a special build.xml entry
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3832 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-08 14:31:09 +00:00
karlchenofhell
8bff810d19
- fixed logging output of serverMemory.request()
...
- don't start up if DATA/yacy.running exists as this is usually a sign of an already started yacy-instance
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3831 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-08 12:45:03 +00:00
rramthun
18a5380ee3
*) situation-dependent lock-buttons for search-page
...
*) removed one unused import and a double definition of "ogg" as media-type
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3817 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-07 15:26:41 +00:00
karlchenofhell
f05ca43780
- the wiki-parser works for remote wiki-code now, not displaying links anymore as if they were local (ViewProfile comment)
...
- fixed wrong link to CrawlStart on Status-page
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3816 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-07 11:35:48 +00:00
karlchenofhell
9d6605a83c
- fixed NPE in Blacklist Cleaner during deletion of more than one double entries
...
- don't display responseHeader1.db in CacheAdmin_p anymore
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3814 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-06 23:36:38 +00:00
orbiter
594ff95955
:-(
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3801 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-06 11:34:39 +00:00
orbiter
4ca797401e
fix for ConcurrentModificationException
...
see http://www.yacy-forum.de/viewtopic.php?p=36566#36566
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3800 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-06 10:36:04 +00:00
orbiter
7b904e0077
integrated robots.txt crawlDelay into the crawl balancer
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3797 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-06 07:53:56 +00:00
orbiter
52cb033f01
- slightly different painting of web structure picture:
...
hosts that have many own connections are painted farer away (this is not yet cato's idea, this will be implemented in another step)
- doc update
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3796 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-05 15:32:43 +00:00
karlchenofhell
30c3d909b1
- fixed charset problem in ConfigProfil_p.html (use accept-charset="UTF-8" in forms)
...
- fixed wrong XML output if no peers are known in Network.xml
- simplified parsing of table properties in wikiCode and ZTableToken
- reimplemented GC heuristics. They are needed to constantly ensure that an amount of free memory is available which is higher than Java's max. limit for performing a Full GC (please use serverMemory.request(long, boolean) rather than serverMemory.available(long, boolean) to provide data for averaging over the last GCs)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3793 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-05 11:37:19 +00:00
allo
6c9df13552
more debugging
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3791 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-04 20:30:40 +00:00
allo
4392ee0c51
BugFix for typo and wrong include
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3789 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-04 16:06:58 +00:00
allo
d1e1580223
Surftips Blacklist
...
Blacklists List Hardcoded instead of only updated on firststart / migration.java
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3788 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-04 15:36:10 +00:00
theli
e1a5babff1
*) Logging GUI handler: line-size is now set to max-size if max-size was exceeded
...
See: http://www.yacy-forum.de/viewtopic.php?p=36355
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3786 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-02 21:23:32 +00:00
(no author)
94cc9f05f5
*) Improvements for restart via update wrapper
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3785 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-02 15:25:13 +00:00
hydrox
44bac7dea1
*) blog-comments can now be moderated
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3778 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-01 06:02:55 +00:00
borg-0300
2ab020445a
bugfix, i think - http://www.yacy-forum.de/viewtopic.php?t=4059
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3777 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-31 17:03:02 +00:00
(no author)
f89517203d
*) SOAP: new function to get the Performance Settings of Queues and Processes
...
No items left in the yadmin SOAP-TODO :-)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3776 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-31 15:47:58 +00:00
(no author)
92351c4dcb
*) SOAP: bookmarks list now indicates if a bookmark is private (requested by KoH)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3775 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-31 14:54:56 +00:00
allo
957a25afff
getRight(rightName) instead of get...Right()
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3774 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-31 14:48:20 +00:00
(no author)
e0e46d3aec
*) SOAP: new function doGarbageCollection (requested by KoH)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3773 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-31 14:32:41 +00:00
(no author)
1efe607c34
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3771 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-31 11:24:59 +00:00
low012
a0149317ac
*) fixed bug where headlines were added to directory of a wiki page multiple times ( http://www.yacy-forum.de/viewtopic.php?t=4034 )
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3762 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-25 16:36:09 +00:00
(no author)
ef24bed406
Sorry...
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3760 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-24 16:25:07 +00:00
(no author)
a29cb2e1af
blupp
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3759 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-24 16:14:46 +00:00
theli
651b05ba43
*) wsdl file updated (requested by KoH)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3758 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-24 16:07:58 +00:00
orbiter
0ae6664ad8
enhanced web structure picture
...
- hand-over of get properties from web front-end to graphics generation
- added depth-control buttons
- added marking of anchor-points to highlight relation order
- enhanced ymage graphics library
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3757 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-24 15:28:03 +00:00
orbiter
a585b4d41b
added web structure image
...
see http://localhost:8080/WatchWebStructure_p.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3747 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-22 15:20:50 +00:00
orbiter
33ad0c8246
added a web structure computation and logging:
...
- all web page parsing operations will now increase a web structure file
- the file is computed in memory and dumped at shutdown-time to PLASMASB/webStructure.map in readable form (not a database)
- the file can be used externally to analyse the link structure of the crawled pages
- the web structure can also be retrieved using a xml-interface at http://localhost:8080/xml/webstructure.xml
- the short-term purpose is the computation of a link-graph image (before linuxtag!)
- a long-term purpose could be a decentralized computation of the citation rank
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3746 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-22 08:13:48 +00:00
karlchenofhell
7904175338
- sorry for typos
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3743 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-20 16:22:46 +00:00
karlchenofhell
baa9402b97
- wiki-parser is now configurable via the config setting wikiParser.class which holds the class-name for the parser to use
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3742 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-20 16:19:25 +00:00
karlchenofhell
601fc7d1c5
- added source to J7Zip-modifed.jar and it's license (changelog is still to come)
...
- moved HTML-*replace-methods from wikiCode to de.anomic.data.htmlTools
- prepared use of different wiki parsers as suggested here: http://www.yacy-forum.de/viewtopic.php?p=34444#34444
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3741 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-20 13:29:12 +00:00
karlchenofhell
0a64047081
- plasmaParserDocument can process subdocuments now (other archive-parsers may want to use this method)
...
- added 7zip parser
- added 'text/sgml' to realtime parseable mimetypes (sometimes returned by the mime type parser)
- added new cached output stream class, very suitable for parsers because of limited memory
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3740 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-18 23:13:44 +00:00
theli
b1680ab71f
*) bugfix for ArrayIndexOutOfBoundsException in robots-parser (thanks to low012)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3739 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-18 13:39:08 +00:00
theli
b30e64daab
*) passing homepath to serverLog.configureLogging
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3738 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-18 13:04:26 +00:00
theli
9a4375b115
*) robots.txt: adding support for crawl-delay
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3737 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-18 13:00:42 +00:00
orbiter
11ac7688d5
reverted a part of last commit
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3736 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-16 17:52:11 +00:00
orbiter
b3f97b5c38
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3735 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-16 17:45:39 +00:00
orbiter
3c5ff7f735
adopted kelondroBytesIntMap to kelondroIntBytesMap
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3734 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-16 15:20:15 +00:00
orbiter
5551ff5306
enhanced index storage data structure kelondroBytesIntMap
...
this stores now two index structures, one for data that is aquired during start-up
and one for data that is aquired during run-time. This reduces the grow factor, and should reduce the memory amount in case that a index-reorganisation happens.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3733 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-16 14:36:56 +00:00
orbiter
872eb46cb9
some redesign of the handling of the index for kelondroFlexTable
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3732 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-16 10:48:26 +00:00
karlchenofhell
086239da36
- added servlet: remote crawler queue overview
...
- added servlet: crawl profile editor
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3731 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-16 10:11:25 +00:00
karlchenofhell
8ba81e0995
- added some comments (will get more in the near future)
...
- added missing <label> to the search field in Network.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3728 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-15 15:48:28 +00:00
allo
65a8a9fc58
fix for nullpointer
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3726 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-14 16:56:13 +00:00
orbiter
b05e2314cf
another dht selection fix
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3725 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-14 12:52:39 +00:00
karlchenofhell
03c6551b0c
- fix for http://www.yacy-forum.de/viewtopic.php?t=3747
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3724 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-14 12:44:44 +00:00
orbiter
b28e5d0ee9
protection against wrong word hash length
...
see http://www.yacy-forum.de/viewtopic.php?p=35657#35657
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3723 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-14 10:00:23 +00:00
orbiter
e897eb9b4a
fix for DHT selection target
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3722 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-14 08:17:36 +00:00
orbiter
0384b8771b
fix for http://www.yacy-forum.de/viewtopic.php?p=35700#35700
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3719 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-13 19:37:16 +00:00
orbiter
578c2ef130
release 0.52
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3715 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-11 22:12:29 +00:00
orbiter
756a27049b
last-minute-feature 'newbie-selection' for workshop purpose:
...
for remote search, always select all peers that are less than a day old
(should be removed someday in the future if load is too high, which could mean when pph > 100)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3712 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-11 19:55:19 +00:00
orbiter
46367afaaa
update of memory-protection values
...
see http://www.yacy-forum.de/viewtopic.php?p=35539#35539
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3709 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-11 18:02:48 +00:00
rramthun
ea87fe5d78
*) Updated German translation
...
*) Changed "Lost Handle" error to warning (masses of it if deleting crawl-profile)
*) Removed unnecessary code from Windows script
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3708 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-11 17:48:22 +00:00
orbiter
85035dc319
addition to svn 3699: check send/receive if p2p-mode is activated
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3701 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-10 13:27:38 +00:00
orbiter
26f05d1fd0
avoid division by zero if search is done for no words
...
this case is relevant if the bluewords (yacy.blue) are used
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3698 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-09 22:10:12 +00:00
orbiter
2fa8b50e54
reverting svn 3691+3692
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3696 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-09 19:31:40 +00:00
orbiter
139c59ebbd
- fixed dht selction problem: the seed tables used a wrong ordering
...
- cleaned some code
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3693 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-09 17:59:36 +00:00
orbiter
22a0e9f117
more timeout-control
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3692 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-09 14:53:17 +00:00
orbiter
24db55a541
added timeout for httpd-sockets during read
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3691 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-09 14:30:01 +00:00
orbiter
f3fee4d445
fix for http://www.yacy-forum.de/viewtopic.php?p=35322
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3689 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-08 09:19:36 +00:00
orbiter
7f56c8d4aa
fixed some seed selection details
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3685 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-07 22:22:35 +00:00
orbiter
e602436fda
fixed problem with cluster routing
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3684 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-07 20:48:24 +00:00
orbiter
0831034e07
fixed non-termination bug for robinson remote crawl peer selection
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3681 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-07 14:37:50 +00:00
orbiter
d6480dc670
fix for long transfer pauses
...
see http://www.yacy-forum.de/viewtopic.php?p=35243#35243
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3672 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-06 21:43:20 +00:00
theli
cb43ae11ba
*) Bugfix
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3668 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-06 12:57:22 +00:00
theli
0b5fc3c28c
*) moving date functions to serverDate class
...
*) Sitemap-parser
- logging added
- parsing of modDate added
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3667 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-06 12:36:49 +00:00
theli
6f46245a51
*) Bookmarks: Ajax icon is displayed while loading title
...
*) First version of a sitemap parser added
- currently only autodetection of sitemap files is supported
*) DB-Import restructured
- pause/resume should work again now
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3666 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-06 09:52:04 +00:00
orbiter
111ba9e359
- fixed some width problems in new status page
...
- fixed deadlock in dns cache
- added termination security for DHT peer selection
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3660 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-05 23:18:00 +00:00
theli
74dd6cac95
*) signal yacy shutdown to updater
...
*) some javadoc added
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3658 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-05 16:20:01 +00:00
theli
43748f87fb
*) changes required for the uploader
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3655 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-05 15:41:05 +00:00
rramthun
e12e934ade
*) Fixed broken compile process.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3650 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-04 21:33:37 +00:00
rramthun
d6811ac243
*) Moving tar.jar from libx to lib
...
*) Enhanced interface
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3649 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-04 19:46:23 +00:00
theli
469583ea80
*) new interface class. should be implemented by the updater to allow communication between the updater and yacy
...
(not yet functional)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3648 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-04 14:22:10 +00:00
orbiter
191ef16499
fixed wrong ordering that caused bad dht selection
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3646 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-04 14:11:50 +00:00
orbiter
7cf8981a98
- added debugging code for wrong DHT target iterator
...
- restricted distance constraint from 0.4 to 0.2
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3644 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-03 22:57:55 +00:00
orbiter
dd44a1394f
disabled automatic performance setting change
...
- during crawl start
- each indexing cycle
- for delay values
- for short memory cycles
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3634 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-02 15:39:27 +00:00
orbiter
b9add5cf37
some bugfixes:
...
- dht iterator start point
- wordIndex synchronization
- surftipps url check
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3633 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-02 14:20:43 +00:00
orbiter
06b6e35484
fix for a null pointer exception if clusters are not defined
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3632 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-02 12:26:29 +00:00
orbiter
35c660654d
more debugging lines to fix bug for
...
http://www.yacy-forum.de/viewtopic.php?p=34935#34935
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3629 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-30 23:05:19 +00:00
orbiter
47e90f31b2
fix for deadlock in plasmaWordIndex.addPageIndex
...
synchronization for class method not necessary
see also: http://www.yacy-forum.de/viewtopic.php?p=34959#34959
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3628 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-30 22:30:09 +00:00
orbiter
81844e85b2
- fixed more cluster routing problems
...
- fixed a problem in remote search when balancer caused shift process to wait too long
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3627 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-30 00:39:53 +00:00
orbiter
304ed3f4d2
fix for remote crawl requests in clusters
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3626 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-29 22:52:07 +00:00
orbiter
1979a167d3
fixed problem with cast
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3625 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-29 22:31:44 +00:00
orbiter
e48189c710
enhanced cluster routing
...
- cluster definitions can now contain an addition for local ip addresses
- cluster-cluster communication uses the local ip address instead the global address, if one is given
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3624 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-29 22:05:34 +00:00
orbiter
b33cef421e
better routing for public clusters
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3620 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-29 00:08:38 +00:00
theli
7c902996b5
*) changes required for the uploaderWrapper
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3618 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-28 16:04:37 +00:00
orbiter
f73e1e3af9
fixed bugs in remote search setting for public clusters
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3615 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-28 00:12:41 +00:00
orbiter
485bf1ea83
bugfix for robinson/remote crawl bug
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3614 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-27 21:35:43 +00:00
orbiter
62c947b4aa
next try to fix deadlock in plasmaWordIndex
...
see also:
http://www.yacy-forum.de/viewtopic.php?p=34821#34821
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3607 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-27 12:26:36 +00:00
orbiter
871ee1ce0f
one step closer to automatic updates:
...
automatically aquire release information from download archives
web pages from latest.yacy-forum.net and yacy.net are retrieved, parsed,
links wihin are analysed, sorted and the most recent developer and main
releases are provided as direct download link on the status page, if it was
discovered that a more recent version than the current version is available.
This process is done only once during run-time of a peer, to protect our
download archives from DoS by YaCy peers.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3606 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-27 09:23:44 +00:00
theli
ec225f9ab6
*) SOAP: adding methods to get the comment and MD5 checksum of a single file
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3604 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-27 05:10:31 +00:00
theli
2399ed817c
*) robots.txt parser now extracts the sitemap-URL (will be used later)
...
*) some javadoc added
*) junit testclass for robots.txt parser added
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3602 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-26 15:42:38 +00:00
orbiter
fa012789b2
tried to fix a deadlock problem durin shutdown
...
see also:
http://www.yacy-forum.de/viewtopic.php?p=34753#34753
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3601 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-26 15:15:40 +00:00
orbiter
e192f616a2
collection of small bugfixes
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3600 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-26 14:28:57 +00:00
orbiter
64a6d6e5e6
added new set iterator (needed for last commit)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3599 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-26 09:52:37 +00:00
orbiter
f8de19fb2f
robinson cluster: added client-side protocol implementation
...
- the network configuration page shows a new option: robinson clusters
- when a global search is made, all robinson peers are excluded, but:
- robinson peers/clusters that provide peer tags and where search words match
such tags, they are included in global search. Therefore, robinson peers/clusters
support the global yacy network with their indexes, without doin DHT-exchange
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3598 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-26 09:51:51 +00:00
theli
50e1e61fa5
*) SOAP: adding functions to rename and move files
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3595 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-25 05:36:28 +00:00
theli
63a004abff
*) bugfix for Nullpointerexception
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3594 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-25 05:22:09 +00:00
orbiter
29fe2beac7
possibly fixed a deadlock
...
cannot find forum link now for that
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3593 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-24 21:55:57 +00:00
(no author)
4f4d3d71dd
*) Faster appearance of ConfigBasic by bypassing UPNP-scan in case of existing external connects
...
*) Marked two deprecated source-points
*) Added possibility to dump words from indexing to file. Should not affect performance in the current form.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3592 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-24 16:33:31 +00:00
orbiter
657585fe0d
network functions for robinson peers: server-side protection
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3591 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-24 15:11:12 +00:00
orbiter
89c1511738
- added new Network Configuration menu, can be found in basic settings
...
- new cluster functions will be available in this menu, but currently not enabled,
because corresponding interface methods are not ready yet
- shifted remote crawl settings to new network configuration menu
- shifted DHT distribution/receive to the new network configuration menu
- adopted some string constants
- added cluster configuration settings to yacy.init
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3589 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-23 20:47:07 +00:00
orbiter
62b79aa0a9
bugfix for http://www.yacy-forum.de/viewtopic.php?p=34558#34558
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3586 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-20 21:38:12 +00:00
orbiter
2f3b518169
temporary patch for startup-problem:
...
http://www.yacy-forum.de/viewtopic.php?t=3854
This is a serious problem that is caused by the database bug between 0.511 - 0.513
which produced a large number of double-entries in the RWI index. The uniq()-method
tries to fix this, and it does not terminate when the index is large and the number
of double-occurrences is also large. This patch does simply implement a time-controlled
termination, which does not heal the inconsistency problem. The uniq-method itself
is correct and does not need a bugfix, the non-termination is simply caused by the large number
of data that is shifted during the process. It was possible to reproduce this behaviour
in a test environment.
A real fix would need to:
- enhance the uniq()-method by using a recursive, binary segmentation of the array to be fixed
- uniq() must report the entries that are double
- the double-entries must be deleted from the collection index (from the index and the collections) to heal the problem
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3583 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-20 07:53:58 +00:00
rramthun
e6fb6426a3
*) Some cosmetical changes and corrections
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3582 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-19 16:16:54 +00:00
orbiter
ba525ebf52
- re-enabled path optimization that was disabled during testing
...
- re-implemented index load/extend optimization that was removed from kelondroFlexTable,
this is now part of kelondroIntBytesIndex
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3580 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-19 14:55:19 +00:00
orbiter
595ee10468
fixed datatabase inconsistency bugs
...
inserted many debug lines
added a huge number of asserts
extended database test methods
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3579 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-19 13:37:02 +00:00
orbiter
ca79362b9d
disabling auto-setting of remote crawl performance
...
see also http://www.yacy-forum.de/viewtopic.php?t=3849
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3577 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-18 18:36:06 +00:00
orbiter
7a7a1c7c29
fight against problems with remove-methods and synchronization
...
- some bugs may have been fixed with wrong removal operations
- removed temporary storage of remove-positions and replaced by direct deletions
- changed synchronization
- added many assets
- modified dbtest to also test remove during threaded stresstest
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3576 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-17 15:15:47 +00:00
orbiter
b6a5f53020
removed double synchronization from kelondroRecords.USAGE to prevent thread locking.
...
The method synchronization should be sufficient
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3574 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-15 21:13:54 +00:00
(no author)
6186185775
*) Moved some comments to javadoc
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3573 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-14 10:11:37 +00:00
orbiter
063063aa0c
fix for 100% cpu bug during dht selection
...
see also: http://www.yacy-forum.de/viewtopic.php?p=34068#34068
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3570 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-13 13:40:19 +00:00
michitux
4990909178
Some bugfixes, new layout/style for image search results:
...
* removed divide by zero bug when 20_dhtdistribution_busysleep is 0
* replaced German comment with wrong charset in source/de/anomic/plasma/plasmaCrawlBalancer.java by an English one
* replaced the table-fix for floating behind snipped images by a br with clear
* removed unnecessary old xhtml-files (were not in use, they were created when we weren't having xhtml for testing)
* new layout for image-search results: replaced the old one with spans and tables inside (not valid) with new divs, now each image snippet container has the same size
TODO:
* the ids of the snippetLoading-divs aren't valid because ids must start with an alphabetic letter or an underscore, they have to be prefixed
* in the returned snippet-xml is an unresolved pattern for status (the status is only set for text snippets)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3566 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-12 18:21:17 +00:00
orbiter
78d04bcbcf
fixed bug in search statistics
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3562 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-12 09:24:56 +00:00
orbiter
b79b4082e2
completed search exclusion:
...
- exclusion on index-level (not only from search snippets)
- exclusion hand-over at remote search protocol
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3556 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-10 12:27:03 +00:00
auron_x
d66b0276e3
*) removed log-output for PPM-calc
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3553 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-07 10:03:12 +00:00
low012
4400acc27d
*) created new 8 bit oldschool style font for possible future use
...
*) main method is generalization of main method of ymageFontGenerator:
it does not matter how many lines of how many bits a font is made of
as long as the values stay the same within the font -> use this class as
a template for your own font generators and be a happy camper
*) main method checks if font is valid (96 characters, all letters must have
same number of lines and same number of bits per line)
*) ***** I have not checked if the result is really a valid font so far. *****
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3552 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-05 21:05:16 +00:00
orbiter
25070822a5
fix for http://www.yacy-forum.de/viewtopic.php?p=33925#33925
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3551 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-05 19:08:59 +00:00
orbiter
06a7978730
moved url pattern matching for search to better place
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3550 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-05 15:08:14 +00:00
orbiter
159bd0cab5
diverses; b.o. fix for http://www.yacy-forum.de/viewtopic.php?p=33914#33914
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3549 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-05 14:58:29 +00:00
orbiter
cdc7b77a62
fix for http://www.yacy-forum.de/viewtopic.php?p=33916#33916
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3548 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-05 14:47:45 +00:00
orbiter
40c14a4f0e
- better implementation of search query properties
...
- basic protection against start-up problems when database files are corrupted
- auto-delete of not-critical databases during startup when load error occurs
- on-the-fly reset option for all database tables
- automatic on-the-fly reset for seed tables during enumeration exceptions
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3547 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-05 10:14:48 +00:00
theli
1696606b7f
*) changing loglevel of "PPM-Calculation" message
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3545 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-04 13:09:59 +00:00
auron_x
f30bf1683e
*) corrected spelling of captcha
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3544 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-04 10:13:58 +00:00
orbiter
fcdf000fbc
bugfix for http://www.yacy-forum.de/viewtopic.php?p=33838#33838
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3543 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-03 22:08:40 +00:00
low012
d7edc9740b
*) added correct (c) and Last*-data
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3542 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-03 20:58:45 +00:00
low012
ee241f32e6
*) very basic capcha class (see coding sections of forum for more details)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3541 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-03 20:54:14 +00:00
orbiter
6e7340ef52
added exclusion search
...
(you can now search and exclude words from the result with '-')
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3540 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-03 15:35:29 +00:00
orbiter
e4734a8b6b
fix for fix in SVN 3537
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3539 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-03 14:42:29 +00:00
orbiter
356033aceb
fixed bug with continuous reset of balancer file index
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3537 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-03 12:36:24 +00:00
orbiter
ba2c307ab3
optimized memory allocation in kelondroRow.Entry
...
such an entry cannot be instantiated without allocation of new byte[]; instead
it can re-use memory from other kelondroRow.Entry objects.
during bugfixing also other bugs may have been solved, maybe the INCONSISTENCY problem
could have been solved. One cause can be missing synchronization during bulk storage
when a R/W-path optimization is done. To test this case, the optimization is currently
switched off.
More memory enhancements can be done after this initial change to the allocation scheme.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3536 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-03 12:10:12 +00:00
theli
24ea4ca631
*) adding first version of postscript parser
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3535 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-01 15:02:07 +00:00
theli
def0d6124e
*) trying to solve SecurityManager problem during init of soap engine
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3534 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-31 16:01:52 +00:00
theli
75eb65028a
*) adding a test if a seucrity manager is active
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3533 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-31 14:30:09 +00:00
orbiter
210ede8230
added a class for byte-array management. This was the result of a very large experiment
...
to replace byte[] objects within kelondro. Frequent System.arraycopy are common when
kelondroRow.Entry objects are handled. This class may be used to prevent this.
However, experimental replacement of byte[] by kelondroByteArray in kelondroRow.Entry
resulted in complete re-write of large parts of kelondro. This experiment did not
completely lead to a result, because then the interface to kelondro had to be changed
also from byte[] to kelondroByteArray, which may have caused a rewrite of large parts
of YaCy. The experiment is therefore abanonded, but this class remains here without
any function but possibly for future use.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3531 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-30 08:44:43 +00:00
theli
1b7fda12ee
*) SOAP: separate function to get the active/passive/potential peer list
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3526 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-28 07:34:44 +00:00
orbiter
6488ec8a80
no deletions in index in case that snippet-loading fails and there is no network connection
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3525 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-27 08:21:45 +00:00
orbiter
847349358b
less memory usage during collectionIndex-rebuild
...
should also speed up that process a little bit
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3524 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-27 08:21:03 +00:00
auron_x
8ef3ad12a7
*) fix for rare bug in PPM-calc
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3523 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-25 21:46:03 +00:00
auron_x
00bc0c1b47
*) new logging for PPM-Calculation
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3522 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-25 20:24:12 +00:00
auron_x
5941577076
*) added some logging to PPM-Calculation to find a rare bug
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3521 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-25 14:56:42 +00:00
orbiter
5c3afb3202
added option to configure a path to a secondary index location.
...
this shall be used to store a fragment of the index on another physical device,
to split IO load and enhance access speed. The index is splitted in such a way
that the LURLs are stored to the secondary location, and the RWIs to the primary
location. This is especially useful for environments where symbolic links are
not possible and may cause IO access even if there is no write access to the
device which hosts the symbolic link.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3519 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-24 15:28:17 +00:00
theli
c2e6afbd69
*) bugfix: setting mimeType properly for dir listing with e.g. "?format=xml"
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3516 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-23 05:37:19 +00:00
orbiter
242c19b480
completed TLD categorization
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3515 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-22 13:52:00 +00:00
hydrox
b99f9d870d
*) fixed double selection of peers for the same DHT-chunk.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3513 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-22 09:08:38 +00:00
theli
f20b596dc0
*) adding servlet to display all deployed SOAP Services
...
- soap related servlets are located in htroot/soap
*) new serverContext class for soap
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3511 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-22 08:30:57 +00:00
theli
75d90834a2
*) adding additional file extension for powerpoint
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3507 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-21 16:18:58 +00:00
orbiter
2cb16824e3
removed support for old database structures.
...
The new collection index will be more generalized to support other indexes
i.e. YBR block-rank computation. A clean-up of the many conditions to support
the old database was necessary.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3506 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-21 15:35:35 +00:00
theli
81b4598487
*) peer profile can now be displayed as vcard
...
e.g. http://localhost:8080/ViewProfile.vcf?hash=localhash
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3504 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-21 15:08:18 +00:00
orbiter
3688ec33e5
release 0.51
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3501 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-21 14:00:17 +00:00
theli
1f61c13697
*) RSS-parser extracts the author tags now
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3500 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-21 13:35:32 +00:00
orbiter
602ac42010
fix for OOM case when a kelondroTree Node cache grows
...
See also: http://www.yacy-forum.de/viewtopic.php?p=33275#33275
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3499 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-21 13:26:18 +00:00
theli
b374812f01
*) adding rpm packager as author
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3498 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-21 13:09:12 +00:00
theli
40ce33e664
*) adding RSS feed for yacy news
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3496 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-21 12:22:18 +00:00
theli
589cbd8cbf
*) replacing all yacy-news-category strings with corresponding constants
...
Note: please use these constants from now on
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3495 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-21 11:09:15 +00:00
allo
f4af360f7c
bugfix
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3494 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-20 15:37:19 +00:00
orbiter
7af188ff9a
fix for http://www.yacy-forum.de/viewtopic.php?p=33089#33089
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3491 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-19 11:59:29 +00:00
orbiter
5bbf010107
removed synchronization of size() method from numerous classes to avoid thread locking
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3490 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-18 19:45:23 +00:00
orbiter
6b9eea3932
- removed differentiation between longTitle and shortTitle; this cannot be used for search results,
...
and it is difficult to get both types from all document types
- added some author parsing
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3489 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-18 12:33:19 +00:00
orbiter
a738b57b31
added author tag to indexing content
...
enhanced composition of title tag
TODO: insert author information for external parsers
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3488 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-17 01:18:34 +00:00
orbiter
6be57983a8
another update to the crawl balancer
...
can now alternate between top and bottom of the crawl stack
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3487 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-16 16:54:54 +00:00
orbiter
91cdc1493f
removed query to NAT or responder in case that no other peer is there.
...
this is not needed any more, there are enough peers
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3486 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-16 15:21:24 +00:00
orbiter
4783a30910
- fixed a flush problem in balancer
...
- return to idle divisor in RWI RAM cache flush
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3485 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-16 15:16:26 +00:00
theli
91c2a042a7
*) bugfix for wrong proxy traffic accounting
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3484 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-16 13:52:48 +00:00
orbiter
861f41e67e
redesigned NURL-handling:
...
- the general NURL-index for all crawl stack types was splitted into separate indexes for these stacks
- the new NURL-index is managed by the crawl balancer
- the crawl balancer does not need an internal index any more, it is replaced by the NURL-index
- the NURL.Entry was generalized and is now a new class plasmaCrawlEntry
- the new class plasmaCrawlEntry replaces also the preNURL.Entry class, and will also replace the switchboardEntry class in the future
- the new class plasmaCrawlEntry is more accurate for date entries (holds milliseconds) and can contain larger 'name' entries (anchor tag names)
- the EURL object was replaced by a new ZURL object, which is a container for the plasmaCrawlEntry and some tracking information
- the EURL index is now filled with ZURL objects
- a new index delegatedURL holds ZURL objects about plasmaCrawlEntry obects to track which url is handed over to other peers
- redesigned handling of plasmaCrawlEntry - handover, because there is no need any more to convert one entry object into another
- found and fixed numerous bugs in the context of crawl state handling
- fixed a serious bug in kelondroCache which caused that entries could not be removed
- fixed some bugs in online interface and adopted monitor output to new entry objects
- adopted yacy protocol to handle new delegatedURL entries
all old crawl queues will disappear after this update!
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3483 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-16 13:25:56 +00:00
hydrox
9b5fb3908d
*) a peer-message are now created when a blog-comment is written
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3480 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-15 12:58:17 +00:00
orbiter
581db87237
more debug code for
...
http://www.yacy-forum.de/viewtopic.php?p=33009#33009
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3479 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-14 15:04:06 +00:00
orbiter
81c4cc6bf7
better debugging of balancer failure
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3478 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-14 12:02:56 +00:00
orbiter
dd06d4cada
more logging to better trace bug
...
http://www.yacy-forum.de/viewtopic.php?p=33001#33001
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3477 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-14 09:36:54 +00:00
orbiter
96b79bf86d
redesigned remove method in kelondroRowSet
...
This should fix also numerous bugs like
http://www.yacy-forum.de/viewtopic.php?p=31077#31077
(java.lang.ArrayIndexOutOfBoundsException in kelondroRowCollection.removeShift)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3476 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-14 08:55:05 +00:00
orbiter
9f929b5438
better snippet handling in case of snippet load fail
...
see also http://www.yacy-forum.de/viewtopic.php?p=31096#31096
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3475 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-13 22:18:36 +00:00
auron_x
d451ad48d3
*) improved peerloadgraphic:
...
- unnecessary (0 %) pieces are removed
- percent-values of each thread displayed in legend
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3474 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-12 19:08:17 +00:00
orbiter
a5d668c0c6
added speed-buttons for easy performance setting
...
appears in crawl start and on indexing monitor page
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3473 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-12 16:24:28 +00:00
orbiter
5b0a84ce09
fix for synchronization deadlock with flushMissNameCache.
...
see also: http://www.yacy-forum.de/viewtopic.php?p=32939#32939
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3472 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-12 09:06:57 +00:00
karlchenofhell
e2ac5f62bd
- Code hübscher machen [von NNs TODO]
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3471 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-11 19:53:14 +00:00
allo
f04097c3dd
integrated tor-patch for crawling, if yacyDebugMode is set.
...
(replaces: http://yacy.deruwe.de/overlay/net-misc/yacy-tor/files/disable_dns_checks-svn3132.patch )
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3470 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-11 18:43:11 +00:00
auron_x
22fe14f292
*) first version of Peerload-graphic
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3469 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-11 17:04:11 +00:00
orbiter
432d7d4e9c
better catch
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3468 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-10 23:38:08 +00:00
orbiter
8f7e8b6ee2
auto-delete for not-fixable db error in crawl stacker.
...
see also http://www.yacy-forum.de/viewtopic.php?p=32906#32906
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3467 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-10 23:31:36 +00:00
orbiter
7a52b07fcc
better memory protection during freemen cycle
...
see also http://www.yacy-forum.de/viewtopic.php?p=32903#32903
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3466 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-10 23:22:37 +00:00
orbiter
6faa262259
fix for NURL-fix
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3465 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-09 14:30:53 +00:00
orbiter
909d7a8ae9
fixed wrong implemented row iterator in kelomdroFlexSplitTables
...
this has no effect, until now this iterator was only used on
the Index Administration page.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3464 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-09 13:55:26 +00:00
orbiter
a1fb8358b2
lets make a well-formed http link so that other crawlers don't have a problem to follow this link :-)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3463 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-09 12:35:54 +00:00
orbiter
4edb70f68b
added yacybot info-page from Roland
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3462 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-09 12:26:31 +00:00
orbiter
3ef77d2030
fix for http://www.yacy-forum.de/viewtopic.php?p=29878#29878
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3461 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-09 12:14:25 +00:00
orbiter
3bb3df3fc0
fix for http://www.yacy-forum.de/viewtopic.php?p=32298#32298
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3460 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-09 12:03:53 +00:00
orbiter
243a2f831b
fixed problem with not found NURL-hashes
...
The cause for this problem could still not be found, but the effect
is handled much better. The NURL-pop will continue automatically until
it found a hash that can be found.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3458 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-09 11:07:20 +00:00
orbiter
6ad39bae1e
fixed shutdown problem
...
this fixes the 'inconsistency' messages during start-up
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3457 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-09 08:48:47 +00:00
orbiter
38b93f8cb8
bugfix for my last commit:
...
iterator did not consider secondary start point in case of rotation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3456 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-08 22:07:17 +00:00
karlchenofhell
264a82eec8
- fix for http://www.yacy-forum.de/viewtopic.php?t=3657
...
- fix for http://www.yacy-forum.de/viewtopic.php?p=32758#32758
- Diff takes any objects now, not only strings
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3455 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-08 22:04:15 +00:00