Commit Graph

1954 Commits

Author SHA1 Message Date
orbiter
764a40e37d speed enhancements for crawler and url retrieval (affects also search speed)
- concurrency for LURL-fetching: this can be done using a concurrent lookup into the separated url databases. Concurrency is possible because there is no IO during lookup. The more LURL-Tables are present, the better is the speedup. More CPUs will increase speed
- because a large number of LURL-lookups are made during crawling (for double-check), the LURL-Lookup speed enhancements enhances also crawling speed
- search speed also profits from LURL-lookup enhancement
- changed some flushing parameters in word index caching which should make better use of large word index caches and should speed up indexing
- removed flush chunksize parameter, because this was only useful for IO path enhancement feature which was removed some weeks ago to prevent blocking and deadlocks during search requests

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4628 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-31 15:41:19 +00:00
orbiter
b2150057d2 removed unnecessary cleanup method
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4625 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-30 20:32:08 +00:00
f1ori
76eac114ed * define global javascript-variable with var to get rid of warnings
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4624 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-30 19:51:19 +00:00
low012
b63cf2fc1c *) added button to Crawl Profile Editor to delete all terminated crawl jobs (only visible if there are terminated crawl jobs)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4620 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-30 15:15:56 +00:00
f1ori
2aed6bb3f7 * return valid xml in xml-bookmarks
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4619 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-30 14:58:29 +00:00
low012
4c3f1b67ad *) refactoring of Blacklist_p.java (moving entries might be slightly slower, but the code is more tidy now)
*) added edit functionality for blacklist entries

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4616 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-29 20:39:46 +00:00
f1ori
466d49e90c * added login-parameter to be able to force authentication
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4613 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-29 11:10:04 +00:00
f1ori
c67350f138 * use putXML with forXML-parameter to ensure urls are valid xml (problem was & in url)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4611 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-28 22:50:33 +00:00
f1ori
98a03249f3 - optimize selectall-javascript-function (in firefox more then 10 times faster now)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4610 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-28 16:28:36 +00:00
orbiter
0241d070bc added concurrency to indexing process:
- the methods {parsing, semantic analysis (condensing), structure analysis (web structure)} in the serialized indexing path had been made concurrent.
- four BlockingQueues handle concurrency and hand-over of the indexing objects, the last object in the queue is stored into a blockingQueue of maximum size 1 to serialize the process for storage (which uses IO and therefore here should not be deserialized)
- a concurrency of (CPUs + 1) is default. Single-CPU users will profil from the change because large files cannot block the indexing process any more.
- removed the secondary indexing thread, which is superfluous now. Concurrency is default for all users.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4609 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-28 11:56:28 +00:00
orbiter
bca87f1e38 - refactoring of serverThreads: renaming to distinguish busy-threads and blocking-threads
- added blockingThreads which are threads that are not driven by pause times but by BlockingQueue lookup

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4606 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-27 12:03:16 +00:00
orbiter
968c775025 - preparation of parsing/indexing queue for concurrent execution
- remote crawl receipts are now transmitted concurrently in separate threads (makes remove crawls much faster!)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4605 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-26 22:43:38 +00:00
orbiter
7f9f639d20 - refactoring and abstraction of index reference (urls) handling: blacklisting is part of reference filtering
- refactoring of word/phrase handling: word abstraction from condenser becomes part of index element handling
- removed unused code parts from condenser

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4603 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-26 15:37:49 +00:00
orbiter
d6050b9ffb - separated the LURL data storage and Crawl result stack for process supervision.
this is another step to enable multiple, concurrent fulltext-indexes
- another try to make the yacy-httpc more stable

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4602 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-26 14:13:05 +00:00
low012
8e889de50b *) Added Lotus' patch (http://forum.yacy-websuche.de/viewtopic.php?t=979), user will be taken back to last opened page after making changes in Advanced Settings.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4601 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-25 20:40:54 +00:00
orbiter
93633abed8 - removed some debugging code from search process - should speed up now
- added some profiling code to search event - more time details in PerformanceSearch_p.html

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4594 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-23 00:55:04 +00:00
orbiter
541b817502 refactoring of switchboard queueing
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4591 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-22 01:28:37 +00:00
orbiter
3e44293f07 - fixed a problem with thread pools in row collection
- added a line-viewing feature in threaddump	

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4587 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-20 14:21:58 +00:00
danielr
e43051b125 - fixed Threaddump output (html-escaped ie. <init>)
- in EcoFS converted comments to javadoc


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4586 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-20 10:20:55 +00:00
orbiter
433ff855f7 - fixed another concurrency problem in collection sorting
- fixed a typing problem that was introduced in svn 4579 and caused the crawl monitor to fail

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4585 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-19 23:47:24 +00:00
apfelmaennchen
5fde618337 changed display of y-marks
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4583 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-19 19:54:35 +00:00
daburna
1618bd5505 added submenu
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4580 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-17 14:51:04 +00:00
orbiter
7150b463ff changed handling of default values and database paths:
- the default files yacy.init and for the network definition is now moved to the path defaults
- the httpProxy.conf is renamed to yacy.conf
- the DATA/INDEX/PUBLIC is renamed to the actual network nickname, which should be freeworld or sciencenet
more menu entries
- added apfelmaennchens alternative search page to the menu
- added the new thread dump page to the server log menu point as submenu
modifications
- modified the thread dump page: sorting by thread type

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4575 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-16 22:31:54 +00:00
apfelmaennchen
54cb097ea4 added .trigger("update") after paging
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4574 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-16 22:05:50 +00:00
apfelmaennchen
82f17ccee2 just an example sidebar
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4573 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-16 21:44:51 +00:00
apfelmaennchen
3c710f22cd added server side driven pagination for search tabs
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4572 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-16 21:44:20 +00:00
orbiter
7fd094fcbe small bug in ftpc: did cot compile in Java 1.5
Please set compiler to Java 1.5-compliance

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4570 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-16 13:41:49 +00:00
lulabad
0a593a4e4a small changes to Threaddump Servlet
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4569 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-16 13:15:23 +00:00
lulabad
995b5a2b27 added Threaddump Servlet, see http://forum.yacy-websuche.de/viewtopic.php?f=9&t=862
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4568 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-16 09:03:52 +00:00
lulabad
d1b9f50b94 added "delete Release" to Update Page
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4564 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-15 10:37:20 +00:00
lulabad
3d122c125c added "current installed release" to Update Page
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4563 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-15 09:22:07 +00:00
apfelmaennchen
368b8735b5 added 'close tab' function
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4560 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-14 15:59:56 +00:00
orbiter
fa1090113d - next try to fix the networking problem:
set the maximum transfer size to less than MTU=1500-52: buffer size <= 1448
- some refactoring of transfer methods (naming)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4558 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-14 00:16:04 +00:00
apfelmaennchen
f63bd26268 fixed search performance / dynamic display of results
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4557 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-12 21:24:53 +00:00
orbiter
5530b8e1ca reverted changes to yacy protocol classes: they caused the sciencenet to loose connections
a comparisment with the main release 0.57 had been made: this showed a stable network
This is an emergency operation to ensure availability of the sciencenet network.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4553 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-12 00:05:18 +00:00
orbiter
b3d4571a5d fix for bad opensearch file format
did you know that yacy has a built-in rss feed reader? try:
http://localhost:8080/FeedReader_p.html?url=http://www.tagesthemen.de/xml/rss2
it can be used to fetch search results from other yacy peers:
http://localhost:8080/FeedReader_p.html?url=http://sciencenet.fzk.de:8080/yacysearch.rss?search=yacy

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4551 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-11 12:45:27 +00:00
orbiter
b4ed937f1e - modified zone navigation (does still not work correctly)
- added dht switch in network definition
- 0.574

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4550 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-11 11:09:38 +00:00
apfelmaennchen
c75fa90206 adjusted display of search results
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4547 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-10 23:23:04 +00:00
apfelmaennchen
7a902424af adjusted display of search results
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4546 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-10 23:22:34 +00:00
orbiter
e0c481decb not class file in SVN .. I guess it is a mistake
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4545 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-10 19:05:06 +00:00
lulabad
8c15fbf7d1 simple and fast fix for LogStatistics_p
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4544 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-10 18:52:01 +00:00
orbiter
9eddc1506b - one try to fix the httpd problem
- fix for handling of collection index that appears when removing elements
- added another navigation method (stub, not working yet)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4543 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-09 23:58:22 +00:00
apfelmaennchen
b4b370586a fixed the box headings
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4541 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-09 17:25:09 +00:00
apfelmaennchen
f7a0804e83 small optical change for the sidebar
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4540 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-09 13:04:34 +00:00
apfelmaennchen
c5f378c7a4 additional images
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4539 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-09 12:56:11 +00:00
apfelmaennchen
6ebc9b7325 additional images
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4538 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-09 12:55:26 +00:00
apfelmaennchen
3c686e4e0e for testing puposes - new user interface based on jQuery and Ajax
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4537 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-09 12:54:37 +00:00
apfelmaennchen
f238478cc3 for testing puposes - new user interface based on jQuery and Ajax
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4536 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-09 12:54:13 +00:00
apfelmaennchen
cb8625ca67 for testing puposes - new user interface based on jQuery and Ajax
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4535 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-09 12:53:50 +00:00
apfelmaennchen
2b43ea9f9d for testing puposes - new user interface based on jQuery and Ajax
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4534 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-09 12:53:00 +00:00