Commit Graph

5722 Commits

Author SHA1 Message Date
orbiter
cffef67dc5 added a short info line about the latency monitor
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6048 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-10 23:03:29 +00:00
orbiter
733385cdd7 enahnced database access times by removal of unnecessary synchronization.
added also more hacks that resulted from high-volum query testing

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6047 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-10 23:02:42 +00:00
apfelmaennchen
5a7dec880e - some improvements for: http://forum.yacy-websuche.de/viewtopic.php?f=9&t=1904#p15668
- portalsearch: introduced yconf.load_js and yconf.load_css
- yacysearch.html still having problems with focus after sidebar is loaded
- yacysearchtrailer.json seems not to be valid json for ?nav=all

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6046 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-10 22:11:31 +00:00
orbiter
5d7045387b added more word lists and a multi-access search test tool for high-performance query testing:
run searchtestmulti.sh; then 10 concurrent processes fire 1000 requests each to  the local peer.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6045 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-10 22:01:48 +00:00
orbiter
398e210fef removed synchronization in logging that causes deadlocks in high-performance environments
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6044 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-10 19:17:30 +00:00
orbiter
db3a06dd81 removed cookie handling in httpc:
- no need to do cookie handling in proxy, this was switched off so far
- no need for cookies in crawler, this was switched on (by mistake)
This fix was needed for a case where a web server flooded the crawler with cookies and caused a complete blocking of the httpc.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6043 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-10 16:11:09 +00:00
orbiter
1c54ae4a63 some small changes in HandleMap Testing
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6042 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-10 15:02:52 +00:00
orbiter
b21e9149f5 another fix for navigation results, the json result format and searches with yacyinteractive
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6041 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-10 12:41:15 +00:00
orbiter
15c5406b9c fixed yacyinteractive
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6040 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-10 07:24:45 +00:00
orbiter
2c5554c912 small enhancements in search result computation speed
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6039 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-09 15:22:23 +00:00
orbiter
e0b3984805 added navigation keys for site and author facets to remote search interface
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6038 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-09 09:07:52 +00:00
orbiter
27fa6a66ad - completed the author navigation
- removed some unused variables

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6037 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-08 23:30:12 +00:00
orbiter
a9a8b8d161 - added display of author navigation (usage of that navigator not yet implemented
- added a synchronization in pdf parser which should help to avoid deadlocks that occur when displaying several search results pointing to pdf sources
- fixed smaller bugs in navigation

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6036 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-08 22:01:26 +00:00
orbiter
c879783008 added steering of navigator computation:
- by default the navigator computation if off for servlet yacysearch.html, but:
- the servlet is called by default with a option to switch navigator results on
this will prevent that metasearch users will get slow results that are caused by unnecessary computations

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6035 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-07 22:51:15 +00:00
orbiter
c079b18ee7 - refactoring of IntegerHandleIndex and LongHandleIndex: both classes had been merged into the new HandleMap class, which handles (key<byte[]>,n-byte-long) pairs with arbitraty key and value length. This will be useful to get a memory-enhanced/minimized database table indexing.
- added a analysis method that counts bytes that could be saved in case the new HandleMap can be applied in the most efficient way. Look for the log messages beginning with "HeapReader saturation": in most cases we could save about 30% RAM!
- removed the old FlexTable database structure. It was not used any more.
- removed memory statistics in PerformanceMemory about flex tables and node caches (node caches were used by Tree Tables, which are also not used any more)
- add a stub for a steering of navigation functions. That should help to switch off naviagtion computation in cases where it is not demanded by a client

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6034 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-07 21:48:01 +00:00
orbiter
bead0006da replaced tmp file extensions by prt
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6033 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-06 18:09:58 +00:00
orbiter
3189f9cd39 fixed problem with DCEntry initialization
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6032 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-06 18:00:50 +00:00
orbiter
a704d82280 patch for problem with digest
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6031 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-06 16:53:16 +00:00
orbiter
3029ef6eb3 fixed a bug that was recently inserted which caused that no idx and gap files were written.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6030 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-06 16:43:58 +00:00
orbiter
b6e274f211 omit most of forced crawl delays by using a separat delay table which flushes delayed URLs at the correct time
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6029 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-06 16:20:27 +00:00
orbiter
d50be59088 - added a automatic re-construction of the domain stack after 10 minutes. this includes then urls to the domain stack that were left over in case of stack size limitations when the domain stack was created the last time
- changed the busy sleep time for the crawl thread to 30 millisecons. This is sufficient to crawl with 2000 PPM.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6028 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-06 09:34:44 +00:00
orbiter
5fdba0fa51 - fixed a not working selection rule in balancer
- more security about crawl-delay, be more fail-save
- better logging in case of long forced crawl-delays

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6027 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-06 08:46:59 +00:00
orbiter
f5602404d5 another speed boost for the balancer
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6026 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-06 02:37:04 +00:00
orbiter
95e8cbd1c3 new fully redesigned balancer and bugfixes regarding lost profile handles and killed crawls
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6025 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-06 01:56:31 +00:00
orbiter
c062385552 fix for http://forum.yacy-websuche.de/viewtopic.php?p=15555#p15555
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6024 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-05 18:18:16 +00:00
orbiter
42ae40b9f6 some bugfixes to database close() methods
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6023 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-04 22:43:46 +00:00
orbiter
a0c53abbe1 - wait until local results are computed during search, see http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2167&hilit=&p=15521#p15521
- show only x+1 pages in page navigator

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6022 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-04 20:58:47 +00:00
orbiter
94f3d90af2 added a hint about regular expressions in crawl start
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6021 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-04 20:03:26 +00:00
orbiter
9bfd22f65d fix for http://forum.yacy-websuche.de/viewtopic.php?p=15523#p15523
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6020 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-04 19:57:25 +00:00
orbiter
1c77db670f re-designed response format for navigation:
- changed json and rss response templates


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6019 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-04 10:54:49 +00:00
orbiter
15fad767c0 some refactoring of topic generation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6018 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-03 23:49:06 +00:00
orbiter
f28f62fb21 added servlet for easy wiki content and search window integration
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6017 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-03 22:22:20 +00:00
lotus
efe97f446a better proxy configuration in case of remote proxy
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6016 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-03 19:03:03 +00:00
orbiter
cc49aedf12 - fixed problem with remote search NPE
- more abstraction for search requests

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6015 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-03 08:49:54 +00:00
f1ori
9e18abc2ac * fix charset detection, see http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2137
* why has this been uncommented???


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6014 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-02 20:54:13 +00:00
orbiter
c38c852090 modified access method to get index entries out of a array of BLOBs:
iterate them, then merge; not collect them and merge then.
This should use less memory and may behave better in an environment with many queries.
To ensure that too many queries will not cause total blocking,
a time-out of one second was also added. After the time-out
the index data that was collected so far is returned.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6013 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-02 16:53:45 +00:00
apfelmaennchen
55ff919b5d - yacysearchtrailer.html ... just an idea for a timeline
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6012 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-02 16:47:39 +00:00
orbiter
ab06a6edd2 renamed topwords to topics and enhanced computation methods of topics
topics will now only be computed using the document title, not the document url,
because the host navigator is now responsible for statistical effects of urls.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6011 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-02 15:20:10 +00:00
lotus
61d9e131b4 better/new proxy auto config
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6010 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-02 12:18:29 +00:00
apfelmaennchen
0d44a6d503 - yacy portalsearch experiments with navigation in sidebar (topwords & domains)
- not yet functional ...

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6009 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-02 11:02:36 +00:00
apfelmaennchen
9f9a1b4ad8 - yacysearchtrailer.html small temporary work around for jquery-css display bug
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6008 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-02 09:08:34 +00:00
apfelmaennchen
b0e2d854e0 - fixed sidebar for yacysearch.html & yacysearchtrailer.html
- @orbiter: please do not use <h2> or <h1> tags in the context of the sidebar!!!

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6007 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-02 07:32:45 +00:00
orbiter
a5d481eab1 enhanced navigation
- fixed too early computation of navigation
- moved navigation rendering to yacysearchtrailer
- added more asserts

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6006 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-01 22:45:28 +00:00
orbiter
3ca1f109c4 added more jquery themes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6005 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-01 21:49:18 +00:00
orbiter
3ea399ec91 fix for absolute paths for repository path
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6004 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-01 10:54:41 +00:00
lotus
6b92155eb6 corrected spelling
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6003 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-01 09:48:08 +00:00
orbiter
5eac607166 fixed configuration of repository path
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6002 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-01 00:13:23 +00:00
apfelmaennchen
daee735ad7 - fix for yacysearch.html
- navigation/sidebar JavaScript is now also triggerd by #(navigation)# instead of display=3

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6001 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-31 07:29:24 +00:00
apfelmaennchen
8fe69da2bb - some fixes for prev. post
- better resizing and dragging

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6000 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-30 17:06:47 +00:00
apfelmaennchen
0eb3bffe97 - added 'drawer' (sidebar) for future navigational items to yacyui-portalsearch.js
- http://forum.yacy-websuche.de/viewtopic.php?f=9&t=1904#p15311

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5999 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-05-30 14:32:29 +00:00