Commit Graph

2743 Commits

Author SHA1 Message Date
orbiter
c6c97f23ad - added cache usage properties to crawl start
- added special rule to balancer to omit forced delays if cache is used exclusively
- extended the htCache size by default to 32GB

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6241 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-24 11:54:04 +00:00
orbiter
161d2fd2ef redesign of access to the HTCache (now http.client.Cache):
- better control to the cache by using combined request-header and content access methods
- refactoring of many classes to comply to this new access method
- make shure that the cache is always written if something was loaded
- some redesign of the process how http response results are feeded into the new indexing queue
- introduction of a cache read policy:
 * never use the cache
 * use the cache if entry exist
 * use the cache if the proxy freshness rule confirmes
 * use only the cache and go never online
- added configuration options for the crawl profiles to use the new cache policies. There is not yet a input during crawl start to set the policy but this will be added in another step.
- set the default policies for the existing crawl profiles. If you want them to appear in your default profiles you must delete the crawl profiles database; othervise the policy is 'proxy freshness rule'
- enhanced some cache access methods in such a way that unnecessary retrievals are omitted (i.e. for size computation). That should reduce some IO but also a lot of CPU computation because sizes were computed after decompression of content after retrieval of the content from the disc.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6239 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-23 21:31:51 +00:00
lulabad
da43164dd6 fix for UNRESOLVED_PATTERN see http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2300
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6238 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-23 06:02:36 +00:00
orbiter
1d8d51075c refactoring:
- removed the plasma package. The name of that package came from a very early pre-version of YaCy, even before YaCy was named AnomicHTTPProxy. The Proxy project introduced search for cache contents using class files that had been developed during the plasma project. Information from 2002 about plasma can be found here:
http://web.archive.org/web/20020802110827/http://anomic.de/AnomicPlasma/index.html
We stil have one class that comes mostly unchanged from the plasma project, the Condenser class. But this is now part of the document package and all other classes in the plasma package can be assigned to other packages.
- cleaned up the http package: better structure of that class and clean isolation of server and client classes. The old HTCache becomes part of the client sub-package of http.
- because the plasmaSwitchboard is now part of the search package all servlets had to be touched to declare a different package source.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6232 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-19 20:37:44 +00:00
orbiter
5bb8074150 removed the indexing queue. This queue was superfluous since the introduction of the blocking queues last year, where documents are parsed, analysed and stored in the index with concurrency.
- The indexing queue was a historic data structure that was introduced at the very beginning at the project as a part of the switchboard organisation object structure. Without the indexing queue the switchboard queue becomes also superfluous. It has been removed as well.
- Removing the switchboard queue requires that all servlets are called without a opaque generic ('<?>'). That caused that all serlets had to be modified.
- Many servlets displayed the indexing queue or the size of that queue. In the past months the indexer was so fast that mostly the indexing queue appeared empty, so there was no use of it any more. Because the queue has been removed, the display in the servlets had also to be removed.
- The surrogate work task had been a part of the indexing queue control structure. Without the indexing queue the surrogates needed its own task management. That has been integrated here.
- Because the indexing queue had a special queue entry object and properties attached to this object, the propterties had to be moved to the queue entry object which is part of the new indexing queue withing the blocking queue, the Response Object. That object has now also the new properties of the removed indexing queue entry object.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6225 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-17 13:59:21 +00:00
orbiter
b332dfad67 - inserted request object into response object which carries this now instead generating new objects
- fixed a problem with the crawler introduced in SVN 6216

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6222 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-15 23:08:35 +00:00
orbiter
ca72ed7526 -removed superfluous crawl cache
-refactoring of crawler classes

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6221 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-15 21:07:46 +00:00
orbiter
13c63f4082 a set of small fixes to crawling behaviour
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6216 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-15 14:15:51 +00:00
orbiter
b2263bc720 enhanced document type recognition
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6209 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-14 11:01:05 +00:00
orbiter
7d493cf8cc moved parser configuration in separate servelet
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6207 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-14 06:57:13 +00:00
orbiter
3f113f38a8 removed unused imports
removed unused libs from eclipse class path

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6201 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-13 10:19:10 +00:00
lotus
9f083bb6b2 check filetype before loading (no more mp4 loading)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6200 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-12 16:50:11 +00:00
f1ori
f814e0fa81 enable warnings and fix most of it
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6196 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-11 21:01:27 +00:00
f1ori
8931c8d6b4 improvments to debianpackage:
* autoupdate completely disabled, display hint
* restart-button in interface works!

* moved all build-Variables to yacyBuildProperties
* fixed some warnings


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6195 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-11 17:03:22 +00:00
orbiter
4b74ad0a46 fixed setting of parser configuration servlets
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6191 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-10 15:02:34 +00:00
orbiter
57a88d435b redesign of parser mime type detection and parser steering
There is now a mime-blacklist instead of a mime-whitelist

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6190 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-10 14:22:17 +00:00
orbiter
21b8704fb4 refactoring of the ParserDispatcher and ParserConfig: resulted into Idiom, Parser and Classification classes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6188 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-09 22:25:31 +00:00
orbiter
8ca1f5d400 - some work to integrate the html parser the same way as the other parsers are integrated (not finished)
- added migration of code of settings pages (hmm.. does not work correctly yet, sorry)
- more refactoring
- removed more unused code

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6187 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-09 20:56:30 +00:00
orbiter
499723891d removed all non-http daemons; they had not been used and may be a potential security risk.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6185 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-08 22:24:34 +00:00
orbiter
0e8647d62f refactoring of search classes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6184 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-08 22:14:57 +00:00
orbiter
dafffd0153 refactoring of parsers and document processing
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6182 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-08 21:48:08 +00:00
apfelmaennchen
ba7905aaa8 PortalSearch:
- removed buttons
- auto load on scroll
- some small fixes

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6178 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-08 19:01:06 +00:00
lotus
ec2970cc76 higher dht distribution speed by default
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6168 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-03 13:28:19 +00:00
orbiter
995da28c73 all stack/heap files that had been stored in DATA/PLASMA are now stored in the network-specific QUEUES path
There is no migration. All crawls must be restarted.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6167 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-02 17:01:23 +00:00
orbiter
409538e17a code cleanup and code simplifcation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6161 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-30 22:20:55 +00:00
orbiter
154bbc3364 code cleanup: call of static methods directly to the class
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6155 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-30 13:01:35 +00:00
orbiter
222850414e simplification of the code: removed unused classes, methods and variables
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6154 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-30 09:27:46 +00:00
orbiter
9a674d8047 - After the removal of the Tree class some code simplifications are possible. This affects mostly the Records class, which can be refactored and the result of the refactoring results in a reduced number of classes.
- The EcoTable was renamed to Table.


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6151 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-28 21:37:37 +00:00
f1ori
7eb3bff5b3 * workaround for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2220&hilit=#p16128
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6143 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-26 14:05:39 +00:00
apfelmaennchen
d69d89caed portalsearch: better navigator call href='javascript:...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6142 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-26 10:26:45 +00:00
apfelmaennchen
477a4e14f9 updated styles
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6141 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-26 09:11:02 +00:00
apfelmaennchen
0a38dd41b3 replacing styles
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6140 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-26 09:03:25 +00:00
apfelmaennchen
565230d2a0 another atempt to fix the widget on linuxtag.de
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6139 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-25 15:12:55 +00:00
apfelmaennchen
f92daf9cc5 fixed the "1 to 10 of 10" bug in portalsearch for less than 10 results
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6138 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-25 15:03:16 +00:00
apfelmaennchen
e51ba46b8a css fix for forum search
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6137 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-25 09:31:25 +00:00
apfelmaennchen
0923f20993 font-size fix
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6136 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-25 08:38:48 +00:00
apfelmaennchen
edef81b41d again a css fix for portalsearch :-(
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6135 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-25 08:21:55 +00:00
apfelmaennchen
32d0ee964f more css fixes ;-)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6134 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-25 08:17:40 +00:00
apfelmaennchen
cfd3d672a7 portalsearch css fix
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6133 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-25 08:10:37 +00:00
apfelmaennchen
a71bb7178d small fix for portalsearch
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6132 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-25 08:07:17 +00:00
orbiter
69b573a8ac - next release cycle
- doku update for phpBB3
- added more files for search test scripts

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6130 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-25 05:53:57 +00:00
apfelmaennchen
121728643b - css fix for navigators
- maybe this fixes the display problems on linuxtag.de

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6129 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-24 22:41:35 +00:00
orbiter
93c69fa1cb - added hints to integrate a yacy search in phpBB3
- added also a phpBB3 crawl start with optimized crawl attributes

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6126 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-22 23:38:15 +00:00
orbiter
99fa265e1d fix for search bug caused by tenant patch
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6125 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-22 22:31:29 +00:00
apfelmaennchen
3164d9a205 small fix for favicons
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6124 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-22 21:20:32 +00:00
apfelmaennchen
edcb78dcb6 updated documentation / removed faviconize
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6123 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-22 18:38:42 +00:00
apfelmaennchen
b6058a7db1 yacyui-portalsearch:
- more bug fixes
- moved from faviconize to YaCy's favicons

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6122 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-22 18:20:12 +00:00
orbiter
96d4e9383a added new jquer version in wiki search widget snippet
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6121 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-22 13:19:09 +00:00
orbiter
57af311627 fix for wrong urls in navigator when a tenant is used
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6119 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-22 12:25:18 +00:00
lotus
01853ce849 removed "illegal character"
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6117 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-22 11:40:55 +00:00
lotus
187ee4d06e another IE fix (also same names in html and js)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6116 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-22 11:24:01 +00:00
lotus
48051fef4b another fix for IE
http://forum.yacy-websuche.de/viewtopic.php?p=16030#p16030

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6115 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-22 11:15:00 +00:00
lotus
37c162053d fix for IE & WatchCrawler (were same names in js and html)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6114 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-22 11:03:44 +00:00
orbiter
f2db0d248b fix for multi-word search (makes a replace-all)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6113 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-22 06:37:11 +00:00
apfelmaennchen
7fcffdcb3c - yacyui-portalsearch: fix for multiple search terms
- http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2209#p16015

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6112 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-22 05:43:54 +00:00
orbiter
b07e518bcc added faviconcode to json search output
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6111 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-21 20:57:18 +00:00
apfelmaennchen
f091ad3e23 - hopefully a fix for the 'koblenz'-bug
- http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2209#p16005

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6110 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-21 20:24:39 +00:00
apfelmaennchen
d268bb5108 yacyui-portalsearch: auto open navigator sidebar
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6107 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-21 12:53:14 +00:00
apfelmaennchen
a22103560c yacyui-portalsearch waits with execution until all svripts have been loaded. It checks the load status every 200ms!
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6106 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-21 07:54:42 +00:00
apfelmaennchen
49955cc7b9 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2209&p=15984#p15984
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6104 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-20 22:40:27 +00:00
apfelmaennchen
2bb020a7a5 yacyui-portalsearch: small improvement to navigators
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6101 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-20 12:08:35 +00:00
apfelmaennchen
fee88d5fa3 maybe this fixes: http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2209#p15967
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6100 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-20 11:41:44 +00:00
orbiter
a119860b82 moved IndexImportWikimedia into different menu position
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6094 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-19 14:03:28 +00:00
orbiter
fc69a76197 update to web structure picture:
- allow bigger size
- better instructions for api usage

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6089 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-17 12:37:31 +00:00
orbiter
ae015e8e98 refactoring of blob package classes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6088 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-17 09:58:15 +00:00
orbiter
be1c7ddc64 refactoring of search classes -- moved Ranking Profile to search package
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6086 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-16 21:45:40 +00:00
orbiter
b5bc399cea added necessary synchronization for logging statistics (causes deadlock)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6083 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-16 10:37:13 +00:00
lulabad
e377a1e9a1 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1969
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6082 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-16 08:16:59 +00:00
orbiter
ce1adf9955 serialized all logging using concurrency:
high-performance search query situations as seen in yacy-metager integration showed deadlock situation caused by synchronization effects inside of sun.java code. It appears that the logger is not completely safe against deadlock situations in concurrent calls of the logger. One possible solution would be a outside-synchronization with 'synchronized' statements, but that would further apply blocking on all high-efficient methods that call the logger. It is much better to do a non-blocking hand-over of logging lines and work off log entries with a concurrent log writer. This also disconnects IO operations from logging, which can also cause IO operation when a log is written to a file. This commit not only moves the logger from kelondro to yacy.logging, it also inserts the concurrency methods to realize non-blocking logging.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6078 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-15 21:19:54 +00:00
lotus
aec3e7995a autoconfig.pac can be used to browse .yacy-domains only
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6077 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-15 19:48:11 +00:00
orbiter
4e825852d2 added stub for phpBB3 search integration guide
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6076 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-15 11:52:57 +00:00
orbiter
bc6dd8194b refactoring: moved search query class to new search package
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6075 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-15 11:49:00 +00:00
orbiter
f348190566 tried to insert a database dump import method to the phpBB3 import function. Reason: imports or large database dumps are cannot be handled with phpMyAdmin and this should be an easy way to the database dumps into a mySQL database where it can be exported again with the phpBB3 content integration adapter. Completion or removal of this function stub will follow before next main release.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6065 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-13 23:03:40 +00:00
orbiter
945777aa80 replaced rwi term counting method by one that computes the maximum of the blobs that contibute to the RWI. An addition of the blob sizes is wrong/incorrect and does not reflect the real size. Truncation the size operation to the maximum of all blobs is also incorrect, but not as wrong as the sum of all blob sizes wich double-counts many rwi entries.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6064 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-13 22:59:54 +00:00
apfelmaennchen
303ccda69f small fix for "did you mean"
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6063 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-13 11:11:30 +00:00
orbiter
7c4d1d471c hand-over of more specific object
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6062 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-13 10:22:25 +00:00
apfelmaennchen
9150bc0f7d - don't show empty "did you mean"
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6061 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-13 07:02:50 +00:00
apfelmaennchen
6c116be536 - set default &meanCount=5
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6060 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-13 06:49:17 +00:00
apfelmaennchen
09acfa66d1 - improved "did you mean"
- added &meanCount= to query string
- &meanCount=0 ==> no suggestion, no performance loss
- sorting suggestions by sb.indexSegment.termIndex().count()

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6059 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-13 06:20:05 +00:00
apfelmaennchen
54a48b4184 - added "did you mean" to search page
- currently works for single word queries only!

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6057 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-12 20:36:03 +00:00
apfelmaennchen
31360ba40c - Updated ConfigLiveSearch.html
- added documentation for load_js and load_css

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6056 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-12 05:57:08 +00:00
apfelmaennchen
ab09d8ebb3 - small noscript fix
- noscript is now functionall but ugly

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6055 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-11 22:10:02 +00:00
apfelmaennchen
55ef9ae12a small fix for last post
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6054 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-11 21:42:34 +00:00
apfelmaennchen
36dc9b09ac - partial update to jquery-1.3.2
- partial update to jquery-ui-1.7.2
- yacyportalsearch fixed sidebar for navigators


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6053 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-11 21:34:39 +00:00
orbiter
cffef67dc5 added a short info line about the latency monitor
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6048 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-10 23:03:29 +00:00
orbiter
733385cdd7 enahnced database access times by removal of unnecessary synchronization.
added also more hacks that resulted from high-volum query testing

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6047 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-10 23:02:42 +00:00
apfelmaennchen
5a7dec880e - some improvements for: http://forum.yacy-websuche.de/viewtopic.php?f=9&t=1904#p15668
- portalsearch: introduced yconf.load_js and yconf.load_css
- yacysearch.html still having problems with focus after sidebar is loaded
- yacysearchtrailer.json seems not to be valid json for ?nav=all

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6046 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-10 22:11:31 +00:00
orbiter
b21e9149f5 another fix for navigation results, the json result format and searches with yacyinteractive
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6041 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-10 12:41:15 +00:00
orbiter
15c5406b9c fixed yacyinteractive
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6040 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-10 07:24:45 +00:00
orbiter
e0b3984805 added navigation keys for site and author facets to remote search interface
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6038 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-09 09:07:52 +00:00
orbiter
27fa6a66ad - completed the author navigation
- removed some unused variables

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6037 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-08 23:30:12 +00:00
orbiter
a9a8b8d161 - added display of author navigation (usage of that navigator not yet implemented
- added a synchronization in pdf parser which should help to avoid deadlocks that occur when displaying several search results pointing to pdf sources
- fixed smaller bugs in navigation

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6036 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-08 22:01:26 +00:00
orbiter
c879783008 added steering of navigator computation:
- by default the navigator computation if off for servlet yacysearch.html, but:
- the servlet is called by default with a option to switch navigator results on
this will prevent that metasearch users will get slow results that are caused by unnecessary computations

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6035 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-07 22:51:15 +00:00
orbiter
c079b18ee7 - refactoring of IntegerHandleIndex and LongHandleIndex: both classes had been merged into the new HandleMap class, which handles (key<byte[]>,n-byte-long) pairs with arbitraty key and value length. This will be useful to get a memory-enhanced/minimized database table indexing.
- added a analysis method that counts bytes that could be saved in case the new HandleMap can be applied in the most efficient way. Look for the log messages beginning with "HeapReader saturation": in most cases we could save about 30% RAM!
- removed the old FlexTable database structure. It was not used any more.
- removed memory statistics in PerformanceMemory about flex tables and node caches (node caches were used by Tree Tables, which are also not used any more)
- add a stub for a steering of navigation functions. That should help to switch off naviagtion computation in cases where it is not demanded by a client

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6034 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-07 21:48:01 +00:00
orbiter
a0c53abbe1 - wait until local results are computed during search, see http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2167&hilit=&p=15521#p15521
- show only x+1 pages in page navigator

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6022 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-04 20:58:47 +00:00
orbiter
94f3d90af2 added a hint about regular expressions in crawl start
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6021 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-04 20:03:26 +00:00
orbiter
1c77db670f re-designed response format for navigation:
- changed json and rss response templates


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6019 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-04 10:54:49 +00:00
orbiter
15fad767c0 some refactoring of topic generation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6018 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-03 23:49:06 +00:00
orbiter
f28f62fb21 added servlet for easy wiki content and search window integration
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6017 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-03 22:22:20 +00:00
lotus
efe97f446a better proxy configuration in case of remote proxy
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6016 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-03 19:03:03 +00:00