Commit Graph

3873 Commits

Author SHA1 Message Date
orbiter
a995b95367 tried a fix for the httpd access bug (too many unclosed sessions)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6362 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-30 13:18:02 +00:00
orbiter
e1fba41cad better logging
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6361 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-29 21:52:17 +00:00
orbiter
2275f885a8 possible fix for concurrency problem
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6360 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-29 21:40:50 +00:00
low012
a6a3090c3d *) blacklist cleaner supports usage of regular expressions now
*) refacored BlacklistCleaner_p.java for better readability
*) moved check of validity of patterns to the Balcklist implementation since patterns might be valid in one implementation, but not in another
*) added method to check validity to Blacklist interface
*) fixed some minor issues like typos or wrong whitespaces
*) set subversion properties for a whole bunch of files

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6359 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-29 21:28:49 +00:00
orbiter
5a93807781 improved web cache speed:
- removed one computation out of a synchronization
- removed one not necessary has() call


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6358 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-29 08:41:05 +00:00
orbiter
2e8b2867ff double performance of store method because it avoids one 'has'
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6357 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-29 08:23:44 +00:00
orbiter
afda5b1adc new join method for indexes (not yet used)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6356 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-29 08:16:24 +00:00
orbiter
65b66c2c18 better handling of array files of length 0
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6355 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-29 08:13:44 +00:00
orbiter
1957b5797a fix for seed generation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6354 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-29 08:05:36 +00:00
orbiter
432154f725 new strategy for concurrent database index key retrieval
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6353 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-29 08:04:00 +00:00
orbiter
a11cd9f80f - removed reverse name lookup for http access logging (grr..)
- removed a synchronization in seed info string generation

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6351 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-28 15:23:15 +00:00
orbiter
2e6bdce086 - added more logging to balancer
- changed balancer logic slightly

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6350 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-27 22:35:22 +00:00
orbiter
1171a72006 fix for deadlock as seen in http://forum.yacy-websuche.de/viewtopic.php?p=17521#p17521
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6343 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-24 19:14:35 +00:00
orbiter
031e6eefbd some updates to dublin core, metadata browsing, file indexing and parser stability
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6342 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-24 12:54:45 +00:00
hermens
62a7341c4d Fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2204
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6341 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-24 11:38:15 +00:00
low012
f65bfaa9af *) Removed base tag from errror page. This has been added by myself a long time ago as a workaround for some weird behavior of my router, but as it turns out, it does more bad than good in general: If HTTPS is used for communication with YaCy, entering a wrong passwort led to an errror page with a form which would send username and password unencrypted with the user possibly being unaware of this.
*) changed some comments, added some annotations, added SVN properties here and there

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6340 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-23 21:26:14 +00:00
orbiter
e4797ebcde fix for http://forum.yacy-websuche.de/viewtopic.php?p=17509#p17509
corrupted files are ignored

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6339 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-22 22:00:45 +00:00
orbiter
efa7fb34f0 better oom-awareness of miss-cache in cache
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6338 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-22 15:01:29 +00:00
orbiter
3e9dcfc204 fix for http://forum.yacy-websuche.de/viewtopic.php?p=17504#p17504
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6337 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-22 14:39:06 +00:00
orbiter
c3a4aee255 some redesign with a possible fix for the ReferenceContainerCache.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6336 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-22 14:33:57 +00:00
orbiter
aca8a78eb8 fix for shutdown of DocumentIndex objects
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6333 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-21 15:24:47 +00:00
orbiter
23ab6fbca4 - navigation appear at correct position when opengeodb-results are also presented after a search
- show an about box if about.headline and about.body is set

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6332 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-20 22:10:45 +00:00
orbiter
4db34eea73 fix for OOM problem in kelondro Cache
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6331 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-19 07:50:46 +00:00
orbiter
8ea1d7ab59 fix for wrong assert condition in search abstract generation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6330 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-18 21:49:38 +00:00
orbiter
fbd77bd77c git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6328 6c8d7289-2bf4-0310-a012-ef5d649a1542 2009-09-18 12:33:35 +00:00
orbiter
54c7cbf1d9 - fast result for local search in case that less than 10 hits exists
- small change in display of RAM in profiling

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6326 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-18 09:19:52 +00:00
orbiter
28d4b921b6 different approach for file search
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6325 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-18 07:54:16 +00:00
orbiter
f99f86c5c5 added concurrency to file indexing class
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6324 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-17 22:37:21 +00:00
orbiter
902d16cf6c fixes to parser
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6323 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-17 22:00:19 +00:00
orbiter
4a1c852435 fix in usage of RAM copy for Table objects and some cosmetics in asserts.
This bug affected Tables in case that a removeOne() was called and a RAM copy of the table was active. It may happen for peer owners with a lot of RAM assigned to YaCy. The bug appeared especially during crawling when the balancer tried to get new entries from the crawl queue.
This bug may help to solve report at
http://forum.yacy-websuche.de/viewtopic.php?p=17417#p17417
and will be tracked there

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6322 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-17 20:41:50 +00:00
lotus
dce450e2e0 possible fix for "hung" doc-documents
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6320 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-17 17:08:32 +00:00
orbiter
e627f75415 one more fix to badwords and stopwords
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6316 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-15 11:47:50 +00:00
orbiter
721b88efbd - fixed a problem loading blacklists with new yacycore.jar
- fixed badwords and stopwords initialization

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6315 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-15 11:46:02 +00:00
orbiter
80d5005044 fixed seed upload methods - replaced reflection with direct instantiation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6314 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-15 09:47:02 +00:00
orbiter
68465c37af added a convenience class to add files into a YaCy index
to make this possible, the yacyURL must be able to process file:// urls, which has also been implemented
testing of the new class resulted in some bugfixes in other classes

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6313 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-14 21:17:42 +00:00
orbiter
2e41e10ffd - updates to yacyVersion parser (remove old targets)
- added javadoc target to built script (does not work yet without errors)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6312 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-14 18:08:40 +00:00
orbiter
27d00285aa - added a new file reader cache that may serve as full-file-copy of blob database files. This is not yet used
- removed class FileWriter and replaced all usage of that class with CachedFileWriter

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6309 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-09 21:28:23 +00:00
orbiter
fd6b9cb7dc refactoring of IO access classes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6308 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-09 20:49:16 +00:00
orbiter
d64569aa39 reuturn only recommendations of words that have a greater count than the original word
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6307 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-08 13:57:52 +00:00
orbiter
604c37927f used comparator for did-you-mean that uses index sizes for comparisment, but:
- limit comparisment to only the first 10 elements that had been sorted before without IO
- added a size cache to index computation because the size is computed at least twice in set comparator


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6306 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-08 13:48:17 +00:00
orbiter
a58d9cae7d - show location name in geolocalization search result
- added link from location icon to openstreetmap browser with coordinates

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6305 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-08 10:18:03 +00:00
orbiter
573d03c7d7 added configuration to enable ram table copy
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6304 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-07 20:30:57 +00:00
orbiter
3be54e1891 fix to rule when to use a ram table copy
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6302 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-07 19:22:12 +00:00
orbiter
700218846c disabled or removed sleep calls
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6301 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-07 18:50:44 +00:00
orbiter
342c5d0fd4 fixed city name detection: finds now also substrings of city names
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6300 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-06 20:44:02 +00:00
orbiter
18aa0609ca fix for caching of word hash computation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6299 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-05 22:05:18 +00:00
orbiter
a10a6cce45 patch for http://forum.yacy-websuche.de/viewtopic.php?p=17289#p17289
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6298 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-05 21:52:50 +00:00
low012
53bbdfd19a *) setting SVN keywords
*) minor changes

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6297 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-05 20:41:21 +00:00
low012
25f6145934 *) preventing null pointer exception in case empty search word or only one character is enterd or all search words are removed by filters
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6296 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-05 20:31:39 +00:00
low012
248f3fd9b5 *) cleaned up code for better readability
*) added a few copyright notices
*) removed redundancy in constructors of ListToken

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6295 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-09-05 20:04:23 +00:00