Commit Graph

1304 Commits

Author SHA1 Message Date
theli
56e4dbeb71 *) displaying current active + current idle threads in PerformanceQueues_p.html now
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1470 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-28 15:17:04 +00:00
theli
859c6a88f5 *) testing various thread pool eviction settings to avoid outOfMemory - Thread creation problem
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1467 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-27 16:51:29 +00:00
allo
7197f171d3 better ppm calculation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1464 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-27 11:51:27 +00:00
orbiter
f2b18cede9 AND-bugfix
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1461 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-27 03:58:38 +00:00
orbiter
b946e28e61 some ranking enhancements
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1460 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-27 02:48:27 +00:00
rramthun
6c02f889f7 Cosmetic changes.
Corrected version numbering as described in http://www.yacy-websuche.de/wiki/index.php/De:Versionsnummern

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1453 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-26 15:12:44 +00:00
theli
b191f06d16 *) Adding additional logging message to locate problems with stackcrawl threads
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1452 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-26 14:24:29 +00:00
theli
5c56b9ed59 *) catch exceptions that could occur during url decoding
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1451 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-26 13:57:49 +00:00
theli
d9bcd73d93 *) Bugfix for exception
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1448 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-26 12:15:59 +00:00
theli
f5abfe8d57 *) more failsafe threadpools
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1446 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-26 09:37:43 +00:00
orbiter
47344e8df0 removed referrer fake (too many complaints, too less use)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1444 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-25 18:54:46 +00:00
hermens
ad0de69607 Yet another bug fix for svn 1441. It should work now.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1443 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-25 13:32:04 +00:00
hermens
58fd40e1c1 Aaargh
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1442 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-25 13:28:03 +00:00
hermens
b08af0c2cb *) Force download of seed file when checking upload success
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1441 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-25 13:25:48 +00:00
hermens
66c889138e *) Bugfix: Principals are reported back as 'principal', so IWasAccessed should also be true
*) make it easier to include legacy peers switching between timezones +0100 and +0200



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1438 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-25 01:50:24 +00:00
orbiter
a56fefe0d3 added missing forced-flush for index cache
see http://www.yacy-forum.de/viewtopic.php?p=15732#15732

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1434 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-24 16:24:15 +00:00
hermens
78bcb8014a *) Limit range for selection of indexes for distribution to a DHTDistance of 0.2
(For wider ranges enough suitable targets are not probable)
*) Migrate Indexes from ClassicDB back to AssortmentCluster if transfer fails
*) Remove class iterateFiles from plasmaWordIndex
   (The class iterateFiles from plasmaWordIndexClassicDB is used instead)



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1430 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-24 14:58:15 +00:00
hermens
861aae678d *) cleanup cacheAge database when cleaning up the HTCache
*) Log directory deletes with level Fine



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1427 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-24 14:21:01 +00:00
orbiter
b9d73f63e7 replaced String object in loop detection by byte[] to omit String-generation
which could cause locks.
See http://www.yacy-forum.de/viewtopic.php?p=15738#15738

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1425 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-24 13:12:38 +00:00
low012
927c2c3709 *) Fixed a minor bug in code for tables. {|border"1" did not work, {| border"1" did. Now the space is not needed anymore.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1423 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-24 11:15:20 +00:00
theli
75aad0fe66 *) Bugfix for URLs containing spaces
See: http://www.yacy-forum.de/viewtopic.php?t=1640

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1422 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-24 09:30:58 +00:00
theli
754a35877f *) Changing robots parser cxclusion policy
- crawling is now allowed if server returned a 403 statuscode 
     when trying to download the robots.txt
   See: http://www.yacy-forum.de/viewtopic.php?t=1612

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1421 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-24 08:53:30 +00:00
hermens
a2e2d583f9 *) small bugfix regarding peerPingMaxRunning
*) beautify log



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1419 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-24 01:16:22 +00:00
theli
b4e2efef10 *) first test of new iteration function
ATTENTION: please don't use it at the moment

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1418 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-23 17:20:30 +00:00
rramthun
a4e90c4b11 Fixed spelling bug.
I think this is important for other programmers, who don't make the same mistake as the original author.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1417 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-23 15:08:39 +00:00
low012
c45517db46 *) replaced code for table with better version (by kane)
*) split replaceHTML into replaceCharacters and replaceHTMLonly, replaceHTML can still be used to ensure compatibility



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1416 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-23 13:59:40 +00:00
orbiter
eabf4a0386 fix for null pointer exception during shut-down
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1415 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-23 13:45:14 +00:00
orbiter
47843e69e2 auto-reset for switchboard queue stack
bugfix for http://www.yacy-forum.de/viewtopic.php?p=15684#15684

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1414 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-23 12:41:08 +00:00
orbiter
a70970f993 fixed increment in content iterator
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1413 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-23 12:11:32 +00:00
hermens
62ab8d18c1 *) Bugfix for peer sorting method. This seems to cause funny side effects in the SeedDB
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1412 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-23 01:42:26 +00:00
hermens
75b268f16d *) use majority voting for peer type decision
*) reduce the number of peer pings sent out
see: http://www.yacy-forum.de/viewtopic.php?t=1748



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1411 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-22 23:14:37 +00:00
orbiter
d6581c445b added content iterator for corrupted database files
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1406 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-22 17:12:16 +00:00
theli
ecdc1f7547 *) Bugfix for crawling URLs with query parameters
See: http://www.yacy-forum.de/viewtopic.php?p=14065
*) Preparation for http://www.yacy-forum.de/viewtopic.php?t=1719

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1405 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-22 16:39:10 +00:00
low012
eb80156233 *) added Kane's code for tables
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1403 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-22 15:41:49 +00:00
low012
ef22fa8bf2 *) beautifying code and a little comment
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1401 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-22 14:20:38 +00:00
orbiter
fc4ae899f7 added word-position to ranking (this is only a first step)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1395 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-22 02:16:09 +00:00
orbiter
bb2095fe39 assortment files are now not deleted, but shifted to a backup directory.
See also: http://www.yacy-forum.de/viewtopic.php?p=15458#15458

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1394 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-22 01:03:01 +00:00
orbiter
7366e39dd3 tried to fix 100% CPU bug.
See http://www.yacy-forum.de/viewtopic.php?p=15569#15569

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1393 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-22 00:50:05 +00:00
orbiter
f14d49fae9 enhancements, bugfixes and additions to word index attribute storage
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1392 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-22 00:07:00 +00:00
allo
4d33020f56 Migration to WORK
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1389 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-21 16:39:57 +00:00
rramthun
1e5feedf0e Fix for http://www.yacy-forum.de/viewtopic.php?p=15547#15547
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1388 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-21 11:20:26 +00:00
orbiter
f4ffa9aee5 - implemented more attributes to index entries
- implemented hand-over of new word index attributes during remote search
- implemented word-distance computation during search

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1382 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-20 15:14:21 +00:00
theli
026dcdfcc0 *) Bugfix from mbirth for ftpc bug
See: http://www.yacy-forum.de/viewtopic.php?p=15496#15496
   

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1381 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-20 13:37:42 +00:00
rramthun
34a7802a36 .bat files create DOS-window with title "YaCy" as suggested in http://www.yacy-forum.de/viewtopic.php?p=15470#15470
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1380 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-20 12:01:12 +00:00
orbiter
90b940e90e fixed position storage problem.
Now the word position is properly stored.
No use of that now, but can be used for better ranking.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1378 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-20 00:26:55 +00:00
orbiter
0371494010 tried to add word position to index
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1377 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-19 14:13:39 +00:00
orbiter
f1cfee7703 removed tabs from condenser
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1376 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-19 12:24:35 +00:00
hermens
37791fd529 *) Close indexEntities when "found not enough peers for distribution"
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1375 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-18 20:47:05 +00:00
orbiter
3ec7177f0f added seed flag 'AcceptCitationReference'
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1374 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-18 14:52:24 +00:00
borg-0300
c5b6154136 added CRDistOn = true/false
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1372 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-18 02:18:23 +00:00
low012
ee010c36ae *) fixed bug where ampersands were replaced by replaceHTML even if they were part of &.+;
*) fixed bug in htmlentities (cAseSenSitivity!)


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1371 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-18 01:48:34 +00:00
orbiter
71d5c2b2ca better control for target peer selection for RWI transfer
see also http://www.yacy-forum.de/viewtopic.php?p=15343#15343

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1370 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-18 00:30:07 +00:00
hermens
ca7407b7e1 *) Don't change maxTime if zero or negative
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1363 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-17 14:47:21 +00:00
hermens
84fce71d11 *) correct merging of indexes when enumerating downward
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1361 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-17 11:44:55 +00:00
low012
7d5af75d11 *) Second try: replaced replaceHTML again. There should be no problem this time.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1359 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-17 01:00:38 +00:00
orbiter
d45b1162e2 signature fix
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1358 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-17 00:43:38 +00:00
orbiter
c51e18c4ad - added signatures to ordering
- added storage of orderings to database
- orderings can be read from database when opening a database

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1357 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-17 00:31:27 +00:00
low012
c6d58d9b9a *) taking back changes for replaceHTML
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1356 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-16 21:43:32 +00:00
low012
8af5df7883 *) Replaced replaceHTML by a more versatile method (code by Kane). Please test for possible side effects if you use this method in any of your classes.
*) Added a little bit of javadoc
*) Replaced tabs by spaces and beautyfied the code here and there.

The code is saved in UTF-8 encoding. I hope that does not cause any trouble.


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1354 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-16 21:15:06 +00:00
allo
fd1dd6859b code cleanup
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1353 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-16 17:40:02 +00:00
theli
99031b10ae *) Bugfix for recursive translation problem with file separators
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1352 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-16 17:32:24 +00:00
allo
1f9ff68f45 bugfix for the notdir in the recursive functions
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1351 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-16 17:16:50 +00:00
theli
da81d7db17 *) Adding possibility to close open connections via Connections_p.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1350 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-16 14:46:33 +00:00
allo
f8ee05372f better Logging
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1344 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-14 22:06:35 +00:00
allo
5531d18676 translations with relative Path
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1342 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-14 18:52:14 +00:00
allo
fe2d983c3e recursive Translations!
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1341 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-14 11:56:20 +00:00
orbiter
3d7c8aaeae removed confusing method
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1339 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-14 02:08:11 +00:00
orbiter
4cd0c45a77 code cleanup
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1337 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-14 01:06:54 +00:00
hermens
971247b78f - rotate merged indexes after merging
see: http://www.yacy-forum.de/viewtopic.php?t=1717
- fix -rwihashlist to correctly shutdown



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1336 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-13 23:59:04 +00:00
(no author)
001513cc1f Now custom httpHeader can be created
and filled with cookies and so on.

This header one can set into serverObjects

Check CookieTest.html and CookieTest.java for details.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1334 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-13 22:50:04 +00:00
(no author)
55f3232219 Patch for the Coockie management.
Version 0.1

Start Yacy, go to localhost:8080/CookieTest.html
Play around with cookies
Look into CookieTest.java to See, how it works

This behavior will be changed 
such that httpHeader will be responsible for the cookies in the future



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1332 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-13 21:29:04 +00:00
orbiter
e2ff1767b5 fix for last DHT distribution bug-fix
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1330 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-13 16:58:27 +00:00
allo
2cd01eb1aa bugfix for dates.db
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1329 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-13 15:26:10 +00:00
allo
d5aaaf500f bookmarksDB:
lowercase Tags; if you create a Tag with Uppercase chars, friendlyName will be set.
So The Tag is displayed Uppercase, but stored Lowercase.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1328 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-13 15:10:24 +00:00
orbiter
060e5a0df0 fixed problem with DHT target peer selection:
- shifted selection in front of distribution
see http://www.yacy-forum.de/viewtopic.php?p=15131#15131


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1327 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-13 14:32:48 +00:00
theli
7c22afe3de *) Bugfix for NullpointerException in deleteOldHTCache
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1326 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-13 10:37:42 +00:00
(no author)
873cff20e9 handle UTF-8 correctly
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1324 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-12 21:15:07 +00:00
(no author)
1d3249e787 handle UTF-8 correctly
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1323 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-12 21:14:39 +00:00
(no author)
76f356315d handle UTF-8 correctly
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1322 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-12 21:14:12 +00:00
orbiter
b21b9df2d0 added section headlines generation to html parser
can be viewed in cache control, but is not yet included to indexing

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1320 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-12 20:21:34 +00:00
rramthun
c4487deba9 Minor changes collected over some time.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1319 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-12 19:15:31 +00:00
allo
13c978d3bc only lowercase tags
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1317 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-12 17:41:02 +00:00
orbiter
de359a9431 patched possible 100% cpu bug.
See http://www.yacy-forum.de/viewtopic.php?p=15067#15067

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1316 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-12 16:50:34 +00:00
allo
6822dce57b Using Orbiters function for auth
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1315 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-12 16:21:08 +00:00
orbiter
38b19ff56a another bugfix to the correctedNodeIterator
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1314 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-12 15:19:09 +00:00
orbiter
45c44ca9c8 bugfix for http://www.yacy-forum.de/viewtopic.php?t=1717
(no rotation)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1313 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-12 07:24:03 +00:00
orbiter
c137683a5c added patch to correct tree-iterator
(symptoms: at the end of the iteration,
more elements are iterated that are not in the
correct order)


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1312 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-11 13:47:16 +00:00
orbiter
2028403670 - consolidated different orderings to kelondroNaturalOrder
- added another iteration method to rwihash-enumeration


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1309 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-11 00:32:44 +00:00
orbiter
9544c47684 added some UTF-8 handling.
hope this will help somehow.. for shure not THE solution to our UTF-8 problem


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1308 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-10 16:48:59 +00:00
borg-0300
9d8dca750e BUGFIX for my last commit
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1306 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-10 10:38:11 +00:00
borg-0300
5449193167 bugfix for http://www.yacy-forum.de/viewtopic.php?t=1706 (i hope)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1304 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 23:19:05 +00:00
borg-0300
2a23f5d419 F..., Sorry, no time, later
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1303 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 19:26:39 +00:00
borg-0300
3a2d13786e bugfix for http://www.yacy-forum.de/viewtopic.php?t=1706
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1302 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 19:18:41 +00:00
borg-0300
23a39698a7 last commit removed
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1301 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 19:07:11 +00:00
borg-0300
3abd843cdb bugfix for http://www.yacy-forum.de/viewtopic.php?t=1706
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1300 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 18:54:21 +00:00
allo
c3284c27f5 this Code is now in passwordAuth
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1299 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 11:50:48 +00:00
theli
1c7f0789c9 *) Allow crawling of dynamic urls if configured
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1298 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 10:31:06 +00:00
theli
15b4e16591 *) correcting problems with comments while using the urlRedirector in command mode
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1297 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 10:12:23 +00:00
theli
94e54c4450 *) Removing url hash from loaded list before inserting the url into the crawling queue
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1296 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 10:05:45 +00:00
theli
99861ff2c1 *) Crawling depth can now be configured via command string in urlRedirector mode
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1295 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 09:34:51 +00:00
orbiter
a125fee9ce fixed iterator
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1294 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 00:55:47 +00:00
orbiter
33f8920e1c new database format: splitted trees, using an ordering to split the tree
The new format can be tested with the dbtest

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1293 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 00:46:56 +00:00
orbiter
d83cc3084a fixed small problem when password cannot be decrypted with base64
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1292 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 00:16:34 +00:00
borg-0300
dc0999ec9c adapted to new HTCache structure
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1290 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-05 00:33:59 +00:00
allo
6df8080cf2 tiny fix for recursive Translations
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1289 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-04 21:59:09 +00:00
allo
4e4bd4662d redirectors fix
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1288 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-04 17:40:18 +00:00
allo
b63bcf25d7 moving string2vector and vector2string to listsmanager
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1286 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-04 13:09:43 +00:00
orbiter
10a095d6b5 - added base64 ordering methods
- added coding interface

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1285 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-04 01:45:23 +00:00
orbiter
9086261476 refactoring of base64 encoding:
the kelondro database needs specific information about the order of
base64-encoded keys. Since no other package depends on base64
(only the httpd uses base64 for encryption, but does not need to encode these strings)
it is good to move base64 encoding to the new ordering classes in kelondro.


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1284 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-04 00:39:00 +00:00
orbiter
7c0d7ed4f8 introduction of generic order interface for kelondro
this is needed for (to be introduced) splitted tree-files

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1283 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-03 12:35:39 +00:00
borg-0300
b24fcc8ca4 oom
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1281 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-03 11:08:45 +00:00
borg-0300
7da232b5b9 HTCache Reset if necessary
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1280 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-03 11:02:02 +00:00
borg-0300
4f18f24d81 small change
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1278 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-02 22:30:40 +00:00
borg-0300
c652527620 YaCy removes now the old HTCACHE data
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1277 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-02 22:24:19 +00:00
borg-0300
69f65210e2 ".yacy" has its own directory;
glad new year :)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1275 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-01 13:07:57 +00:00
allo
351fffc129 DATA/WORK for user-created content
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1274 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-31 11:47:52 +00:00
allo
a81cc9d969 no DATA/DATA to avoid confusion.
increasing version number

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1273 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-31 11:13:26 +00:00
allo
ba96cefe0c packages for xml/*
bugfix for servlets with packages from theli.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1272 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-30 13:57:51 +00:00
borg-0300
b95c5d5781 BUGFIX for URLs how "/../" ...;
new port handling;

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1271 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-30 12:58:36 +00:00
allo
9cce3c5709 dates Table for bookmarksdb(needed for del.icio.us api)
Files in DATA/DATA
Migration: move bookmarks.db from SETTINGS in DATA

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1270 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-30 12:34:44 +00:00
hermens
11fe95832e avoid division by zero when index transfer is extremely fast
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1269 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-29 20:01:05 +00:00
allo
2e2fa99501 bookmarksManager:
-gettitle_p.xml and AJAX to use it
-classpath change
httpc:
-simple wget function

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1267 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-29 17:45:50 +00:00
allo
ca3a513aee del.icio.us style api
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1265 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-28 18:26:26 +00:00
allo
75b9163242 xmlapi for bookmarktags
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1263 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-28 18:07:57 +00:00
borg-0300
c7abd60782 last commit removed
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1262 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-28 11:21:40 +00:00
borg-0300
8eb4181737 BUGFIX for regular expression
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1261 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-28 10:32:21 +00:00
borg-0300
106cd29981 regular expression shortened
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1260 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-28 10:00:13 +00:00
borg-0300
51433a121f added regular expression for '//';
* http://www.yacy-forum.de/viewtopic.php?t=1666
source text removes, url.getFile() has no ref's;

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1259 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-28 09:44:52 +00:00
allo
4c824cacba Description Field
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1258 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-27 20:47:16 +00:00
allo
867573de71 rebuildTags to rebuild the tags.db from the bookmarks.db
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1257 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-27 15:20:21 +00:00
allo
2982715a04 fix for Nullpointer Exception when adding a bookmark
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1256 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-27 13:03:13 +00:00
allo
09c95544c0 edit Function
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1255 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-26 22:10:12 +00:00
allo
0619051c4b timestamps
sorted by timestamps

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1254 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-26 21:03:42 +00:00
allo
d5e27009b1 bookmarks:
-bugfixes
-deleting

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1253 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-26 19:48:19 +00:00
allo
87ea300d71 bookmarks interface
-bugfix for tags
-defaulttag, if no tag is given

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1251 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-26 19:29:26 +00:00
allo
d0d1855520 tagView
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1250 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-26 18:14:40 +00:00
allo
ece2844385 bookmarkManager
tagView, Public Flag, next-page

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1249 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-26 15:40:09 +00:00
allo
4ac0fd328a First Version of the Bookmarksmanager
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1248 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-26 14:21:01 +00:00
theli
d7b6dcbe2e *) Bugfix for MalformedURL problem if Location header is empty.
See: http://www.yacy-forum.de/viewtopic.php?p=14325#14325

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1247 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-25 13:56:11 +00:00
allo
28f890ee3e Important SECURITY FIX!
The UserDB Passwordcheck was buggy since 1138, any password allows access with the rights of 
the user!

USE SVN or disable all Users(take away all Rights)


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1246 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-24 21:16:19 +00:00
allo
0f1212feb9 userDB.hasAdminrights to check adminRights.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1245 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-22 14:05:05 +00:00
orbiter
b3dca06bb1 added location column to network pages.
The location is computed from the userAgent string of connecting peers.
Therefore this information is not available right after start-up.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1241 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-22 01:01:46 +00:00
hermens
5b3e01bd3c avoid division by zero when importing very small indexes (<100 entries)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1238 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-20 12:03:34 +00:00
orbiter
60fc34d9e8 iterartion bug:
http://www.yacy-forum.de/viewtopic.php?p=14182#14182


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1235 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-19 00:40:06 +00:00
low012
6c91921e9b *) yacyWiki: more versatile distinction between real external links/links that point to DATA/HTDOC
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1232 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-18 21:24:55 +00:00
borg-0300
b7f9adc2c9 new filters added
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1231 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-18 01:15:25 +00:00
rramthun
45ebe477ee Bugfix for http://www.yacy-forum.de/viewtopic.php?p=14175
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1224 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-17 11:13:44 +00:00
low012
eb0f511767 *) yacyWiki: easy way to link to files on local peers (external links)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1223 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-16 15:49:25 +00:00
theli
79667a172e *) Bugfix for additional parser problem
See: http://www.yacy-forum.de/viewtopic.php?p=14146#14146

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1221 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-16 09:10:15 +00:00
theli
b6be828d15 *) Bugfix: Share subdirectory couldn't be views because of LinkageErrors
See: http://www.yacy-forum.de/viewtopic.php?t=1634

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1218 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-16 08:39:48 +00:00
orbiter
c4dd39e8f2 fixing/changing wiki image inline code
for local images hosted within DATA/HTDOCS


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1217 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-16 00:54:24 +00:00
low012
80b3e85356 *) yacyWiki: Images in a share on the same peer can be addressed by path and name only, no complete URL needed anymore. (As requested by theli in http://www.yacy-forum.de/viewtopic.php?t=809)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1215 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-15 21:58:13 +00:00
theli
8c594841a8 *) Bugfix for incorrectly indexing of URLs that were requested with Cookies in the
Request header
   See: http://www.yacy-forum.de/viewtopic.php?p=14077

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1214 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-15 15:30:24 +00:00
orbiter
b5d02d649a fixed bug caused strange search result behaviour
(results from remote peers had not been saved propery after search)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1213 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-15 13:21:42 +00:00
orbiter
4500506735 fixed some bugs concerning url entry retrieval and intexControl interface
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1212 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-15 10:31:00 +00:00
orbiter
83a34b838d * added Object allocation monitor on performanceMemory page
* added some final statements
* changed shutdown sequence order

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1211 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-14 13:04:43 +00:00
orbiter
4ff3d219e8 increased delay for cacheScan start and slowed down scan process
to provide more time to other tasks

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1210 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-13 21:15:52 +00:00
orbiter
3031903d50 re-design of RAM cache flush into assortment cluster
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1209 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-13 16:00:20 +00:00
orbiter
676e384b45 changed synchronization to fix 'contains loops' bug in kelondroTree
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1208 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-13 11:58:55 +00:00
orbiter
3cc02fe749 enhanced buffered write by combination of several chunks into one chunk
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1206 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-13 00:52:27 +00:00
orbiter
0c762daf4b better startup failure handling
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1205 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-12 23:59:58 +00:00
orbiter
f27f9ecf15 * activated write buffer for databases.
This should increase IO performance and reduce HD activity
* bugfixes for new exception-on-failure policy
* bugfixes for new IOChunks
* new Object pool for database write-buffer


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1204 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-12 14:11:59 +00:00
orbiter
c59d1b2f5e - Tests with write buffer (new class kelondroBufferedIOChunks, not yet active)
- minor bugfixes


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1203 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-12 00:19:28 +00:00
orbiter
ca7e60279d introduced IOChunks.
This is an abstraction of chunked IO-processes.
It will help to synchronize access to IO-ports.
Furthermore this is a preparation for upcoming chunk-oriented write buffers.


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1201 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-11 02:36:49 +00:00
orbiter
bb79fb5d91 - changed handling of error cases retrieving urls from database
(no more NULL values are returned, instead, an IOException is thrown)
- removed ugly damagedURLS implementation from plasmaCrawlLURL.java
  (this inserted a static value into the Object which is not really a good style)
- re-coded damagedURLS collection in yacy.java by catching an exception and evaluating the exception message
to do:
- the urldbcleanup feature must be re-tested


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1200 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-11 00:25:02 +00:00
orbiter
fed92d364b introduced USAGE object for counter synchronization in kelondroRecords
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1199 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-10 02:11:24 +00:00
hermens
35cf6712b2 *) fixes for httpd
- don't send Body on HEAD requests
  - don't send a Last-modified: date, that is later then Date:
  - Use Cache-control instead of Pragma with HTTP/1.1
  - don't send header with HTTP/0.9



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1198 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-09 17:35:45 +00:00
theli
e7d16ef831 *) Corrections in jMimeMagic MagicRule-file to detect some special rss feeds
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1196 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-08 23:25:43 +00:00
orbiter
3086e38bb1 added getRemoved method on demand from theli for migration purpose
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1195 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-08 23:22:57 +00:00
theli
386d9e45d8 *) Bugfix for code cleanup
- Code must be in finally block, otherwise it does not work if an error occurs!

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1193 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-08 22:16:49 +00:00
orbiter
356231b033 changed only for testing with new hastable implementation.
Core funktions are unchanged.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1192 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-08 21:38:34 +00:00
theli
5a1d45715d *) Bugfix for parser configuration bug
- it was not possible to disable all parsers
   See: http://www.yacy-forum.de/viewtopic.php?t=1579

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1191 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-08 21:35:40 +00:00
orbiter
1041f4946a experiments with locks to remove synchronization
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1189 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-08 12:30:55 +00:00
orbiter
288b2c353e debugging of kelondroRecords ... possibly synchronization bugs found
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1188 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-08 02:18:44 +00:00
rramthun
a1061495d4 Fixed some spelling mistakes and added some text which (should) make it easier to understand the options.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1187 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-07 19:47:21 +00:00
orbiter
0cdc58aaea fixed indexing of local domains.
see http://www.yacy-forum.de/viewtopic.php?p=13680#13680

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1186 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-07 14:26:43 +00:00
orbiter
76b97e4d2a integrated geo-snap DBStressTest.java in dbtest
this is still beta.
It uses serverInstantThreads instead of Java 1.5 code for multiple threads

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1185 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-07 13:33:39 +00:00
hermens
ec1202edbe *) Fixes for httpd
- Fix for local timezone in http header
    See: http://www.yacy-forum.de/viewtopic.php?t=836
  - Allow static content to be cached by browser
    See: http://www.yacy-forum.de/viewtopic.php?t=1311


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1184 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-07 13:26:27 +00:00
theli
e1c2d8ec5f *) Speedup "removed from queue"
See: http://www.yacy-forum.de/viewtopic.php?p=13442#12188

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1183 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-07 11:27:44 +00:00
hydrox
96930f0d2b *)added function to removed malformed URLs from urlHash.db
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1182 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-07 11:10:08 +00:00
orbiter
d007d14905 re-insert of migrateSwitchConfigSettings
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1180 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-07 10:31:48 +00:00
theli
8862b6ba4b *) Corrections for code cleanup 1175
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1179 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-07 08:15:39 +00:00
orbiter
13fdebc50d added authentication for link deletion in search result
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1177 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-07 00:36:05 +00:00
orbiter
37f88b4017 code cleanup
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1176 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-06 23:51:29 +00:00
orbiter
ec2b39c1ce code cleanup
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1175 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-06 22:30:15 +00:00
orbiter
76618442e0 code cleanup
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1173 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-06 21:21:14 +00:00
orbiter
8f1f2daa5e implemented interactive link deletion of search results.
next steps: attach voting and restrict to administrator
to see the deletion button, move the mouse pointer to the left of a search result

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1172 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-06 16:15:21 +00:00
theli
6d0f7e6988 *) Adding missing file
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1171 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-06 11:20:06 +00:00
theli
44fa94ac52 *) Modifications for dbImport functionality
- dbImporter threads are now shutdown by the switchboard on server shutdown
   - adding possibility to pause a importer thread via GUI
   - Bugfix for abort function
     See: http://www.yacy-forum.de/viewtopic.php?p=13363#13363

*) Modification of content parser configuration
   - now it's possible to configure which parsers should be enabled for the proxy,
     crawler, icap, etc. separately
   - 

*) htmlFilterContentScraper.java
   - adding regular expression to normalize URLs containing /../ and /./ parts

*) httpc.java
   - adding functionality to unzip gzipped content
   - requested by roland: should be used later to allow gzipped seed lists

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1170 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-06 10:41:19 +00:00
orbiter
dc778659fb fixed problem with time-out during result joint which caused OR behavior instead of AND beahvior
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1167 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-05 15:48:45 +00:00
orbiter
3d8a5ae652 code cleanup
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1166 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-05 14:24:13 +00:00
allo
ae6a4650bc reenabling debugMode (60 Seconds timeout for *all* http connections)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1165 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-05 11:14:11 +00:00
theli
bb1f73ec15 *) Bugfix for code cleanup
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1164 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-05 09:48:11 +00:00
orbiter
7920e1547d code cleanup
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1163 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-05 09:13:13 +00:00
theli
972b8ddd2a *) minor changes: using constants properly
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1162 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-05 09:18:47 +00:00
theli
1c3750de57 *) Bugfix for code cleanup
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1161 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-05 09:15:05 +00:00
theli
64478b1f02 *) Adding possibility to delete crawler queue entries using regular expressions
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1160 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-05 09:11:28 +00:00
orbiter
1d6a6d1f85 code cleanup
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1159 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-05 00:17:12 +00:00
orbiter
a04930f025 code cleanup
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1158 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-04 23:51:28 +00:00
low012
90b0eb144e just a typo...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1155 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-03 09:58:00 +00:00
theli
129b15f3e1 *) Correcting logging output of db importer thread
See: http://www.yacy-forum.de/viewtopic.php?t=1555

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1154 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-02 11:56:12 +00:00
theli
7028412c54 *) Bugfix for NullpointerException in Java Finalizer
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1153 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-02 10:57:31 +00:00
orbiter
420d56ce79 extended db-testing
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1152 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-02 01:51:37 +00:00
orbiter
ecf765ec33 temporary fix to make jrpm extension compilable with my netbeans environment
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1151 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-01 23:03:54 +00:00
rramthun
8c0d982191 1.) Fix from Martin (he ist not at home...)
2.) Search button now gets blocked if clicked and JS active.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1150 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-01 18:26:39 +00:00
theli
b604654c25 *) Adding possibility to do a settings migration on yacy startup
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1149 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-01 13:39:38 +00:00
theli
8ed0aaae8d *) Adding content Parser for RPM Files
- at the moment only the metadata is extracted

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1147 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-01 10:47:00 +00:00
theli
0f769215b5 *) urlRedirector now uses http head requests to determine the mimetype
of a resource before it checks if a URL has to be rejected or not.
   

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1146 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-30 04:52:22 +00:00
theli
b62c36d2ab *) Bugfix for urlRedirector
- media extension was not parsed correctly

*) Displaying urlRedirector connections in Connections_p.html

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1145 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-29 23:45:17 +00:00
allo
8f4259f1e3 md5Auth
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1144 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-29 21:31:06 +00:00
theli
818d37ce44 *) Removing getSimpleName
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1143 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-29 12:50:13 +00:00
theli
b35c5a48bf *) First version of urlRedirector.pl script
- with this script it's possible to pass URLs from squid
     to yacy via the squid redirector interface
   - this URLs are then used by YaCy to feed the crawler

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1141 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-29 12:27:03 +00:00
theli
bdf30117c1 *) Redesign of parser configuration
- restructuring of mimeTypes based on the parsers
   - displaying parser usage count
   - displaying human readably parser names
   - displaying parser version information

*) httpdFileHandler.java
   - adding possibility to support "streaming" servlets
     which are special servlets that can communicate with
     the client via the connection streams autonomous
   - the name of these new servlet types must end with the 
     file extension .stream
   - this feature will be needed by the yacy ScreenSaver
     class to fetch statistic data from the peer without the
     need to reconnect to the server all the time

*) Adding human readable names and version information for
   all supported parsers

*) plasmaParser.java
   - adding new structure to store parser statistic data

*) Adding openDocument parser
   - can be used to parse odt files

*) jmimemagic
   - adding rules to detect openDocument formats properly

*) serverLog.java
   - adding functions that can be used to query if a given
     logging level is enabled or not.


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1140 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-29 07:27:58 +00:00
allo
b86d1085e2 passwordAuth
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1138 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-28 22:35:41 +00:00
theli
5bf70e6e14 *) Bugfix for serverClassLoader.java
- Classloading didn't work properly if there are multiple classes with the same name
   - This could occure because the yacy servlets have no package name defined and therefore
     are all in the same (default) package.

*) Bugfix for Duplicated Class Error
   See: http://www.yacy-forum.de/viewtopic.php?t=1341

  

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1135 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-28 10:15:25 +00:00
theli
d4ac3e25b1 *) Bugfix for file system link bug during detection of invalid URLs
See: http://www.yacy-forum.de/viewtopic.php?p=13301

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1134 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-28 07:17:43 +00:00
orbiter
adf75bc9fa better logging for invalid file path detection
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1133 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-27 22:55:30 +00:00
orbiter
40621a5663 anhancements in ranking preparation and fixed problem with parser/mime recognition
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1132 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-27 11:55:24 +00:00
theli
c650b112ea *) Bugfix for relative URL Bug in Crawler
See: http://www.yacy-forum.de/viewtopic.php?p=13266#13266

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1130 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-27 06:35:23 +00:00
theli
7e670894d9 *) Suppressing stackTraces in proxyError message for "connect timed out" errors
See: http://www.yacy-forum.de/viewtopic.php?t=1504
*) Increasing default http client timeout

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1129 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-25 00:40:35 +00:00
theli
4e73035aef *) Bugfix for "too many open files" during index distribution
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1128 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-24 21:47:16 +00:00
allo
d8afe60e07 Bugfix for last Bugfix ;-).
host/port were set to originaladdress instead of the correct values for the new Url.


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1126 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-23 14:05:25 +00:00
orbiter
1b656f6b31 correction of bug from svn 1123
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1125 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-23 12:07:07 +00:00
orbiter
f57e2d67f5 shortened network overview (less columns fit easier on page)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1124 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-23 11:57:30 +00:00
allo
24d15eb0e8 moving the redirector code
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1123 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-23 07:52:36 +00:00
allo
787c368696 synchronized redirectors and using the port.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1122 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-23 07:37:15 +00:00
orbiter
85282b1d98 enhanced YBR recognition and search result heuristics
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1121 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-23 01:40:02 +00:00
allo
4776f3f815 squid like redirctors
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1120 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-22 22:07:29 +00:00
orbiter
b9cc9029e3 added ybr selection for remote search
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1119 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-22 16:10:24 +00:00
orbiter
0e25020f51 added first generation and usage of YBR index-files. Enhanced overall ranking of search results.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1118 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-22 15:17:05 +00:00
allo
52a0237bf2 using Filetemplates for #[metas]# and other static includes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1116 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-21 08:33:54 +00:00
theli
90d6c6223b *) Adding color codes to network graphic legend
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1114 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-21 08:13:01 +00:00
orbiter
bfe51c7228 added generation of domain-list
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1112 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-21 01:30:30 +00:00
orbiter
0ec54d9c5f enhanced CR-file handling and added first RCI-evaluation tests
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1110 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-20 18:55:35 +00:00
theli
99fb26e499 *) Suppressing stackTraces in proxyError message for harmless errors
See: http://www.yacy-forum.de/viewtopic.php?t=1504

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1108 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-20 15:37:23 +00:00
theli
c2fe3a1670 *) Updating jMimeMagic Ruleset
- to detect some special formated html documents correctly
   - adding rule to detect vCards
*) plasmaParser now supports parsing of files that have a supported fileExtension
   but a unsupported mimeType because the webserver has set it incorrectly to text/plain
*) Adding vCard new Parser


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1107 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-20 14:39:58 +00:00
orbiter
88e3234393 fine-tuning of rci-generation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1105 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-18 02:00:25 +00:00
orbiter
a12759c1bf first try to implement a rci-computation from cr-files
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1103 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-17 16:17:56 +00:00
orbiter
4a8e8f269e refactoring of cr-processing; new kelondro class to handle the attribute file format
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1100 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-17 12:08:04 +00:00
orbiter
24dc0e0760 implemented cr-file processing and further transmission steps
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1099 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-17 01:59:01 +00:00
low012
5cd1e9cef4 *) fixed some dirty code, idea analog to bit stuffing
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1098 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-16 23:58:13 +00:00
orbiter
022530df7e small bugfix in kelondroTree
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1097 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-16 22:36:36 +00:00
orbiter
9d9a87f445 limited htcache storage length
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1096 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-16 18:40:44 +00:00
theli
8e308cf50e *) Possibility to change the server port on-the-fly.
- Now it's possible to change the server port without the need to restart the whole server.
   

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1089 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-15 15:03:15 +00:00
theli
d0dfccdb77 *) Making CrawlStacker pool configurable via GUI and config file
See: http://www.yacy-forum.de/viewtopic.php?t=1448

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1087 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-15 12:46:22 +00:00
theli
3631cb1f6d *) deleting empty entities during index selection
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1086 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-15 12:23:46 +00:00
theli
ca26aab9b1 *) More debugging output for migrateWords
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1085 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-15 11:55:09 +00:00
theli
9b35ae9027 *) Correcting wrong % values on IndexTransfer_p page
See: http://www.yacy-forum.de/viewtopic.php?p=12646 

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1084 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-15 09:52:50 +00:00
theli
e6bf9d90a5 *) Fixing Problems with MalformedURLs during Word Selection
- removing (lurl.toString() == null) comparison because toString() is never null
   - adding (lurl.url() == null) condition because url() is null if we have selected a word entry with
     a malformed URL

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1083 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-15 09:07:00 +00:00
theli
86a9210264 *) indexing queue slots are now configurable via config file
See: http://www.yacy-forum.de/viewtopic.php?t=1480

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1081 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-15 08:25:46 +00:00
theli
3c11d7b81c *) Bugfix for minimizeUrlDB
- function didn't work correctly because of new url hash structure
   See: http://www.yacy-forum.de/viewtopic.php?p=12753#12753

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1080 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-15 07:35:04 +00:00
orbiter
9913049009 fixed outOfMemory bug caused by loops in kelondroTree during enumeration
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1079 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-15 01:20:05 +00:00
allo
f8f9d509d5 removed dead Code
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1078 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-14 12:48:14 +00:00
allo
5918d3985e removed Debug Statements
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1076 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-14 11:47:46 +00:00
theli
bbb936b9ea *) Bugfix for not human readable content of PDFs while viewing the URL Content via GUI
- This Bug also affects the snippet generation on non html/text documents
   See: http://www.yacy-forum.de/viewtopic.php?t=1472

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1075 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-14 10:25:43 +00:00
theli
445e3a620f *) Avoid rejecting of html content by the crawler when the file extension is not set properly
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1074 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-14 10:19:40 +00:00
orbiter
a3fd0069f5 fixed bug in kelondroTree node iteration (rotation did not work)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1072 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-14 00:23:20 +00:00
theli
fd58d5f8e6 *) Adding possibility to specify the interface / IP-Address where YaCy should bind to.
- e.g. Port = 192.168.0.1:8080
          Port = #eth0:8080
          Port = 8080

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1071 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-13 17:03:52 +00:00
allo
889de6686c Migration in yacyVersion
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1070 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-13 15:12:48 +00:00
theli
444a5a9368 *) Bugfix for Entries with null url in GlobalQueue
See: http://www.yacy-forum.de/viewtopic.php?p=12675#12675

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1069 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-13 14:59:38 +00:00
allo
3bbb932fa2 Bugfix for nullpointerexception.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1067 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-13 09:55:14 +00:00
allo
4976f97956 Logout Function on User.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1065 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-12 15:29:18 +00:00
allo
b74e990c74 better Logout function
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1064 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-12 12:09:45 +00:00
borg-0300
ebac51df52 restore defaultRemoteProfile
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1063 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-12 11:38:35 +00:00
allo
4daa4b00e9 logout function
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1062 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-11 22:41:36 +00:00
borg-0300
5778428455 move cutUrlText to nxTools,
max length from URLs(title) on searchpage now 120 chars


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1060 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-11 13:40:53 +00:00
borg-0300
9158845c3b bugfix for snippet text null bytes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1059 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-11 13:27:36 +00:00
orbiter
6e81f2580d try to fix bug with storage of settings
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1058 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-11 08:41:13 +00:00
orbiter
f763923e0a added missing files for last commit
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1057 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-11 08:02:46 +00:00
theli
9649d08171 *) More tolerant robots parser
- converting tabs to spaces
   - cutting of '*' in the disallow section

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1056 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-11 07:49:54 +00:00
orbiter
79818a320f introduced citation-rank transmission protocol and activate transport for anonymisation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1055 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-10 23:48:20 +00:00
theli
6f8d7d3bcd *) Adding first version of YaCy bookmarklet
- this can be used to easily crawl a webpage which is currently opened in the browser
   - to get the bookmarklet javascript simply call http://localhost:8000/QuickCrawlLink_p.html
     and drag and drop the link shown to your Browsers Toolbar/Link-Bar.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1053 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-08 12:14:51 +00:00
theli
7e0647f692 *) Bugfix for userDB usage during authentication
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1052 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-08 10:17:12 +00:00
theli
93cadb47b9 *) More tolerant robots parser for robots-files which missing empty lines between rule blocks
See: http://www.yacy-forum.de/viewtopic.php?p=12471

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1048 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-08 07:41:25 +00:00
orbiter
02f8013013 auto-delete of corrupted word files during word-migration
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1047 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-07 14:57:37 +00:00
orbiter
d2731418bf added creation of global ranking files and changed url normal form usage
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1046 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-07 12:33:02 +00:00
theli
6f9f8ed8f8 *) Automatic Reset of Stack Crawler DB on startup errors
See: http://www.yacy-forum.de/viewtopic.php?t=1432

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1045 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-07 12:19:05 +00:00
theli
fb766413d1 *) Changes on httpc dns caching
- Bugfix: old dns cache did not handle case insensitive hostnames correctly. 
   - adding a possibility to set domain name patterns defining hostnames that should not be cached by the httpc dns cache
     e.g. borg-300.dyndns.org
     This can be done by setting the new httpc.nameCacheNoCachingPatterns property
   - using httpc.dnsResolve wherever possible within the sourcecode
     [httpd.java,plasmaCrawlStacker.java]

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1044 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-07 10:57:54 +00:00
orbiter
bc420c62f6 fixed htcache path generation (never change a running system)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1041 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-07 01:31:11 +00:00
borg-0300
795f488222 new urlNormalform version
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1040 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-06 22:35:56 +00:00
orbiter
c86d801b0f removed dyndns domains from dns caching
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1039 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-06 22:12:08 +00:00
orbiter
6dc42a2392 detecting of loops in kelondroTree during last/first-Node search
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1038 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-06 21:06:55 +00:00
borg-0300
17d2830394 see: http://www.yacy-forum.de/viewtopic.php?t=1416
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1037 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-06 17:24:57 +00:00
theli
dd24f0252f *) Searchword highlighting for info page
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1036 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-06 06:27:17 +00:00
theli
f9fb284fb7 *) Better handling of robots.txt files with incorrect keywords
See: http://www.yacy-forum.de/viewtopic.php?p=12292#12292

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1035 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-06 06:01:08 +00:00
borg-0300
a1406f4617 urlNormalform: no logging
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1034 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-05 16:21:04 +00:00
borg-0300
72cde1d894 getCachePath: no logging
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1033 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-04 22:47:13 +00:00
borg-0300
1fbd72f9e0 rename "index.html" to "ndx"
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1032 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-04 22:39:33 +00:00
borg-0300
cd1107d85e added support for URLs with '?&'
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1030 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-04 17:25:15 +00:00
borg-0300
5fb2b017cb small change
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1029 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-04 16:37:56 +00:00
borg-0300
60e869f236 bugfix for last commit
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1028 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-04 15:52:12 +00:00
borg-0300
544e4ea90e small change
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1027 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-04 14:11:46 +00:00
borg-0300
00ab4d8723 cleaned, small change, Properties
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1026 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-04 13:41:51 +00:00
theli
b8ceb1ffde *) Adding better https support for crawler
- solving problems with unkown certificates by implementing a dummy trust Manager
   - adding https support to robots-parser 
   - Seed File can now be downloaded from https resources
   - adapting plasmaHTCache.java to support https URLs properly

*) URL Normalization
   - sub URLs are now normalized properly during indexing
   - pointing urlNormalForm function of plasmaParser to htmlFilterContentScraper function
   - normalizing URLs which were received by a crawlOrder request

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1024 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-03 15:28:37 +00:00
borg-0300
d2507c6081 rename setJunior()... to orJunior()...,
added javadoc, 
added getPeerType(), setIP(), setPort(String port)


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1023 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-03 14:16:16 +00:00
borg-0300
e3179a6394 added getOwnSeedFile()
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1022 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-03 14:07:58 +00:00
borg-0300
a803a509ae bugfix: port handling in HTCache
grogram flow, cleared up


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1021 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-03 12:39:24 +00:00
theli
3d0dfd4df4 *) Using StringBuffer instead of String concatenation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1018 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-03 08:08:37 +00:00
low012
452db479cd *) bugfix: "21" was displayed as &quot;21&quot; in yacyWiki
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1017 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-02 23:28:54 +00:00
hydrox
cb69047b91 *)cleanup access static methods and fields
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1016 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-02 17:56:26 +00:00
hydrox
56b9f34411 *)removed unused imports
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1015 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-02 16:30:45 +00:00
orbiter
5f68b6886b introduced new url-hashes for better ranking computation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1013 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-02 00:54:55 +00:00
orbiter
aadace1285 fixed network image in search performance monitor
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1012 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-11-01 00:49:13 +00:00
orbiter
bb369c98de fixed search result ordering by date
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1011 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-31 17:17:48 +00:00
hydrox
295aff52a3 *)added offline-browsing-support (onlineMode=0)
*)online-mode now can be changed in Status.html

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1010 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-31 12:25:40 +00:00
orbiter
4d1e56e4d9 fixed intermission-bug (removed 'break for intermission' of httpd-thread)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1009 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-31 10:46:13 +00:00
orbiter
b058ecf0bc refactoring of image-generation; added experimental PNG encoder (not active now)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1008 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-31 02:43:55 +00:00
orbiter
7a9fc653e9 fixed 100% cpu bug in wikiCode
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1007 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-30 14:27:09 +00:00
orbiter
1aa707d357 fix for http://www.yacy-forum.de/viewtopic.php?p=11866#11866
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1005 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-29 15:30:06 +00:00
orbiter
d42531e1b2 added auto-reset for NURL-DBs
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1004 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-29 13:53:32 +00:00
allo
5526d2ff73 Basic WatchCrawler.html, which loades the Queues using AJAX.
ajax.js: generic Ajax-Functions
xml/queues/indexing_p*: the Backend


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1002 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-29 09:47:30 +00:00
allo
92c49b406b adminAuth with userDB and adminAuthenticated (fix for statuspage)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1001 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-28 18:06:36 +00:00
rramthun
27f180f24b Update of YaWoStat to 0.2.
Now does not try to make 400000! operations to load a 4MB textfile :-/

Program is not finished yet.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1000 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-28 16:27:49 +00:00
theli
3b5d0eb053 *) Synchronizing robots.txt downloads to avoid parallel downloads of the same file by separate threads
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@998 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-28 09:10:48 +00:00
theli
723e056c48 *) Bugfix for ClassCastException during SessionPool.close
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@996 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-28 07:25:22 +00:00
theli
e58e85363d *) Bugfix for ConcurrentModificationException while operating on seed properties
*) Bugfix for YACY database inconsistency (no more elements available in db '...seed.new.db'), re-set of db.
   See: http://www.yacy-forum.de/viewtopic.php?p=11836#11836
        http://www.yacy-forum.de/viewtopic.php?p=11814#11814

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@995 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-28 07:15:00 +00:00
orbiter
d656e2b433 added a memory-profile chart generation to database performance testing
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@993 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-28 01:22:11 +00:00
theli
ec3af327f7 *) Bugfix for Proxy-Authentication against remote proxy
See: http://www.yacy-forum.de/viewtopic.php?p=11804#11804

*) Adding first version of db test for mysql
   NOTES:
   - db user + db + db table must be created before starting the test
   - db table must be empty. Entries can not be updated at the moment
   - db connection properties must be changed in the sourcecode at the moment
   TODOs:
   - accepting connection properties via command line
   - implementing update + remove + read operations
   - 'maybe' adding code to create db + table if it doesn't exists

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@991 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-27 11:28:37 +00:00
orbiter
5b0911d7ea added new performance menu for search sequence configuration and monitoring
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@990 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-27 00:42:08 +00:00
orbiter
4fa942511b de-serialized read and write access
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@989 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-26 23:05:20 +00:00
orbiter
1ff0ced515 integration of an interface class for abstract access of kelondro indexed structures like kelondroTree and kelondroHashtable
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@987 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-26 12:37:22 +00:00
allo
ada06b0674 bugfix for Networkimage from Hydrox
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@986 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-26 10:08:37 +00:00
allo
e3f87f9dab Bugfix for not parsable Numbers (NumberFormatException when loading users)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@985 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-25 12:30:07 +00:00
theli
d3ad712418 *) Bugfix for Seed file upload problem via ftpc
See: http://www.yacy-forum.de/viewtopic.php?p=11662#11662

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@984 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-25 10:45:14 +00:00
theli
9a2afe88d4 *) Deactivating unlimited timeout for persistent connections because this
could cause problems with clients which do not shutdown persistent 
   connections properly.
   - Setting timeout for idle persistent connections to 30 minutes

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@983 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-25 10:21:00 +00:00
orbiter
1aa4ba8b62 added post-search filtering of redundant urls (longer than existing cited)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@982 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-25 09:06:00 +00:00
low012
8e7b214e41 *) 5 inverted commas are needed instead of only 4 to strongly emphasise a text (closer to Wikipedia's syntax now)
*) Tried to fix nested <pre> and [= issues, but did not succeed due to sleepyness. Will continue work when I'm awake and have some time.


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@981 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-24 22:37:10 +00:00
orbiter
8d827cdb30 tried to fix problems with order of network list by last-seen (which could also improve the network picture)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@980 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-24 14:07:43 +00:00
low012
1e6cf9fd41 *) indented texts between escape symbols behave like indented preformatted texts now
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@979 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-24 13:21:57 +00:00
theli
446e7e8bef *) Bugfix for Seed-Upload - Permission denied problem
See: http://www.yacy-forum.de/viewtopic.php?p=11648#11648

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@978 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-24 08:54:46 +00:00
orbiter
097009d910 experimental visualization of DHT access during global search (temporary)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@977 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-24 00:34:15 +00:00
orbiter
4dcbc26ef1 introduction of search profiles; very experimental
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@976 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-23 17:50:27 +00:00
theli
6c48c3ce39 *) Bugfix for ArithmeticException during IndexTransfer
See: http://www.yacy-forum.de/viewtopic.php?t=1362

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@974 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-23 16:07:44 +00:00
low012
fd1b08303e fixed a bug theli pointed out (indented <pre>-blocks)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@973 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-23 14:04:21 +00:00
theli
525c8dcbd4 *) Adding Traffic Statistic for Crawler
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@972 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-23 12:35:48 +00:00
theli
9a5ab62928 *) Adding yacy specific X-YACY-Index-Control header which can be used by clients
to disallow yacy to index the response that belongs to the request where 
   X-YACY-Index-Contro is set to "no-index"

*) Bugfix for Seed-List download via Remote Proxy.
   Now the pragma and cache-control http headers of the request are properly set to "no-cache" 
   See: http://www.yacy-forum.de/viewtopic.php?p=11639#11639

*) Bugfix for http-Proxy
   yacy has ignored "no-cache"- pragma and cache-control http headers that were send in requests.
   Now, these request headers are evaluated properly

TODO: Missing evaluation of "no-store" request headers

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@971 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-23 10:35:05 +00:00
theli
539f9e57aa *) Disabling RemoteProxy if remoteProxyName is null or empty
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@970 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-23 09:03:40 +00:00
theli
5a25ad9109 *) Bugfix for useRemoteProxy4YACY and useRemoteProxy4SSL check
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@969 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-23 08:59:11 +00:00
theli
02d9af1a70 *) Restructuring and extending of Remote Proxy Support
- remote proxy configuration can now be "really" changed on the fly and takes effect immediately
   - adding possibility to disable remote proxy usage for yacy->yacy communication
   - adding possibility to disable remote proxy usage for ssl
   - restructuring proxy configuration so that it is stored in a single place now

*) Adding possibility to import a foreign word DB (or even more of them in parallel) 
   at runtime into the peers DB
   - this can be done by calling IndexImport_p.html 
   - ATTENTION: please not that at the moment this thread must be aborted via gui
     before a normal server shutdown is done. 
   - TODO: integrating IndexImport Thread into normal server shutdown
   - TODO: Adding posibility to import crawl-queues, etc. from foreign peers
   - TODO: removing old import function from yacy.java and calling the new routines instead

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@968 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-22 13:28:04 +00:00
low012
222607ef0f *) added <pre>-tag for preformatted text to yacyWiki (as requested by theli)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@967 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-22 13:00:36 +00:00
allo
117a424d00 bugfix for sharing png/gif files in WWW/SHARE
http://www.yacy-forum.de/viewtopic.php?p=11565


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@966 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-20 14:58:15 +00:00
theli
af3060938b *) Bugfix for manual peer ping functionality
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@965 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-20 11:43:43 +00:00
theli
6e3201b74d *) Bugfix in httpc.java
- Requestheader was not passed to the underlying post function properly
   - Bug seems not to have caused any side-effect until yet

*) Bugfix for manual peer ping functionality

*) Bugfix for UnresolvedPattern Problem if an Exception occurred in a servlet.
   See: http://www.yacy-forum.de/viewtopic.php?t=1353

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@963 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-20 09:55:12 +00:00
borg-0300
58b670201d now, changed HTCacheSize needs no restart
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@961 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-19 17:59:54 +00:00
allo
f97c303ebd rights for Admin and Proxy.
Adminrights are OR(old auth or new).
Proxyrights are AND(you need Proxyrights and a not reached Timelimit)


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@960 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-19 12:20:08 +00:00
allo
97de600a68 another bugfix for share/www.
Now you can Use share/ and not only share/dir.html


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@958 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-19 11:42:15 +00:00
allo
df147d0fb0 rightsmanagement for upload/download User - preparations for proxy and admin User
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@957 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-19 11:26:35 +00:00
allo
2dfd6bf36a fix for networkimage
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@956 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-19 10:25:56 +00:00
allo
ec10220d57 Fix for last Commit: .class Files in htroot, not in the dir of the localized HTML-Files
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@955 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-19 07:17:49 +00:00
allo
4db2080188 Bugfix for www and share.
http://www.yacy-forum.de/viewtopic.php?p=11486


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@954 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-19 06:52:43 +00:00
theli
40777556c5 *) Connection Tracking
- adding automatic refresh
   - accepts new parameter nameLookup which can be used to deactivate 
     yacy-peer name lookup (because we have problems with this on large seed-dbs)

*) ViewFile
   New page that can be used to view 
   - original content 
   - plain text content 
   - parsed content
   - parsed sentences 
   of a webpage specified by there url hash
   Mainly for debugging purpose at the moment

*) Robots.txt 
   Bugfix for if-modified-since usage
   TODO: synchronization of downloads to avoid loading the same robots-file 
   multiple times in parallel by different threads

*) Shutdown
   Better abortion of transferRWI and transferURL sessions on server shutdown

*) Status Page
   Adding icon to start/stop crawling via status page

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@950 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-18 07:45:27 +00:00
borg-0300
bcb0d6d5ff changed setLastSeen(long rd) to setLastSeen();
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@949 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-17 23:23:17 +00:00
borg-0300
cdbaf637fb added
- getIP(), getJunior(), getSenior(), getPrincipal(); 
- setIP(), setJunior(), setSenior(), setPrincipal(), setLastSeen(long rd); 
- isPeerOK(), isOnline(String type);
next try to remove hello.class java.util.ConcurrentModificationException:null ;)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@948 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-17 23:12:40 +00:00
borg-0300
e642a5d8b7 more constants
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@947 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-17 15:46:12 +00:00
borg-0300
7b510b558a small change
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@946 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-17 13:59:50 +00:00
borg-0300
ee7d847869 small change
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@945 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-17 13:57:54 +00:00
borg-0300
d77b982083 small fix for last commit
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@944 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-16 00:15:23 +00:00
borg-0300
b00cd5640b bugfix for 'hello.class java.util.ConcurrentModificationException:null'
finals

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@943 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-15 23:37:37 +00:00
rramthun
a98bafb939 Changes to german language file
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@941 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-14 20:36:45 +00:00
allo
43a127ff3a allow httpsTunnels to other Ports than 443. (if secureHttps=false)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@940 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-14 12:51:56 +00:00
allo
93c1cb9226 ipAuth works. thx theli.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@939 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-14 12:01:41 +00:00
allo
6430fa520e bugfix for broken HTDOCS
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@938 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-14 11:50:11 +00:00
allo
4320425a17 ipAuth (this does not work yet)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@937 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-14 10:53:50 +00:00
theli
95abdeb685 *) Bugfix for nextElement function of URL Enumerator
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@936 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-14 08:51:02 +00:00
orbiter
6260942590 changed search process: received indexes are now buffered and written to wordIndex after search
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@934 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-13 13:57:15 +00:00
theli
b7e21ec107 *) Adding DB import function which allows to import an foreign yacy DB (from directory PLASMADB)
into the DB of an other peer.
   ATTENTION: not tested very well. please use this with care and always make a db backup

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@932 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-13 12:31:32 +00:00
borg-0300
7ee03acce0 new function cutUrlText added to shortens the URLs on IndexMonitor.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@931 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-13 12:05:39 +00:00
allo
b88a9584f8 New Errorpage
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@928 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-13 07:47:57 +00:00
theli
b177a80bb7 *) Bugfix for sendRespondError StackOverFlowException problem
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@927 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-13 07:29:14 +00:00
theli
219acc1e8f *) Bugfix for wrong http version in response to http/1.0 requests
See: http://www.yacy-forum.de/viewtopic.php?t=1312

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@926 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-13 06:30:13 +00:00
orbiter
bc56a88cc8 further refactoring of search
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@925 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-13 00:05:30 +00:00
allo
0f2f783e46 no no-cache for mediaExts
see http://www.yacy-forum.de/viewtopic.php?p=11210#11210


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@924 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-12 20:45:14 +00:00
allo
7ca60f97bf localization Support for Includes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@923 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-12 12:44:05 +00:00
orbiter
d29dfb0a12 refactoring of search / preparation for better search methods
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@921 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-12 12:28:49 +00:00
allo
9ff2bd2f0e allow surfing, if no timelimit set.
thx, theli


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@920 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-12 10:58:21 +00:00
theli
0ae166c522 *) Small changes to Index Transfer.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@919 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-12 10:19:01 +00:00
theli
461374e175 *) Restricting amount of files that yacy is allowed to open during index transfer/distribution
This option is configurable via config file and is set per default to 800
   See: http://www.yacy-forum.de/viewtopic.php?p=11137#11137

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@918 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-12 09:38:40 +00:00
theli
c8a35a0130 *) Adding new connection tracking page (currently only for incoming connections)
*) Displaying statistic for incoming connections on status page
*) Bugfix for Loop-Access Bug when trying to access the yacy page while yacy is configured as proxy
   See: http://www.yacy-forum.de/viewtopic.php?p=6826
*) Bugfix for Referer Bug
   See: http://www.yacy-forum.de/viewtopic.php?p=11098#11098
*) Adding reverse Name lookup for yacy-domain names (used by the connection tracking page)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@916 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-12 08:17:43 +00:00
allo
141417821e limitless Accounts (set timelimit = 0)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@914 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-11 20:56:51 +00:00
orbiter
b80b2fbdcc crawling peers now produce waves in network graphic
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@912 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-11 13:06:07 +00:00
orbiter
b45ffecd39 log to fix http://www.yacy-forum.de/viewtopic.php?p=11111#11111
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@911 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-11 07:46:14 +00:00
orbiter
10d3627c90 changed word cache flush scheduling and removed possible locks
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@910 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-11 07:06:33 +00:00
orbiter
dced5c761e added 'corona' for crawling peers in network graphic
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@909 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-10 16:21:09 +00:00
orbiter
4cf0d1eb6d less memory occupation in ImagePainter
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@908 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-10 15:02:53 +00:00
allo
6bd9348bf1 small change
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@907 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-10 14:25:18 +00:00
orbiter
aae5af9686 imagePainter may now paint arbitrary large circles; small update to network graphics
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@906 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-10 13:48:25 +00:00
allo
be8e95d86a debug printlns removed
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@905 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-10 13:38:13 +00:00
allo
f1ff33177d reset Timelimits on Daychange
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@904 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-10 13:06:03 +00:00
theli
54a97a7355 *) IfesL: Suppressing "Broken pipe" stacktrace in log file
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@903 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-10 11:31:46 +00:00
theli
a9e25c26e1 *) adding new sendRespondError method to httpd which accepts a template include file
for individual error messages

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@902 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-10 10:33:09 +00:00
allo
5605cc8018 TimeLimits
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@901 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-10 10:21:25 +00:00
orbiter
6a72f06c40 resizable network picture + greater on click
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@900 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-10 10:08:28 +00:00
orbiter
839db8869c added high/low priority for index adding
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@899 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-10 09:28:28 +00:00
theli
3966b38360 *) Bugfix for userDB time limit calculation
See: http://www.yacy-forum.de/viewtopic.php?p=10949

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@898 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-10 09:17:19 +00:00
theli
1688be8590 *) plasmaSwitchboard.java
adding more verbose logging output for db initialization
*) httpdFileHandler.java
   adding cache for servlet response methods


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@897 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-10 09:13:17 +00:00
theli
e3a586d7bd *) Using serverByteBuffer instead of ByteArrayOutputStream
to speedup httpdFileHandler

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@896 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-10 07:15:57 +00:00
orbiter
e9eb5e4b56 refactoring of index-entity join methods
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@895 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-10 00:45:18 +00:00
orbiter
258fd9eb8e adding missing file for websearch refactoring
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@894 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-10 00:33:25 +00:00
orbiter
77ae30063d refactoring of websearch process
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@893 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-10 00:32:15 +00:00
orbiter
579b22d8ff small update to network drawing
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@892 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-09 23:11:17 +00:00