Commit Graph

1080 Commits

Author SHA1 Message Date
hermens
56936139ae Fix logging in kelondroTree:
*) Use java.util.logging.* (via kelondroRecords) without de.anomic.server.logging.serverLog
*) Log "CORRECTING ITERATOR" only in debug mode
   (happens way too often to be a useful warning and is corrected anyway)
*) Log when correctedNodeIterator stumbles on incorrectly ordered data
   (now this one deserves throwing a warning at the user)



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1503 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-31 02:01:32 +00:00
orbiter
3834675084 fixed bug that caused wrong behavior of search result preparation
(second search on same topic resulted in less links)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1502 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-31 01:20:28 +00:00
hermens
31c8476b5d plasmaWordIndexCache.getContainer:
*) Also get entries from cache
*) calculate available remaining time for backend.getContainer correctly



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1501 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-31 01:04:23 +00:00
hermens
bb1664b63e *) Remove workaround from SVN 1472: It is not needed anymore
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1500 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-31 00:45:48 +00:00
orbiter
ec5d88664a tried too fix serverSwitch synchronization problems
see also: http://www.yacy-forum.de/viewtopic.php?p=16110#16110

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1499 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-30 23:07:20 +00:00
orbiter
3419b3bcdd fix for bug that caused the peer-counter problem.
See http://www.yacy-forum.de/viewtopic.php?p=16016#16016
The kelondroDyn now uses a generic fill character.
kelondroDyn-Tables containing peer/word/url-hashes must not use '_'
as fill character.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1498 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-30 22:16:55 +00:00
allo
9bc62bfdac typo
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1497 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-30 20:57:11 +00:00
allo
cdc595ab67 another ppm fix. no more unknown ppm
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1496 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-30 20:42:02 +00:00
hermens
4f43816ec0 *) Fix wrong class cast in indexSize()
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1495 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-30 17:09:07 +00:00
orbiter
a7f0adf6fa bugfix in entity iterator
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1490 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-30 12:50:40 +00:00
orbiter
fa90c3ca7a - removed some usage of indexEntity
- changed index collection process: indexes are not first flushed to indexEntity,
  but now collected directly from ram cache and assortments

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1489 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-30 12:42:06 +00:00
orbiter
aea3e00864 cleanup: removed unused temporary index management in indexEntity.
This is replaced by indexContainers

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1486 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-30 01:18:25 +00:00
orbiter
03c65742ba changes towards the new index storage scheme:
- replaced usage of temporary IndexEntity by EntryContainer
- added more attributes to word index
- added exact-string search (using quotes in query)
- disabled writing into WORDS during search; EntryContainers are used instead


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1485 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-30 00:42:38 +00:00
rramthun
84a00e5673 Use YaCy logging instead of something I don't understand.
Problem was: YaCy under Linux wrote every CORRECTING ITERATOR message to syslog an your logfiles get VERY big if you run YaCy 24/7. 
Approx. 20MB/day.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1483 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-29 16:04:20 +00:00
theli
ab7a911bb3 *) Trying to solve pool not open problem
See: http://www.yacy-forum.de/viewtopic.php?t=1798

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1482 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-29 08:54:19 +00:00
allo
a6245a302f even better ppm ;-)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1481 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-28 19:22:30 +00:00
hydrox
d665f3c39c *) fixed Threadnames for stackCrawl-Threads
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1480 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-28 19:06:21 +00:00
theli
3d5347bc8e *) changing loglevel for some messages
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1479 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-28 17:40:24 +00:00
theli
0fcd113c42 *) last bugfix part. Seems to work now for the stackCrawler
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1478 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-28 17:25:19 +00:00
theli
b9c9eaeb44 *) next try todo a bugfix :-((
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1477 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-28 17:05:47 +00:00
theli
4b4b93c413 *) next try todo a bugfix :-(
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1476 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-28 16:55:05 +00:00
theli
d9fbad71b9 *) next try todo a bugfix
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1475 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-28 16:38:25 +00:00
theli
6da97bd2e4 *) next bugfix for threadpool problem
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1474 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-28 16:31:40 +00:00
theli
bea2b9edee *) further redesign of threadpools to solve too many thread problem
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1473 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-28 16:18:07 +00:00
hermens
2d1283da34 This is an extremely ugly workaround for an incompatibility between yacySeed hashes and kelondroDyn keys
See: http://www.yacy-forum.de/viewtopic.php?p=15955#15955



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1472 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-28 15:26:56 +00:00
theli
784fd50437 *) more verbose thread names
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1471 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-28 15:26:47 +00:00
theli
56e4dbeb71 *) displaying current active + current idle threads in PerformanceQueues_p.html now
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1470 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-28 15:17:04 +00:00
theli
859c6a88f5 *) testing various thread pool eviction settings to avoid outOfMemory - Thread creation problem
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1467 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-27 16:51:29 +00:00
allo
7197f171d3 better ppm calculation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1464 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-27 11:51:27 +00:00
orbiter
f2b18cede9 AND-bugfix
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1461 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-27 03:58:38 +00:00
orbiter
b946e28e61 some ranking enhancements
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1460 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-27 02:48:27 +00:00
rramthun
6c02f889f7 Cosmetic changes.
Corrected version numbering as described in http://www.yacy-websuche.de/wiki/index.php/De:Versionsnummern

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1453 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-26 15:12:44 +00:00
theli
b191f06d16 *) Adding additional logging message to locate problems with stackcrawl threads
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1452 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-26 14:24:29 +00:00
theli
5c56b9ed59 *) catch exceptions that could occur during url decoding
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1451 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-26 13:57:49 +00:00
theli
d9bcd73d93 *) Bugfix for exception
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1448 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-26 12:15:59 +00:00
theli
f5abfe8d57 *) more failsafe threadpools
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1446 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-26 09:37:43 +00:00
orbiter
47344e8df0 removed referrer fake (too many complaints, too less use)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1444 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-25 18:54:46 +00:00
hermens
ad0de69607 Yet another bug fix for svn 1441. It should work now.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1443 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-25 13:32:04 +00:00
hermens
58fd40e1c1 Aaargh
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1442 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-25 13:28:03 +00:00
hermens
b08af0c2cb *) Force download of seed file when checking upload success
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1441 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-25 13:25:48 +00:00
hermens
66c889138e *) Bugfix: Principals are reported back as 'principal', so IWasAccessed should also be true
*) make it easier to include legacy peers switching between timezones +0100 and +0200



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1438 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-25 01:50:24 +00:00
orbiter
a56fefe0d3 added missing forced-flush for index cache
see http://www.yacy-forum.de/viewtopic.php?p=15732#15732

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1434 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-24 16:24:15 +00:00
hermens
78bcb8014a *) Limit range for selection of indexes for distribution to a DHTDistance of 0.2
(For wider ranges enough suitable targets are not probable)
*) Migrate Indexes from ClassicDB back to AssortmentCluster if transfer fails
*) Remove class iterateFiles from plasmaWordIndex
   (The class iterateFiles from plasmaWordIndexClassicDB is used instead)



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1430 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-24 14:58:15 +00:00
hermens
861aae678d *) cleanup cacheAge database when cleaning up the HTCache
*) Log directory deletes with level Fine



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1427 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-24 14:21:01 +00:00
orbiter
b9d73f63e7 replaced String object in loop detection by byte[] to omit String-generation
which could cause locks.
See http://www.yacy-forum.de/viewtopic.php?p=15738#15738

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1425 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-24 13:12:38 +00:00
low012
927c2c3709 *) Fixed a minor bug in code for tables. {|border"1" did not work, {| border"1" did. Now the space is not needed anymore.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1423 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-24 11:15:20 +00:00
theli
75aad0fe66 *) Bugfix for URLs containing spaces
See: http://www.yacy-forum.de/viewtopic.php?t=1640

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1422 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-24 09:30:58 +00:00
theli
754a35877f *) Changing robots parser cxclusion policy
- crawling is now allowed if server returned a 403 statuscode 
     when trying to download the robots.txt
   See: http://www.yacy-forum.de/viewtopic.php?t=1612

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1421 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-24 08:53:30 +00:00
hermens
a2e2d583f9 *) small bugfix regarding peerPingMaxRunning
*) beautify log



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1419 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-24 01:16:22 +00:00
theli
b4e2efef10 *) first test of new iteration function
ATTENTION: please don't use it at the moment

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1418 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-23 17:20:30 +00:00
rramthun
a4e90c4b11 Fixed spelling bug.
I think this is important for other programmers, who don't make the same mistake as the original author.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1417 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-23 15:08:39 +00:00
low012
c45517db46 *) replaced code for table with better version (by kane)
*) split replaceHTML into replaceCharacters and replaceHTMLonly, replaceHTML can still be used to ensure compatibility



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1416 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-23 13:59:40 +00:00
orbiter
eabf4a0386 fix for null pointer exception during shut-down
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1415 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-23 13:45:14 +00:00
orbiter
47843e69e2 auto-reset for switchboard queue stack
bugfix for http://www.yacy-forum.de/viewtopic.php?p=15684#15684

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1414 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-23 12:41:08 +00:00
orbiter
a70970f993 fixed increment in content iterator
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1413 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-23 12:11:32 +00:00
hermens
62ab8d18c1 *) Bugfix for peer sorting method. This seems to cause funny side effects in the SeedDB
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1412 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-23 01:42:26 +00:00
hermens
75b268f16d *) use majority voting for peer type decision
*) reduce the number of peer pings sent out
see: http://www.yacy-forum.de/viewtopic.php?t=1748



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1411 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-22 23:14:37 +00:00
orbiter
d6581c445b added content iterator for corrupted database files
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1406 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-22 17:12:16 +00:00
theli
ecdc1f7547 *) Bugfix for crawling URLs with query parameters
See: http://www.yacy-forum.de/viewtopic.php?p=14065
*) Preparation for http://www.yacy-forum.de/viewtopic.php?t=1719

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1405 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-22 16:39:10 +00:00
low012
eb80156233 *) added Kane's code for tables
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1403 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-22 15:41:49 +00:00
low012
ef22fa8bf2 *) beautifying code and a little comment
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1401 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-22 14:20:38 +00:00
orbiter
fc4ae899f7 added word-position to ranking (this is only a first step)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1395 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-22 02:16:09 +00:00
orbiter
bb2095fe39 assortment files are now not deleted, but shifted to a backup directory.
See also: http://www.yacy-forum.de/viewtopic.php?p=15458#15458

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1394 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-22 01:03:01 +00:00
orbiter
7366e39dd3 tried to fix 100% CPU bug.
See http://www.yacy-forum.de/viewtopic.php?p=15569#15569

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1393 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-22 00:50:05 +00:00
orbiter
f14d49fae9 enhancements, bugfixes and additions to word index attribute storage
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1392 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-22 00:07:00 +00:00
allo
4d33020f56 Migration to WORK
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1389 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-21 16:39:57 +00:00
rramthun
1e5feedf0e Fix for http://www.yacy-forum.de/viewtopic.php?p=15547#15547
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1388 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-21 11:20:26 +00:00
orbiter
f4ffa9aee5 - implemented more attributes to index entries
- implemented hand-over of new word index attributes during remote search
- implemented word-distance computation during search

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1382 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-20 15:14:21 +00:00
theli
026dcdfcc0 *) Bugfix from mbirth for ftpc bug
See: http://www.yacy-forum.de/viewtopic.php?p=15496#15496
   

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1381 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-20 13:37:42 +00:00
rramthun
34a7802a36 .bat files create DOS-window with title "YaCy" as suggested in http://www.yacy-forum.de/viewtopic.php?p=15470#15470
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1380 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-20 12:01:12 +00:00
orbiter
90b940e90e fixed position storage problem.
Now the word position is properly stored.
No use of that now, but can be used for better ranking.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1378 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-20 00:26:55 +00:00
orbiter
0371494010 tried to add word position to index
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1377 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-19 14:13:39 +00:00
orbiter
f1cfee7703 removed tabs from condenser
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1376 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-19 12:24:35 +00:00
hermens
37791fd529 *) Close indexEntities when "found not enough peers for distribution"
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1375 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-18 20:47:05 +00:00
orbiter
3ec7177f0f added seed flag 'AcceptCitationReference'
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1374 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-18 14:52:24 +00:00
borg-0300
c5b6154136 added CRDistOn = true/false
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1372 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-18 02:18:23 +00:00
low012
ee010c36ae *) fixed bug where ampersands were replaced by replaceHTML even if they were part of &.+;
*) fixed bug in htmlentities (cAseSenSitivity!)


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1371 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-18 01:48:34 +00:00
orbiter
71d5c2b2ca better control for target peer selection for RWI transfer
see also http://www.yacy-forum.de/viewtopic.php?p=15343#15343

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1370 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-18 00:30:07 +00:00
hermens
ca7407b7e1 *) Don't change maxTime if zero or negative
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1363 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-17 14:47:21 +00:00
hermens
84fce71d11 *) correct merging of indexes when enumerating downward
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1361 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-17 11:44:55 +00:00
low012
7d5af75d11 *) Second try: replaced replaceHTML again. There should be no problem this time.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1359 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-17 01:00:38 +00:00
orbiter
d45b1162e2 signature fix
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1358 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-17 00:43:38 +00:00
orbiter
c51e18c4ad - added signatures to ordering
- added storage of orderings to database
- orderings can be read from database when opening a database

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1357 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-17 00:31:27 +00:00
low012
c6d58d9b9a *) taking back changes for replaceHTML
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1356 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-16 21:43:32 +00:00
low012
8af5df7883 *) Replaced replaceHTML by a more versatile method (code by Kane). Please test for possible side effects if you use this method in any of your classes.
*) Added a little bit of javadoc
*) Replaced tabs by spaces and beautyfied the code here and there.

The code is saved in UTF-8 encoding. I hope that does not cause any trouble.


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1354 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-16 21:15:06 +00:00
allo
fd1dd6859b code cleanup
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1353 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-16 17:40:02 +00:00
theli
99031b10ae *) Bugfix for recursive translation problem with file separators
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1352 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-16 17:32:24 +00:00
allo
1f9ff68f45 bugfix for the notdir in the recursive functions
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1351 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-16 17:16:50 +00:00
theli
da81d7db17 *) Adding possibility to close open connections via Connections_p.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1350 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-16 14:46:33 +00:00
allo
f8ee05372f better Logging
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1344 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-14 22:06:35 +00:00
allo
5531d18676 translations with relative Path
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1342 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-14 18:52:14 +00:00
allo
fe2d983c3e recursive Translations!
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1341 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-14 11:56:20 +00:00
orbiter
3d7c8aaeae removed confusing method
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1339 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-14 02:08:11 +00:00
orbiter
4cd0c45a77 code cleanup
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1337 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-14 01:06:54 +00:00
hermens
971247b78f - rotate merged indexes after merging
see: http://www.yacy-forum.de/viewtopic.php?t=1717
- fix -rwihashlist to correctly shutdown



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1336 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-13 23:59:04 +00:00
(no author)
001513cc1f Now custom httpHeader can be created
and filled with cookies and so on.

This header one can set into serverObjects

Check CookieTest.html and CookieTest.java for details.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1334 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-13 22:50:04 +00:00
(no author)
55f3232219 Patch for the Coockie management.
Version 0.1

Start Yacy, go to localhost:8080/CookieTest.html
Play around with cookies
Look into CookieTest.java to See, how it works

This behavior will be changed 
such that httpHeader will be responsible for the cookies in the future



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1332 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-13 21:29:04 +00:00
orbiter
e2ff1767b5 fix for last DHT distribution bug-fix
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1330 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-13 16:58:27 +00:00
allo
2cd01eb1aa bugfix for dates.db
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1329 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-13 15:26:10 +00:00
allo
d5aaaf500f bookmarksDB:
lowercase Tags; if you create a Tag with Uppercase chars, friendlyName will be set.
So The Tag is displayed Uppercase, but stored Lowercase.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1328 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-13 15:10:24 +00:00
orbiter
060e5a0df0 fixed problem with DHT target peer selection:
- shifted selection in front of distribution
see http://www.yacy-forum.de/viewtopic.php?p=15131#15131


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1327 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-13 14:32:48 +00:00
theli
7c22afe3de *) Bugfix for NullpointerException in deleteOldHTCache
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1326 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-13 10:37:42 +00:00
(no author)
873cff20e9 handle UTF-8 correctly
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1324 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-12 21:15:07 +00:00
(no author)
1d3249e787 handle UTF-8 correctly
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1323 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-12 21:14:39 +00:00
(no author)
76f356315d handle UTF-8 correctly
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1322 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-12 21:14:12 +00:00
orbiter
b21b9df2d0 added section headlines generation to html parser
can be viewed in cache control, but is not yet included to indexing

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1320 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-12 20:21:34 +00:00
rramthun
c4487deba9 Minor changes collected over some time.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1319 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-12 19:15:31 +00:00
allo
13c978d3bc only lowercase tags
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1317 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-12 17:41:02 +00:00
orbiter
de359a9431 patched possible 100% cpu bug.
See http://www.yacy-forum.de/viewtopic.php?p=15067#15067

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1316 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-12 16:50:34 +00:00
allo
6822dce57b Using Orbiters function for auth
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1315 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-12 16:21:08 +00:00
orbiter
38b19ff56a another bugfix to the correctedNodeIterator
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1314 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-12 15:19:09 +00:00
orbiter
45c44ca9c8 bugfix for http://www.yacy-forum.de/viewtopic.php?t=1717
(no rotation)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1313 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-12 07:24:03 +00:00
orbiter
c137683a5c added patch to correct tree-iterator
(symptoms: at the end of the iteration,
more elements are iterated that are not in the
correct order)


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1312 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-11 13:47:16 +00:00
orbiter
2028403670 - consolidated different orderings to kelondroNaturalOrder
- added another iteration method to rwihash-enumeration


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1309 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-11 00:32:44 +00:00
orbiter
9544c47684 added some UTF-8 handling.
hope this will help somehow.. for shure not THE solution to our UTF-8 problem


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1308 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-10 16:48:59 +00:00
borg-0300
9d8dca750e BUGFIX for my last commit
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1306 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-10 10:38:11 +00:00
borg-0300
5449193167 bugfix for http://www.yacy-forum.de/viewtopic.php?t=1706 (i hope)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1304 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 23:19:05 +00:00
borg-0300
2a23f5d419 F..., Sorry, no time, later
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1303 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 19:26:39 +00:00
borg-0300
3a2d13786e bugfix for http://www.yacy-forum.de/viewtopic.php?t=1706
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1302 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 19:18:41 +00:00
borg-0300
23a39698a7 last commit removed
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1301 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 19:07:11 +00:00
borg-0300
3abd843cdb bugfix for http://www.yacy-forum.de/viewtopic.php?t=1706
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1300 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 18:54:21 +00:00
allo
c3284c27f5 this Code is now in passwordAuth
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1299 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 11:50:48 +00:00
theli
1c7f0789c9 *) Allow crawling of dynamic urls if configured
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1298 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 10:31:06 +00:00
theli
15b4e16591 *) correcting problems with comments while using the urlRedirector in command mode
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1297 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 10:12:23 +00:00
theli
94e54c4450 *) Removing url hash from loaded list before inserting the url into the crawling queue
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1296 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 10:05:45 +00:00
theli
99861ff2c1 *) Crawling depth can now be configured via command string in urlRedirector mode
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1295 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 09:34:51 +00:00
orbiter
a125fee9ce fixed iterator
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1294 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 00:55:47 +00:00
orbiter
33f8920e1c new database format: splitted trees, using an ordering to split the tree
The new format can be tested with the dbtest

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1293 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 00:46:56 +00:00
orbiter
d83cc3084a fixed small problem when password cannot be decrypted with base64
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1292 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-09 00:16:34 +00:00
borg-0300
dc0999ec9c adapted to new HTCache structure
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1290 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-05 00:33:59 +00:00
allo
6df8080cf2 tiny fix for recursive Translations
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1289 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-04 21:59:09 +00:00
allo
4e4bd4662d redirectors fix
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1288 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-04 17:40:18 +00:00
allo
b63bcf25d7 moving string2vector and vector2string to listsmanager
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1286 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-04 13:09:43 +00:00
orbiter
10a095d6b5 - added base64 ordering methods
- added coding interface

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1285 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-04 01:45:23 +00:00
orbiter
9086261476 refactoring of base64 encoding:
the kelondro database needs specific information about the order of
base64-encoded keys. Since no other package depends on base64
(only the httpd uses base64 for encryption, but does not need to encode these strings)
it is good to move base64 encoding to the new ordering classes in kelondro.


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1284 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-04 00:39:00 +00:00
orbiter
7c0d7ed4f8 introduction of generic order interface for kelondro
this is needed for (to be introduced) splitted tree-files

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1283 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-03 12:35:39 +00:00
borg-0300
b24fcc8ca4 oom
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1281 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-03 11:08:45 +00:00
borg-0300
7da232b5b9 HTCache Reset if necessary
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1280 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-03 11:02:02 +00:00
borg-0300
4f18f24d81 small change
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1278 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-02 22:30:40 +00:00
borg-0300
c652527620 YaCy removes now the old HTCACHE data
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1277 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-02 22:24:19 +00:00
borg-0300
69f65210e2 ".yacy" has its own directory;
glad new year :)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1275 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-01-01 13:07:57 +00:00
allo
351fffc129 DATA/WORK for user-created content
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1274 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-31 11:47:52 +00:00
allo
a81cc9d969 no DATA/DATA to avoid confusion.
increasing version number

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1273 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-31 11:13:26 +00:00
allo
ba96cefe0c packages for xml/*
bugfix for servlets with packages from theli.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1272 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-30 13:57:51 +00:00
borg-0300
b95c5d5781 BUGFIX for URLs how "/../" ...;
new port handling;

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1271 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-30 12:58:36 +00:00
allo
9cce3c5709 dates Table for bookmarksdb(needed for del.icio.us api)
Files in DATA/DATA
Migration: move bookmarks.db from SETTINGS in DATA

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1270 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-30 12:34:44 +00:00
hermens
11fe95832e avoid division by zero when index transfer is extremely fast
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1269 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-29 20:01:05 +00:00
allo
2e2fa99501 bookmarksManager:
-gettitle_p.xml and AJAX to use it
-classpath change
httpc:
-simple wget function

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1267 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-29 17:45:50 +00:00
allo
ca3a513aee del.icio.us style api
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1265 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-28 18:26:26 +00:00
allo
75b9163242 xmlapi for bookmarktags
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1263 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-28 18:07:57 +00:00