borg-0300
7626823519
BUGFIX for last 'commit'
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@635 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-09-01 23:43:27 +00:00
borg-0300
971756e8dd
the delete size is smaller
...
See: http://www.yacy-forum.de/viewtopic.php?t=1084
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@634 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-09-01 23:35:00 +00:00
theli
0471019606
*) IndexCreateIndexingQueue_p.html now also shows indexing jobs that are currently in process
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@633 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-09-01 22:05:20 +00:00
borg-0300
cc493ef8c1
Added change from Hermes
...
See: http://www.yacy-forum.de/viewtopic.php?t=1050
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@629 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-09-01 11:18:41 +00:00
theli
bead8a32aa
*) IndexCreate_p.java:
...
Crawler StartURLs will now also added to the errorURL-DB if an error occures on this url
*) kelondroStack.java, plasmaSwitchboardQueue.java
Adding method which returns a list of all entries in the queue. This list is used by IndexCreate_p.java
instead of an iterator to display the indexing-list.
Advantages: avoid concurrent modifications of the list while displaying it.
Speedup because now we have to access only one sync function instead of multiple ones
(one for each entry)
*) IndexCreateIndexingQueue_p.java
Using new list() function of plasmaSwitchboardQueue
*) httpdFileHandler.java
If a servelet returns the special value "LOCATION" the httpFileHandler does a Redirection of
the Browser to the URL specified by the servelet. This can e.g. be used when a http get request is
used insead of a post request, but a refresh should not be allowed.
*) IndexCreateWWWLocalQueue_p.html
Now it's possible to delete single entries of the local crawler queue
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@626 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-09-01 07:52:46 +00:00
theli
48aaf703cc
*) Adding additional logging output to detect crawling problems
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@625 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-09-01 06:55:21 +00:00
theli
59b8a98c7e
*) Bugfix for suppressing of stacktrace in log on crawler error "MalformedURLException"
...
See: http://www.yacy-forum.de/viewtopic.php?p=8840
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@623 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-09-01 06:31:30 +00:00
borg-0300
c1d7527929
better cache cleanup
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@621 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-31 13:07:08 +00:00
theli
2e6df95786
*) adding toString method
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@620 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-31 10:43:03 +00:00
theli
4fd5b95b1f
*) Renaming Logger function names to reflect the proper Java Logging API Loglevels
...
- please use logFine instead of logDebug
- please use logSevere instead of logFailure and logError
See: http://www.yacy-forum.de/viewtopic.php?p=8726#8726
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@615 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-30 21:32:59 +00:00
theli
6adf8a4bde
*) Renaming Logger function names to reflect the proper Java Logging API Loglevels
...
- please use logFine instead of logDebug
- please use logFailure instead of logError
See: http://www.yacy-forum.de/viewtopic.php?p=8726#8726
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@614 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-30 21:10:39 +00:00
theli
f19c09b227
*) Suppress stacktrace on crawler error for "MalformedURLException"
...
See: http://www.yacy-forum.de/viewtopic.php?p=8733#8733
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@613 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-30 20:25:07 +00:00
theli
cc1df08069
*) Adding missing synchronized blocks
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@608 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-30 14:57:32 +00:00
borg-0300
bf14e6def5
*) proxyCache, proxyCacheSize can be changed under 'Proxy Indexing'
...
- path now are absolute
*) move path check from plasmaHTCache to plasmaSwitchboard
- only one path check when starting
*) small other
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@606 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-30 12:50:30 +00:00
theli
9b818b1ce3
*) Pausing Crawlers if there is not enough space on disk
...
See: http://www.yacy-forum.de/viewtopic.php?p=8648
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@603 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-30 09:43:27 +00:00
theli
b33094e925
*) Trying to solve "Too many open files bug"
...
*) Temp.Bugfix for "Bug in Index Restore"
See: http://www.yacy-forum.de/viewtopic.php?p=8647#8647
Orbiter: Please take a look
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@602 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-30 09:07:42 +00:00
theli
34790acf02
*) Bugfix for suppressing of stacktrace in log on crawler error "unknown host"
...
See: http://www.yacy-forum.de/viewtopic.php?p=8615#8615
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@600 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-30 06:24:23 +00:00
theli
af7b8f75bd
*) Making proxyAccessLogging configureable via yacy.logging file
...
- logging can be disabled now
- logging directory / filelimit / rotation count can be configured now
See: http://www.yacy-forum.de/viewtopic.php?t=965&postdays=0&postorder=asc&start=30#8280
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@595 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-29 11:31:58 +00:00
theli
2a081c9ee5
*) Adding additional logging message for "NURL.entry() == null" Bug
...
See: http://www.yacy-forum.de/viewtopic.php?p=8446
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@591 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-28 05:39:26 +00:00
theli
cb1f11c96b
*) Suppress stacktrace on crawler error for "Unknown Host"
...
See: http://www.yacy-forum.de/viewtopic.php?p=8431
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@590 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-28 05:08:26 +00:00
theli
e338a13de3
*) Suppress stacktrace on crawler error for "Read timed out"
...
See: http://www.yacy-forum.de/viewtopic.php?p=8433
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@589 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-27 18:48:02 +00:00
theli
2e43e744de
*) Suppress stacktrace on crawler error for "connect timed out"
...
See: http://www.yacy-forum.de/viewtopic.php?p=8420
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@588 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-27 04:53:25 +00:00
theli
36cbe04e3e
*) Bugfix for Crawler Redirection Bug
...
See: http://www.yacy-forum.de/viewtopic.php?p=8422
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@587 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-27 04:36:13 +00:00
theli
b70de495a0
*) Remembering Crawler-isPaused setting
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@586 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-25 09:51:24 +00:00
theli
e569a84dc0
*) Using the same configuration settings for all indexing threads on server Startup
...
See: http://www.yacy-forum.de/viewtopic.php?p=8349
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@584 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-24 09:34:04 +00:00
theli
17be77a468
*) Bugfix for "Crawler data will not be removed from htcache if content parsing failed"
...
See: http://www.yacy-forum.de/viewtopic.php?t=965&highlight=ramdisk
*) Making ACCEPT_LANGUAGE configureable for crawler
See: http://www.yacy-forum.de/viewtopic.php?p=8327
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@583 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-24 07:47:42 +00:00
theli
5f55dff297
*) Bugfix for "Binäre Nullen auf der page: Index Creation: Indexing Queue"
...
See: http://www.yacy-forum.de/viewtopic.php?p=6877#6877
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@577 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-23 08:37:42 +00:00
allo
eb6365c069
local Bootstrapping bug.
...
use yacyDebugMode=true to allow local bootstrapping
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@572 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-22 12:13:19 +00:00
theli
330eae7cf3
*) Normalizing CrawlerStartURL now before crawling is started
...
*) CrawlWorker also does a URL normalization now before following the redirection URL
*) CrawlWorker removes redirection URL correctly from noticeURL stack now
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@571 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-21 22:52:46 +00:00
theli
ab894d26bc
*) Bugfix for "plasmaSwitchboard.deQueue: null" Bug (hopefully)
...
See: http://www.yacy-forum.de/viewtopic.php?p=8135#8135
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@570 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-21 22:48:37 +00:00
theli
eaf9f26cc3
*) Bugfix for NULL PROFILE HANDLE 'null' Bug:
...
See: http://www.yacy-forum.de/viewtopic.php?p=7855#7855
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@569 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-21 22:16:19 +00:00
rramthun
4cb382decb
Adding changes by borg-0300 from http://www.yacy-forum.de/viewtopic.php?t=997
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@565 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-20 17:05:01 +00:00
theli
ec4c70d722
*) If there are at most 10 entries left while doing an index transfer, these entries will also be appended
...
to the index list
|> D 2005/08/18 10:00:02 PLASMA Selected partial index (33 from 37 URLs, 0 not bound) for word fSuQM0xAJK1G
See: http://www.yacy-forum.de/viewtopic.php?t=970
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@556 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-18 10:04:45 +00:00
theli
d4a045d7b1
*) Trying to solve "de.anomic.plasma.plasmaSwitchboard.deQueue': null" Bug
...
See: http://www.yacy-forum.de/viewtopic.php?p=7791
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@555 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-18 06:40:26 +00:00
theli
ea9a992f05
*) Before the crawler retries to download a URL it checks if the server is already doing a shutdown
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@554 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-17 11:36:48 +00:00
theli
ea26b84eed
*) Bugfix for http://www.yacy-forum.de/viewtopic.php?t=954
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@553 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-17 10:12:52 +00:00
theli
0c8a48e2cb
*) converting php Session ID to lower case in funktion isCGI
...
See: http://www.yacy-forum.de/viewtopic.php?p=7671#7671
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@552 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-17 05:50:18 +00:00
orbiter
e616395c3b
latest changes and cut for 0.40
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@548 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-16 15:15:19 +00:00
orbiter
c47bb1182d
bugfix for assortment initialization error
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@547 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-16 11:43:14 +00:00
theli
4654eae4e2
*) adding php Session ID to argument in funktion isCGI
...
See: http://www.yacy-forum.de/viewtopic.php?p=7671#7671
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@546 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-16 11:33:31 +00:00
orbiter
25f632dbd9
more DHT bugfixes and better logging of DHT effects
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@542 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-16 00:31:15 +00:00
orbiter
5cb00889d9
enhancements to dht selection, search and search presentation
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@540 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-15 01:12:25 +00:00
orbiter
ba0a486328
moved printStackTrace() to logging
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@539 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-14 23:35:18 +00:00
orbiter
3094045d34
fix for http://www.yacy-forum.de/viewtopic.php?p=7454#7454
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@536 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-14 16:16:11 +00:00
orbiter
cd10370992
several bugfixes and dht selection / logging improvement
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@531 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-14 00:57:30 +00:00
orbiter
3610fe6b3a
see http://www.yacy-forum.de/viewtopic.php?p=7410#7410
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@530 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-13 22:04:18 +00:00
orbiter
c8a7a85ce2
fix for http://www.yacy-forum.de/viewtopic.php?p=7384#7384
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@529 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-13 21:41:22 +00:00
orbiter
6594541ef5
fix for http://www.yacy-forum.de/viewtopic.php?p=7361#7361
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@526 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-12 22:53:49 +00:00
orbiter
7db543a9fa
fixes for several dht misbehaviours
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@524 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-12 22:14:24 +00:00
orbiter
5716f8521d
bug fixes for word ordering and dht index selection
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@521 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-12 14:06:47 +00:00