Commit Graph

398 Commits

Author SHA1 Message Date
theli
bead8a32aa *) IndexCreate_p.java:
Crawler StartURLs will now also added to the errorURL-DB if an error occures on this url
*) kelondroStack.java, plasmaSwitchboardQueue.java
   Adding method which returns a list of all entries in the queue. This list is used by IndexCreate_p.java 
   instead of an iterator to display the indexing-list. 
   Advantages: avoid concurrent modifications of the list while displaying it. 
               Speedup because now we have to access only one sync function instead of multiple ones 
               (one for each entry)
*) IndexCreateIndexingQueue_p.java
   Using new list() function of plasmaSwitchboardQueue
*) httpdFileHandler.java
   If a servelet returns the special value "LOCATION" the httpFileHandler does a Redirection of 
   the Browser to the URL specified by the servelet. This can e.g. be used when a http get request is
   used insead of a post request, but a refresh should not be allowed.
*) IndexCreateWWWLocalQueue_p.html
   Now it's possible to delete single entries of the local crawler queue

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@626 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-09-01 07:52:46 +00:00
theli
48aaf703cc *) Adding additional logging output to detect crawling problems
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@625 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-09-01 06:55:21 +00:00
theli
59b8a98c7e *) Bugfix for suppressing of stacktrace in log on crawler error "MalformedURLException"
See: http://www.yacy-forum.de/viewtopic.php?p=8840

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@623 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-09-01 06:31:30 +00:00
low012
2ee4f9f4e5 *) added escape characters [= and =] to yacyWiki
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@622 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-31 14:43:55 +00:00
borg-0300
c1d7527929 better cache cleanup
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@621 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-31 13:07:08 +00:00
theli
2e6df95786 *) adding toString method
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@620 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-31 10:43:03 +00:00
theli
ebbd063c92 *) Making mimeTable static final
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@619 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-31 09:22:55 +00:00
theli
60e77dcc60 *) Adding additional loglevel finer + finest according to Thread http://www.yacy-forum.de/viewtopic.php?p=8750#8750
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@618 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-30 22:37:48 +00:00
borg-0300
fa54b5f38d cleanup spaces
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@617 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-30 22:24:38 +00:00
theli
4fd5b95b1f *) Renaming Logger function names to reflect the proper Java Logging API Loglevels
- please use logFine instead of logDebug
   - please use logSevere instead of logFailure and logError
   See: http://www.yacy-forum.de/viewtopic.php?p=8726#8726

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@615 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-30 21:32:59 +00:00
theli
6adf8a4bde *) Renaming Logger function names to reflect the proper Java Logging API Loglevels
- please use logFine instead of logDebug
   - please use logFailure instead of logError
   See: http://www.yacy-forum.de/viewtopic.php?p=8726#8726

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@614 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-30 21:10:39 +00:00
theli
f19c09b227 *) Suppress stacktrace on crawler error for "MalformedURLException"
See: http://www.yacy-forum.de/viewtopic.php?p=8733#8733

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@613 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-30 20:25:07 +00:00
theli
cc1df08069 *) Adding missing synchronized blocks
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@608 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-30 14:57:32 +00:00
theli
0dfa8b62e2 *) Changing Proxy-Useragent string according to thread http://www.yacy-forum.de/viewtopic.php?p=8183#8183
A typical useragent string now e.g. looks like: 
   Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.7.10; YaCy 0.401/00602; yacy.net) Gecko/20050716 Firefox/1.0.6

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@607 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-30 13:41:47 +00:00
borg-0300
bf14e6def5 *) proxyCache, proxyCacheSize can be changed under 'Proxy Indexing'
- path now are absolute
*) move path check from plasmaHTCache to plasmaSwitchboard
   - only one path check when starting
*) small other

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@606 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-30 12:50:30 +00:00
theli
3dfda1c9da *) More verbose output on ftp-seed-upload failure
See: http://www.yacy-forum.de/viewtopic.php?p=8000#8000

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@605 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-30 12:18:41 +00:00
theli
e3aa3a2d75 *) Bugfix for ProxyAccess Logger
URL was accidentally logged without the parameters  

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@604 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-30 11:37:54 +00:00
theli
9b818b1ce3 *) Pausing Crawlers if there is not enough space on disk
See: http://www.yacy-forum.de/viewtopic.php?p=8648

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@603 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-30 09:43:27 +00:00
theli
b33094e925 *) Trying to solve "Too many open files bug"
*) Temp.Bugfix for "Bug in Index Restore"
   See: http://www.yacy-forum.de/viewtopic.php?p=8647#8647
   Orbiter: Please take a look



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@602 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-30 09:07:42 +00:00
theli
b67f008eb8 *) Trying to solve "Too many open files bug"
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@601 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-30 06:45:49 +00:00
theli
34790acf02 *) Bugfix for suppressing of stacktrace in log on crawler error "unknown host"
See: http://www.yacy-forum.de/viewtopic.php?p=8615#8615

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@600 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-30 06:24:23 +00:00
theli
a812fb86cc *) Port Forwarding Feature does not detect broken connection properly.
Therefor a test-request was added to the isConnected function to detect broken connections
   and to keep open connections alive


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@596 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-29 11:39:10 +00:00
theli
af7b8f75bd *) Making proxyAccessLogging configureable via yacy.logging file
- logging can be disabled now
   - logging directory / filelimit / rotation count can be configured now
   See: http://www.yacy-forum.de/viewtopic.php?t=965&postdays=0&postorder=asc&start=30#8280

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@595 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-29 11:31:58 +00:00
theli
8132a44305 *) Better error handling if yacy SOAP extension is not installed
See: http://www.yacy-forum.de/viewtopic.php?t=1040

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@594 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-29 08:24:07 +00:00
allo
66ebce1109 use staticIP more often
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@592 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-28 16:55:52 +00:00
theli
2a081c9ee5 *) Adding additional logging message for "NURL.entry() == null" Bug
See: http://www.yacy-forum.de/viewtopic.php?p=8446

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@591 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-28 05:39:26 +00:00
theli
cb1f11c96b *) Suppress stacktrace on crawler error for "Unknown Host"
See: http://www.yacy-forum.de/viewtopic.php?p=8431

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@590 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-28 05:08:26 +00:00
theli
e338a13de3 *) Suppress stacktrace on crawler error for "Read timed out"
See: http://www.yacy-forum.de/viewtopic.php?p=8433

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@589 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-27 18:48:02 +00:00
theli
2e43e744de *) Suppress stacktrace on crawler error for "connect timed out"
See: http://www.yacy-forum.de/viewtopic.php?p=8420 

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@588 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-27 04:53:25 +00:00
theli
36cbe04e3e *) Bugfix for Crawler Redirection Bug
See: http://www.yacy-forum.de/viewtopic.php?p=8422

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@587 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-27 04:36:13 +00:00
theli
b70de495a0 *) Remembering Crawler-isPaused setting
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@586 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-25 09:51:24 +00:00
theli
e569a84dc0 *) Using the same configuration settings for all indexing threads on server Startup
See: http://www.yacy-forum.de/viewtopic.php?p=8349

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@584 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-24 09:34:04 +00:00
theli
17be77a468 *) Bugfix for "Crawler data will not be removed from htcache if content parsing failed"
See: http://www.yacy-forum.de/viewtopic.php?t=965&highlight=ramdisk
*) Making ACCEPT_LANGUAGE configureable for crawler
   See: http://www.yacy-forum.de/viewtopic.php?p=8327

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@583 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-24 07:47:42 +00:00
theli
858cb983d7 *) Printout date and system name on proxy error page
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@581 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-23 11:32:36 +00:00
theli
4a2f06053f *) Bugfix for "Gehäuselautsprecher" Bug
If de.anomic.server.logging.ConsoleOutErrHandler.ignoreCtrlChr is set to true all control chars except 
   space,tab,newline, are replaced with spaces
   See: http://www.yacy-forum.de/viewtopic.php?p=5528

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@579 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-23 10:48:24 +00:00
theli
dc7c5237fb *) Bugfix for "Gehäuselautsprecher" Bug
If de.anomic.server.logging.ConsoleOutErrHandler.ignoreCtrlChr is set to true all control chars except 
   space,tab,newline, are replaced with spaces
   See: http://www.yacy-forum.de/viewtopic.php?p=5528

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@578 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-23 10:45:44 +00:00
theli
5f55dff297 *) Bugfix for "Binäre Nullen auf der page: Index Creation: Indexing Queue"
See: http://www.yacy-forum.de/viewtopic.php?p=6877#6877

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@577 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-23 08:37:42 +00:00
theli
cb97d2972e *) Bugfix for "peer not accessible via .yacy name if Transparent Proxy Support is enabled" bug
See: per Browser nicht erreichbare Peers

*) Proxy Error Page now displays the Peer Name on top

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@575 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-23 07:27:58 +00:00
allo
60074b4301 more DebugMode(60 Secs Timeout)
needed for Yacy with tor.


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@573 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-22 14:02:34 +00:00
allo
eb6365c069 local Bootstrapping bug.
use yacyDebugMode=true to allow local bootstrapping


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@572 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-22 12:13:19 +00:00
theli
330eae7cf3 *) Normalizing CrawlerStartURL now before crawling is started
*) CrawlWorker also does a URL normalization now before following the redirection URL
*) CrawlWorker removes redirection URL correctly from noticeURL stack now

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@571 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-21 22:52:46 +00:00
theli
ab894d26bc *) Bugfix for "plasmaSwitchboard.deQueue: null" Bug (hopefully)
See: http://www.yacy-forum.de/viewtopic.php?p=8135#8135

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@570 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-21 22:48:37 +00:00
theli
eaf9f26cc3 *) Bugfix for NULL PROFILE HANDLE 'null' Bug:
See: http://www.yacy-forum.de/viewtopic.php?p=7855#7855

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@569 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-21 22:16:19 +00:00
rramthun
0864ea367d Added preformated changelog.txt
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@567 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-21 10:38:22 +00:00
allo
41aa3ae72e provide a virtuell Headerfield IS_ADMIN.
This allows Serverlets to check Admin Status.
http://www.yacy-forum.de/viewtopic.php?t=1003


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@566 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-20 21:22:05 +00:00
rramthun
4cb382decb Adding changes by borg-0300 from http://www.yacy-forum.de/viewtopic.php?t=997
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@565 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-20 17:05:01 +00:00
low012
ecfdc4928a *) better implementation of definition lists
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@564 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-20 12:57:31 +00:00
rramthun
26289e955c Updates of the language file
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@563 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-20 12:09:27 +00:00
theli
0d0bc02f38 *) Printing stacktrace into log on seed-upload error
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@562 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-19 22:59:33 +00:00
theli
115c4edfcc *) Adding additional logging statements to help debugging seed-upload problems
See: http://www.yacy-forum.de/viewtopic.php?t=975&highlight= 

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@561 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-08-19 09:32:30 +00:00