Commit Graph

223 Commits

Author SHA1 Message Date
orbiter
311e627363 blocking of blacklisted urls in indexReceive and small changes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@397 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-07-11 15:36:10 +00:00
orbiter
2f0d7ea8d3 removed htcache stati (superfluous now)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@396 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-07-09 00:33:34 +00:00
orbiter
277048501e bugfix
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@395 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-07-08 16:24:07 +00:00
orbiter
8b89c46afe fixed problem with cache write
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@394 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-07-08 16:06:35 +00:00
orbiter
455ae9f55f fixed htcache-store problem and due-time for remote crawls
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@393 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-07-08 15:17:50 +00:00
theli
55d10b864c *) further improvements in shutdown behaviour
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@392 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-07-07 22:03:52 +00:00
theli
96386d6fc6 *) further improvements in shutdown behaviour
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@391 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-07-07 21:44:31 +00:00
theli
0b95c9c434 *) Bugfix for Thread.getID() usagage + PeerPing-Shutdown Deadlock
See:
   - http://www.yacy-forum.de/viewtopic.php?p=4937
   - http://www.yacy-forum.de/viewtopic.php?p=4939

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@390 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-07-07 21:00:11 +00:00
theli
13eeaa08f3 *) httpc.java:
- Now it's possible to interrupt pending httpc-actions on server shutdown  
   - this is possible because of a newly introduced registration mechanism for
     open sockets
*) yacyCore.java
   - blocking peerPing threads can now be interrupted on server shutdown
*) serverCore.java
   - restructuring shutdown code 
*) error.html
   - port number is now set correctly if port forwarding was enabled


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@389 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-07-07 13:58:54 +00:00
theli
f57b60cd60 *) Correcting Problems with htmlFilterContentScraper.java
Tag name comparison was case sensitive, therefore e.g.
      <A href="test.txt>test</A>
   was not parsed correctly.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@387 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-07-07 13:32:49 +00:00
orbiter
b79070b471 fixed proxy/scraper
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@385 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-07-07 13:25:41 +00:00
orbiter
419f8fb398 fixed bugs/missing code regarding new crawl stack
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@384 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-07-07 01:38:49 +00:00
orbiter
112c5d3332 the new file-based indexing queue
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@382 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-07-06 14:50:01 +00:00
orbiter
858cd94299 replaced indexing ram-queue by file-based stack-queue
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@381 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-07-06 14:48:41 +00:00
allo
a31029c5de == as separator
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@374 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-07-04 17:40:44 +00:00
allo
9f0f45bcae Warning: this is untested.
I will need this for recursive Translation, and check it while extending the Translator.


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@373 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-07-04 15:53:45 +00:00
theli
57c30f1d78 *) bugfix for usage of httpc without gzip content encoding
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@369 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-07-04 11:25:25 +00:00
theli
0e2c33ee55 *) Network.html/Network.java:
- Adding function to manually force peer ping to remote yacy peer
  See:Network.html?page=4
- for debugging purpose only!

*) serverAbstractThread.java:
- Adding posibility to notify a server thread via a synchronization object
- this is needed e.g. by the port forwarding feature to send a notification
  to the peerPing thread to redo peer-ping with the new ip/port Settings_p.html

*) Port Forwarding Feature (it should work now)
- adding a serverThread which is responsible to detect broken port forwarding 
  connections and to do reconnect if needed
- serverCore.java: moving port forwarding initialization into a separate function
- adding positility to configure the ssh port 
- moving configuration section on the gui into a separate fieldset
- hello.java: only trying to do a second connect to the clientIp address during
  peer handshake if either remote port forwarding is not enabled locally or
  the clientIP is not equal to any local ip

*) httpdFileHandler.java:
- printout a more verbose errormessage

*) httpc.java
- allowing to deactivate content encoding from outside


 

*) plasmaCrawlWorker.java
- the crawler worker now tries to refetch the content of a website without
  gzip content encoding if a gzip error occured



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@368 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-07-04 11:09:48 +00:00
orbiter
5159a090b0 fixed parser bug with lowercase force (appeared in: http://spellbound.sourceforge.net/)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@367 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-07-03 23:33:25 +00:00
orbiter
7f7cbc5019 fixed bug with snippets
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@365 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-07-03 13:37:20 +00:00
orbiter
3607159a9a fixed connection of out-dated peers
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@364 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-07-03 13:08:49 +00:00
orbiter
fbef7fed81 adopted latestVersion to float handling
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@363 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-07-03 12:40:36 +00:00
orbiter
eb74fa0c82 fixed a bug with snippet-length
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@359 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-07-01 23:35:36 +00:00
orbiter
ec8a5ad060 fixed base implementation (verified with www.burningboard.net/archive/index.html)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@356 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-30 23:49:32 +00:00
orbiter
1d3cfb380a experimental implementation of base parsing in html parser
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@355 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-30 23:28:35 +00:00
orbiter
86f2aa8478 fixed seed-load date bug (evaluating server date for age computation)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@354 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-30 23:19:08 +00:00
orbiter
664bceced5 removed debug-lines
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@351 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-30 18:56:01 +00:00
orbiter
75ebdbc852 enhanced snippet-generation (case where snippet is too long)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@350 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-30 18:54:00 +00:00
allo
bef3aaec38 better Translationsystem.
You only need to escape Chars, which have special meanings in regexps.
some you have to escape: ()[]{}


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@348 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-30 15:28:40 +00:00
orbiter
8a4f297324 fixed/enhanced snippet error-handling; suppression of results where no snippet exists
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@347 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-30 00:01:53 +00:00
orbiter
712fe9ef18 bugfixed utf-8 decoding and parser
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@346 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-29 22:55:37 +00:00
allo
82d78fdab7 new Implementation and basic bbCode(Ok, opnly one Command ...)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@343 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-29 21:01:12 +00:00
allo
a60f5efcd6 rename
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@342 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-29 20:59:56 +00:00
theli
6e97f70549 *) httpd.java: improved errorhandling
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@333 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-29 10:05:38 +00:00
theli
9d8c66fb5e *) adding possibility to forward received yacy-messages (htroot/yacy/message.java)
via a command-line email program (e.g. sendmail) to a configured email address
   - the configuration dialog is reachable via Settings_p.html#messageForwarding

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@332 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-29 09:50:48 +00:00
theli
08e4334c1d *) Status.java: showing amount of time since last upload of seed-file
*) hello.java: adding additional output for principal-downgrade bug
*) httpd.java, httpdFileHandler.java, httpdProxyHandler.java: improved errorhandling
*) yacyCore.java: trying to fix principal-downgrade bug
*) yacySeed.java: adding some constants

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@329 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-28 11:27:31 +00:00
theli
eee6322aaf *) Adding redirection support to plasmaCrawlWorker.java
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@328 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-28 08:07:41 +00:00
theli
cd279907c0 *) Adding redirection support to plasmaCrawlWorker.java
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@327 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-28 08:01:26 +00:00
theli
6697d5e52e *) correcting fkt. mediaExtContains
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@326 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-28 06:44:31 +00:00
low012
a333715ffd This changes eliminate a way to cripple Message pages on other peers by sending special messages with HTML tags. (See forum for details.)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@325 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-27 19:52:40 +00:00
orbiter
abba8fe61b fixed utf-8 decoding in htmlFilterAbstractScraper and removed httpd timing
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@323 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-26 22:40:50 +00:00
orbiter
3addf58046 enhanced snippet-loading with threads
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@322 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-24 07:41:07 +00:00
orbiter
4afcf10158 added kelondroHashtable (not finished yet)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@321 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-23 14:41:59 +00:00
orbiter
56d28a16f0 bugfixes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@320 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-23 14:40:39 +00:00
orbiter
d6c85228a6 enhanced snippet computation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@319 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-23 12:12:12 +00:00
theli
d53b2393e5 *) autoconfig.java: ip address was not reported correctly when port-forwardin is on
*) hello.java: reportedip my be empty at peer startup
*) httpc.java: adding method to determine if the connection was already closed or is broken
*) httpdProxyHandler.java: trying to do a better errorhandling
*) server/serverCore.java
- setting myseed ip-address and port correctly if port-forwarding is on
- doing a more failsafe close and adding some debugging output
*) yacyClient.java: adding some logging statements to allow a better detection of 
   "degraded to senior"-bug
*) yacyCore.java: restructuring publishMySeed
   (@Orbiter: pleas take a look)
- to avoid buzy waiting
- to allow a gracefull shutdown on server shutdown
- new seed count was not calculated correctly in the previous version
*) yacySeedDB.java: host ip and port was not initialized correctly if port-forwarding
   was activated

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@318 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-23 11:00:26 +00:00
theli
fafda068f9 *) allowing crawler to process resources with statuscode 203
- this is needed if yacy is behind a second proxy 

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@316 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-23 10:00:31 +00:00
theli
aae9a433a6 *) correcting usage of supportedFileExt-List
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@315 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-23 07:43:59 +00:00
orbiter
1e7f062350 many bugfixes, memory leak fixes, performance enhancements; new kelondroHashtable; activated snippets
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@313 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-23 02:07:45 +00:00
orbiter
68dc2b0c6b added kelondroArray, the basis for upcoming kelondroHash and some bug fixes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@311 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-21 01:17:25 +00:00