Commit Graph

5159 Commits

Author SHA1 Message Date
orbiter
85075269a6 extended fail-safe memory-managament. prevents too much allocation, too often GC and should help for the 100%CPU-bug
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@303 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-20 00:46:23 +00:00
orbiter
10781f36b7 changed performance menu to accept milliseconds; 200 milliseconds may be appropriate for all crawling jobs; indexing should be set to 0
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@299 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-17 11:52:21 +00:00
theli
3227a9eba5 *) Adding retry function for seed uploading
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@298 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-17 08:45:44 +00:00
theli
aea355c03c *) adding test for connection status of port forwarding feature
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@292 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-17 07:57:00 +00:00
orbiter
5d06ded005 enhanced html parser speed
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@290 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-17 01:26:51 +00:00
orbiter
5a490aa065 fixed html parser
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@289 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-16 21:49:56 +00:00
orbiter
a25b5b4986 fixed possible memory leak in htmlScraper: be aware that now links can get lost; further work necessary
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@288 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-16 18:31:28 +00:00
orbiter
38747857c2 correction of correction to port-forwarding
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@287 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-16 11:49:54 +00:00
orbiter
dbda6e1e85 corrections to port forwarding
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@286 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-16 11:40:36 +00:00
theli
d8cb3324a9 *) property "mytime" was not set correctly
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@285 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-16 11:22:20 +00:00
theli
b2c5305737 *) Bugfix: Uptime Uptime #minutes (kosmetisch)
See: http://www.yacy-forum.de/viewtopic.php?p=4294

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@283 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-16 10:09:13 +00:00
theli
9a98988c3c *) Bugfix for SSL/NIO Bug
See: http://www.yacy-forum.de/viewtopic.php?t=516
   - removing NIO from server/serverCore.java because of massive problems
     with socket close issues
*) Adding support for remote port forwarding via sch
   @Orbiter: Please take a look into
   - hello.java
   - server/serverCore.java.publicIP()
   - yacy/yacyClient.java.publishMySeed(...)
*) Making startup loading of additional content parsers more failsafe


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@281 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-16 07:28:07 +00:00
orbiter
a1ffc27041 preparations for image/movie/music indexing
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@280 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-16 00:31:13 +00:00
orbiter
f45dc29f35 maintenance
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@279 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-15 14:59:53 +00:00
theli
a2e5018427 *) adding missing calls for function close() to avoid "too many open file" bug
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@273 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-13 09:56:41 +00:00
theli
3ee35b9ff9 *) Proxy-Error-Msg: align=center
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@272 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-13 09:54:36 +00:00
theli
c7d294d8d4 *) Bugfix for:
- 302 redirection Problem on Amazon Server
   - Wrong References in proxymsg/error.html
   See: http://www.yacy-forum.de/viewtopic.php?t=515

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@271 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-13 07:50:35 +00:00
orbiter
878ff0ae7b corrections
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@262 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-09 16:39:25 +00:00
theli
a566588e9b *) adding configuration section for new http keep-alive support
*) moving transparent proxy configuration into new config section


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@238 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-09 09:44:07 +00:00
theli
6bde37950f *) adding xsl stylesheet that can be used by browsers to format the rss search result in a user friendly format
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@237 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-09 09:36:44 +00:00
theli
e302c50390 *) Changes needed because of logging migration
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@236 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-09 09:34:44 +00:00
theli
f9a95b5cb8 *) Displaying more user friendly Memory Usage statistic
*) Displaying traffic consumed by yacy 
   - this is not finished yet
   - at the moment only outgoing proxy traffic is counted

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@235 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-09 09:33:15 +00:00
theli
f8de2556f8 *) Changes needed to allow a move verbose error message
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@234 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-09 09:29:35 +00:00
orbiter
33f9315e58 implemented multithreading of indexing
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@221 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-08 13:19:05 +00:00
allo
7c318f8886 Language files, not skins...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@220 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-08 11:05:04 +00:00
theli
52cf732fad *) correcting "seed-ftp-upload/Nothing changed" bug:
See: http://www.yacy-forum.de/viewtopic.php?p=3986#3986

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@219 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-08 09:54:52 +00:00
orbiter
ca3b4ccaf4 added snippet-routines (not yet finished)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@218 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-08 00:52:24 +00:00
orbiter
a73e6de005 faq/doc - Fix
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@215 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-07 09:28:45 +00:00
theli
9af1bf4b38 *) displaying memory usage of yacy in Status.html
*) displaying more expressive uptime information on Status.html and Network.html

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@214 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-07 09:05:21 +00:00
orbiter
ee0758fe4d bugfixes/empty-dir-deletion/snippet-test-activation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@212 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-07 01:05:55 +00:00
orbiter
594c591223 changes towards 0.38
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@208 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-03 02:43:35 +00:00
orbiter
d8fdc2526e added experimental snipplet-generation (to be disabled for 0.38)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@206 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-06-02 01:33:10 +00:00
orbiter
e89ded9e41 bugfixes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@204 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-31 22:12:43 +00:00
orbiter
650ca3955a added flush-thread for index cache and added language-name mapping in Language_p
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@203 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-31 17:39:14 +00:00
rramthun
ec77e6faa6 updated german language file
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@201 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-30 18:19:04 +00:00
rramthun
4e63456dba some corrections/enhancements to the webinterface
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@198 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-29 18:23:58 +00:00
orbiter
3d8a2ff937 enhanced parallelization of local/global/remote crawling
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@197 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-29 11:56:40 +00:00
allo
8ab021b39f Your-Peer Stats added
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@185 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-28 11:07:08 +00:00
allo
c6331cf088 "Comma separated Values File" for the Network Overview Page.
http://www.yacy-forum.de/viewtopic.php?t=4


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@184 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-28 10:45:42 +00:00
allo
7de98b1895 Deleted static locales because they conflict with the dynamic ones.
use Language_p.html and $rootdir/locales/*.lng to generate locales in htroot/locales


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@178 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-26 14:08:22 +00:00
allo
a8b2dccc7a New Translation System.
Use Wordlists like these:
de.lng
Network\ Overview=Netzwerk-Übersicht


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@176 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-26 10:26:32 +00:00
orbiter
21110dcd5e fixed bugs with open files and caching
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@175 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-25 13:48:48 +00:00
theli
74eb21f62e *) adding image tag into rss template
*) adding a xslt stylesheet so that the rss document can be viewed in a normal webbrowser
*) adding pubDate tag to each search item

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@173 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-25 08:47:34 +00:00
orbiter
6f09251bbc added peer-Name settings to Settings page
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@172 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-24 15:47:50 +00:00
orbiter
5f90daa265 implemented localization environment
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@171 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-24 14:23:04 +00:00
theli
6cc4297761 *) adding namespace for yacy topwords feature
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@170 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-24 09:29:10 +00:00
orbiter
73922f3501 experimental topword-intergartion in rss
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@167 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-23 21:59:57 +00:00
allo
6af3744a0b http://www.yacy-forum.de/viewtopic.php?p=3368 fixed:
Newlines are now converted correctly.


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@166 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-23 20:21:58 +00:00
allo
6ee564c482 a really simple Interface
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@165 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-23 18:44:12 +00:00
orbiter
76dc892017 refined assortment
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@159 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-23 13:15:28 +00:00
theli
0e1d9e9722 *) shrinking httpc linebuffer when httpc is returned to pool. This is done to free memory
*) Making Seed-Upload configuration more verbose.
*) Some Changes in SOAP Search API (not finished yet).

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@158 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-23 10:10:51 +00:00
theli
b625aa91fd *) Trying to solve Seed-Upload-Configuration - "Error with submitted information. Nothing changed." Bug:
see: http://www.yacy-forum.de/viewtopic.php?p=3233

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@157 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-23 09:12:20 +00:00
orbiter
4b01ff7548 activated assortments, removed write-queues
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@151 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-22 23:59:52 +00:00
orbiter
e26ac60c3e modified assortment data structures
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@148 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-22 13:27:54 +00:00
rramthun
377fb07250 Used YaCy logging class instead of stdout
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@147 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-21 21:33:58 +00:00
rramthun
85e7c31a57 Fixed a spelling mistake
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@145 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-21 18:37:23 +00:00
rramthun
76475f9f38 Fixed various spelling mistakes...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@144 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-20 20:12:33 +00:00
rramthun
bfff0a96a7 Fixed some spelling mistakes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@143 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-19 18:47:03 +00:00
low012
71b9cb0c33 Just making it look like it is supposed to in Opera too. (Opera makes a difference between center and middle which most browsers don't.)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@138 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-18 20:42:34 +00:00
rramthun
01fb4e4eb9 Fixed problem described in http://www.yacy-forum.de/viewtopic.php?t=377
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@135 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-18 12:35:58 +00:00
theli
042f171022 *) Correcting problems with old seed configuration
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@133 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-18 07:46:34 +00:00
orbiter
dfde28a244 changed kaskelix mascot
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@130 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-17 23:27:50 +00:00
theli
361f05978d Multiple updates regarding the yacy seedUpload facility,
optional content parsers, thread pool configuration ...

Please help me testing if everything works correct.

*) Migration of yacy seedUpload functionality
See: http://www.yacy-forum.de/viewtopic.php?t=256
- new uploaders can now be easily introduced because of a new modulare uploader system
- default uploaders are: none, file, ftp
- adding optional uploader for scp
- each uploader provides its own configuration file that will be 
  included into the settings page using the new template include feature
- Each uploader can define its libx dependencies. If not all needed libs are
  available, the uploader is deactivated automatically.

*) Migration of optional parsers
See: http://www.yacy-forum.de/viewtopic.php?t=198
- Parsers can now also define there libx dependencies
- adding parser for bzip compressed content
- adding parser for gzip compressed content
- adding parser for zip files
- adding parser for tar files
- adding parser to detect the mime-type of a file
  this is needed by the bzip/gzip Parser.java
- adding parser for rtf files
- removing extra configuration file yacy.parser
  the list of enabled parsers is now stored in the main config file

*) Adding configuration option in the performance dialog to configure
See: http://www.yacy-forum.de/viewtopic.php?t=267
- maxActive / maxIdle / minIdle values for httpd-session-threadpool
- maxActive / maxIdle / minIdle values for crawler-threadpool

*) Changing Crawling Filter behaviour
See: http://www.yacy-forum.de/viewtopic.php?p=2631

*) Replacing some hardcoded strings with the proper constants of the httpHeader class

*) Adding new libs to libx directory. This libs are
- needed by new content parsers
- needed by new optional seed uploader
- needed by SOAP API (which will be committed later)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@126 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-17 08:25:04 +00:00
theli
014b139c73 *) Bugfix of "There are xxx entries in the crawler queue. Showing 0 most recent entries" Bug.
see: http://www.yacy-forum.de/viewtopic.php?t=338
   see: http://www.yacy-forum.de/viewtopic.php?p=2552

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@122 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-15 10:09:15 +00:00
rramthun
f9c7deca2e Fixed some spelling mistakes and removed underline from accesskey chars
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@121 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-14 13:56:01 +00:00
theli
1d38599598 *) changing comment
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@119 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-14 09:50:22 +00:00
theli
d2c4e9a55e *) Implementing yacy forum wishlist item: "Pause Crawling"
see: http://www.yacy-forum.de/viewtopic.php?t=48



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@118 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-14 09:41:05 +00:00
rramthun
2d751ba831 Fixed a spelling mistake
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@117 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-13 20:43:08 +00:00
orbiter
287d2e6f10 further enhanced caching (new cache flush methods)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@111 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-12 23:33:48 +00:00
rramthun
85c2f3be8a Fixed spelling mistakes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@110 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-12 17:50:45 +00:00
orbiter
ea478f3975 enhanced indexing-caching
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@107 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-11 23:42:40 +00:00
orbiter
b4030e5023 implemented serverSwitchActions - action-hooks
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@105 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-11 14:58:03 +00:00
theli
a5fec449c8 *) setting threadnames for kelondroMap:writequeue and publishSeed
so that a thread dump is more verbose

*) Moving code for transparent proxy support to a separate function

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@98 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-09 08:53:39 +00:00
theli
74f12bb0f3 *) adding transparent proxy support
Now a firewall can transparently redirect all 
   http traffic through yacy.
   

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@96 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-08 22:36:26 +00:00
rramthun
982c3ebeeb Added some accesskeys to webinterface and updated help with some hints on how to use them.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@91 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-08 13:05:30 +00:00
rramthun
6c8e0fdbf5 Made Homepage and eMail clickable
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@87 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-07 21:38:14 +00:00
orbiter
1d7fed87dc redesign of index caching - removed indexCache.db
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@86 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-07 21:11:18 +00:00
rramthun
3f85978519 Fixed one spelling mistake, limited input for ICQ numbers to 9 digits and made ICQ number in peer profiles clickable.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@85 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-07 21:07:43 +00:00
theli
e7f7aa0bb9 *) Import statements reorganized
Now it's easier to determine which class really uses which other class*) Reogranizing Import Statements 

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@83 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-05 05:36:42 +00:00
theli
351c86d5d9 *) Migration of optional Content Parser integration
- each additional parser must be in a subpackage 
  of plasma.parser
- each parser must have its own ant build file (which will 
  be called automatically from the main build file)
- Calling the main build file results in building a separate 
  zip file for each optional parser. This zip file includes:
  + sources of the Parser.java
  + compiled classes of the Parser.java
  + needed additional libs (libx)
- To install an additional parser the user simply needs to
  extract the zip file listed above into his/her yacy directory.
- The configuration (enabling/disabling) of a parser can be done
  via the webinterface (currently the settings dialoge) and is
  done "on-the-fly". The installation can not be done "on-the-fly"
  at the moment because of classpath issues.
- The classpath of the linux startup/stop scripts is generated 
  automatically now (including all libraries from lib and libx).

*) Bugfix: File Extension was not calculated correctly by the crawler
   e.g.: file extension was accidentally: .php?param=value
   Corrected.

*) Adding additional parser for parsing of rss/atom feeds
- added needed libs to do this.

TODO:
- automatic building classpath for windows startup scripts


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@78 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-03 09:47:56 +00:00
orbiter
f99930c04b fixed brute-force + peer-disconnect - Bug
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@75 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-01 23:31:21 +00:00
orbiter
c7c6aaf06e many bug-fixes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@73 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-30 01:22:46 +00:00
allo
52abc456fb new Templates
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@70 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-28 19:38:35 +00:00
rramthun
56409402f0 Fixed some spelling mistakes...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@66 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-26 18:10:40 +00:00
rramthun
9cb8779208 Fixed some spelling mistakes...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@64 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-26 15:38:44 +00:00
rramthun
19e69f0efd Changed <head> YACY into YaCy
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@58 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-25 15:34:18 +00:00
allo
4c8cc101d6 Bugfix: Do not shot the first X lines, but the last X line of log
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@57 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-25 15:10:24 +00:00
orbiter
ba16da72b4 fixed not-working kelondroRecords-Cache
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@56 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-25 14:46:59 +00:00
orbiter
7fb645b0ab enhanced crawling performance, changed memory settings, new performace options
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@51 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-24 23:15:40 +00:00
theli
58b1a0ba40 *) adding an new package for extra content parsers
*) adding content parser for
- pdf (using the pdf-box library)
- doc (using the textmining.org library)
*) adding a Interface for content parsers
*) adding a configuration file which can be used to configure which parser is used for which mimeType
*) Sempahore class was moved and renamed to serverSemaphore
*) Changing yacy shutdown behaviour
Buzy waiting loop for shutdown was removed and replaced with a blocking call (using the semaphore class mentioned above) to the new switchboard.waitForShutdown method.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@46 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-24 21:24:53 +00:00
rramthun
ff21586a27 Fixed some spelling mistakes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@43 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-22 15:29:28 +00:00
orbiter
00f223cfc1 fixed post-parsing (a case when the bluelist is empty)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@41 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-21 17:13:43 +00:00
theli
c9c0a1f11c *) Trying to speedup local crawling
- introduction of a threadpool for crawling
- introduction of a job queue to avoid buzy waiting for a free crawler slot

*) New classes added
- queue for receiving of crawler jobs
- semaphore class to do reader/writer synchronization (mutual exclusion)
- message object to hold all needed data about a crawler job

*) Trying to solve session-thread shutdown problem
- session thread stopped variable is now set from outside before interrupting the
  session thread.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@39 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-21 10:31:40 +00:00
rramthun
ce7d8c4fe0 Fixed some spelling mistakes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@37 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-20 16:01:14 +00:00
rramthun
570de9c4f4 Fixed some spelling mistakes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@36 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-20 13:28:32 +00:00
rramthun
b61567a39e Fixed spelling mistake and inserted author as described in mailinglist
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@32 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-19 13:56:59 +00:00
(no author)
d5ff81c636 *) Undoing last changes. Sorry.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@25 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-19 06:52:04 +00:00
(no author)
ea4849716f *) Bugfix: plasmaCondenser.getWords() was not used correctly
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@24 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-19 06:37:27 +00:00
(no author)
0a6cf3f5e7 *) Bugfix: Reference to plasmaHTCache.Entry.urlString was not set correctly
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@23 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-19 06:33:53 +00:00
orbiter
b9203bdb50 bug fixes and code cleaning
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@22 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-15 14:18:14 +00:00
(no author)
b7d4389e4b *) support for Proxy Auto-Config File generation added.
File is accessible using: 
   http://proxy:8080/autoconfig.pac

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@20 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-15 09:06:15 +00:00
orbiter
e7d055b98e very experimental integration of the new generic parser and optional disabling of bluelist filtering in proxy. Does not yet work properly. To disable the disable-feature, the presence of a non-empty bluelist is necessary
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@17 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-13 15:52:00 +00:00
orbiter
a87a17a3c8 prepared generic text parser environment
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@15 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-12 22:57:54 +00:00
rramthun
a6154697ba Fixed some grammar/spelling mistakes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@8 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-09 10:49:03 +00:00
allo
1e33de27da Log ist now skinnable
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-09 10:10:32 +00:00
rramthun
f824958184 Fixed some spelling/grammar mistakes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-09 09:20:50 +00:00
orbiter
7eb3c81aad name check on new peer names
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-08 23:15:03 +00:00
allo
955d26b117 Bug: could not install from url and use the skin (because of a two names "use")
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-08 19:07:41 +00:00
orbiter
248077d3f0 initial load with yacy 0.36
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-07 19:19:42 +00:00