Commit Graph

8926 Commits

Author SHA1 Message Date
theli
361f05978d Multiple updates regarding the yacy seedUpload facility,
optional content parsers, thread pool configuration ...

Please help me testing if everything works correct.

*) Migration of yacy seedUpload functionality
See: http://www.yacy-forum.de/viewtopic.php?t=256
- new uploaders can now be easily introduced because of a new modulare uploader system
- default uploaders are: none, file, ftp
- adding optional uploader for scp
- each uploader provides its own configuration file that will be 
  included into the settings page using the new template include feature
- Each uploader can define its libx dependencies. If not all needed libs are
  available, the uploader is deactivated automatically.

*) Migration of optional parsers
See: http://www.yacy-forum.de/viewtopic.php?t=198
- Parsers can now also define there libx dependencies
- adding parser for bzip compressed content
- adding parser for gzip compressed content
- adding parser for zip files
- adding parser for tar files
- adding parser to detect the mime-type of a file
  this is needed by the bzip/gzip Parser.java
- adding parser for rtf files
- removing extra configuration file yacy.parser
  the list of enabled parsers is now stored in the main config file

*) Adding configuration option in the performance dialog to configure
See: http://www.yacy-forum.de/viewtopic.php?t=267
- maxActive / maxIdle / minIdle values for httpd-session-threadpool
- maxActive / maxIdle / minIdle values for crawler-threadpool

*) Changing Crawling Filter behaviour
See: http://www.yacy-forum.de/viewtopic.php?p=2631

*) Replacing some hardcoded strings with the proper constants of the httpHeader class

*) Adding new libs to libx directory. This libs are
- needed by new content parsers
- needed by new optional seed uploader
- needed by SOAP API (which will be committed later)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@126 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-17 08:25:04 +00:00
theli
c57306f620 *) Correcting Problem with Template Include - Unresolved filename
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@125 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-17 06:17:02 +00:00
allo
07ca4083d3 right Prefix for Includes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@124 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-16 16:35:43 +00:00
theli
bcdc44c065 *) Adding more constants. Needed for next commit.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@123 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-15 15:17:17 +00:00
theli
014b139c73 *) Bugfix of "There are xxx entries in the crawler queue. Showing 0 most recent entries" Bug.
see: http://www.yacy-forum.de/viewtopic.php?t=338
   see: http://www.yacy-forum.de/viewtopic.php?p=2552

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@122 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-15 10:09:15 +00:00
rramthun
f9c7deca2e Fixed some spelling mistakes and removed underline from accesskey chars
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@121 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-14 13:56:01 +00:00
theli
ddc5675781 *) Correcting typo
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@120 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-14 11:14:34 +00:00
theli
1d38599598 *) changing comment
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@119 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-14 09:50:22 +00:00
theli
d2c4e9a55e *) Implementing yacy forum wishlist item: "Pause Crawling"
see: http://www.yacy-forum.de/viewtopic.php?t=48



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@118 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-14 09:41:05 +00:00
rramthun
2d751ba831 Fixed a spelling mistake
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@117 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-13 20:43:08 +00:00
theli
511f1c9b25 *) Bugfix of http://www.yacy-forum.de/viewtopic.php?p=2508&highlight=#2508
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@116 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-13 20:21:07 +00:00
(no author)
8559c6ed5f *) removing obsolete class
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@115 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-13 09:18:34 +00:00
theli
4dd387aae9 *) moving constants (see last commit) to proper httpHeader class
*) migrating fileHandler + proxyHandler to use constants instead of hardcoded values

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@114 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-13 09:14:12 +00:00
theli
aea3479a9c *) introducing of additional constants
to improve maintainability of the sourcecode

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@113 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-13 08:30:30 +00:00
orbiter
e5e46eccf8 fixed version/date
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@112 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-12 23:38:45 +00:00
orbiter
287d2e6f10 further enhanced caching (new cache flush methods)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@111 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-12 23:33:48 +00:00
rramthun
85c2f3be8a Fixed spelling mistakes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@110 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-12 17:50:45 +00:00
rramthun
9e9abd1689 added OpenOffice.org MIME types
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@109 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-12 17:38:42 +00:00
orbiter
376b917c91 fixed shut-down by stopYACY.sh
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@108 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-12 10:05:17 +00:00
orbiter
ea478f3975 enhanced indexing-caching
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@107 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-11 23:42:40 +00:00
orbiter
0cfe94bb66 fixed last commit + added missing files
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@106 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-11 15:54:26 +00:00
orbiter
b4030e5023 implemented serverSwitchActions - action-hooks
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@105 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-11 14:58:03 +00:00
theli
5a651284a2 *) Adding server shutdown hook
... to allow a graceful shutdown even if the java runtime was interrupted using
   <ctrl>+<c>

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@104 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-11 09:44:36 +00:00
allo
77fd0fdf3f Includes, which use a normale Template Key to find the sourcefile
Syntax: #%[key]%#


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@103 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-11 09:23:15 +00:00
allo
abb6750090 added Include Funktion. #%include.inc%# includes other Templates
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@102 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-10 16:55:18 +00:00
theli
6f4d2e5272 *) fixing replace bug.
using 
      stringvar = stringvar.replace(xxx) 
   istead of 
      stringvar.replace()

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@101 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-10 12:17:50 +00:00
theli
cfe3d7e640 *) adding configuration option for transparent proxy support
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@100 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-10 10:11:51 +00:00
theli
f678cda514 *) correcting bug in transparent proxy support
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@99 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-10 09:22:14 +00:00
theli
a5fec449c8 *) setting threadnames for kelondroMap:writequeue and publishSeed
so that a thread dump is more verbose

*) Moving code for transparent proxy support to a separate function

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@98 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-09 08:53:39 +00:00
theli
dae6f46812 *) Forgotten to do transparent proxy support also for http head and post
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@97 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-08 22:50:00 +00:00
theli
74f12bb0f3 *) adding transparent proxy support
Now a firewall can transparently redirect all 
   http traffic through yacy.
   

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@96 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-08 22:36:26 +00:00
theli
92a65874d6 *) fixed proxy authentication bug:
http://www.yacy-forum.de/viewtopic.php?t=174

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@95 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-08 21:11:40 +00:00
orbiter
10a4a2741d fixed missing close
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@94 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-08 20:39:11 +00:00
allo
4b9982e2b2 ...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@93 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-08 14:55:49 +00:00
orbiter
db1da3345d introduced singleton-database
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@92 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-08 14:42:09 +00:00
rramthun
982c3ebeeb Added some accesskeys to webinterface and updated help with some hints on how to use them.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@91 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-08 13:05:30 +00:00
theli
cbdc499ba6 *) adding many missing (File)?(Input|Output)Stream.close() calls to avoid "Too many open files bug".
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@90 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-08 07:24:33 +00:00
(no author)
ca93d02cca *) Recommiting makerelease file with on the
fly generation of java classpath. This
   is needed because makerelease was not merged
   correctly by mc before his last commit.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@89 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-08 06:36:48 +00:00
orbiter
a9b22647dc fixed bug in indexDump.stack - generation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@88 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-08 00:11:59 +00:00
rramthun
6c8e0fdbf5 Made Homepage and eMail clickable
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@87 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-07 21:38:14 +00:00
orbiter
1d7fed87dc redesign of index caching - removed indexCache.db
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@86 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-07 21:11:18 +00:00
rramthun
3f85978519 Fixed one spelling mistake, limited input for ICQ numbers to 9 digits and made ICQ number in peer profiles clickable.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@85 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-07 21:07:43 +00:00
theli
1dad015b0b *) Migration of Ant build files
- now the ant build file has the same functionality as the makerelease build file
- from now on the ant build files can be used instead of the makerelease build script

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@84 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-06 08:43:11 +00:00
theli
e7f7aa0bb9 *) Import statements reorganized
Now it's easier to determine which class really uses which other class*) Reogranizing Import Statements 

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@83 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-05 05:36:42 +00:00
theli
2aa5fe8f50 *) Import statements reorganized
Now it's easier to determine which class really uses which other class

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@82 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-05 05:32:19 +00:00
rramthun
978c16edd3 Deleted startYACY_hidden.bat and fixed annoying spelling mistake.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@81 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-04 21:09:45 +00:00
allo
0e5961249c Installer for YaCy 0.37
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@80 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-03 13:41:11 +00:00
theli
5d3eefb702 *) undoing last change to yacy.init
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@79 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-03 10:06:01 +00:00
theli
351c86d5d9 *) Migration of optional Content Parser integration
- each additional parser must be in a subpackage 
  of plasma.parser
- each parser must have its own ant build file (which will 
  be called automatically from the main build file)
- Calling the main build file results in building a separate 
  zip file for each optional parser. This zip file includes:
  + sources of the Parser.java
  + compiled classes of the Parser.java
  + needed additional libs (libx)
- To install an additional parser the user simply needs to
  extract the zip file listed above into his/her yacy directory.
- The configuration (enabling/disabling) of a parser can be done
  via the webinterface (currently the settings dialoge) and is
  done "on-the-fly". The installation can not be done "on-the-fly"
  at the moment because of classpath issues.
- The classpath of the linux startup/stop scripts is generated 
  automatically now (including all libraries from lib and libx).

*) Bugfix: File Extension was not calculated correctly by the crawler
   e.g.: file extension was accidentally: .php?param=value
   Corrected.

*) Adding additional parser for parsing of rss/atom feeds
- added needed libs to do this.

TODO:
- automatic building classpath for windows startup scripts


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@78 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-03 09:47:56 +00:00
allo
1a4ad5a0ac updated the versionnummber ...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@77 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-02 19:16:45 +00:00