Commit Graph

1210 Commits

Author SHA1 Message Date
orbiter
76dc892017 refined assortment
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@159 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-23 13:15:28 +00:00
theli
0e1d9e9722 *) shrinking httpc linebuffer when httpc is returned to pool. This is done to free memory
*) Making Seed-Upload configuration more verbose.
*) Some Changes in SOAP Search API (not finished yet).

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@158 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-23 10:10:51 +00:00
theli
b625aa91fd *) Trying to solve Seed-Upload-Configuration - "Error with submitted information. Nothing changed." Bug:
see: http://www.yacy-forum.de/viewtopic.php?p=3233

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@157 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-23 09:12:20 +00:00
orbiter
4b01ff7548 activated assortments, removed write-queues
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@151 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-22 23:59:52 +00:00
orbiter
e26ac60c3e modified assortment data structures
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@148 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-22 13:27:54 +00:00
rramthun
377fb07250 Used YaCy logging class instead of stdout
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@147 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-21 21:33:58 +00:00
rramthun
85e7c31a57 Fixed a spelling mistake
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@145 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-21 18:37:23 +00:00
rramthun
76475f9f38 Fixed various spelling mistakes...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@144 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-20 20:12:33 +00:00
rramthun
bfff0a96a7 Fixed some spelling mistakes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@143 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-19 18:47:03 +00:00
low012
71b9cb0c33 Just making it look like it is supposed to in Opera too. (Opera makes a difference between center and middle which most browsers don't.)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@138 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-18 20:42:34 +00:00
rramthun
01fb4e4eb9 Fixed problem described in http://www.yacy-forum.de/viewtopic.php?t=377
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@135 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-18 12:35:58 +00:00
theli
042f171022 *) Correcting problems with old seed configuration
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@133 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-18 07:46:34 +00:00
orbiter
dfde28a244 changed kaskelix mascot
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@130 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-17 23:27:50 +00:00
theli
361f05978d Multiple updates regarding the yacy seedUpload facility,
optional content parsers, thread pool configuration ...

Please help me testing if everything works correct.

*) Migration of yacy seedUpload functionality
See: http://www.yacy-forum.de/viewtopic.php?t=256
- new uploaders can now be easily introduced because of a new modulare uploader system
- default uploaders are: none, file, ftp
- adding optional uploader for scp
- each uploader provides its own configuration file that will be 
  included into the settings page using the new template include feature
- Each uploader can define its libx dependencies. If not all needed libs are
  available, the uploader is deactivated automatically.

*) Migration of optional parsers
See: http://www.yacy-forum.de/viewtopic.php?t=198
- Parsers can now also define there libx dependencies
- adding parser for bzip compressed content
- adding parser for gzip compressed content
- adding parser for zip files
- adding parser for tar files
- adding parser to detect the mime-type of a file
  this is needed by the bzip/gzip Parser.java
- adding parser for rtf files
- removing extra configuration file yacy.parser
  the list of enabled parsers is now stored in the main config file

*) Adding configuration option in the performance dialog to configure
See: http://www.yacy-forum.de/viewtopic.php?t=267
- maxActive / maxIdle / minIdle values for httpd-session-threadpool
- maxActive / maxIdle / minIdle values for crawler-threadpool

*) Changing Crawling Filter behaviour
See: http://www.yacy-forum.de/viewtopic.php?p=2631

*) Replacing some hardcoded strings with the proper constants of the httpHeader class

*) Adding new libs to libx directory. This libs are
- needed by new content parsers
- needed by new optional seed uploader
- needed by SOAP API (which will be committed later)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@126 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-17 08:25:04 +00:00
theli
014b139c73 *) Bugfix of "There are xxx entries in the crawler queue. Showing 0 most recent entries" Bug.
see: http://www.yacy-forum.de/viewtopic.php?t=338
   see: http://www.yacy-forum.de/viewtopic.php?p=2552

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@122 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-15 10:09:15 +00:00
rramthun
f9c7deca2e Fixed some spelling mistakes and removed underline from accesskey chars
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@121 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-14 13:56:01 +00:00
theli
1d38599598 *) changing comment
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@119 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-14 09:50:22 +00:00
theli
d2c4e9a55e *) Implementing yacy forum wishlist item: "Pause Crawling"
see: http://www.yacy-forum.de/viewtopic.php?t=48



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@118 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-14 09:41:05 +00:00
rramthun
2d751ba831 Fixed a spelling mistake
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@117 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-13 20:43:08 +00:00
orbiter
287d2e6f10 further enhanced caching (new cache flush methods)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@111 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-12 23:33:48 +00:00
rramthun
85c2f3be8a Fixed spelling mistakes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@110 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-12 17:50:45 +00:00
orbiter
ea478f3975 enhanced indexing-caching
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@107 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-11 23:42:40 +00:00
orbiter
b4030e5023 implemented serverSwitchActions - action-hooks
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@105 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-11 14:58:03 +00:00
theli
a5fec449c8 *) setting threadnames for kelondroMap:writequeue and publishSeed
so that a thread dump is more verbose

*) Moving code for transparent proxy support to a separate function

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@98 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-09 08:53:39 +00:00
theli
74f12bb0f3 *) adding transparent proxy support
Now a firewall can transparently redirect all 
   http traffic through yacy.
   

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@96 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-08 22:36:26 +00:00
rramthun
982c3ebeeb Added some accesskeys to webinterface and updated help with some hints on how to use them.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@91 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-08 13:05:30 +00:00
rramthun
6c8e0fdbf5 Made Homepage and eMail clickable
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@87 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-07 21:38:14 +00:00
orbiter
1d7fed87dc redesign of index caching - removed indexCache.db
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@86 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-07 21:11:18 +00:00
rramthun
3f85978519 Fixed one spelling mistake, limited input for ICQ numbers to 9 digits and made ICQ number in peer profiles clickable.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@85 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-07 21:07:43 +00:00
theli
e7f7aa0bb9 *) Import statements reorganized
Now it's easier to determine which class really uses which other class*) Reogranizing Import Statements 

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@83 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-05 05:36:42 +00:00
theli
351c86d5d9 *) Migration of optional Content Parser integration
- each additional parser must be in a subpackage 
  of plasma.parser
- each parser must have its own ant build file (which will 
  be called automatically from the main build file)
- Calling the main build file results in building a separate 
  zip file for each optional parser. This zip file includes:
  + sources of the Parser.java
  + compiled classes of the Parser.java
  + needed additional libs (libx)
- To install an additional parser the user simply needs to
  extract the zip file listed above into his/her yacy directory.
- The configuration (enabling/disabling) of a parser can be done
  via the webinterface (currently the settings dialoge) and is
  done "on-the-fly". The installation can not be done "on-the-fly"
  at the moment because of classpath issues.
- The classpath of the linux startup/stop scripts is generated 
  automatically now (including all libraries from lib and libx).

*) Bugfix: File Extension was not calculated correctly by the crawler
   e.g.: file extension was accidentally: .php?param=value
   Corrected.

*) Adding additional parser for parsing of rss/atom feeds
- added needed libs to do this.

TODO:
- automatic building classpath for windows startup scripts


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@78 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-03 09:47:56 +00:00
orbiter
f99930c04b fixed brute-force + peer-disconnect - Bug
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@75 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-01 23:31:21 +00:00
orbiter
c7c6aaf06e many bug-fixes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@73 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-30 01:22:46 +00:00
allo
52abc456fb new Templates
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@70 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-28 19:38:35 +00:00
rramthun
56409402f0 Fixed some spelling mistakes...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@66 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-26 18:10:40 +00:00
rramthun
9cb8779208 Fixed some spelling mistakes...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@64 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-26 15:38:44 +00:00
rramthun
19e69f0efd Changed <head> YACY into YaCy
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@58 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-25 15:34:18 +00:00
allo
4c8cc101d6 Bugfix: Do not shot the first X lines, but the last X line of log
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@57 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-25 15:10:24 +00:00
orbiter
ba16da72b4 fixed not-working kelondroRecords-Cache
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@56 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-25 14:46:59 +00:00
orbiter
7fb645b0ab enhanced crawling performance, changed memory settings, new performace options
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@51 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-24 23:15:40 +00:00
theli
58b1a0ba40 *) adding an new package for extra content parsers
*) adding content parser for
- pdf (using the pdf-box library)
- doc (using the textmining.org library)
*) adding a Interface for content parsers
*) adding a configuration file which can be used to configure which parser is used for which mimeType
*) Sempahore class was moved and renamed to serverSemaphore
*) Changing yacy shutdown behaviour
Buzy waiting loop for shutdown was removed and replaced with a blocking call (using the semaphore class mentioned above) to the new switchboard.waitForShutdown method.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@46 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-24 21:24:53 +00:00
rramthun
ff21586a27 Fixed some spelling mistakes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@43 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-22 15:29:28 +00:00
orbiter
00f223cfc1 fixed post-parsing (a case when the bluelist is empty)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@41 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-21 17:13:43 +00:00
theli
c9c0a1f11c *) Trying to speedup local crawling
- introduction of a threadpool for crawling
- introduction of a job queue to avoid buzy waiting for a free crawler slot

*) New classes added
- queue for receiving of crawler jobs
- semaphore class to do reader/writer synchronization (mutual exclusion)
- message object to hold all needed data about a crawler job

*) Trying to solve session-thread shutdown problem
- session thread stopped variable is now set from outside before interrupting the
  session thread.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@39 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-21 10:31:40 +00:00
rramthun
ce7d8c4fe0 Fixed some spelling mistakes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@37 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-20 16:01:14 +00:00
rramthun
570de9c4f4 Fixed some spelling mistakes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@36 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-20 13:28:32 +00:00
rramthun
b61567a39e Fixed spelling mistake and inserted author as described in mailinglist
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@32 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-19 13:56:59 +00:00
(no author)
d5ff81c636 *) Undoing last changes. Sorry.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@25 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-19 06:52:04 +00:00
(no author)
ea4849716f *) Bugfix: plasmaCondenser.getWords() was not used correctly
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@24 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-19 06:37:27 +00:00
(no author)
0a6cf3f5e7 *) Bugfix: Reference to plasmaHTCache.Entry.urlString was not set correctly
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@23 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-19 06:33:53 +00:00
orbiter
b9203bdb50 bug fixes and code cleaning
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@22 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-15 14:18:14 +00:00
(no author)
b7d4389e4b *) support for Proxy Auto-Config File generation added.
File is accessible using: 
   http://proxy:8080/autoconfig.pac

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@20 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-15 09:06:15 +00:00
orbiter
e7d055b98e very experimental integration of the new generic parser and optional disabling of bluelist filtering in proxy. Does not yet work properly. To disable the disable-feature, the presence of a non-empty bluelist is necessary
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@17 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-13 15:52:00 +00:00
orbiter
a87a17a3c8 prepared generic text parser environment
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@15 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-12 22:57:54 +00:00
rramthun
a6154697ba Fixed some grammar/spelling mistakes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@8 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-09 10:49:03 +00:00
allo
1e33de27da Log ist now skinnable
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-09 10:10:32 +00:00
rramthun
f824958184 Fixed some spelling/grammar mistakes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-09 09:20:50 +00:00
orbiter
7eb3c81aad name check on new peer names
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-08 23:15:03 +00:00
allo
955d26b117 Bug: could not install from url and use the skin (because of a two names "use")
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-08 19:07:41 +00:00
orbiter
248077d3f0 initial load with yacy 0.36
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-07 19:19:42 +00:00