Commit Graph

1383 Commits

Author SHA1 Message Date
orbiter
1d7fed87dc redesign of index caching - removed indexCache.db
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@86 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-07 21:11:18 +00:00
rramthun
3f85978519 Fixed one spelling mistake, limited input for ICQ numbers to 9 digits and made ICQ number in peer profiles clickable.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@85 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-07 21:07:43 +00:00
theli
e7f7aa0bb9 *) Import statements reorganized
Now it's easier to determine which class really uses which other class*) Reogranizing Import Statements 

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@83 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-05 05:36:42 +00:00
theli
351c86d5d9 *) Migration of optional Content Parser integration
- each additional parser must be in a subpackage 
  of plasma.parser
- each parser must have its own ant build file (which will 
  be called automatically from the main build file)
- Calling the main build file results in building a separate 
  zip file for each optional parser. This zip file includes:
  + sources of the Parser.java
  + compiled classes of the Parser.java
  + needed additional libs (libx)
- To install an additional parser the user simply needs to
  extract the zip file listed above into his/her yacy directory.
- The configuration (enabling/disabling) of a parser can be done
  via the webinterface (currently the settings dialoge) and is
  done "on-the-fly". The installation can not be done "on-the-fly"
  at the moment because of classpath issues.
- The classpath of the linux startup/stop scripts is generated 
  automatically now (including all libraries from lib and libx).

*) Bugfix: File Extension was not calculated correctly by the crawler
   e.g.: file extension was accidentally: .php?param=value
   Corrected.

*) Adding additional parser for parsing of rss/atom feeds
- added needed libs to do this.

TODO:
- automatic building classpath for windows startup scripts


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@78 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-03 09:47:56 +00:00
orbiter
f99930c04b fixed brute-force + peer-disconnect - Bug
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@75 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-01 23:31:21 +00:00
orbiter
c7c6aaf06e many bug-fixes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@73 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-30 01:22:46 +00:00
allo
52abc456fb new Templates
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@70 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-28 19:38:35 +00:00
rramthun
56409402f0 Fixed some spelling mistakes...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@66 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-26 18:10:40 +00:00
rramthun
9cb8779208 Fixed some spelling mistakes...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@64 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-26 15:38:44 +00:00
rramthun
19e69f0efd Changed <head> YACY into YaCy
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@58 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-25 15:34:18 +00:00
allo
4c8cc101d6 Bugfix: Do not shot the first X lines, but the last X line of log
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@57 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-25 15:10:24 +00:00
orbiter
ba16da72b4 fixed not-working kelondroRecords-Cache
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@56 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-25 14:46:59 +00:00
orbiter
7fb645b0ab enhanced crawling performance, changed memory settings, new performace options
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@51 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-24 23:15:40 +00:00
theli
58b1a0ba40 *) adding an new package for extra content parsers
*) adding content parser for
- pdf (using the pdf-box library)
- doc (using the textmining.org library)
*) adding a Interface for content parsers
*) adding a configuration file which can be used to configure which parser is used for which mimeType
*) Sempahore class was moved and renamed to serverSemaphore
*) Changing yacy shutdown behaviour
Buzy waiting loop for shutdown was removed and replaced with a blocking call (using the semaphore class mentioned above) to the new switchboard.waitForShutdown method.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@46 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-24 21:24:53 +00:00
rramthun
ff21586a27 Fixed some spelling mistakes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@43 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-22 15:29:28 +00:00
orbiter
00f223cfc1 fixed post-parsing (a case when the bluelist is empty)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@41 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-21 17:13:43 +00:00
theli
c9c0a1f11c *) Trying to speedup local crawling
- introduction of a threadpool for crawling
- introduction of a job queue to avoid buzy waiting for a free crawler slot

*) New classes added
- queue for receiving of crawler jobs
- semaphore class to do reader/writer synchronization (mutual exclusion)
- message object to hold all needed data about a crawler job

*) Trying to solve session-thread shutdown problem
- session thread stopped variable is now set from outside before interrupting the
  session thread.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@39 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-21 10:31:40 +00:00
rramthun
ce7d8c4fe0 Fixed some spelling mistakes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@37 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-20 16:01:14 +00:00
rramthun
570de9c4f4 Fixed some spelling mistakes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@36 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-20 13:28:32 +00:00
rramthun
b61567a39e Fixed spelling mistake and inserted author as described in mailinglist
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@32 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-19 13:56:59 +00:00
(no author)
d5ff81c636 *) Undoing last changes. Sorry.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@25 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-19 06:52:04 +00:00
(no author)
ea4849716f *) Bugfix: plasmaCondenser.getWords() was not used correctly
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@24 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-19 06:37:27 +00:00
(no author)
0a6cf3f5e7 *) Bugfix: Reference to plasmaHTCache.Entry.urlString was not set correctly
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@23 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-19 06:33:53 +00:00
orbiter
b9203bdb50 bug fixes and code cleaning
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@22 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-15 14:18:14 +00:00
(no author)
b7d4389e4b *) support for Proxy Auto-Config File generation added.
File is accessible using: 
   http://proxy:8080/autoconfig.pac

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@20 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-15 09:06:15 +00:00
orbiter
e7d055b98e very experimental integration of the new generic parser and optional disabling of bluelist filtering in proxy. Does not yet work properly. To disable the disable-feature, the presence of a non-empty bluelist is necessary
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@17 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-13 15:52:00 +00:00
orbiter
a87a17a3c8 prepared generic text parser environment
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@15 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-12 22:57:54 +00:00
rramthun
a6154697ba Fixed some grammar/spelling mistakes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@8 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-09 10:49:03 +00:00
allo
1e33de27da Log ist now skinnable
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-09 10:10:32 +00:00
rramthun
f824958184 Fixed some spelling/grammar mistakes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-09 09:20:50 +00:00
orbiter
7eb3c81aad name check on new peer names
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-08 23:15:03 +00:00
allo
955d26b117 Bug: could not install from url and use the skin (because of a two names "use")
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-08 19:07:41 +00:00
orbiter
248077d3f0 initial load with yacy 0.36
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-04-07 19:19:42 +00:00