yacy_search_server/libx
theli 351c86d5d9 *) Migration of optional Content Parser integration
- each additional parser must be in a subpackage 
  of plasma.parser
- each parser must have its own ant build file (which will 
  be called automatically from the main build file)
- Calling the main build file results in building a separate 
  zip file for each optional parser. This zip file includes:
  + sources of the Parser.java
  + compiled classes of the Parser.java
  + needed additional libs (libx)
- To install an additional parser the user simply needs to
  extract the zip file listed above into his/her yacy directory.
- The configuration (enabling/disabling) of a parser can be done
  via the webinterface (currently the settings dialoge) and is
  done "on-the-fly". The installation can not be done "on-the-fly"
  at the moment because of classpath issues.
- The classpath of the linux startup/stop scripts is generated 
  automatically now (including all libraries from lib and libx).

*) Bugfix: File Extension was not calculated correctly by the crawler
   e.g.: file extension was accidentally: .php?param=value
   Corrected.

*) Adding additional parser for parsing of rss/atom feeds
- added needed libs to do this.

TODO:
- automatic building classpath for windows startup scripts


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@78 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-05-03 09:47:56 +00:00
..
commons-logging.jar *) Migration of optional Content Parser integration 2005-05-03 09:47:56 +00:00
informa-0.6.0.jar *) Migration of optional Content Parser integration 2005-05-03 09:47:56 +00:00
informa-0.6.0.license *) Migration of optional Content Parser integration 2005-05-03 09:47:56 +00:00
jdom.jar *) Migration of optional Content Parser integration 2005-05-03 09:47:56 +00:00
log4j-1.2.9.jar *) adding directory and classes needed by the new content parsers for pdf + doc 2005-04-24 21:12:11 +00:00
PDFBox-0.7.1.jar *) adding directory and classes needed by the new content parsers for pdf + doc 2005-04-24 21:12:11 +00:00
PDFBox-0.7.1.License *) adding directory and classes needed by the new content parsers for pdf + doc 2005-04-24 21:12:11 +00:00
readme_libx.txt Fixed some spelling mistakes... 2005-04-26 15:38:44 +00:00
tm-extractors-0.4.jar *) adding directory and classes needed by the new content parsers for pdf + doc 2005-04-24 21:12:11 +00:00
tm-extractors-0.4.License *) adding directory and classes needed by the new content parsers for pdf + doc 2005-04-24 21:12:11 +00:00