mirror of
https://github.com/yacy/yacy_search_server.git
synced 2024-09-22 00:00:59 +02:00
c36da90261
- when a site-crawl for ftp sites is now started, then a special directory-tree harvester gets the complete directory structure of a ftp server at once - the harvester runs concurrently and feeds into the normal crawl queue also in this: - fixed the 'start from file' crawl function - added a link detector for the html parser. The html parser can now also extract links that are not included in <a> tags. - this causes that a crawl start is now also possible from clear text link files git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7367 6c8d7289-2bf4-0310-a012-ef5d649a1542 |
||
---|---|---|
.. | ||
AbstractScraper.java | ||
AbstractTransformer.java | ||
CharacterCoding.java | ||
ContentScraper.java | ||
ContentTransformer.java | ||
ImageEntry.java | ||
Scraper.java | ||
ScraperInputStream.java | ||
ScraperListener.java | ||
Transformer.java | ||
TransformerWriter.java |