yacy_search_server/source/net/yacy/document/importer
Michael Peter Christen 659178942f - Redesigned crawler and parser to accept embedded links from the NOLOAD
queue and not from virtual documents generated by the parser.
- The parser now generates nice description texts for NOLOAD entries
which shall make it possible to find media content using the search
index and not using the media prefetch algorithm during search (which
was costly)
- Removed the media-search prefetch process from image search
2012-04-24 16:07:03 +02:00
..
Importer.java migrated all my LGPL 3 -licensed files to the LGPL 2.1 because LGPL 3 is not compatible to the GPL 2 2010-06-28 16:25:14 +00:00
MediawikiImporter.java - Redesigned crawler and parser to accept embedded links from the NOLOAD 2012-04-24 16:07:03 +02:00
OAIListFriendsLoader.java - tested the ARC methods 2011-11-25 14:09:25 +00:00
OAIPMHImporter.java !Important: move from Hashtable to HashMap 2012-01-09 01:29:18 +01:00
OAIPMHLoader.java stop loading via http at defined maximum of bytes - even size is unknown before loading 2011-08-01 23:28:23 +00:00
ResumptionToken.java some last-minute performance hacks 2011-11-25 11:23:52 +00:00