yacy_search_server/source/net/yacy/document/parser
orbiter d9173ba7ed added more solr fields to integrate values from URIMetadataRow. All
writings to the Metadata-DB are now also done to solr. This includes
metadata transfer during search and rwi transfer.

The new/added solr fields are:

## time when resource was loaded
load_date_dt

## date until resource shall be considered as fresh
fresh_date_dt

## id of the host, a 6-byte hash that is part of the document id
host_id_s

## ids of referrer to this document
referrer_id_ss

## the md5 of the raw source
md5_s

## the name of the publisher of the document
publisher_t

## the language used in the document; starts with primary language
language_ss

## an external ranking value
ranking_i

## the size of the raw source
size_i

## number of links to audio resources
audiolinkscount_i

## number of links to video resources
videolinkscount_i

## number of links to application resources
applinkscount_i
2012-08-05 15:49:27 +02:00
..
augment made class methods static where possible 2012-07-05 12:38:41 +02:00
html - replaced all length() == 0 and size() == 0 with isEmpty() 2012-07-10 22:59:03 +02:00
images - replaced all length() == 0 and size() == 0 with isEmpty() 2012-07-10 22:59:03 +02:00
rdfa made class methods static where possible 2012-07-05 12:38:41 +02:00
xml added concurrency enhancement to xml parser 2012-06-04 23:35:56 +02:00
bzipParser.java - moved triple store to net.yacy.cora.lod (should be generalized there 2012-06-11 16:48:53 +02:00
csvParser.java - replaced all length() == 0 and size() == 0 with isEmpty() 2012-07-10 22:59:03 +02:00
docParser.java refactoring and new usage of SentenceReader: this class appeared as one 2012-07-04 21:15:10 +02:00
dwgParser.java - replaced all length() == 0 and size() == 0 with isEmpty() 2012-07-10 22:59:03 +02:00
genericParser.java - replaced all length() == 0 and size() == 0 with isEmpty() 2012-07-10 22:59:03 +02:00
gzipParser.java - moved triple store to net.yacy.cora.lod (should be generalized there 2012-06-11 16:48:53 +02:00
htmlParser.java - fixed a memory leak (or bad usage) during parsing/snippet fetch 2012-07-06 09:05:41 +02:00
mmParser.java reduced logging overhead (a bit) 2012-07-12 19:23:40 +02:00
odtParser.java removed unused method parameters 2012-07-05 10:23:07 +02:00
ooxmlParser.java - replaced all length() == 0 and size() == 0 with isEmpty() 2012-07-10 22:59:03 +02:00
pdfParser.java added more solr fields to integrate values from URIMetadataRow. All 2012-08-05 15:49:27 +02:00
pptParser.java reduced logging overhead (a bit) 2012-07-12 19:23:40 +02:00
psParser.java reduced logging overhead (a bit) 2012-07-12 19:23:40 +02:00
rdfParser.java refactoring and new usage of SentenceReader: this class appeared as one 2012-07-04 21:15:10 +02:00
rssParser.java - moved triple store to net.yacy.cora.lod (should be generalized there 2012-06-11 16:48:53 +02:00
rtfParser.java refactoring and new usage of SentenceReader: this class appeared as one 2012-07-04 21:15:10 +02:00
sevenzipParser.java reduced logging overhead (a bit) 2012-07-12 19:23:40 +02:00
sidAudioParser.java cleaned unnecessary nested code 2012-07-05 08:44:39 +02:00
sitemapParser.java fix for sitemap importer: can now also import very large sitemaps within 2012-07-08 16:11:50 +02:00
swfParser.java reduced logging overhead (a bit) 2012-07-12 19:23:40 +02:00
tarParser.java reduced logging overhead (a bit) 2012-07-12 19:23:40 +02:00
torrentParser.java - replaced all length() == 0 and size() == 0 with isEmpty() 2012-07-10 22:59:03 +02:00
vcfParser.java reduced logging overhead (a bit) 2012-07-12 19:23:40 +02:00
vsdParser.java reduced logging overhead (a bit) 2012-07-12 19:23:40 +02:00
xlsParser.java removed unused method parameters 2012-07-05 10:23:07 +02:00
zipParser.java reduced logging overhead (a bit) 2012-07-12 19:23:40 +02:00