yacy_search_server/source/net/yacy
Michael Peter Christen 101a6e6e14 Patch the citation index for links with canonical tags.
This shall fulfill the following requirement:
If a document A links to B and B contains a 'canonical C', then the
citation rank computation shall consider that A links to C and B does
not link to C.
To do so, we first must collect all canonical links, find all references
to them, get the anchor list of the documents and patch the citation
reference of these links.
2013-10-07 11:15:58 +02:00
..
contentcontrol - the webgraph shall store all links which appear on a web page and not 2013-09-15 00:30:23 +02:00
cora added two more fields source_cr_host_norm_i,target_cr_host_norm_i in 2013-09-27 16:57:05 +02:00
crawler Patch the citation index for links with canonical tags. 2013-10-07 11:15:58 +02:00
data - replaced the properties object in AnchorURL with distinct variables 2013-09-15 23:27:04 +02:00
document do not check all words with synonym library if the library is empty 2013-09-21 08:56:24 +02:00
gui Added 'final' for all exception blocks as this helps the Java compiler 2013-07-17 18:31:30 +02:00
interaction - the webgraph shall store all links which appear on a web page and not 2013-09-15 00:30:23 +02:00
kelondro - the webgraph shall store all links which appear on a web page and not 2013-09-15 00:30:23 +02:00
peers fix for npe during location search 2013-09-21 21:03:58 +02:00
repository self-healing of mistakenly deactivated crawl profiles. This fixes a bug 2013-09-25 18:27:54 +02:00
search Patch the citation index for links with canonical tags. 2013-10-07 11:15:58 +02:00
server - crawl profiles shall be deleted from active and passive stacks if they 2013-09-26 10:22:31 +02:00
upnp replaced StringBuffer with StringBuilder 2013-07-23 12:21:27 +02:00
utils - the webgraph shall store all links which appear on a web page and not 2013-09-15 00:30:23 +02:00
visualization added tools to visualize image generation (i.e. during testing) 2013-09-09 12:58:26 +02:00
dbtest.java - the webgraph shall store all links which appear on a web page and not 2013-09-15 00:30:23 +02:00
migration.java FieldReIndex: omit active vocabulary fields from reindex detection 2013-08-14 00:00:30 +02:00
yacy.java disabled ipv4 preference to enable ipv6-only networks like freifunk 2013-09-20 16:52:37 +02:00
YaCySearchClient.java Added 'final' for all exception blocks as this helps the Java compiler 2013-07-17 18:31:30 +02:00