Michael Peter Christen
0294a53459
- add canonical field only if requested by solr schema
...
- remove canonical url from in/outbound urls if present
2012-06-26 14:51:57 +02:00
Michael Peter Christen
3fd4a01286
added option to record urls that are forwarded to the solr index
2012-06-26 13:54:48 +02:00
Michael Peter Christen
d763e4d94b
fixed bad referer computation in SSIs which causes a NPE during host
...
computation. This error was there before the latest IPv6 hack but did
not cause a NPE. The IPv6 hack was not the cause for this bug, but it
discovered the misconfiguration of the 'referer' referrer.
2012-06-26 11:18:29 +02:00
Michael Peter Christen
358b04885e
more IPv6 hacks
2012-06-26 00:25:46 +02:00
Michael Peter Christen
96aeb127e3
generalized localhost naming.
...
this is also a preparation for a better IPv6 implementation.
2012-06-26 00:08:25 +02:00
Michael Peter Christen
77f795756c
fixing redirects and status codes: storing of status code in
...
ResponseHeader to make it available for late evaluations, like storage
in solr.
2012-06-25 18:17:31 +02:00
Michael Peter Christen
8dd469b9dd
added option to configure the autocommit delay time of solr on-the-fly
2012-06-25 14:59:46 +02:00
Michael Peter Christen
5d9bd4ddc2
Merge remote-tracking branch 'origin/master'
2012-06-25 11:37:32 +02:00
Michael Peter Christen
b9dfca4b0a
- fixed IndexFederated Servlet / a embedded Solr can now be selected
...
- added code stub for an embedded Solr but generation of Solr store is
still commented out (it works but is not yet ready for usage)
2012-06-25 11:34:38 +02:00
reger
2931726386
adjusted NetBeans classpath for new and updated libraries in lib
2012-06-24 22:50:08 +02:00
Michael Peter Christen
cc1b6762bb
root, not yacy
2012-06-24 10:58:09 +02:00
Michael Peter Christen
2589158f44
changed recommended line in /etc/crontab for high-availability
2012-06-24 10:57:18 +02:00
reger
4156d4e12b
Merge branch 'master' of git://gitorious.org/yacy/rc1.git
2012-06-23 21:22:46 +02:00
Michael Peter Christen
7b53be141f
upgraded to pdfbox 1.7.0
...
changes in http://www.apache.org/dist/pdfbox/1.7.0/RELEASE-NOTES.txt
with many bugfixes, including performance related
2012-06-22 16:49:58 +02:00
Michael Peter Christen
fad3b14813
added jetty libraries, needed for future use as web server and as
...
application server for the solr search interface
2012-06-22 15:31:17 +02:00
Michael Peter Christen
a38b0a2c46
extended embedded solr tests to ensure that it will be usable within a
...
jetty instance
2012-06-22 11:40:02 +02:00
Michael Peter Christen
b9d42fd9c8
using com.google.common.io.Files instead of homebrew methods
2012-06-22 11:39:17 +02:00
Michael Peter Christen
a5eb91fa60
refactoring
2012-06-22 00:49:32 +02:00
Michael Peter Christen
1be0025a9c
- added test for EmbeddedSolrConnector
...
- added needed libraries for this test
this includes most (all) files needed for an embedded solr
2012-06-22 00:36:49 +02:00
Michael Peter Christen
dbdd697f4d
moved RDFaParser.xsl configuration file to defaults
2012-06-21 16:09:12 +02:00
Michael Peter Christen
90b82ce994
using guava for host resolution (non-blocking for ips) and time-out
2012-06-21 16:04:48 +02:00
Michael Peter Christen
f094936b89
added new class libraries to mac app
2012-06-21 14:59:55 +02:00
Michael Peter Christen
e12bb254b4
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
2012-06-21 14:55:50 +02:00
Michael Peter Christen
3f55dc7c1e
- added solr core and libraries that solr needs (lucene is missing, will
...
follow later)
- added embedded solr connector which can connect to solr
programmatically (without using a server in between)
2012-06-21 14:55:38 +02:00
reger
2fccc4e883
Merge branch 'master' of git://gitorious.org/yacy/rc1.git
2012-06-21 01:01:12 +02:00
Michael Peter Christen
c337190a00
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
2012-06-20 20:18:10 +02:00
cominch
c63c3a4495
Show additional interaction elements in footer section on each page, if
...
activated in ConfigPortal.html.
This footer is also visible in augmented browsing proxy mode.
2012-06-20 18:04:23 +02:00
Michael Peter Christen
786be7d175
better integration of RDFaParser
2012-06-20 16:39:04 +02:00
cominch
fa98657bb3
Augmented Browsing: changed the settings page
2012-06-20 09:10:39 +02:00
cominch
751eeade0d
Merge remote-tracking branch 'original yacy/master'
2012-06-20 07:58:27 +02:00
cominch
84a11ec48c
Corrected loading of default page settings on ConfigPortal.html
2012-06-20 07:55:28 +02:00
sixcooler
bea002dc15
correct table in new look of Crawler_p
2012-06-19 13:13:00 +02:00
Michael Peter Christen
15f4551d88
Release 1.03
2012-06-19 08:51:26 +02:00
Michael Peter Christen
8738336408
set Xms lower than Xmx
2012-06-19 08:45:49 +02:00
Michael Peter Christen
de3ef8ad73
removed unimportant warnings
2012-06-19 08:45:34 +02:00
Michael Peter Christen
f7c43e964c
enable asserts only with debugging
2012-06-19 08:23:10 +02:00
Michael Peter Christen
82a682b31d
fixed problem with seed when switching network
2012-06-19 07:44:44 +02:00
reger
b89a69ae2e
Merge branch 'master' of git://gitorious.org/yacy/rc1.git
2012-06-19 03:04:44 +02:00
cominch
6b4545d6b0
Only load tag information if necessary
2012-06-19 01:40:22 +02:00
cominch
011f8a5818
Auto Tagging: Add hyperlinks to tags (provisional)
2012-06-19 01:24:06 +02:00
Michael Peter Christen
8c544edee4
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
2012-06-18 21:26:06 +02:00
Michael Peter Christen
7dc59979bc
fix for npe, possibly for http://bugs.yacy.net/view.php?id=195
2012-06-18 21:25:39 +02:00
Michael Peter Christen
1d4e206b2b
bugfix in vocabulary generation
2012-06-18 18:10:40 +02:00
cominch
2c89975378
Merge remote-tracking branch 'original yacy/master'
2012-06-18 16:16:46 +02:00
cominch
71047fe63a
Augmented browsing: CSS fix
2012-06-18 16:16:31 +02:00
Michael Peter Christen
52f5d40043
better abstraction of document model generation
2012-06-18 15:55:18 +02:00
Michael Peter Christen
8b7c4d3144
produce a rdf output containing the triplestore with yacydoc; ie:
...
http://localhost:8090/api/yacydoc.rdf?urlhash=yOiCM7Fh1hyQ
2012-06-18 15:47:54 +02:00
cominch
f7160dae5c
Merge remote-tracking branch 'original yacy/master'
2012-06-18 15:44:50 +02:00
cominch
e4555cbee3
Augmented browsing: Pass on additional action parameter
2012-06-18 15:44:01 +02:00
Michael Peter Christen
24bbe359ca
integrate also geonames library files for less cities. these are more
...
useful for tagging since less normal words are false-identified as
location
2012-06-18 15:19:57 +02:00