Commit Graph

10161 Commits

Author SHA1 Message Date
bhoerdzn
a3824dfbaa check URL on inital load, if set 2013-10-09 13:52:44 +02:00
bhoerdzn
52f49d475b add a hidden field for "crawlingstart" since jQuery omits the submit button value 2013-10-09 13:38:20 +02:00
bhoerdzn
b0c0ec2dec link recorded crawl starts back to "CrawlStartExpert_p" in "Process Scheduler" 2013-10-09 12:55:42 +02:00
bhoerdzn
d64d45361c use integer types for boolean values 2013-10-09 12:42:04 +02:00
bhoerdzn
eda123d6fd remove debugging code intercepting post requests 2013-10-09 11:51:07 +02:00
bhoerdzn
5057f27bbd fix typo in parsing "cachePolicy" parameter 2013-10-09 11:41:15 +02:00
bhoerdzn
98f5c9018d Fixed template vars for "deleteold". Fixed parsing "deleteold" parameter. Stop "setState" overwriting "deletold" state on load. 2013-10-09 11:32:17 +02:00
bhoerdzn
a6a62986d4 correct state handling for country code restriction 2013-10-09 10:42:35 +02:00
bhoerdzn
4066b85155 correctly set initial state for load filters 2013-10-09 10:36:08 +02:00
bhoerdzn
8c91c3e7cd set form boolean values to 0 & 1 instead of false & true 2013-10-09 10:05:51 +02:00
bhoerdzn
c27fabc88e fixed wrong parameter check 2013-10-09 10:00:16 +02:00
bhoerdzn
2214bf5396 Remove some post parameters, if they are set to default values, as their values are already set by YaCy. Added some documentation. 2013-10-09 09:48:00 +02:00
reger
e74f548551 make legacy http server (serverCore) implement YaCyHttpServer interface 2013-10-09 01:07:22 +02:00
reger
71d2655c02 downgrade to Jetty 8 to assure support of JRE 1.6
- introduce a YaCyHttp interface to modulize/separate http server
- adjust the Jetty version specific implementation part (in package net.yacy.http)
     - putting the version specific code in classes starting with Jetty8xxxx
     - moved existing Jetty9xxx implementation into a test class (to keep the code)
- adjust build to the changed jars
- make use of the introduced YaCyHttpServer interface in related htroot servlets

- adjust other test cases/classes
2013-10-09 00:40:48 +02:00
Michael Peter Christen
1b61bd40ed - Added new solr field url_file_name_tokens_t which stores the file name
tokens. This can be used to enhance the ranking.
- Added also a rating_i field as basis for later usage.
- enhanced the tokenization process.
2013-10-08 23:48:13 +02:00
orbiter
6efa7532d2 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 2013-10-08 19:04:57 +02:00
orbiter
5f5a97bafc added the anchor text within web pages to the searcheable entities of a
web page. This can be of benefit for the ranking if these fields are
used for boosts.
2013-10-08 18:41:07 +02:00
orbiter
705b3338ee list more fields available for search and for ranking boosts 2013-10-08 18:15:35 +02:00
sixcooler
d536092fe4 fix false fill NAME_CACHE_MISS-DNS-Cache in case of a timeout
for eg. caused by massive requests when crawl from file
2013-10-08 18:02:42 +02:00
bhoerdzn
405878182f Use list template for all other option lists. Fixed some template expressions. 2013-10-08 15:04:31 +02:00
bhoerdzn
8e74098cd4 Use list template for "reloadIfOlderNumber". 2013-10-08 13:26:09 +02:00
bhoerdzn
52bad7b908 Dynamic toggling of form fields, based on passed in and selected values. This will also cut down the post string by disabling not needed fields. 2013-10-08 13:24:27 +02:00
Michael Peter Christen
78e7aadb26 removed unused initialization method 2013-10-07 23:51:28 +02:00
Michael Peter Christen
e56aa4fe93 fixed search navigation 2013-10-07 23:51:08 +02:00
Michael Peter Christen
4fbc4740df removed warnings 2013-10-07 23:41:50 +02:00
Lotus
202a9fbdad adding synonyms from German OpenThesaurus ready for use in YaCy 2013-10-07 22:02:42 +02:00
Michael Peter Christen
21aa6a0321 migration to Solr 4.5.0 2013-10-07 17:09:40 +02:00
bhoerdzn
45cf553bc3 try to guess default crawling mode, if none set 2013-10-07 13:13:22 +02:00
bhoerdzn
b4f0c822f2 assign strings before checking contents 2013-10-07 13:01:39 +02:00
Michael Peter Christen
ef31d0f279 fix for rss reader, see http://bugs.yacy.net/view.php?id=294 2013-10-07 12:59:54 +02:00
bhoerdzn
499abe8f91 set default values for string parameters 2013-10-07 12:32:23 +02:00
Jens Bertram
85316b3ac6 Merge branch 'master' into crawlexpert-post 2013-10-07 12:02:52 +02:00
bhoerdzn
42ea56eaad made crawStartExpert_p aware of post variables; extended template where needed 2013-10-07 11:25:59 +02:00
Michael Peter Christen
101a6e6e14 Patch the citation index for links with canonical tags.
This shall fulfill the following requirement:
If a document A links to B and B contains a 'canonical C', then the
citation rank computation shall consider that A links to C and B does
not link to C.
To do so, we first must collect all canonical links, find all references
to them, get the anchor list of the documents and patch the citation
reference of these links.
2013-10-07 11:15:58 +02:00
reger
daebeb93aa add call to AccessTracker to jetty security handler 2013-10-04 01:16:17 +02:00
reger
172aefaeeb adjust YaCySecurityHandler to Jetty 9 conventions
- mainly adjust prepareConstraintInfo to use the RoleInfo.setChecked as in Jetty Source distribution
- use constraint check behavior as in ConstraintSecurityHandler
  see http://git.eclipse.org/c/jetty/org.eclipse.jetty.project.git/tree/jetty-security/src/main/java/org/eclipse/jetty/security/ConstraintSecurityHandler.java?id=jetty-9.0.5.v20130813
2013-10-03 19:38:03 +02:00
orbiter
ba3c173077 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 2013-10-03 18:19:02 +02:00
reger
6f9ed439d3 - expand localHostName check of AbstractRemoteHandler
to pevent request is handled as proxy request 
- make domain handler not relay on included path in resolved .yacy address
2013-10-01 03:04:32 +02:00
reger
561ea135af fix : forgot adding security handler 2013-09-30 04:35:17 +02:00
reger
f46771bdf5 upd build script from rc1/master 2013-09-30 03:47:55 +02:00
reger
c7c706fd9f merge with rc1/master 2013-09-30 03:46:39 +02:00
reger
272b196d05 update Jetty server init() to activate yacy-domain and transparent proxy handler
- adding  domain & proxy handler to a context (as it was in inital design)
     (context required for dispatcher)
- make handler context and servlet context parallel available 
     (to allow use of YaCyDefaultServlet to handle legacyServlets)
- set transparent proxy request handled after dispatch.forward to skip further handling for .yacy domain requests
2013-09-30 03:12:52 +02:00
reger
fd119deb00 fix NPE on modified since check ( Response.requestHeader allowed to be null) 2013-09-30 02:50:53 +02:00
reger
66145a0410 - add welcome file (index.html) support to YaCyDefaultServlet
- change SolrServlet default search field (&df) to text_t
2013-09-29 03:34:00 +02:00
orbiter
a3b5d84c81 Merge remote-tracking branch 'origin/master'
Conflicts:
	.classpath
2013-09-28 15:46:59 +02:00
orbiter
adfae074cf added classpath for debugging 2013-09-28 15:45:33 +02:00
Michael Peter Christen
b28d43decc added two more fields source_cr_host_norm_i,target_cr_host_norm_i in
webgraph and an addition to postprocessing to copy all cr ranking
attributes to the link edges associated to the postprocessing documents
2013-09-27 16:57:05 +02:00
Michael Peter Christen
a52f3a597e fix for canonical-from-http-header feature 2013-09-27 15:09:04 +02:00
Michael Peter Christen
2dd7c5be44 added parsing of http-canonical tags (untested, could not find an
example page)
2013-09-27 13:17:50 +02:00
Michael Peter Christen
4476dea5ba do not fail if a wrong boost key is used; instead, print only a warning
See also: http://bugs.yacy.net/view.php?id=293
2013-09-27 12:28:09 +02:00