Commit Graph

5159 Commits

Author SHA1 Message Date
Michael Peter Christen
1269e77dfa enhanced location search 2014-10-15 00:55:57 +02:00
Michael Peter Christen
75b5f24be4 make browsing of file://z: - paths in index browser easier - this will
now show the root paths on a shared drive
2014-10-13 18:33:39 +02:00
Michael Peter Christen
8ac3e9f890 fix for api icon in yacysearch_location.html 2014-10-13 16:53:00 +02:00
Michael Peter Christen
a1dd0ae62c Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 2014-10-12 23:43:32 +02:00
reger
f5967dfedf add filter to citation page and a on/off button
to display only sentences with citations,
while maintaining the sentence number.
Make the filtered list the default in search result citation link
2014-10-12 06:32:13 +02:00
Michael Peter Christen
f818f84adb more ipv6 fixes 2014-10-11 00:34:07 +02:00
Michael Peter Christen
2c2b50e65d refactoring (class name should start with uppercase letter) 2014-10-10 14:32:21 +02:00
Michael Peter Christen
14385057c2 added also the NetworkHistory servlet... 2014-10-10 14:16:16 +02:00
Michael Peter Christen
d8beafba3a fix for values in CrawlProfileEditor table and xml; now the full profile
is available in the xml.
2014-10-09 13:27:20 +02:00
Michael Peter Christen
ec95dfa2e6 fixed crawl profile xml result which did not show the correct crawl
status.
2014-10-08 18:48:57 +02:00
Michael Peter Christen
8c1a89cb34 added another decoration flag to switch off network graphics in crawler
monitor and index browser: decoration.grafics.linkstructure
Please set this to false to remove the graphics from the interface.
2014-10-08 17:12:35 +02:00
Michael Peter Christen
764e4ed673 fixed appearance of RSS icon on search result page 2014-10-08 15:48:45 +02:00
Michael Peter Christen
9b1958e8ca more ipv6 bugfixes 2014-10-08 15:21:49 +02:00
Michael Peter Christen
7817fc50c9 added a high cpu cycle monitor to PerformanceQueues 2014-10-08 15:20:43 +02:00
Michael Peter Christen
5082feb103 less volume for effect sounds 2014-10-08 15:04:35 +02:00
Michael Peter Christen
0bfc69b29b more ipv6 bugfixes 2014-10-08 12:38:56 +02:00
Michael Peter Christen
a27563e5c3 removed the atmo sound clips because they had been too large 2014-10-07 23:42:41 +02:00
Michael Peter Christen
ae58b22f5b ipv6 fixes for Network.html front page 2014-10-07 21:57:41 +02:00
Michael Peter Christen
e413beac04 fix for latest UPnP update 2014-10-07 17:52:13 +02:00
Michael Peter Christen
74957f3760 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 2014-10-07 17:51:18 +02:00
Michael Peter Christen
2a052f446a Added an experimental audio feedback system.
This is the first element of a new 'decoration' component which may hold
switches for different external appearance parameters.
The first switch in that context is decoration.audio (as usual in
yacy.init). This value is set to false by default, that means the audio
feedback element is switched off by default. To switch it on, set
decoration.audio = true (using /ConfigProperties_p.html). You will then
hear sounds for the following events:
- remote searches
- incoming dht transmissions
- new documents from the crawler
Sound clips are stored in htroot/env/soundclips/ which is done so
because a future implementation will read these files using the http
client and with configurable urls which will make it very easy for the
user to replace the given sounds with own sounds.
2014-10-07 17:51:07 +02:00
Marc Nause
1e6e69bc40 Finished implementation of UPNP:
*) will try other ports if YaCy standard ports are not available
*) distinguish between internal and external port (not sure if this
works 100%)

Still to add: propery in config to enter own external port (in case of
manually configured NAT)
2014-10-07 13:10:06 +02:00
Michael Peter Christen
e1bc768f9d more IPv6 bugfixes 2014-10-06 17:44:27 +02:00
Michael Peter Christen
961f06c0b6 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 2014-10-05 01:25:12 +02:00
reger
209e0f2fe8 allow url parameter in worktable apicall
allow url=wwwl?param=a&param=b (with ?, & encoded)
fix:  http://mantis.tokeek.de/view.php?id=100

fix double adding of  '&' in MultiProtocolURL.escape()
2014-10-04 04:11:48 +02:00
reger
3562b5e3a4 open rejected urls in new browser 2014-10-03 20:49:40 +02:00
reger
b0c87d8240 fix image search expand box, cut-off of 2nd capture line height
tested with IE11 and Firefox 32 (change worked for both to show 2nd line without cutting off height)

+fix charset parameter in metadataImageParser
+update start errMsgTxt to "java 1.7"
2014-10-03 01:43:05 +02:00
reger
fa99b286cc add html5 autofocus to query input field
(leave onload untouched = redundant, for IE9 http://www.w3schools.com/tags/att_input_autofocus.asp)

adjust Peer-to-Peer/ Privacy switch label 
to display "Peer-to-Peer" as 2nd switch option in active stealth mode
2014-10-01 23:53:41 +02:00
Michael Peter Christen
329262231f unresolved pattern fix 2014-10-01 15:34:43 +02:00
Michael Peter Christen
528f583d72 ipv6 fixes 2014-10-01 15:32:10 +02:00
Michael Peter Christen
e4ccca9497 fix for xss bugs found by CTF365 2014-10-01 12:22:55 +02:00
Michael Peter Christen
247e626083 IPv6 host parsing bugfixes 2014-10-01 10:21:03 +02:00
Michael Peter Christen
fe917deb2d when pinging other peers, be able to select the right IP option 2014-10-01 03:47:57 +02:00
Michael Peter Christen
65e6ae52fb IPv6-enhanced Network monitoring page 2014-10-01 03:10:39 +02:00
reger
7c1707872b search result showPicture update search parameter
used parameter &cat=image is obsolete and returns no results
- remove &cat=image and &cat=href references
- remove &tenant= references (unused)
Use contentdom=image and inurl: parameter to make showPicture link display something (open in new window because of used inurl modifier changes original query)
2014-09-30 22:22:13 +02:00
Michael Peter Christen
3073c69aee Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 2014-09-30 14:54:06 +02:00
Michael Peter Christen
6491270b3a large IPv6 redesign of peer ping methods!
removed preferred IPv4 in start options and added a new field IP6 in
peer seeds which will contain one or more IPv6 addresses. Now every peer
has one or more IP addresses assigned, even several IPv6 addresses are
possible. The peer-ping process must check all given and possible IP
addresses for a backping and return the one IP which was successful when
pinging the peer. The ping-ing peer must be able to recognize which of
the given IPs are available for outside access of the peer and store
this accordingly. If only one IPv6 address is available and no IPv4,
then the IPv6 is stored in the old IP field of the seed DNA.
Many methods in Seed.java are now marked as @deprecated because they had
been used for a single IP only. There is still a large construction site
left in YaCy now where all these deprecated methods must be replaced
with new method calls. The 'extra'-IPs, used by cluster assignment had
been removed since that can be replaced with IPv6 usage in p2p clusters.
All clusters must now use IPv6 if they want an intranet-routing.
2014-09-30 14:53:52 +02:00
reger
0ecbf32134 update to Jetty 9.2.3 2014-09-27 23:27:05 +02:00
reger
46afdf7d21 add link to thread pool settings in status panel 2014-09-21 22:35:03 +02:00
reger
54019313e7 fix NPE in ViewFile - show snippet
on document not in index
2014-09-21 03:48:54 +02:00
reger
4873a2d3a4 adjust link to peer in Network list
(www path obsolete)
2014-09-21 00:10:20 +02:00
orbiter
3ac31614a3 added option to reverse-sort YaCy tables (internal API change only) 2014-09-18 11:11:09 +02:00
Michael Peter Christen
6d3d4c4ea6 changed the concurrent enumeration of query results in such a way that
it is now possible to get the results in two steps:
- first retrieve all IDs as given for a query
- then retieve each document individually

This was necessary for very large result sets where a query may run for
hours and is possibly terminated by a solr-internal timeout. This occurs
regulary during postprocessing and therefore this commit may fix
unwanted postprocessing terminations.
2014-09-17 13:58:55 +02:00
reger
ed0d7a80d5 modifiy description for Field-Reindex to act only on local index
http://mantis.tokeek.de/view.php?id=279
2014-09-15 23:29:42 +02:00
Michael Peter Christen
81f9b34da7 increaesed ability ot search for all images on a single server within
the p2p remote search
2014-09-15 20:33:22 +02:00
Michael Peter Christen
9b92685771 automatically add a wild card if only a search on a single domain is
done. This makes it possible to search all documents on a single domain
even if no search word is given. This is in particular interesting when
searching for all images on a single domain.
2014-09-15 13:38:53 +02:00
Michael Peter Christen
abde89438b fix for favicon 2014-09-15 13:03:08 +02:00
Michael Peter Christen
ca8b2bf099 removed www and welcome servlet, these had been demo servlets and are
not needed any more
2014-09-15 12:48:58 +02:00
reger
5247d01cd4 implement a forward to remote peer link in P2P Network list
Most links in Network.html are only available with transparent proxy = on, which is switched off by default,
to make the provided links useable in default setup a small forward servlet added (goto_p.java),
which takes the peer hash as parameter and forwards to current public ip (optional with path= parameter).
The servlet is protected ( _p ending) to assure forwarding works only for authorized YaCy users.
2014-09-14 23:26:22 +02:00
reger
de7641023c add recommended link "self" to atom feed output 2014-09-14 01:40:27 +02:00
Michael Peter Christen
805a95a98b fix for http://mantis.tokeek.de/view.php?id=467 2014-09-13 16:26:40 +02:00
Michael Peter Christen
7527ae63e7 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 2014-09-13 11:02:38 +02:00
Michael Peter Christen
b93ea4e2a6 Added the option to retrieve only the own seed or a seleceted seed
(selected by peer-hash) in the /yacy/seedlist.[json/xml] api.
Added also a xml-version of the servlet.
The own seed can be retrieved i.e. with
http://localhost:8090/yacy/seedlist.xml?my=
http://localhost:8090/yacy/seedlist.json?my=
and any other peer can be selected with
http://localhost:8090/yacy/seedlist.xml?id=<peerhash>
http://localhost:8090/yacy/seedlist.json?id=<peerhash>
2014-09-13 11:02:11 +02:00
reger
b5e0f70197 - remove repositoryPath post from ConfigBasic (obsolete)
- remove static snippetComputationTime from ResultEntry (not used)
2014-09-13 03:21:52 +02:00
Michael Peter Christen
ffc259c944 changed link to new tutorial repository (yes, Youtube..). The link does
not point to youtube directly to prevent that the referer to the peer
address is given to youtube. Instead, a forwarder address at yacy.net is
used to redirect to the tutorial repository (and can be changed later).
2014-09-13 02:15:36 +02:00
Michael Peter Christen
b0bfafa581 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 2014-09-12 16:37:21 +02:00
Michael Peter Christen
1735dbc9d9 enhanced image search: bugfixes and performance enhancements 2014-09-12 16:37:01 +02:00
reger
1d5d0b82a6 - skip html template specific servlet post variables (show_xxx) for feeds,
- add <updated> (in required format) to atom feed
2014-09-12 02:10:18 +02:00
reger
8ed6550261 adding totalResults and id to atom feed output 2014-09-12 00:08:22 +02:00
Michael Peter Christen
7611bf79bd Merge branch 'master' of gitorious.org:yacy/icewindxs-rc1
Conflicts:
	locales/ru.lng
2014-09-10 13:24:49 +02:00
Michael Peter Christen
d3b000b089 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 2014-09-10 13:18:15 +02:00
reger
9962b9e548 use configured search items per page if not specified in post
- remove verify=cacheonly from admin screen search box to use the configured values
  (otherwise definition overwrites configured behavior and is used for following searches what might give unexpected/confusing different results compared to using /yacysearch )
2014-09-10 00:52:37 +02:00
Michael Peter Christen
2a52c6f0f1 using htroot/api/blacklists as source folder: removed package
declaration of some classes in that folder
2014-09-08 11:36:28 +02:00
Michael Peter Christen
f510fb82dd css skin fix: visited was not set which caused bad colors on new-user
menu design.
2014-09-05 12:54:21 +02:00
Michael Peter Christen
57ce7eeff3 fixed localhost authorization and replaced the adminRealm with an info
string which is visible in the browser. That makes it possible that the
browser instructs the user how to change a forgotten admin password
(during runtime).
2014-09-02 13:15:19 +02:00
Michael Peter Christen
62f48a28d6 moved index administration up ahead of system administration to put more
importance on it. People should not feel that it is more important to
tweak any settings (which may break things) than to look into the index.
2014-09-02 12:06:56 +02:00
Michael Peter Christen
77b4c6dc5b moved Table administration and Busy Queues Config out of mini-submenu of
advanced settings to a top-menu entry. Moved the advanced setting to a
less prominent place of the submenu. Removed the table administration
from target analysis submenu because it appeared double, the table
administration is now the default in the system administration.
Sorry for inconvenience if i constantly move menues around, but this
makes just more sense and YaCy is still not finished :)
2014-09-02 12:04:44 +02:00
Michael Peter Christen
c90ae191ab moved cookie monitoring to the network monitoring submenu 2014-09-02 11:53:22 +02:00
orbiter
0947bea882 fixed wrong submenu title 2014-09-01 11:08:35 +02:00
orbiter
3ba47823cb switched position of API steering and content semantic 2014-09-01 11:04:24 +02:00
reger
0ff66118bf exclude nav-header/footer in ServerScannerList.html?embedded
fixes display of header in yacyinteractive.html
2014-08-31 05:04:19 +02:00
orbiter
301961c4c1 small fix to the welcome message 2014-08-28 16:28:06 +02:00
orbiter
2dd4b274d4 update to kaskelix 2014-08-28 12:15:11 +02:00
orbiter
46efeb6ea2 Merge branch 'master' of git@gitorious.org:yacy/rc1.git 2014-08-27 18:46:04 +02:00
orbiter
60f97faec6 added hint to the search api examples to remind the user that these
interfaces are examples and not actual variants of the current search
2014-08-27 18:44:20 +02:00
orbiter
f642cfbe30 added hint to the regular expression tester 2014-08-27 18:40:20 +02:00
orbiter
73ebf69ca7 changed style of info-icon to be similar to bootstrap.css glyphicons 2014-08-27 18:29:40 +02:00
reger
6654d314f1 add rss version to api/feed.rss
IE11 reports error without
2014-08-27 02:31:21 +02:00
orbiter
cbb5f06630 do not remove the index deletion option from the IndexControlURLs_p.html
servlet after a deletion happend, instead show but disable the option
when the index is empty.
2014-08-27 00:45:39 +02:00
orbiter
73c2e47de3 added a confirmation dialog to complete index deletion 2014-08-27 00:31:03 +02:00
orbiter
688c6d8954 Merge branch 'master' of git@gitorious.org:yacy/rc1.git 2014-08-27 00:04:36 +02:00
orbiter
500e0b9927 fix for browsing of file paths in Index Browser 2014-08-27 00:03:24 +02:00
Marc Nause
2af56fa37d Improved UPnP. (still not perfect)
*) set HTTPS port if enabled
*) improved data structures (may not be final)
*) moved UPnP to own package
2014-08-26 22:47:13 +02:00
orbiter
b3ebd38079 removed the HTDOCS repository concept because the concept to host files
on the YaCy http server is obsolete; YaCy can index file:// and smb://
paths
2014-08-26 19:02:53 +02:00
orbiter
5611d45b65 renamed Host Browser to Index Browser (gives a better hint what it
actually does)
2014-08-26 18:52:49 +02:00
reger
ec5b1d9e33 let NETWORK_WHITELIST take precedence over NETWORK_BLACKLIST
this makes it easier to config exception (for private networks),
like   blacklist= .*
        whitelist= 10\..*,127\..* .....     allows only listed ip pattern
2014-08-26 01:02:38 +02:00
reger
70bb3d1b38 update target url for yacy-portalsearch.html to search.yacy.net (peer yacportalsearch2014)
(old www.yacy-suche.de not reacheable)
2014-08-24 23:05:22 +02:00
reger
29ccbf6491 seedUploadUrl config is lost on restart if no publish event occured
-add a saveMySeed() on uploadurl changes (to keep url setting without retyping even if network down)
2014-08-21 01:26:42 +02:00
reger
e033e79826 remove old description for proxy port settings (Settings_p.html?page=ProxyAccess)
- The options were not current (only port number accepted, which is part of ConfigBasic.html)
- Deleted options and the port number input field from the proxyaccess page.
- joined both transparent proxy setup pages (Settings_Http.inc & Settings_ProxyAccess.inc) in one page
- adjustments to the related/linked pages
2014-08-20 22:45:36 +02:00
orbiter
e4e1bdeba0 added 0x40 to image of lockopen-gif image palette (light grey) 2014-08-20 00:55:31 +02:00
orbiter
7028a39abb changed lock/unlock image design 2014-08-20 00:48:48 +02:00
orbiter
b4f2a1db6e added a unlock icon for all protected pages that are unlocked because
the administrator is logged in.
2014-08-19 19:58:31 +02:00
reger
7267c76881 set default "Search Interfaces"."Solr RSS/Opensearch" query to show latest 10 addition to index 2014-08-18 02:53:03 +02:00
reger
f76d81f5c9 fix: hanging text in input fields of WatchWebStructure_p.html
in IE11
2014-08-16 03:37:36 +02:00
orbiter
cf9e7fdbb8 reverted template from latest cherry-picked commit 2014-08-15 00:06:22 +02:00
Alex
f6c7467a90 updated some french translations 2014-08-15 00:04:10 +02:00
reger
19e35a9126 add type attribute to atom feed <link> tag
(for /yacysearch.atom)
2014-08-11 21:46:39 +02:00
reger
0a2f4a0e2f eliminate lat/lon type conversion in osm
(define as double)
2014-08-10 22:35:25 +02:00
Michael Peter Christen
01bbb20666 increased default logging line count to max 2014-08-06 12:40:35 +02:00
Michael Peter Christen
9bc3e457dd fix for termination of all crawls 2014-08-05 22:23:52 +02:00
Michael Peter Christen
8d650ca225 added hint to port forwarding videos 2014-08-05 21:31:28 +02:00
reger
3963bca3b6 catch IndexControlRWIs_p error if RWI not connected 2014-08-04 00:03:42 +02:00
orbiter
2371d6b8db target linktexts must be string to enable search facets on these fields 2014-08-01 13:20:25 +02:00
Michael Peter Christen
05d58e4df0 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 2014-08-01 12:04:25 +02:00
Michael Peter Christen
98f45c9032 fix for image alt attachment to AnchorURLs in html parser. 2014-08-01 12:04:15 +02:00
orbiter
22ce4fb4dd better error handling for remote solr queries and exists-checks 2014-08-01 11:00:10 +02:00
orbiter
161a11070c yacystats is gone :( 2014-07-29 11:12:01 +02:00
Michael Peter Christen
c115f3869c enhanced snippet computation and test method in ViewFile 2014-07-28 15:42:57 +02:00
Michael Peter Christen
6e1dc444c3 added a snippet test function in ViewFile: you can now search for a
specific word on the document; the servlet returns the snippet in the
same way as it would be shown in a search result.
2014-07-24 14:59:37 +02:00
reger
29d1945c16 fix double &query parameter (index.html)
?query=word&query=
2014-07-22 21:54:46 +02:00
Michael Peter Christen
542c20a597 changed handling of crawl profile field crawlingIfOlder: this should be
filled with the date, when the url is recognized as to be outdated. That
field was partly misinterpreted and the time interval was filled in. In
case that all the urls which are in the index shall be treated as
outdated, the field is filled now with Long.MAX_VALUE because then all
crawl dates are before that date and therefore outdated.
2014-07-22 00:23:17 +02:00
reger
7f0e757bb5 fix bookmark.rss
- channel end tag postion
- link with html entity
2014-07-21 19:26:12 +02:00
orbiter
e441831a24 reverted toString() change in AnchorURL to prevent mistakenly used
toString(). This fixes also the update link bug.
2014-07-21 15:58:29 +02:00
reger
697b9743e7 Add link to RemoteCrawl_p
suggestion http://mantis.tokeek.de/view.php?id=277
2014-07-21 02:00:05 +02:00
reger
47f201a6b8 Add Solr default query fields (&qf) to select servlet
according to the ranking profiles boost fields defined by the peer (if df/qf is not specified in query).
This allows for pretty simple queries ( q=word) without the need to know about the specific index configuration.
Making sure all relevant fields (as determined by the index owner) are searched, still maintaining the option to query specific fields
and does not relay on the duplication of text to text_t.
- add author to reset-default boost fields (support results for author nav)
2014-07-21 00:47:14 +02:00
reger
8004cfc961 fix input boostfield factor of 0.0 in RankingSolr
- input was accepted and stored but not editeable (added check factor >0.0 during edit)
- make use of some more predefined solr constants
2014-07-20 12:28:59 +02:00
reger
a2cb366b25 Combine /heuristic search modifier with opensearch configured targets
- with search modifier /heuristic a request is send to all configured opensearch target systems (old /heuristic/blekko modifier not longer valid)
- this allows to use opensearch heuristic on individual search request (in contrast to configuration HEURISTIC_OPENSEARCH=true which sends a osd request on all global searches
- the index.html searchoption text adjusted to be displayed only if option configured
- add Archive-It to predefined systems
2014-07-20 00:00:43 +02:00
Michael Peter Christen
2de159719b added an option to set 'obey nofollow' for links with rel="nofollow"
attribute in the <a> tag for each crawl. This introduces a lot of
changes because it extends the usage of the AnchorURL Object type which
now also has a different toString method that the underlying
DigestURL.toString. It is therefore not advised to use .toString at all
for urls, just just toNormalform(false) instead.
2014-07-18 12:43:01 +02:00
Michael Peter Christen
87f8118108 added option to delete documents from the webgraph 2014-07-16 16:04:19 +02:00
Michael Peter Christen
32a2ff925c Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 2014-07-16 14:58:27 +02:00
Michael Peter Christen
d07cdd8c3b added SolrCloud access mode and configuration 2014-07-16 14:57:51 +02:00
Michael Peter Christen
8514bffc22 enhanced postprocessing status report 2014-07-16 14:57:25 +02:00
reger
f99f3d5cf2 fix button (clear list) text color in CrawlResults 2014-07-13 00:48:50 +02:00
Michael Peter Christen
b5fc2b63ea removed exist() retrieval functions from error cache and replaced it
with metadata retrieval from connectors directly. This should cause
better usage of the cache. Automatically increase the metadata cache if
more memory is available.
2014-07-11 19:52:25 +02:00
Michael Peter Christen
62c72360ee cleanup of checkAcceptanceInitially in CrawlStacker, should avoid
double-calling of solr
2014-07-11 18:36:04 +02:00
orbiter
dab9a0786a Merge branch 'master' of git@gitorious.org:yacy/rc1.git 2014-07-11 04:04:34 +02:00
orbiter
51bf5c85b0 Renamed the transmission cloud to buffer in dispatcher since the name
'cloud' was a bad idea. Changed also the accumulation process for peer
targets so that every dht chunk is not assigned the set of redundant
targets but they are assigned to redundant targets individually. This
enhances the granularity of the target accumulation and should enhance
the efficiency of the process. Finally the dht protocol client was
enriched with the ability to remove the 'accept remote index' flag from
peers or remove peers completely if they do not answer at all.
2014-07-11 04:04:09 +02:00
reger
7057e0b3e2 catch input file not found in Mediawiki import 2014-07-10 23:58:47 +02:00
Michael Peter Christen
f384fd624b Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 2014-07-07 11:11:50 +02:00
reger
ba5a59a28d make search result also avail. as atom feed via /yacysearch.atom
- fix logo in rss feed
2014-07-03 22:01:13 +02:00
orbiter
59160984cc timeline performance update 2014-07-03 13:06:29 +02:00
orbiter
54bea96e67 Merge branch 'master' of git@gitorious.org:yacy/rc1.git 2014-07-02 23:23:34 +02:00
Michael Peter Christen
15b2fad6a2 reverted latest change for reindexing because that works actually only
for internal Solr indexes. This is mainly caused by the fact that an
external Solr may be also a SolrCloud which do not support LukeRequests,
which are needed to request the old Schema.
2014-07-02 14:56:34 +02:00
Michael Peter Christen
841cc77391 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 2014-07-02 14:35:02 +02:00
Michael Peter Christen
e09218129c remove check for local solr. This check was made during a time when Solr
was optional and another alternative metadata store was available. Since
that store is now removed, Solr is always available (internally or
externally)
2014-07-02 14:34:48 +02:00
orbiter
2073e69034 fix for long periods in timeline 2014-07-02 11:29:50 +02:00
reger
1f94df29e7 fix NPE in solr rss where snippet contains only the title text
and adjusted xslt, for solr snippets (&hl=true) to decode the xml encoded html <b> tag by adding disable-output-escaping
(still open item description may be double as dc: tag and rss.description tag)
2014-07-01 23:24:26 +02:00
Michael Peter Christen
8c52f0651b refactoring of AccessTracker events & timeline fix 2014-07-01 16:06:01 +02:00
Michael Peter Christen
1b279d7a7e fixed external link 2014-06-27 15:12:53 +02:00
Michael Peter Christen
74206a10c7 refactoring 2014-06-27 14:40:36 +02:00
Michael Peter Christen
36e623d8bf enhanced metadata enrichment for media file type search:
- Web servers may now deliver YaCy-specific http header field with a
title and keywords. The new http header fields are:
X-YaCy-Media-Title - to be used for media (image, audio, video) titles
X-YaCy-Media-Keywords - to be used for media (image, audio, video)
keywords
- both fields are written to document fields title and keywords and are
searched also during image search.
- to make the usage of arbitrary http header fields (including this new
fields) possible in the /api/push_p.json servlet, a new POST argument is
also introduced to push http header fields. The new POST attribute is
named "responseHeader-X" (where X is the counter). It is allowed to use
this attribute as multi-attribute several times, each can be filled with
a http header line.
- see /api/push_p.html for examples
2014-06-26 13:02:35 +02:00
reger
a88ea14e09 harmonize use of style for "delete" button
- apply the monstly used btn-danger class
2014-06-22 23:33:59 +02:00
Michael Peter Christen
8fd72b5e8b Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 2014-06-20 13:57:06 +02:00
Michael Peter Christen
81d0f01a6f added 'synchronous' and 'commit' flags in push api 2014-06-20 13:56:55 +02:00
reger
5043eff33a move page navigation below results (image search)
force page navigation to be displayed below results in image search for any number of displayed images instead to be displayed to the right of last image.
2014-06-20 01:02:43 +02:00
Marc Nause
f443cfa32d Improvements and bugfixes for recording actions of blacklist API. 2014-06-17 22:54:47 +02:00
Michael Peter Christen
0ba6b98d5b fix for broken json 2014-06-17 11:36:20 +02:00
orbiter
4177c9cf05 fix for crawl start check 2014-06-15 22:50:04 +02:00
orbiter
0bbb5040b8 Merge branch 'master' of git@gitorious.org:yacy/rc1.git 2014-06-15 12:38:52 +02:00
orbiter
9d5d86cd03 Added filter query options to the ranking servlet /RankingSolr_p.html.
Filter queries are not actually related to ranking, but user requests
have pointed out that specific boost queries to move results to the end
of the result list are not sufficient. Such boost filters may be better
executed as actual filter and therefore such a filter can now be
statically applied to every search request. A typical use could be the
expression "http_unique_b:true AND www_unique_b:true" which uses the
recently introduced fields http_unique_b and www_unique_b which are true
only for one of the alternatives with/without http(s) and with/without
prefix 'www.' in host names.
2014-06-15 12:38:30 +02:00
Michael Peter Christen
d2151857f1 Added collection navigation:
The collection field (can be filled i.e. in Crawl Start) can be used to
add categories to YaCy index entries. The usage of that field was
restricted to solr searches and post argument filters as implemented in
commit f7571386a3.
This commit extends collections to a full navigation option in the
standard YaCy search interface. The field is not active by default but
can be activated easily in the /ConfigSearchPage_p.html servlet (just
check the 'Collection' facet field). Collections can now be used for (at
least) two purposes:
- to provide search tenants (through post argument collection)
- to provide self-made category navigation
Search requests may now have (independently from switched on or off
collection facet) a "collection:<collection-name>" modifier attached;
firthermore collection names may use disjunctions using the '|' pipe
symbol. For example, this is a valid search request:
www collection:user|proxy
2014-06-15 12:11:23 +02:00
Michael Peter Christen
74c249288a added a push api to make it possible to upload files directly without
crawling to the YaCy indexer. Files are uploaded using POST multipart
requests; multiple file uploads are possible as well. Each file has
attached the file date and mime type which is used to get the right
parser for the submitted data. Also an url is submitted which is
assigned to the document.
The CrawlSwitchboard has a new option for default Crawl Profiles which
are assigned dynamically from the new push interface.
2014-06-12 18:10:07 +02:00
reger
c798a9d1bb fix unresolved pattern in yacysearch.rss title
and rss xml error due to html & encoding in url entries
2014-06-07 03:01:26 +02:00
Michael Peter Christen
e64be5dcad in case that the network is switched to any other than freeworld, RWIs
are disabled. This is a temporary fix. There must be a better way to
determine if RWIs are to be switched on or of.
2014-06-04 13:59:37 +02:00
Michael Peter Christen
87f171675b doing index deletions using a get string which makes it easier to
copy-paste deletion examples (see: #EuGH :( )
2014-06-04 12:09:49 +02:00
Michael Peter Christen
a2f800cd8f fix for bad String conversion 2014-06-04 12:07:07 +02:00
Michael Peter Christen
b3b174e2b8 fixed webgraph postprocessing and status display in Crawler_p servlet 2014-06-02 15:06:38 +02:00
reger
7a52a6ba3f add links to port config in status panel
- pom upd to match javadoc location
2014-06-02 02:11:54 +02:00
reger
c3e40c82fe make https port setting changeable via front end somewhere
(chosen Http Networking page /Settings_p.html?page=http )
2014-06-01 03:15:38 +02:00
Michael Peter Christen
698f053658 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 2014-06-01 01:02:12 +02:00
Michael Peter Christen
f23c4142e0 added option to configure a custom user agent within allip networks 2014-06-01 01:02:03 +02:00
reger
8e233e2eb4 - fix typo in Message_p (defaultpath)
- use more existing switchboardconstants for getproperties
- replace depriciated call defaultservlet
2014-06-01 00:20:25 +02:00
Michael Peter Christen
8ad41a882c fixed several problems with postprocessing:
- unique-postprocessing was destroying results from other
postprocessings; removed cross-updates as they had been not necessary
- unique-postprocessing did not restrict on same protocol
- inefficient concurrent update cache was redesigned completely
- increased limits for concurrent blocking queues to prevent early
time-out
2014-05-29 13:24:24 +02:00
Michael Peter Christen
640b684bb6 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 2014-05-28 19:19:17 +02:00
Michael Peter Christen
2f5477ea59 a try to fix the mixed up terms 'Active' -> 'Senior' and 'Passive' ->
'Junior'
2014-05-28 18:48:54 +02:00
reger
ca5437dd50 fix crawl of file:// , also http://mantis.tokeek.de/view.php?id=149
local files can be crawled (intranet mode) url parsing fixed according to  RFC 1738 (for unix and windows)
for win like file:///c:/tmp   or file://localhost/c:/tmp
for linux like file:///tmp  or file://localhost/tmp
Host is ignored and path must be absolute
2014-05-28 03:01:34 +02:00
reger
66f6797f52 make config search page layout closer to actual page appearance 2014-05-25 01:06:39 +02:00
sixcooler
5b1c4ef191 Monitoring and limit connection-count for Jetty 2014-05-22 22:16:39 +02:00
orbiter
ce1dbfeb0f fix appearance of image search thumbnails. 2014-05-22 15:01:58 +02:00
orbiter
6daae59479 switch on core.service.rwi when switching back from portal mode to p2p
mode
2014-05-22 12:55:22 +02:00
Michael Peter Christen
f0db501630 better handling of ranking parameters and new default values for date
navigation which is done using ranking in solr.
2014-05-22 03:01:07 +02:00
Michael Peter Christen
2520590b45 migrated from pdfbox 1.8.4 to 1.8.5. They have a very long bugfix list
for that update:
http://www.apache.org/dist/pdfbox/1.8.5/RELEASE-NOTES.txt
2014-05-21 22:48:41 +02:00
Michael Peter Christen
6634b5b737 debug code for index distribution testing 2014-05-21 18:20:16 +02:00
Michael Peter Christen
89e13fa34e fixed bug in test function 2014-05-21 15:31:47 +02:00
Marc Nause
4723329e29 Improved blacklist XML/JSON API. 2014-05-19 20:51:43 +02:00
reger
f91b2f51ae fix: load_Rss remove feed to many parameter for get
use form post methode
2014-05-18 22:41:09 +02:00
orbiter
c028ae9b09 Merge branch 'master' of git@gitorious.org:yacy/rc1.git 2014-05-18 21:21:17 +02:00
reger
e31493e139 "Use remote proxy for yacy" has no function, remove option and related config item
see/fix bug http://mantis.tokeek.de/view.php?id=23
http://mantis.tokeek.de/view.php?id=189
2014-05-17 23:36:59 +02:00
reger
89e2c5e884 fix: allow enable of CrawlStartExpert.html #file 2014-05-17 22:56:15 +02:00
reger
1b37b12998 fix: CrawlStartExpert.html # From File with missing filename
- crawlName must not be empty
- crawlingFile must not be empty
2014-05-17 21:34:23 +02:00
orbiter
0d8072aa99 removed warnings 2014-05-13 22:29:05 +02:00
orbiter
be7c99dbe8 switched menu position of ConfigPortal.html and ConfigSearchBox.html 2014-05-13 08:14:56 +02:00
Michael Peter Christen
a1ac4c3b76 automatically clear graphics cache 2014-05-12 15:45:25 +02:00
reger
f87ac716f3 improve IndexDeletion by query
adding transparently text_t as pseudo default search field if no fieldname (no  : ) is included.
adressing bug report  http://mantis.tokeek.de/view.php?id=274
2014-05-12 00:12:05 +02:00
reger
e9060d31bd update to Jetty 9
besides adjustments in code it makes the servlet settings in web.xml significant.
This applies to solr, gsa and proxy servlet. There is no longer a default setup in code during init (as jetty 9 checks for double definition).
2014-05-11 01:53:11 +02:00
orbiter
b9c1a61814 added a peername=<peername> property in the seedlist API 2014-05-08 07:41:40 +02:00
orbiter
c637955e67 fix for navigation steering / p2p mode
see also:
http://forum.yacy-websuche.de/viewtopic.php?f=5&t=5198&p=29958#p29958
2014-05-06 05:58:51 +02:00
Marc Nause
f98ccf952f Improved Blacklist API:
*) added JSON support
*) fixed Exception in case of missing parameters
*) renamed parameter for items in "add entry" and "delete entry" from
"entry" to "item" to match term in XML
2014-05-05 23:16:01 +02:00
reger
91bd384cf6 fix input-group layout on index.html
see bug http://mantis.tokeek.de/view.php?id=391
2014-05-03 21:55:10 +02:00
Marc Nause
0d88f292dc Key for parameter "blacklist name" is "list" in all servlets now. 2014-05-02 14:18:52 +02:00
reger
80e0ee92e5 adjust search page layout - search box to current style 2014-05-02 01:15:03 +02:00
reger
a81dfc27eb remove obsolet css class bookmarkfieldset 2014-05-02 00:35:54 +02:00
Michael Peter Christen
0898f0be17 input-group for main search input window 2014-04-30 06:46:06 +02:00
Michael Peter Christen
9bb616d778 enhanced HostBrowser buttons and fixed text input alignment 2014-04-30 06:21:53 +02:00
Michael Peter Christen
4a818ad72c fix for strange fail reason 2014-04-30 05:14:01 +02:00
Michael Peter Christen
a2fba6584f use submitted default userAgent if cloning a crawl 2014-04-30 05:05:02 +02:00
Marc Nause
e0822fa008 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 2014-04-30 00:48:55 +02:00
Marc Nause
c97da1a0d8 First draft of a blacklist API. 2014-04-30 00:48:38 +02:00
reger
312972c586 add display filter (active/disabled) to IndexSchema_p.html config
for easier overview of schema fields
2014-04-29 22:51:01 +02:00
Michael Peter Christen
d79d7dde55 fix for result display 2014-04-29 16:24:21 +02:00