Commit Graph

5607 Commits

Author SHA1 Message Date
luccioman
d9766ca981 Fixed WatchWebStructure_p.html render to include https URLs.
As described in mantis 721 (http://mantis.tokeek.de/view.php?id=721)
WatchWebStructure_p.html failed to include in its structure view https
and other protocols and ports than default http.
2017-01-16 18:41:58 +01:00
luccioman
ed3dd5e31a Fixed webstructure.xml API used with a domain name 'about' parameter.
As described in mantis 720 (http://mantis.tokeek.de/view.php?id=720),
when requesting this API with a domain name instead of a complete URL
only HTTP references on default port were listed.
2017-01-16 16:41:06 +01:00
luccioman
0da1e6ba16 Factored code re-implementing DigestURL.hosthash() method.
This ensure consistent implementation of the url host hash generation
and easier usage finding in source code.

Also added a unit test for this function.
2017-01-16 10:18:42 +01:00
luccioman
f793d97e56 Factored common code with DigestURL.hosthash() 2017-01-13 16:05:46 +01:00
luccioman
9cea7cbb10 Detailed some Javadoc related to /api/webstructure.xml usage. 2017-01-12 17:52:47 +01:00
reger
007e2afa6e Start to rename "Augmented Browsing" to "Web Proxy ..." / "View via Proxy"
The augmented Browsing option was reduced to the web proxy functionallity.
Augmented browsing is not available and no known plan exist to reimplement
alteration of result pages with additional information.
2017-01-12 01:36:30 +01:00
luccioman
339f005ced Blacklist import and update performance improvements.
Measurement sample : import from blacklist local file containing about
15000 entries
 - before refactoring : several minutes
 - after refactoring : a few seconds!
2017-01-06 12:24:31 +01:00
luccioman
e3892b0957 Added some JavaDoc. 2017-01-06 11:23:40 +01:00
luccioman
52d05d14c6 Display result favicons only for http or https resources.
Favicon display only makes sense for http(s) websites, being public or
intranet. So I modified the favicon conditional display to verify the
result URL protocol rather than if we are in intranet mode.

Also prevented rendering an img HTML tag with empty src on other results
protocols such as ftp or file.

Fixing this thanks to priest2 report
(http://forum.yacy-websuche.de/viewtopic.php?f=23&t=5923).
2017-01-06 09:00:28 +01:00
luccioman
b154d3eb87 Added descriptive titles to Crawler_p.html speed settings.
As reported by bubul
(http://forum.yacy-websuche.de/viewtopic.php?f=23&t=5924) , LF and MH
acronyms meaning were not detailed.
Also added label tags for improved accessibility on these input fields.
2017-01-05 14:54:59 +01:00
reger
68d4dc5cc5 Complete harmonization RequestHeader getCookie with std ServletRequest
to use javax.servlet.http.Cookie parameters.
Depreciate now obsolete getHeaderCookies.
Adjust setting of MaxAge to spec if >= 0 otherwise keep default.
2017-01-02 03:04:21 +01:00
reger
396ed3c769 On negative result vote also delete document from fulltext index
(not only from dht)
2017-01-01 23:58:38 +01:00
reger
f153cc4b5d add/allow to create a bookmark of result viewed via urlproxy.
For this on the header of the viewed result a "add bookmark" button is
available (for authenticated users).
Currently the bookmark is added to a (virtual) bookmark folder "/proxy"
w/o any additional tags etc.
2016-12-23 19:03:44 +01:00
reger
7bf2bcf504 fix and prevent exception on missing required cookie name
skip cookie creation if name is empty.
2016-12-22 19:52:38 +01:00
luccioman
128c8ef8d4 Fixed title rendering having non ASCII chars in QuickCrawlLink_p.html. 2016-12-21 08:19:09 +01:00
luccioman
ee6933c004 Added a title on the previous and next page pagination buttons.
This is to clarify the meaning of these buttons for users who could
think they link respectively to the first and last results page.
2016-12-21 07:22:41 +01:00
reger
8eb6fba59c activate filetype navigator plugin and restrict config (append) of navs
to not already actives.
Dht results are now included in count this might over shoot on redundant
dht and solr, while the previous solr facet based was always low.
2016-12-21 02:04:13 +01:00
luccioman
c25e48e969 Enabled displaying results after 14th page for local search queries.
Fixes issue #90 for local queries only: Stealth mode, Portal mode or
Intranet mode. 
For P2p mode, the issue would probably be difficult to solve with
reasonable performance. This is still to dig.

Also switched some InterreputedException catch log messages to warn
level as this is normal behavior when shutting down a peer.

Fixed yacysearch buttons navbar behavior to deal correctly with total
results count or offset over 1000. Also improved the buttons navbar to
be able to navigate over 10th page for local queries.
2016-12-20 14:52:33 +01:00
reger
6be9d62ab4 show earthsearch.png in ConfigSearchPage layout on activated location
navigator (for more realistic impression)
2016-12-20 02:06:43 +01:00
reger
c50e23c495 reduce creation of empty legacy RequestHeader() in situation where null
is acceptable (less for garbage collection).
2016-12-18 02:38:43 +01:00
reger
193b2ab1fc reduce redundant declaration for simple date formatter
using predefined GenericFormatter.SIMPLE_FORMATTER
2016-12-17 23:29:57 +01:00
reger
38d676c7e4 use GenericFormatter SimpleDate for sortable column in table_API
to allow correct chronological sorting (of the date string)
fix for http://mantis.tokeek.de/view.php?id=585
2016-12-17 21:44:09 +01:00
reger
c702eb6786 del dead menu link to /repository
(directory not created in current distribution -> old)
2016-12-17 02:38:52 +01:00
luccioman
467650c042 Hardened system update checks.
When a downloaded archive release is corrupted, empty, or can not be
opened for any reason, the update script must not be launched because it
erases the existing lib/*.jar libraries.
2016-12-16 11:03:09 +01:00
luccioman
00e81fcc15 Check HTTP status when downloading a release, and report eventual error. 2016-12-15 15:30:36 +01:00
reger
8e2cef5f07 allow protocol navigator to be unselected if only one button is shown
after activating navi/facetfilter
2016-12-15 00:45:08 +01:00
luccioman
437e535e5c Fixed admin navbar rendering at various screen sizes.
Fix mantis 443 (http://mantis.tokeek.de/view.php?id=443).

Tested on :
 - Debian jessie : Firefox ESR 45.5.1
 - MS Windows 10 : Firefox 50.1.0, Chrome 55 and Edge
 - Emulated devices/adaptative views embedded in the previously
mentionned browsers
2016-12-14 12:49:41 +01:00
luccioman
b90730f956 Fixed locations search navbar overlapping issues.
This is similar to the main yacysearch navbar issues described in mantis
708 (http://mantis.tokeek.de/view.php?id=708)
2016-12-13 16:50:24 +01:00
luccioman
0714b06038 Fixed resource switch button overlapping at various screen sizes.
Fixes second part of mantis 708
(http://mantis.tokeek.de/view.php?id=708)

The bootstrap-switch component has some sizing issues with long labels,
which are not likely to be solved soon due to a lack of resources on
that project (see issue
https://github.com/nostalgiaz/bootstrap-switch/issues/419 )

This fix works by applying the following ideas :
 - labels are long, so font-size and padding are reduced on small screen
sizes using a media query
 - use relative percent width values on the component wrappers to
prevent overlapping on the neighbour content
 - disable animation because it relies on absolute pixels width values
2016-12-13 15:33:18 +01:00
luccioman
848bfc240c Fixed YaCy logo (no external mark) for the refactored navbar search.
Thanks to reger24 feedback.
2016-12-12 12:55:20 +01:00
reger
8acdc5443b prepare ConfigSearchPage servlet to append and remove navigator plugins,
keeping order of added nav's.
The search page preview template displays active navs. Therefore a select
and add button has been added below the preview (to keep it close to actual).
This should in future likely be done by drag&drop (html5 feature).
2016-12-12 02:29:15 +01:00
reger
b32bcdf344 list entries in outgoing cookie monitor one per line
for easier readability.
For this adjust outgoingCookies entry to use Cookie[] instead of String[]
2016-12-10 22:08:09 +01:00
luccioman
f37a86e1c6 Fixed yacy search navabar header overlapping at various screen sizes.
- using a icon-only admin button at small and medium screen size
- using a icon-only "Search Interfaces" button at small screen size
- hiding the YaCy brand at extra-small screen size

Fixes the header part of mantis 708
(http://mantis.tokeek.de/view.php?id=708).

Navigator button overlapping is still to fix.
2016-12-09 11:25:09 +01:00
reger
77e65016c0 use more availabe SwitchboardConstans in ProxyIndexingMonitor_p
(to easily find usage)
2016-12-07 00:39:53 +01:00
luccioman
8146b97e9b Added a unit after the vocabulary size value for easier understanding. 2016-12-05 10:58:23 +01:00
reger
de33c7e765 replace one more arbitrary CONNECTION_PROP_CLIENTIP header with std.
getRemoteAddr()
2016-12-05 00:11:03 +01:00
reger
14e73f5b9b use bootstrap button style in MessageSend_p.html
and align buttons with form
2016-12-04 22:26:02 +01:00
reger
65871d28b2 skip comparing "xxxxx" on missing authorization header in Blog servlet 2016-12-04 22:11:22 +01:00
reger
82512613f5 fix unresolved pattern in ConfiLanguage drop down list 2016-12-03 01:13:47 +01:00
luccioman
0b4e7795df Fixed JavaScript error "hs.htmlExpand is not a function".
This error occurs on /ConfigSearchPage_p.html and on search results page
when Metadata links are enabled.

The fix was to remove unnecessary use of hs.htmlExpand() which is now
part of highslide-full.js library file, currently not distributed with
YaCy (only includes highslide.js). The Metadata links work correctly and
the initial dynamic expansion offered by htmlExpand() did not bring much
usability.
2016-11-29 02:56:43 +01:00
luccioman
1f4f0eacc2 Fixed a JS undefined error case, occurring when search field is empty. 2016-11-29 02:11:44 +01:00
luccioman
ceb7588880 Converted "clone" URL links in Table_API_p.html to purely relative ones.
Again for easier YaCy integration when running behind a reverse proxy
subfolder.
2016-11-29 01:34:33 +01:00
luccioman
cca3417b87 Fixed image and favicon viewing for unauthenticated local requests.
As reported by @reger24, image and favicon viewing was broken with
unauthenticated requests on peers configured to require authentication
even from localhost.

So I unified viewing rights check in a single new function on
ImageViewer class.
2016-11-28 22:10:05 +01:00
reger
02092de3d8 remove login cookie generation for static admin ind User servlet
cookieAuth is never successful for static admin, leaving the creation and
handling for login cookies for static admin obsolete.
2016-11-26 23:28:30 +01:00
reger
49f19aff75 exclude external link icon in Collage servlet
(icons display not close to image in IE)
2016-11-26 19:53:00 +01:00
reger
a0705c049d include check to prevent adding username identical with static admin
in ConfigAccounts_p
2016-11-26 18:26:14 +01:00
luccioman
89017e17e4 Converted ajax URL to relative and added a check on the response status.
This makes YaCy easier to configure when running behind a reverse Proxy.

The check on status avoids trying to update the page with error text
content when the server returned a 404 or 500 error message for example.
2016-11-25 11:13:16 +01:00
reger
8e3e3ed191 update the older ResponseHeader patch to handle cookies,
to work directly with javax.servlet.http.Cookie (rename headerProps to
cookieStore as is only used for this).
(Re)implement set-cookie in DefaultServlet to make cookieAuthentication
work as designed.
2016-11-25 02:00:20 +01:00
luccioman
aa9ddf3c23 Added control over Robots.txt active threads maximum number.
When starting a crawl from a file containing thousands of links,
configuration setting "crawler.MaxActiveThreads" is effective to prevent
saturating the system with too many outgoing HTTP connections threads
launched by the crawler.
But robots.txt was not affected by this setting and was indefinitely
increasing the number of concurrently loading threads until most ot the
connections timed out.

To improve performance control, added a pool of threads for Robots.txt,
consistently used in its ensureExist() and massCrawlCheck() methods.
The Robots.txt threads pool max size can now be configured in the
/PerformanceQueus_p.html page, or with the new
"robots.txt.MaxActiveThreads" setting, initialized with the same default
value as the crawler.
2016-11-23 18:13:05 +01:00
reger
baf6d21cfe ConfigSearchPage, move protocol navi up to better simulate actual design.
Because here btn-group-justified screws up table column width (Explorer
and Firefox) bootstrap btn-group is used.
2016-11-23 01:33:01 +01:00