Commit Graph

13538 Commits

Author SHA1 Message Date
luccioman
6f5e55c9f0 Updated the JavaScript license information page 2017-09-13 08:35:15 +02:00
luccioman
c7149acb48 Disabled as default verbose browser console logs in yacysort.js 2017-09-13 08:23:19 +02:00
luccioman
b50700c35f Added missing copyright header to the yacysort.js file 2017-09-13 08:16:29 +02:00
luccioman
86d41f0242 Moved the JS resort specific styling to the usual YaCy CSS location 2017-09-13 08:08:38 +02:00
luccioman
9e86d183b8 Disable manual search results resorting when resorting is done with JS
Also added a constant for the js resorting setting key.
2017-09-13 07:58:05 +02:00
luccioman
4ccd38357f Trigger js resorting animations using only CSS classes.
Also added some more descriptive comments.
2017-09-13 07:41:03 +02:00
luccioman
e40a225bc1 Merge branch 'javascript-resort' of https://github.com/Scarfmonster/yacy_search_server into jsResort 2017-09-13 07:29:58 +02:00
Ryszard Goń
2af011243f Javascript re-sorting: Remove potentially breaking display property and reset max-height when animation is finished. 2017-09-11 20:02:19 +02:00
Ryszard Goń
634f52fefc Javascript re-sorting: replace jQuery show() with css animations 2017-09-10 17:20:12 +02:00
luccioman
66cb9c4ff9 Added Solr filter queries for audio, video and application domains
Inspired from the existing one used on image search, and consistent with
post filtering on content domain applied in SearchEvent.addNodes().

These filters are quite simplistic but at least audio, video or
application search now return results. Previously, when filtering on
these content domains, many results pages (and often even the first
page) were empty while the total results count suggested that results
should be available. This was because filtering on domain was only
applied AFTER requesting Solr indexes.
2017-09-08 11:16:37 +02:00
luccioman
5d3ceb31b7 Improved search navigators counters accuracy and consistency.
- added some missing increments from RWI results
- decrement relevant navigator counts when solr or RWI results are
evicted because duplicates detection or constraints checked belatedly
- do not compute facets when unnecessary to avoid unwanted CPU load
- do not increment from facets when already done
- do not rely on facets on remote solr peers requests, as most of the
time only a limited part of their total results if fetched (thus also
preventing unnecessary load on remote peers)
- use a concurrency friendly score map for the dates navigators to
prevent unwanted ConcurrentModificationExceptions

This improves the situation for the most obvious inconsistencies in
search navigators counts, but more has to be done for a true accuracy
(notably when query modifiers constraints are applied belatedly - after
the solr or RWI retrieval request - such as the content domain
constraint)
2017-09-06 16:58:40 +02:00
reger
ba0ba75998 update master.lng, IndexExport_p.html text 2017-09-05 00:51:43 +02:00
JeremyRand
ab0e50b941
Javascript re-sorting: optimize the jQuery selectors a little bit. 2017-09-03 18:09:52 +00:00
JeremyRand
86b5094970
Fix numbered page navigation from getting corrupted when statistics() runs. 2017-09-03 18:09:51 +00:00
JeremyRand
a888254769
Add UI for numbered page navigation when Javascript re-sorting is enabled. 2017-09-03 18:09:44 +00:00
JeremyRand
74333c931e
Fix the sidebar item "Wiki Name Space" with Javascript re-sorting. 2017-09-03 17:50:17 +00:00
JeremyRand
4a9e64caea
(WIP) Add numbered page navigation when Javascript re-sorting is enabled.
TODO: Add UI for selecting the number.
2017-09-03 17:50:17 +00:00
JeremyRand
6ec256dc34
(WIP) Fix the sidebar when Javascript resorting is in use.
TODO: Add some markup so that DOM traversal in the animations is less painful.
2017-09-03 17:50:16 +00:00
JeremyRand
d37df75afa
(WIP) Optionally sort HTML search items via Javascript.
TODO: Expose a GUI setting for this.
2017-09-03 17:50:08 +00:00
JeremyRand
61be709a97
Add data-ranking attribute to each HTML search item. 2017-09-03 17:44:19 +00:00
luccioman
8e4f31bdc7 Updated internal ISO 639-1 language codes with latest standards.
Includes 54 language code additions, some name modifications, and
marking a few deprecated.
2017-09-02 09:53:38 +02:00
luccioman
a28428047a Fixed count of filtered results from local solr.
Was inadequately modified in my previous related commits (making next
pages buttons unavailable in Search portal mode), as
SearchEvent.local_solr_available did not count the total filtered
results but only the ones within the currently fetched result page(s).
2017-08-31 11:24:59 +02:00
luccioman
30c2f50e0b Use final results counts in progress bar detailed statistics.
Using unfiltered detailed counts (local and remote entries found before
doubles detection and before applying query modifiers) was confusing and
inconsistent with the total count. It could let think more results are
to come in the next pages, without understanding why they are not
displayed.
2017-08-31 07:37:24 +02:00
Michael Peter Christen
2f71005a93 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git 2017-08-30 23:51:44 +02:00
Michael Peter Christen
2314f8e358 try to fix problem
with error description
http://forum.yacy-websuche.de/viewtopic.php?f=5&t=6023&p=33889&sid=37bc7aa029422be571b9266cdef43c52#p33889
2017-08-30 23:50:14 +02:00
luccioman
3c9df6e0ce Use local solr filtered results in total search results count.
This modification has indeed low incidence as eventual query modifiers
are already applied when requesting the local solr index. 
It mainly impact doublons detected with results from remote peers.

Also updated javadocs for clarification.
2017-08-30 12:23:45 +02:00
luccioman
8b25b485eb Make result action links visible when focusing them with keyboard. 2017-08-29 08:16:12 +02:00
luccioman
3e933979df Removed duplicate HTML class attribute. 2017-08-29 07:39:12 +02:00
luccioman
ce22076920 Fixed Unresolved_Pattern occurence on results favicon HTML id. 2017-08-29 07:32:33 +02:00
luccioman
a1a0515312 Added a button to manually refresh sorting of p2p search results.
As a server-side oriented alternative to the JavaScript realtime
resorting feature proposed in PR #104.
The goal is the same as in this PR : having the possibility compensate
the network latency of various peers results fetching and obtain once
possible a consistently ranked result set.
2017-08-28 19:03:51 +02:00
reger
31c99b2a8a update master.lng, RankingSolr_p.html text 2017-08-27 04:22:39 +02:00
luccioman
4eba88f2ff Removed some unnecessary uses of java.lang.reflect api.
This improves code browsing and readability, making search by references
or call hierarchy IDE features more accurate.
2017-08-24 18:47:18 +02:00
luccioman
da3dbf9ea1 Use Javadoc style comments on SearchEvent properties.
For better code readability and understanding.
2017-08-23 08:20:37 +02:00
luccioman
c6ae87168a Added unit tests on the gzip parser. 2017-08-22 14:13:00 +02:00
luccioman
169ffdd1c7 Finer control on max links to parse in the html parser. 2017-08-22 14:11:35 +02:00
luccioman
4743a104b5 Added some unit tests on FileUtils. 2017-08-22 14:06:09 +02:00
luccioman
e41d046a9d Improved parsing support for OOXML spreadsheets (.xlsx)
As reported edycop in mantis 765 (
http://mantis.tokeek.de/view.php?id=765 ), parsing of xlsx files was
quite incomplete.
Now properly support "Shared String Table" entry in Office Open XML
spreadsheets, an also detect embedded URLs.

Integrating the Apache poi-ooxml library could be an option for finer
OOXML formats support, but their SAX style parsing example (
http://poi.apache.org/spreadsheet/how-to.html#xssf_sax_api ) tends to
show that a custom SAX handler is still efficient for lightweight and
low memory footprint processing.
2017-08-21 09:38:20 +02:00
reger
51a4e03c93 Allow to stop currently running warc import (stop button) 2017-08-20 22:17:27 +02:00
luccioman
6cec2cdcb5 Use unredirected robots.txt URL when adding an entry to the table. 2017-08-16 14:21:07 +02:00
luccioman
3f0446f14b Ensure proper synchronous robots entry retrieval on first check.
Previously, when checking for the first time the robots.txt policy on a
unknown host (not cached in the robots table), result was always empty
in the /getpageinfo_p.xml api and in the /CrawlCheck_p.html page. Next
calls returned however the correct information.
2017-08-16 09:30:33 +02:00
luccioman
9da75ac76d Upgraded Docker base image from deprecated java to openjdk. 2017-08-15 21:04:36 +02:00
luccioman
b23a563065 Prevent search result failure on incomplete images information.
Complements the recent modification related to images in commit 7f395ef.

Unfortunately many documents metadata fetched from the freeworld p2p
network have only partial information about embedded images. Without
proper error handling, this made many searches in p2p mode to fail
completely.
2017-08-15 10:11:05 +02:00
Michael Peter Christen
30d71c6359 added usage of X-Real-IP http header
to identify request IPs which came through NGINX reverse proxy
configurations
2017-08-15 07:16:01 +02:00
Michael Peter Christen
f45378c11c Merge branch 'master' of https://github.com/yacy/yacy_search_server.git 2017-08-14 20:12:26 +02:00
Michael Peter Christen
7f395ef937 added image link in search results
This should be a help to make a preview of search results.
The image is computed from the list of embedded images, it is
always the first image in that list.
In rss-type results the image is presented like
<media:content medium="image" url="https://abc.xyz/logo.png"/>
as defined in
http://www.rssboard.org/media-rss#media-content
2017-08-14 20:12:09 +02:00
luccioman
780173008e Implemented partial stream parsing of tar archives.
Also added JUnit tests for the tar parser and fixed unwanted use of the
tar parser as a fallback on files included in a tar archive.
2017-08-14 14:57:58 +02:00
luccioman
acab6a6def Also handle text content when parsing XML within limits. 2017-08-14 14:47:01 +02:00
reger
f38fb7f02c Add junit test for AbstractOperations.addOperand() 2017-08-14 02:16:43 +02:00
reger
2a07799ad1 Correction of d03e2c98ea
Fix Conjunction.addOperator to do nothing if term is empty
prevent to result in query string with repeated logical operator
like "field:term AND AND field:term"
possibliy causing out of mem in postprocessing_doublecontent
2017-08-14 01:03:15 +02:00
reger
d03e2c98ea Fix Conjunction.addOperator to do nothing if term is empty
prevent to result in query string with repeated logical operator
like "field:term AND AND field:term"
possibliy causing out of mem in postprocessing_doublecontent
2017-08-14 00:52:03 +02:00