Commit Graph

2197 Commits

Author SHA1 Message Date
lotus
029e16b653 replaced some put(String, String) by putHTML(String, String) on serverObjects respond
in htroot/ root
didn't touch htroot/xml/
this should solve potential xss issues

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5184 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-19 11:45:11 +00:00
apfelmaennchen
5b2a57bfd0 - /xml/util/getpageinfo_p.xml added <desc> and <lang> tags
- changed htmlFilterContentScraper.getKeywords() to split either space or comma charater not both

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5183 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-18 21:01:23 +00:00
orbiter
ea5de7436d added Sciencenet to the compare search
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5178 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-18 10:56:18 +00:00
orbiter
1bbf362cef update to the crawl balancer: better organization and better crawl delay prediction
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5176 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-17 21:45:21 +00:00
orbiter
ddcf285499 - fixed a bug in performance setting (did not work with german translation)
- reduced maximum number of error url references to save some memory (this was actually a small memory leak)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5174 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-16 23:04:24 +00:00
orbiter
0cd0fee546 fixed bug with wrong proxy result enqueueing. See:
http://forum.yacy-websuche.de/viewtopic.php?p=8130#p8130
- removed the online status property. This influenced the proxy behavior and created some complexity that was not needed because the online status was never used as it was ceated for (offline browsing)
- checked all proxy identification procedures during crawling and enhanced transparency and error checking
- fixed a proxy identification routine that caused the wrong selection of the proxy result queue

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5173 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-16 21:56:23 +00:00
apfelmaennchen
e071f759d2 YaCy-UI: small optical changes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5172 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-16 21:39:14 +00:00
apfelmaennchen
bbacf86fe8 - added /xml/bookmarks/posts/add_p.xml
- security fix to /xml/bookmarks/posts/delete_p.xml
- YaCy-UI: added 'add' and 'delete' bookmarks

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5171 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-16 21:38:13 +00:00
apfelmaennchen
cd1ac5bb90 - fixed security issue with /xml/util/ynetSearch.xml
- hopefully fixed YaCy-UI local search with async=false for Ajax-request

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5170 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-16 05:55:31 +00:00
orbiter
c73cf05ddd tried to fix local search in yacy-ui
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5169 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-15 21:56:53 +00:00
apfelmaennchen
99ff478d63 YaCy-UI: small optical fix
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5167 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-15 20:30:33 +00:00
orbiter
7e24c51fd5 - removed alternative search page in main menue in favor of rich client search page
- added necessary option to search request of yacy-ui to get snippets

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5165 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-15 19:00:22 +00:00
apfelmaennchen
7c5867a832 Major update to YaCy-UI
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5162 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-15 17:18:07 +00:00
lotus
73f233bb11 * set resource observer to 1000MB
* transparent favicon

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5160 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-15 12:41:27 +00:00
orbiter
1be24158a2 small fix / rendering option
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5159 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-15 10:22:05 +00:00
orbiter
105be67998 - some bugfixes to compare search
- redesigned input boxes: smaller, more space for result page

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5158 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-15 09:43:51 +00:00
orbiter
693fa2a157 - renamed Comparison to compare_yacy
- added more search engines
- some refactoring and added a list that is used to present the search engine list in a specific order
- added simpleheader and no-header options
- added the compare search to the simple header
- added default compare search page selection storage - after re-start you get the same default search engines as you selected before

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5157 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-15 09:17:05 +00:00
lotus
6216105ca5 small fix
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5150 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-14 18:12:26 +00:00
lotus
5e5178b5e8 please use putHTML to avoid XSS
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5149 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-14 18:08:39 +00:00
low012
b33a6cbb77 *) less disturbing elements in yacy frame
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5148 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-14 17:58:08 +00:00
f1ori
98d902b972 * remember last searchwords in Comparison_p.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5147 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-14 15:21:57 +00:00
orbiter
bd45c5a2bc integrated the comparison page into the main menu
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5146 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-14 10:40:01 +00:00
low012
5e0390a24c *) Ooooooooops!
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5145 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-13 17:14:41 +00:00
low012
dc56c35289 *) added page to compare results of 2 search engines
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5144 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-13 16:50:01 +00:00
lotus
880d1a83e2 do not change memory and some non-defined tasks with performance profiles
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5142 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-12 11:54:25 +00:00
lotus
b68d06a6e8 performance settings based on network's remote crawl speed
removed some _pro values from config

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5134 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-10 12:52:17 +00:00
orbiter
bb5c898441 enhancements to localsearch behavior
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5131 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-09 10:24:42 +00:00
orbiter
3c6e8d2015 set default ppm when network is switched
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5127 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-08 18:20:05 +00:00
orbiter
20c2d3c248 fix for bad formatting in CrawlResults
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5126 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-08 13:59:35 +00:00
lotus
b92105c8b0 do not change auto recrawl scheduler with performance profiles
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5123 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-07 13:59:24 +00:00
orbiter
d09ddabd09 corrected a design mistake (5-byte hashes not necessary)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5119 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-04 21:28:00 +00:00
orbiter
c97d0fcee7 modified the domain list export function:
- used the new superfast domain list generation from the domain statistics
- better interactive behavior

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5118 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-04 20:28:36 +00:00
orbiter
77ee0765a4 - added domain statistic generation to IndexControlURLs_p.html servlet
- added 'delete all' button to all results of such a domain statistic output which causes that all urls to this domain are deleted
- extended stack cleaner to clean also the statistics: they are not completely destroyed, only the smallest counting domains are removed


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5117 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-04 19:41:57 +00:00
lotus
44bc8311af translation fix
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5116 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-04 19:26:59 +00:00
lotus
e5c0b969d6 * save performance profile speed
* fix for wrong javastart_priority after first start

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5115 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-04 19:12:22 +00:00
lotus
d7a16c1f30 * added shutdown on search page (this page is shown after clicking the tray icon)
* shorter, less technical words for configuration-links

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5114 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-04 12:51:05 +00:00
orbiter
80a7bc93d6 - added statistical evaluation about domains that appear during crawling
- added tables that show this statistics in CrawlResults web pages

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5113 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-04 09:59:17 +00:00
orbiter
4a4f388ca5 re-design and simplification of crawl start menu layout
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5112 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-04 07:56:29 +00:00
orbiter
4fbee21cea - added fetch-ahead again (had been removed in last commit)
- reverted default query mode to verify=false

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5111 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-03 23:50:13 +00:00
orbiter
fc03b0437a fixed a error case where a second search after a first search with a different search word failed
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5109 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-03 15:55:25 +00:00
daburna
992635c074 translation update
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5107 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-03 13:44:58 +00:00
orbiter
05dbba4bab added logging conditions to all fine and finest log line calls
this will prevent an overhead for the generation of the log lines in case that they then are not printed

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5102 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-03 00:30:21 +00:00
orbiter
d3d41e2ee4 - fixed problem with searching with quotes (still not complete, but not as bad as before)
- fixed parsing of crawl-delay statements when seconds were given with float numbers
- enhanced performance of profiling (not too many loggings; not more than one per second)
- removed some debug output
- fixed wrong return type in logging
- added a logging condition in httpd to prevent that logging statements are generated when they are not written (should be added everywhere!)
- fixed wrong word distance computation in RWI management


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5101 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-02 23:49:48 +00:00
lotus
3fbfd5a78b * fix for non-changing offset on new search term
* dht-heap doesn't has to be deleted (5097), we simply write a new one on exit
* do not install YaCy in startup because a Windows-shutdown might corrupt something. Installing YaCy as a service would solve this.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5099 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-09-02 15:09:31 +00:00
danielr
cd19d0aee6 - added warnings for failed transferRWI (dht-in)
- fixed parseMultipart (uncompress gzipped body) (dht-in)
- fixed parseMultipart (using content-length only if uncompressed)
- better gzipped POST (chunked instead of content-length) (dht-out)



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5096 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-08-29 09:42:39 +00:00
orbiter
016f57d714 fixed a dead link
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5094 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-08-28 21:45:58 +00:00
f1ori
b8ee04daf1 fix for http://www.yacy-forum.org/viewtopic.php?f=2&t=160 (wrong url in form)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5086 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-08-26 18:45:19 +00:00
lotus
d9d9c522a1 addendum to last commit
moved recrawl times for standard profiles to constants
calculate new specific dates in cleanup job

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5082 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-08-26 13:20:18 +00:00
lotus
480497f7c9 changed recrawl
use a specific date to define old documents
this solves an unwanted recrawl-loop during a running crawl

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5081 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-08-25 20:31:32 +00:00
orbiter
536e77e8b7 modifications towards a single database operation to read/write http header and cached file at once:
- removed distinction between header file types for http and ftp; ftp is simulated by using http properties
- removed all old resourceInfo classes that handled this distinction
- introduced a new distinction between http request and http response objects
- unified new response objects with two other object types that had been introduced elsewhere
- changed all servlet call methods to use the new http request header object type
- divided static object keys for http header properties into request and response types
- refactoring here and there (a large number of type changes and many methods merged/moved)


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5079 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-08-25 18:11:47 +00:00