Commit Graph

3293 Commits

Author SHA1 Message Date
lotus
bbb7aea8f3 fix basic config change in portal mode
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7582 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-03-10 20:04:15 +00:00
orbiter
30aed9824a moved getBytes() to UTF8.getBytes() to use a default String encoding
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7580 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-03-10 12:35:32 +00:00
orbiter
4d733608fb fix for broken JSON, see: http://forum.yacy-websuche.de/viewtopic.php?p=22162#p22162
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7577 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-03-09 20:08:20 +00:00
orbiter
7962d35425 - removed file upload function in crawl start and replaced it with an input field for a file path where the crawl start file is loaded. This was necessary to support the API steering for file crawl starts, for two reasons:
1) if the file is changed for a re-crawl this is not reflected in the steering because it would take the previously uploaded crawl start file
2) browsers do not submit the full path of the selected file even if this path is shown in the input field because of security reasons. There is no work-around or hack to make the submission of the full path possible

- fixed deletion of crawl start point urls in crawl stack and balancer double-check
- fixed a problem with steering self-call (no resolving of localhost)
- added more logging for the crawler to supervise why crawl urls are not taken by the loader
- added a javascript onload-function to select domain restriction in all cases where a crawl is started from a file or from a url
- fixed the restrict-to-domain pattern computation, added a 'www.'-prefix and added this functionality also to a crawl start from file 

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7574 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-03-09 12:50:39 +00:00
orbiter
e1b6916423 always try to guess the size of a StringBuilder to prevent too many memory re-allocations
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7572 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-03-09 09:29:05 +00:00
low012
bea8137997 *) minor changes
*) fixed potential NPE in suggest.java

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7571 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-03-08 23:27:41 +00:00
low012
3e03963b1c *) minor changes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7570 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-03-08 22:37:17 +00:00
low012
0da3b6489e *) added the only changes from r7557 which actualy made sense
*) caught potential exception (occured when user entered a string which did not contain digits only for the maximum number of lines)
*) use prop.putHTML to avoid potential XSS attack in case an attacker manages to cause something to end up in the logs which contains a string which was defined by the attacker

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7562 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-03-07 21:44:58 +00:00
orbiter
29acd2f108 reverted also changes in ViewLog from SVN 7557 because the ThreadDump submenu was not visible any more.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7560 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-03-07 21:19:47 +00:00
orbiter
cb1f49d0f2 replaced all 'new String' with default encoding (missing) or UTF-8 encoding with a String generation method that uses a pre-defined Charset constant for UTF-8. This avoids a cache-lookup for the Charset object using String hashing of the String 'UTF-8'.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7558 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-03-07 20:36:40 +00:00
low012
9d366ee9d7 *) removed unused code (I assume that most of the code was really dead, but if you need any of the classes, tell me and I will put it back in.)
*) minor code cleanup in ViewLog

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7557 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-03-07 18:55:11 +00:00
orbiter
7138f4036b less synchronization, better thread dump tool
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7556 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-03-07 15:29:45 +00:00
orbiter
f8d0454c53 small bug fixes and experiments with search speed enhancement
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7549 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-03-04 14:29:22 +00:00
orbiter
bed79402be introduction of a new remote search load control: the remote search has taken 10 results per peer with a time-out of 3 seconds so far. The attributes of number of results per peer and time-out time can now be configured.
This has two aspects: the user who searches may want to increase these values to get more results and more load on the remote side and the user of the server which is accessed for this search may want to restrict the load. Both sides can now be configured. The server-site maximum load parameters are defined by a network definition and the client-side search request load can be defined by each user individually but when the remote search is done the requested service is limited to the network definition.

You can find now in the network definition file:
network.unit.remotesearch.maxcount and network.unit.remotesearch.maxtime
and in the yacy.conf file:
remotesearch.maxcount and remotesearch.maxtime

There is currently no web interface to define the client-side remote search attributes, please set them manually
    

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7548 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-03-04 13:44:00 +00:00
orbiter
42d90664f3 - fixed a memory leak in the httpc.post method (no finish)
- patched some more memory-saving relevant code
- some more minor bug fixes

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7541 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-03-01 09:03:33 +00:00
mikeworks
85f5c02deb de.lng: Updated German translation and removed old unused strings, e.g. 8080 -> 8090 and Search Portal translations
Bookmarks.html, Ranking_p.html, base.css: Fixed XHTML errors to make pages compatible again - switched div -> span inside dt and replaced css definition of id (unique) with class (class of elements)
header.template: Fixed link to berliOS changelog by replacing & -> & and adding translation for German page by refering to German berliOS UI ;-]

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7539 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-28 00:22:05 +00:00
mikeworks
073ee5525f de.lng: Updated German translation added missing line no change when no other network was selected
YaCy logos: removed outer white line that made the logo blitz on black or dark themes http://forum.yacy-websuche.de/viewtopic.php?f=15&t=1724#p21655

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7535 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-27 01:19:46 +00:00
orbiter
b1781d7aae some more performance hacks
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7533 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-26 01:24:49 +00:00
orbiter
5e186e0122 continuing the fight against deadlocks during time formatting: better caching.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7531 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-25 21:11:53 +00:00
orbiter
a92d80a545 performance enhancements using an alternative to a insensitive collator (a complex string compare):
- less synchronizations
- better speed
..at most important and commonly used classes: http headers, url parsing and html parsing

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7526 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-25 11:23:57 +00:00
orbiter
dec4f36700 - fix for missing favicons in search widgets
- fix for bad digest/hash computation in case of interrupts to class

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7518 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-23 23:04:05 +00:00
orbiter
24909b3006 slightly less restrictive values for DoS
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7509 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-21 15:24:09 +00:00
orbiter
311f57d360 DoS to prevent online snippet fetch: allow read from cache.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7508 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-21 15:11:03 +00:00
orbiter
83a38b16bb added link to svn log
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7507 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-21 12:11:12 +00:00
orbiter
09a98a6ae4 - disabled arrow keys in search (pages switch) again -- too disturbing when editing the query line
- fixed alt text in AccessGrid

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7504 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-21 10:30:58 +00:00
orbiter
82f262f685 - enhanced circle drawing speed
- beautified 'moving dot' feature (using smaller and correctly positioned dots)
- added moving dots to DHT transfer

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7500 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-21 00:03:11 +00:00
orbiter
29dc416ac6 more animations in graphics. See network and access picture.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7498 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-19 01:56:12 +00:00
orbiter
a80ee9a03d THE GRID is coming to YaCy .. see new animated graphics on http://localhost:8090/AccessGrid_p.html
showing incoming and outgoing connections in an animated way

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7496 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-18 23:19:35 +00:00
low012
83ae46902b *) removed link to LogStatistics
*) removed translation of LogStatistics

Once again: If you want this page back, I will put it back in, but it seems to have been non-operational for over one year without anybody complaining about it.

Does anybody know if net.yacy.kelondro.logging.LogalizerHandler still serves any purpose?

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7495 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-18 02:11:19 +00:00
low012
ce012e11aa *) deleted LogStatistics since the page did not work anymore and it seemed to be obsolete, tell me if you miss it and I will add it again
*) a few minor changes

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7494 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-18 01:46:07 +00:00
low012
c5051c4020 *) fixed bug which caused entries to not be deleted when deleting by URL on IndexCreateWWWLocalQueue_p.html (I hope this did not break anything else)
*)  cleaned up code a little bit

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7493 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-18 01:25:46 +00:00
orbiter
fb8b046bb4 removed unused properties
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7491 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-17 22:47:39 +00:00
low012
b0bdf2d9ed *) Oops!
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7490 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-17 20:42:35 +00:00
low012
de065e594f *) make sure that only positive values are accepted as refresh interval on Crawler Monitor page
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7489 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-17 20:40:34 +00:00
orbiter
c69a4874b4 fix for non-valid json, see http://forum.yacy-websuche.de/viewtopic.php?f=6&t=3179
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7488 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-17 16:47:05 +00:00
orbiter
4473cf8c61 replaced utf-8 with UTF-8
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7485 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-16 13:51:30 +00:00
orbiter
d2eb2fe8d9 added patch from quix0r, see
http://forum.yacy-websuche.de/viewtopic.php?p=21188#p21188

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7483 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-14 23:16:47 +00:00
orbiter
abb35addb8 added
accept-charset="UTF-8"
to all forms
this applies patches from http://forum.yacy-websuche.de/viewtopic.php?p=20891#p20891

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7482 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-14 22:57:43 +00:00
orbiter
5892fff51f introduction of dht-burst modes: this can expand the number of target peers in some cases where a better heuristic is needed. The problematic cases are either when a muti-word search is made (still a hard case for our term-oriented DHT) or when a network operator wants that all robinson peers are asked. We therefore introduced two new network steering values that switch on more peers during the peer selection. Because the number of peers can now be very large, the number of maximum httpc connections was also increased.
Please see new coments in yacy.network.freeworld.unit for details of the new DHT selection methods.
The number of maximum peers is now not fixed to a specific number but may increase with
- the partition exponent
- the number of redundant peers
- the robinson burst percentage
- the multiword burst percentage
The maximum can then be the number of senior peers (all visible peers).

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7479 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-13 17:37:28 +00:00
orbiter
4588b5a291 - fixed document number limitation for crawls that restrict the number of documents per domain
- some restructuring of the document counting and logging structures was necessary
- better abstraction of CrawlProfiles
- added deletion of logs to the index deletion option (if the index is deleted using the servlets) which is necessary to reset the domain counters for the page limitation
- more refactoring to get the LibraryProvider more clean
- some refactoring of the Condenser class

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7478 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-12 00:01:40 +00:00
lotus
9f6e3f9793 fix: same number format display before and after get/post
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7476 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-10 09:08:44 +00:00
lotus
4716aa7d3a fix display of statistics link on network.html (caused by menu option)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7475 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-09 21:32:45 +00:00
orbiter
70ca7cec8c fix for http://forum.yacy-websuche.de/viewtopic.php?p=21763#p21763
and another fix for non-working global search when search options are switched off

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7467 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-03 10:43:09 +00:00
orbiter
fe93caac5a added flags and administration options to show advanced search and to show search result attributes (for each search result)
Administration can be done at ConfigPortal.html

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7466 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-02 15:54:13 +00:00
orbiter
0dfb1ef7d2 added custom targets for media search
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7463 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-02 00:50:06 +00:00
orbiter
0cdfb82963 replaced more appearance of double values by float values
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7461 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-02 00:06:29 +00:00
orbiter
eb12e15738 moved all Double values to Float values because of
http://www.exploringbinary.com/java-hangs-when-converting-2-2250738585072012e-308/
YaCy does not really need double-precision floating point computation anywhere, so this should not affect any feature

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7460 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-01 23:49:11 +00:00
orbiter
fe948abd84 added link on ViewProfile.html to EditProfile_p.html to make it easier to customize the peer
the link appears only if the user has admin rights

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7459 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-31 19:55:00 +00:00
low012
77715c2b16 *) less HTML errors, but still several left
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7457 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-28 17:26:47 +00:00
low012
ae10ed5613 *) added a Set to which filter elements are written before mustmatch-filter is created to avoid huge lists of double elements in mustmatch-filter when starting a crawl from a "Link-List of URL" on CrawlStartSite_p.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7456 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-28 16:24:33 +00:00