Commit Graph

7144 Commits

Author SHA1 Message Date
orbiter
4588b5a291 - fixed document number limitation for crawls that restrict the number of documents per domain
- some restructuring of the document counting and logging structures was necessary
- better abstraction of CrawlProfiles
- added deletion of logs to the index deletion option (if the index is deleted using the servlets) which is necessary to reset the domain counters for the page limitation
- more refactoring to get the LibraryProvider more clean
- some refactoring of the Condenser class

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7478 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-12 00:01:40 +00:00
low012
64f32e8f00 *) replaced all IPs in IP filters for proxy with the proper regular expression
*) some cleanup

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7477 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-11 23:37:13 +00:00
lotus
9f6e3f9793 fix: same number format display before and after get/post
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7476 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-10 09:08:44 +00:00
lotus
4716aa7d3a fix display of statistics link on network.html (caused by menu option)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7475 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-09 21:32:45 +00:00
sixcooler
3e8b72be50 update to httpclient-4.1 - sorry forgot some
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7474 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-08 21:53:20 +00:00
sixcooler
45dcfa3460 update to httpclient-4.1
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7473 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-08 21:46:24 +00:00
f1ori
d38c739b31 * add ant-target deleteData, to simulate fresh yacy installation
* useful in IDEs like eclipse

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7472 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-08 09:12:48 +00:00
low012
74b22dfa24 *) fixed bug which affected blacklist entries which consisted of domain _and_ path parts
*) minor cleanup

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7471 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-06 18:07:08 +00:00
low012
4843f04247 *) more input from Moubai
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7470 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-06 16:00:45 +00:00
low012
b9990e722f *) updated French langugage file with input from Moubai (http://yacy-forum.org/viewtopic.php?p=1646)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7469 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-05 11:20:25 +00:00
orbiter
93732d6773 increased number of target peers
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7468 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-04 13:25:28 +00:00
orbiter
70ca7cec8c fix for http://forum.yacy-websuche.de/viewtopic.php?p=21763#p21763
and another fix for non-working global search when search options are switched off

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7467 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-03 10:43:09 +00:00
orbiter
fe93caac5a added flags and administration options to show advanced search and to show search result attributes (for each search result)
Administration can be done at ConfigPortal.html

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7466 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-02 15:54:13 +00:00
f1ori
01cb3bbaec * fix patchCharsetEncoding-test (patchCharsetEncoding now returns null on input null)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7465 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-02 15:28:41 +00:00
orbiter
431f780f41 patch for bad data in url metadata
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7464 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-02 01:19:25 +00:00
orbiter
0dfb1ef7d2 added custom targets for media search
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7463 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-02 00:50:06 +00:00
orbiter
5905f912c5 replaced more double types with float
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7462 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-02 00:22:00 +00:00
orbiter
0cdfb82963 replaced more appearance of double values by float values
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7461 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-02 00:06:29 +00:00
orbiter
eb12e15738 moved all Double values to Float values because of
http://www.exploringbinary.com/java-hangs-when-converting-2-2250738585072012e-308/
YaCy does not really need double-precision floating point computation anywhere, so this should not affect any feature

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7460 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-02-01 23:49:11 +00:00
orbiter
fe948abd84 added link on ViewProfile.html to EditProfile_p.html to make it easier to customize the peer
the link appears only if the user has admin rights

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7459 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-31 19:55:00 +00:00
f1ori
982aa689ef * fix StringIndexOutOfBoundException in WebStructureGraph
* add better escaping to saveMap and loadMap

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7458 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-31 14:25:09 +00:00
low012
77715c2b16 *) less HTML errors, but still several left
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7457 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-28 17:26:47 +00:00
low012
ae10ed5613 *) added a Set to which filter elements are written before mustmatch-filter is created to avoid huge lists of double elements in mustmatch-filter when starting a crawl from a "Link-List of URL" on CrawlStartSite_p.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7456 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-28 16:24:33 +00:00
orbiter
9a1e0158fa better servlet naming in index administration
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7455 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-28 11:32:31 +00:00
orbiter
88773e4daa changed the default port from 8080 to 8090
see also: http://forum.yacy-websuche.de/viewtopic.php?p=21683#p21683

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7454 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-28 10:54:13 +00:00
orbiter
6c35b68f17 - removed 'peerName' property from the yacy settings file because this information is stored in the yacy seed file
- the own seed file gets the lead for storage of the peer name
- exchanged default peer name generation method with one that does not use the local ip
- default peer names are now strings starting with '_anon'
- added another switch to suppress forwarding to ConfigBasic if the name was already changed
- replaced all usages of the yacy.conf peerName with access to the local seed
- changes to the peer name are now applied directly and not after the next peer ping


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7453 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-28 10:12:17 +00:00
mikeworks
8bc66b9e84 de.lng: Updated German translation and cleaned removed lines - no unused strings log entries
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7452 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-28 04:46:42 +00:00
orbiter
786166041a - added recording of all accessed and submitted servlets
- this recording is then used to redirect from the Status.html page to BasicConfig in case that servlet was never submitted
- this acts as an addition to the new default pop-up page 'index.html' which offers an administration link to Status.html. For a first-time user this then redirects directly to the former start page BasicConfig.html

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7451 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-27 11:17:11 +00:00
orbiter
28f669bf0b - fixed/enhanced move to SD/16:9 images (network, web structure)
- added logging in peer ping to analyse time-consuming elements which could be cause for disappearing peers

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7450 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-27 10:16:14 +00:00
orbiter
0376f73fdb extended seed list uploader: do not only upload all active peers but also some more peers that are passive but had been active in the last 24 hours
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7449 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-26 23:21:33 +00:00
orbiter
3fe03f153d - search page becomes default start page (new users are not forced to do configuration since this is not necessary)
- adjusted top menu on search page (shows less stuff and now also the network graphics)
- adjusted the network page (looks better in when showing no other navigation on top)


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7448 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-26 14:58:28 +00:00
orbiter
1bda9e7ed9 option in network servlet to show the servlet with simple or embed menus
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7447 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-26 13:59:55 +00:00
orbiter
991b92f4ae enhanced network graphic
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7446 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-26 13:52:46 +00:00
mikeworks
30df013c2b de.lng: Added German translation to Network Scanner menu
CrawlStartScanner_p.html: Modified HTML code to validate HTML 4.01 Strict, switched selectAll Javasript, added end tags to input, etc ...

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7445 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-23 18:36:44 +00:00
orbiter
0887a1c86f better table layout in Network and API Administration
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7444 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-22 10:53:36 +00:00
orbiter
59d9fe1bd7 added more php mime types
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7443 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-22 09:52:36 +00:00
orbiter
3ae8f40fc8 removed yacy.network.group - this feature was never used
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7442 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-22 09:50:36 +00:00
orbiter
efb4ca8fa8 modified auto-delete of search failure-words:
- words are now not deleted from the search index automatically if index receive is switched off
- a flag in the network definition defines if this feature is switched on at all
- the search filter for not-found word references is switched off for server-side remote searches

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7441 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-22 09:46:00 +00:00
low012
58346b9a76 *) added lock symbol to "API Action Steering" link in menu since the page is protected
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7440 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-22 02:11:18 +00:00
lotus
79c5b35a6a fix bug in installer in case there was a 32 bit jre installed on a 64 bit windows. now the necessary 64 bit jre gets installed.
danke PCA42! http://forum.yacy-websuche.de/viewtopic.php?p=21631#p21631

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7439 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-21 18:09:24 +00:00
orbiter
f1f03d8c90 more logging for strange network loading bug
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7438 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-19 09:31:56 +00:00
f1ori
4e29e9712a * create cleanupjob for cached failed urls
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7437 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-17 15:04:00 +00:00
f1ori
a321c7673d * adminAccountForLocalhost only for localhost
* yacy crawls local domains also, if no password is set (the interface is already protected)
* it's not required anymore, to set a password in intranet mode

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7436 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-17 11:37:30 +00:00
lotus
c9cd03b532 adding convenient option for contacting peer on network page 4. can be used on creating new network. this page is hidden from the interface (not linked).
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7435 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-15 18:49:51 +00:00
hermens
930cb412dd Let SHORT_MILSEC_FORMATTER make a new formatted String every millisecond
see http://forum.yacy-websuche.de/viewtopic.php?f=6&t=3103


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7434 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-15 13:16:40 +00:00
orbiter
090c73e32e catch a OOM in HeapReader iteration
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7433 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-12 12:04:18 +00:00
low012
48463c4507 *) General private License? ;-)
*) minor code changes

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7432 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-12 00:28:08 +00:00
orbiter
c93f4dda72 - cleaned up yacy news
- removed unused methods
- avoid news generation in case that the peer runs in robinson mode

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7431 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-12 00:00:14 +00:00
orbiter
6c1b14c8e1 - more control in access tracker: count number of returned search results (not only info how much is in the index)
- extended query params for this
- enhanced cora

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7430 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-11 22:58:14 +00:00
f1ori
fafab7a8fe * provide option to delete cached snippet fetching failures
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7429 6c8d7289-2bf4-0310-a012-ef5d649a1542
2011-01-10 21:40:56 +00:00