Commit Graph

3704 Commits

Author SHA1 Message Date
orbiter
7b904e0077 integrated robots.txt crawlDelay into the crawl balancer
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3797 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-06 07:53:56 +00:00
orbiter
52cb033f01 - slightly different painting of web structure picture:
hosts that have many own connections are painted farer away (this is not yet cato's idea, this will be implemented in another step)

- doc update

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3796 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-05 15:32:43 +00:00
allo
b955f29f70 more options and bugfix
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3795 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-05 13:23:50 +00:00
allo
ef2395c244 use form instead of javascript
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3794 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-05 12:52:38 +00:00
karlchenofhell
30c3d909b1 - fixed charset problem in ConfigProfil_p.html (use accept-charset="UTF-8" in forms)
- fixed wrong XML output if no peers are known in Network.xml
- simplified parsing of table properties in wikiCode and ZTableToken
- reimplemented GC heuristics. They are needed to constantly ensure that an amount of free memory is available which is higher than Java's max. limit for performing a Full GC (please use serverMemory.request(long, boolean) rather than serverMemory.available(long, boolean) to provide data for averaging over the last GCs)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3793 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-05 11:37:19 +00:00
allo
6c9df13552 more debugging
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3791 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-04 20:30:40 +00:00
allo
089c2a7c39 firefox-like filter function.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3790 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-04 17:33:29 +00:00
allo
4392ee0c51 BugFix for typo and wrong include
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3789 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-04 16:06:58 +00:00
allo
d1e1580223 Surftips Blacklist
Blacklists List Hardcoded instead of only updated on firststart / migration.java

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3788 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-04 15:36:10 +00:00
theli
051a65f7af *) Snippet fetching:
Snippet are now fetched synchronous if the query parameter "fetchSnippet=" 
   is appended to the query string on the yacy search page. This is required 
   for the RSS feed.
   See: http://www.yacy-forum.de/viewtopic.php?t=4051
*) Small changes in the XSLT-stylesheet that is used to generate a html page from
   the RSS feed.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3787 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-04 05:27:46 +00:00
theli
e1a5babff1 *) Logging GUI handler: line-size is now set to max-size if max-size was exceeded
See: http://www.yacy-forum.de/viewtopic.php?p=36355

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3786 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-02 21:23:32 +00:00
(no author)
94cc9f05f5 *) Improvements for restart via update wrapper
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3785 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-02 15:25:13 +00:00
allo
54ddb3262c enter on webstrucutre
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3783 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-02 14:50:36 +00:00
karlchenofhell
edec6049c3 - fix for UPNP error, TODO: via ajax
- removed unused "type" argument in yacysearch.html

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3782 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-02 08:53:08 +00:00
allo
5fc00871a9 getpageinfo/sitemap bugfix
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3781 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-01 16:03:08 +00:00
allo
e7da3d2340 fixed sitemap url in getpageinfo
added suggested tags/keywords in getpageinfo

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3780 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-01 14:44:46 +00:00
allo
d0f8254f95 better refresh ui
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3779 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-01 13:58:48 +00:00
hydrox
44bac7dea1 *) blog-comments can now be moderated
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3778 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-01 06:02:55 +00:00
borg-0300
2ab020445a bugfix, i think - http://www.yacy-forum.de/viewtopic.php?t=4059
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3777 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-31 17:03:02 +00:00
(no author)
f89517203d *) SOAP: new function to get the Performance Settings of Queues and Processes
No items left in the yadmin SOAP-TODO :-)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3776 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-31 15:47:58 +00:00
(no author)
92351c4dcb *) SOAP: bookmarks list now indicates if a bookmark is private (requested by KoH)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3775 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-31 14:54:56 +00:00
allo
957a25afff getRight(rightName) instead of get...Right()
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3774 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-31 14:48:20 +00:00
(no author)
e0e46d3aec *) SOAP: new function doGarbageCollection (requested by KoH)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3773 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-31 14:32:41 +00:00
(no author)
5f80b72cac *) adding peer-hash, seed-url and location to network.xml
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3772 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-31 14:22:27 +00:00
(no author)
1efe607c34 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3771 6c8d7289-2bf4-0310-a012-ef5d649a1542 2007-05-31 11:24:59 +00:00
low012
0610285c4f *) totalResults and itemsPerPage gets set again
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3768 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-29 21:15:42 +00:00
(no author)
5cc8bb075b Syntaxfehler beseitigt
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3764 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-26 19:51:05 +00:00
low012
a97a3756fe *) got rid of HTML in Java classes that led to HTML tags being displayed on monitor pages
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3763 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-25 19:54:54 +00:00
low012
a0149317ac *) fixed bug where headlines were added to directory of a wiki page multiple times (http://www.yacy-forum.de/viewtopic.php?t=4034)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3762 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-25 16:36:09 +00:00
(no author)
ef24bed406 Sorry...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3760 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-24 16:25:07 +00:00
(no author)
a29cb2e1af blupp
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3759 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-24 16:14:46 +00:00
theli
651b05ba43 *) wsdl file updated (requested by KoH)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3758 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-24 16:07:58 +00:00
orbiter
0ae6664ad8 enhanced web structure picture
- hand-over of get properties from web front-end to graphics generation
- added depth-control buttons
- added marking of anchor-points to highlight relation order
- enhanced ymage graphics library

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3757 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-24 15:28:03 +00:00
orbiter
2f34f32ce3 added .classpath
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3756 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-24 07:30:43 +00:00
orbiter
54ca71535e added search protection also to detailed search
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3755 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-23 19:17:36 +00:00
orbiter
b3d2fafcfb added option to block the search function against unauthorized users
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3754 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-23 19:09:19 +00:00
rramthun
dee29faf15 *) System independent restart is now possible if using the updater/wrapper. - Part 1/2 (YaCy side)
The old Windows-only method has not been removed, so Windows-users without updater are still able to restart without updater


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3752 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-23 18:42:31 +00:00
orbiter
3562fe1706 should not be there
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3751 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-23 18:14:07 +00:00
orbiter
c321ddff7a redesigned access control to surftips:
- access is now granted to authorized admin, even if it is blocked to the public
- generalized naming
- changed links to buttons

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3750 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-23 17:49:16 +00:00
daburna
8b393744f1 #first version of french language file by gDupont
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3749 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-23 10:43:54 +00:00
(no author)
b4b56a5475 *) Preparations for system-independent restart using the wrapper. Not functional yet.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3748 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-22 20:41:15 +00:00
orbiter
a585b4d41b added web structure image
see http://localhost:8080/WatchWebStructure_p.html

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3747 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-22 15:20:50 +00:00
orbiter
33ad0c8246 added a web structure computation and logging:
- all web page parsing operations will now increase a web structure file
- the file is computed in memory and dumped at shutdown-time to PLASMASB/webStructure.map in readable form (not a database)
- the file can be used externally to analyse the link structure of the crawled pages
- the web structure can also be retrieved using a xml-interface at http://localhost:8080/xml/webstructure.xml
- the short-term purpose is the computation of a link-graph image (before linuxtag!)
- a long-term purpose could be a decentralized computation of the citation rank



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3746 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-22 08:13:48 +00:00
auron_x
cd3494a04c *) hint for idle indexer only appears if indexer is idle now
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3744 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-21 11:57:57 +00:00
karlchenofhell
7904175338 - sorry for typos
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3743 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-20 16:22:46 +00:00
karlchenofhell
baa9402b97 - wiki-parser is now configurable via the config setting wikiParser.class which holds the class-name for the parser to use
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3742 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-20 16:19:25 +00:00
karlchenofhell
601fc7d1c5 - added source to J7Zip-modifed.jar and it's license (changelog is still to come)
- moved HTML-*replace-methods from wikiCode to de.anomic.data.htmlTools
- prepared use of different wiki parsers as suggested here: http://www.yacy-forum.de/viewtopic.php?p=34444#34444

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3741 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-20 13:29:12 +00:00
karlchenofhell
0a64047081 - plasmaParserDocument can process subdocuments now (other archive-parsers may want to use this method)
- added 7zip parser
- added 'text/sgml' to realtime parseable mimetypes (sometimes returned by the mime type parser)
- added new cached output stream class, very suitable for parsers because of limited memory

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3740 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-18 23:13:44 +00:00
theli
b1680ab71f *) bugfix for ArrayIndexOutOfBoundsException in robots-parser (thanks to low012)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3739 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-18 13:39:08 +00:00
theli
b30e64daab *) passing homepath to serverLog.configureLogging
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3738 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-18 13:04:26 +00:00