allo
bb51efbb49
"Bugfix" for Tagdisplay
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3493 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-19 13:00:33 +00:00
rramthun
43114af6d7
*) Translated robots.txt-config-page
...
*) Simplified some sentences
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3492 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-19 12:42:07 +00:00
orbiter
6b9eea3932
- removed differentiation between longTitle and shortTitle; this cannot be used for search results,
...
and it is difficult to get both types from all document types
- added some author parsing
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3489 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-18 12:33:19 +00:00
orbiter
a738b57b31
added author tag to indexing content
...
enhanced composition of title tag
TODO: insert author information for external parsers
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3488 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-17 01:18:34 +00:00
theli
91c2a042a7
*) bugfix for wrong proxy traffic accounting
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3484 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-16 13:52:48 +00:00
orbiter
861f41e67e
redesigned NURL-handling:
...
- the general NURL-index for all crawl stack types was splitted into separate indexes for these stacks
- the new NURL-index is managed by the crawl balancer
- the crawl balancer does not need an internal index any more, it is replaced by the NURL-index
- the NURL.Entry was generalized and is now a new class plasmaCrawlEntry
- the new class plasmaCrawlEntry replaces also the preNURL.Entry class, and will also replace the switchboardEntry class in the future
- the new class plasmaCrawlEntry is more accurate for date entries (holds milliseconds) and can contain larger 'name' entries (anchor tag names)
- the EURL object was replaced by a new ZURL object, which is a container for the plasmaCrawlEntry and some tracking information
- the EURL index is now filled with ZURL objects
- a new index delegatedURL holds ZURL objects about plasmaCrawlEntry obects to track which url is handed over to other peers
- redesigned handling of plasmaCrawlEntry - handover, because there is no need any more to convert one entry object into another
- found and fixed numerous bugs in the context of crawl state handling
- fixed a serious bug in kelondroCache which caused that entries could not be removed
- fixed some bugs in online interface and adopted monitor output to new entry objects
- adopted yacy protocol to handle new delegatedURL entries
all old crawl queues will disappear after this update!
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3483 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-16 13:25:56 +00:00
hydrox
9b5fb3908d
*) a peer-message are now created when a blog-comment is written
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3480 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-15 12:58:17 +00:00
orbiter
9f929b5438
better snippet handling in case of snippet load fail
...
see also http://www.yacy-forum.de/viewtopic.php?p=31096#31096
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3475 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-13 22:18:36 +00:00
auron_x
d451ad48d3
*) improved peerloadgraphic:
...
- unnecessary (0 %) pieces are removed
- percent-values of each thread displayed in legend
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3474 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-12 19:08:17 +00:00
orbiter
a5d668c0c6
added speed-buttons for easy performance setting
...
appears in crawl start and on indexing monitor page
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3473 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-12 16:24:28 +00:00
karlchenofhell
e2ac5f62bd
- Code hübscher machen [von NNs TODO]
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3471 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-11 19:53:14 +00:00
auron_x
22fe14f292
*) first version of Peerload-graphic
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3469 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-11 17:04:11 +00:00
orbiter
909d7a8ae9
fixed wrong implemented row iterator in kelomdroFlexSplitTables
...
this has no effect, until now this iterator was only used on
the Index Administration page.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3464 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-09 13:55:26 +00:00
orbiter
b3ca177a5d
fix for http://www.yacy-forum.de/viewtopic.php?p=32797#32797
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3459 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-09 11:49:56 +00:00
orbiter
38b93f8cb8
bugfix for my last commit:
...
iterator did not consider secondary start point in case of rotation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3456 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-08 22:07:17 +00:00
karlchenofhell
264a82eec8
- fix for http://www.yacy-forum.de/viewtopic.php?t=3657
...
- fix for http://www.yacy-forum.de/viewtopic.php?p=32758#32758
- Diff takes any objects now, not only strings
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3455 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-08 22:04:15 +00:00
rramthun
045d758537
Avoid stopwords as topwords, configurable
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3454 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-08 20:50:27 +00:00
orbiter
d755a8026d
- better OOM protection
...
- better memory allocation for FlexTable indexes
- splitting between static index and dynamic index (only the dynamic part must grow)
- to enable a merge-iteration of new splittet index, a huge number of classes needed to be adopted for new iterator classes
- added new iterator classes that support cloneable iterators
- adopted all iterator classes to implement cloneable itarators
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3453 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-08 16:15:40 +00:00
karlchenofhell
2be405e1e1
- fix for last two commits
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3452 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-08 14:00:04 +00:00
karlchenofhell
de1b4a1731
- don't publish news if empty or equal page is submitted in wiki
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3451 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-08 13:50:24 +00:00
karlchenofhell
dcc13abd59
- fixed small bug at home page, button "peer's console"
...
- fixed <fieldset><dl> for safari on many pages
- added Blog-link to Network page
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3450 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-08 13:39:09 +00:00
theli
6596167277
*) bugfix for wrong RSS feed pubDate formats
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3449 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-08 08:37:47 +00:00
theli
0d178d00a5
*) adding RSS feed for peer messages
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3448 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-08 08:10:36 +00:00
karlchenofhell
4f2e6ef47b
- WatchCrawler_p shows max. 80 characters of URLs now (maybe dynamically adjustable based on browser width?)
...
- typo in BlacklistCleaner
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3445 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-07 23:16:25 +00:00
orbiter
70cd391ea1
fix for dl/fieldset problem in Safari
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3444 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-07 22:49:32 +00:00
orbiter
5741701b59
moved crawl start up, personal web pages down in main menu
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3443 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-07 16:08:13 +00:00
orbiter
b627c77df6
- workaround for safari bug with definition lists inside fieldsets in ConfigBasic
...
- alternative can be seen in PerformanceMemory, where a dl is simulated with a table layout
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3442 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-07 15:53:04 +00:00
karlchenofhell
03c5906ae7
- minor bugfixes for url-fetcher & http://www.yacy-forum.de/viewtopic.php?t=3646
...
- PerformanceMemory_p.html is valid XHTML again
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3440 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-07 11:50:03 +00:00
theli
8e9bee12fc
*) adding guid to yacysearch.rss
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3435 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-07 05:58:14 +00:00
orbiter
1cba31de43
redesigned ram organization for database caches
...
- each cache can now allocate as much memory as is available
- no more fixed limits
- replaced old performance memory monitor by new one
- added supervision methods as static functions into the classes that provide cache functionality
- steering of ram allocation is done with two simple limits that are ram availability-relative
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3434 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-06 22:43:32 +00:00
theli
e934c5b09b
*) wrong blog rss feed titel
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3433 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-06 17:37:21 +00:00
theli
ceed0364e2
*) Blog RSS: Image added
...
*) RSS Feed for YaCy Bookmarks added
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3432 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-06 17:35:24 +00:00
low012
ce360ef43e
*) no more HTML in plasmaCrawlProfile.java anymore
...
*) <br> will not be displayed in items in Auto Filter Content on WatchCrawler_p.html anymore
*) removed unnecessary replaceHTML()
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3425 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-02 21:09:28 +00:00
karlchenofhell
93e1ad2bca
- fix for last commit
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3424 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-02 01:50:21 +00:00
karlchenofhell
88245e44d8
- improved version of robots.txt (delete your old htroot/robots.txt before updating):
...
- robots.txt is a servlet now
- no need to rewrite the whole file each time a section is added or removed
- user-defined disallows, added manually, won't be overwritten anymore
- new config-setting: httpd.robots.txt, holding names of the disallowed sections
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3423 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-02 01:19:38 +00:00
karlchenofhell
9623bf7bbe
- removed call of java 1.5 method
...
- added config servlet for local robots.txt
- removed YPStats_p as it is of no use anymore
- supertemplates use XHTML now
- quick-fix for http://www.yacy-forum.de/viewtopic.php?p=32296#32296
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3422 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-01 13:54:14 +00:00
daburna
f4c13b422c
*updated translation
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3421 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-01 09:36:59 +00:00
karlchenofhell
1fe505f0b0
- adapted User_p to general web-interface style (and removed status-only page on changes)
...
- beautified WikiHelp.html + typos
- IP hasn't been set correctly in Blog.xml
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3418 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-02-28 09:22:31 +00:00
karlchenofhell
92b6bc0ad2
- fixed wrongly applied replacement of "<" and ">" in Blog and simplified the code a bit
...
- added check, whether active blacklist engine is supported by blacklist cleaner
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3417 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-02-28 00:04:32 +00:00
karlchenofhell
a1d68fe092
- use .class rather than Class.forName for classes in class-path
...
- added Bost's patch for Diff.findDiagonale() from: http://www.yacy-forum.de//files/patch_685.txt
- fixed minor bugs in Blog
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3416 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-02-27 22:52:22 +00:00
orbiter
f4cfd19835
second Generation of collection R/W head path optimization:
...
- permanent cache flush is switched off. The optimized cache flush
works better if it is a large number of collections that is flushed
together
- the flush size can be configured instead the flush divisor. There is
only one size for all flushes
- collection records that shall be removed during collection transition
(jump from one collection file to another) are now not really removed
but only marked in RAM. add-operations to the collection use these
marked collection spaces
- index bulk write operations are now separated for each file of a kelondroFlex
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3414 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-02-27 13:01:22 +00:00
hydrox
e92e8b2ae3
*) added RSS-Feed for blog
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3413 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-02-27 10:05:46 +00:00
hydrox
a107961099
*) fixed blog-comment-deletion without admin-rights is no longer possible
...
*) fixed no empty blog-comments anymore
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3412 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-02-27 08:37:47 +00:00
(no author)
cf47075855
CSS corrects
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3410 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-02-26 23:03:10 +00:00
hydrox
116fc016d0
*) fix for Blogcomment-Preview
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3408 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-02-26 16:18:43 +00:00
hydrox
54fef3574f
*) missing files for last commit
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3406 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-02-26 14:38:34 +00:00
hydrox
cb89c74d52
*) added blog-comments
...
*) removed debug-output when deleting news
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3405 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-02-26 14:36:01 +00:00
karlchenofhell
6fbe31425a
- some code-cleanup (no more syntax-warnings here)
...
- added deletion from loadedURLs of URLs to be blacklisted in IndexControl_p
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3404 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-02-26 12:56:50 +00:00
karlchenofhell
31ad42535a
- added buttons to add complete domain or single URL to blacklist to IndexControl_p
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3400 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-02-25 23:14:45 +00:00
karlchenofhell
e0decf4653
- added support for changing invalid entries in blacklist cleaner
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3397 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-02-25 19:36:05 +00:00