Commit Graph

6281 Commits

Author SHA1 Message Date
orbiter
db19a941cf added new image index storage classes (not integrated yet)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6608 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-21 22:12:05 +00:00
orbiter
c8aece34a4 update to yacy/ai (just more testing)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6607 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-21 22:07:32 +00:00
orbiter
8ce936bcdd added an api recording function: it shall be possible to record
all operations on YaCy in a database that should make it possible
1) to re-create a setting on fresh peers
2) to transmit a setting from one peer to another
3) to re-create crawl starts after a complete deletion of the index
This functionality will also support
4) scheduled re-crawls (new implementation)
To implement this, a new database structure has been crated that stores maps into blob heaps. to encode maps the b-encoding technique was used (this is the same encoding that torrent files use)
- added a b-encoder
- enhanced the b-decoder
- added a b-encoded map heap data structure
- added a table organisation based on b-encoded heaps
- added a servlet to maintain such tables (see Tables_p.html)
- integrated the servlet into the Advanced Settings menu
- added an api recording based on the new tables

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6606 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-21 22:06:03 +00:00
orbiter
8df1694288 - added options to switch on/off search domains (text, image, audio, video, app)
- more memory by default


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6605 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-21 22:03:02 +00:00
mikeworks
19ff7da2b4 German language file change:
- Updated pages are Blog, Bookmarks, CookieMonitor*, CrawlResults, ProxyIndexingMonitor_p.htm, Wiki.html, yacyinteractive.html

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6604 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-21 01:44:29 +00:00
mikeworks
d4579377f8 Updated German language file
- Fixed some end tags (#-----------------------------) for syntax highlighting to properly detect the end
- Updated translation on pages in Search & Browse category

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6603 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-21 00:44:10 +00:00
mikeworks
3597dee3e3 Updated syntax highlighting support for YaCy lng (language) files in Notepad++
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6602 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-20 20:34:40 +00:00
mikeworks
c5322dd3a7 Updated German language file
- Updated changed lines in CrawlProfileEditor_p.html and WikiHelp.html
- Updated new navigation menu structure and some submenus

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6601 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-20 07:37:03 +00:00
mikeworks
ee52634daa Updated french language file due to changes to CrawlProfileEditor_p.html and WikiHelp.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6600 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-20 07:34:54 +00:00
mikeworks
eac3b696ab Fixed typos in WikiHelp.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6599 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-20 07:34:43 +00:00
mikeworks
03fb4a1980 Fixed typos CrawlProfileEditor_p.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6598 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-20 07:33:10 +00:00
orbiter
56e0d9bd01 - testings with image parser
- added image size as part of parsed text in images
- avoid unnecessary error messages if parsing of documents failed but one succeeded


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6597 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-19 14:59:58 +00:00
orbiter
e80e060ca6 - increased thread priority for server threads
- decreased thread priority for crawler threads

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6596 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-19 11:29:22 +00:00
orbiter
78daea95c3 show more navigation pages if there are too less locally
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6595 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-19 00:21:55 +00:00
mikeworks
25c545c4b0 first commit fixing two typos
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6594 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-18 07:25:41 +00:00
orbiter
234f733a3d - relocation of seed db is better for network switch than re-initialization because of the embedding of the peers object in other objects
- small refactoring of blacklist interface code to remove PMD warnings


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6593 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-18 00:07:20 +00:00
orbiter
473b11033d fixed network switch process - crawling did not work after a switch before this fix
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6592 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-17 23:33:15 +00:00
orbiter
fd7b348973 some fixes for the network switch
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6591 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-17 22:07:08 +00:00
orbiter
7d400b17d0 html parser support for .cfm files
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6590 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-17 16:29:49 +00:00
orbiter
f6731c6240 more logging etc.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6589 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-17 00:41:50 +00:00
orbiter
007f8297de added php3 as extension type for html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6588 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-16 15:53:18 +00:00
orbiter
4f1f4863c4 fix for deadlock when initializing a SplitTable with a file of size 0, see also:
http://forum.yacy-websuche.de/viewtopic.php?p=18594#p18594

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6587 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-14 23:03:48 +00:00
orbiter
763e0d447c removed unnecessary cast
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6586 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-14 14:23:26 +00:00
orbiter
cc5dcf69ff missing change for last commit
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6585 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-14 14:20:18 +00:00
orbiter
ca1ef9a079 fix for http://forum.yacy-websuche.de/viewtopic.php?p=18584#p18584
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6584 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-14 13:38:14 +00:00
orbiter
d9169cc6c3 increased proxy load time-out from 30000 to 60000 milliseconds
according to http://forum.yacy-websuche.de/viewtopic.php?p=17782#p17782

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6583 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-14 10:42:05 +00:00
orbiter
34354cf9b2 added a servlet that has been removed in SVN 4881; this servlet is now splitted and will be used for a simple crawl start and a remote crawl monitor (not yet integrated into the interface)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6582 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-13 23:19:45 +00:00
orbiter
938e806182 tried to fix date problem that may have prevented that foreign peers stay in the network
- removed unused code
- removed possibly wrong utc difference correction

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6581 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-13 20:01:46 +00:00
orbiter
bd05e57d3b fix for http://forum.yacy-websuche.de/viewtopic.php?p=18563#p18563
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6580 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-13 18:41:33 +00:00
orbiter
5df628a2a4 - added BEncoder class
- added BEncodedHeap class that encodes B data structures and stores that to a heap
- refactoring of MapView, this is now named MapHeap to fit into the naming scheme of the BEncodedHeap

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6579 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-13 16:21:37 +00:00
orbiter
81035e7080 moved a sub-menu entry
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6578 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-13 16:18:23 +00:00
suessthomas
5f1565cc95 Fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2540&sid=507e4dd9e234af001ba60c0d8adf45be, and small Layout Changes.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6577 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-13 07:57:36 +00:00
orbiter
82f57f79e5 more PMD enhancements
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6576 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-13 00:23:07 +00:00
orbiter
5d930c96f0 more fixes to search result page navigation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6575 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-13 00:04:37 +00:00
suessthomas
1375d0214d minor changes on the de language File - Thanks Copro
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6574 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-12 21:28:16 +00:00
orbiter
8c520f128d reverted a change in ranking process committed this afternoon
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6573 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-12 20:56:37 +00:00
orbiter
a06f7ddb33 more PMD recommendations
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6572 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-12 20:53:19 +00:00
orbiter
eb79ceb3ff update to kelondro data structures
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6571 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-12 15:37:34 +00:00
orbiter
9bbd546e64 in live search, show at least 20 entries instead of only 10
this is a work-around for the problem that the search widget
does not load a second page if the first page did not fill up
the window with enough lines such that a scrollbar ist visible.
Because the scrollbar triggers loading of following pages, this
must be enforced with the trick that more result lines must be
shown after the first search.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6570 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-12 15:23:01 +00:00
orbiter
18172451a0 better search computation:
- increased sort limit, now 3000 entries, before: 1000
  this should cause that more results can be shown in case
  of strong limitating constraints, like domain navigation
- enhanced the sort process
- check against domain navigator bugs
- fix in sort stack
- showing now all naviagtion pages at first search (not only next page)


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6569 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-12 15:01:44 +00:00
orbiter
d126d6c1b5 renamed the servlet WatchCrawler_p to Crawler_p
this was done because that servlet may be used for wget/cronjob
triggered crawl starts and it appears to be confusing that the
name of the crawl start servlet looks like a pure monitoring tool.


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6568 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-12 10:05:28 +00:00
orbiter
66c0a8e849 more PMD recommendations
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6567 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-11 22:18:38 +00:00
orbiter
909a4f91c7 added a logging output for crawl starts that shows the URL that can be used to start the crawl again
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6566 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-11 18:10:39 +00:00
orbiter
bc96d74813 - clean-up of robots.txt parser
- added 'yacybot' as key to recognize robots.txt entries for YaCy
- removed unused method to get robots.txt from database

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6565 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-11 16:36:30 +00:00
orbiter
2113fcd7e5 - fixed usage of isEmpty() which is not available in java 1.5
- increased visibility of some methods

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6564 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-11 12:33:40 +00:00
orbiter
dd459281c8 applied code changes that are recommended by PMD
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6563 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-10 23:09:48 +00:00
lotus
eac2daf2e8 * reenable DHT if yet enough memory is available
* reset treshold on reconfiguratoin
(thanks to sixcooler)

* display status message in web interface

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6562 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-10 19:04:43 +00:00
lotus
0752634b8b log YaCy version on startup
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6561 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-10 16:10:11 +00:00
orbiter
d77a8f3b3e added some modifications recommended by PMD for better performance
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6560 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-10 01:40:26 +00:00
orbiter
d1973bae2a code cleanup: removed unused code and unused methods
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6559 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-10 00:42:49 +00:00