Commit Graph

1265 Commits

Author SHA1 Message Date
orbiter
8e7215475b - extended ViewFile to use is as debugging-tool: you can now use the
post-parameter url to submit an url directly
- fixed some bugs in text parser (not all parts had been analysed)
- fixed a bug in remote search interface (could not handle constraints)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3001 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-23 15:47:19 +00:00
orbiter
30888e7a2f implementation of search constraints
Such constraints may formulate specific restrictions to web searches
This is implemented by scraping information for constraints from a web
page during parsing, and storing flags to the pages within the web index.

In this first step, only information for index pages ("index of", directory listings)
are scraped and stored in flags
- added new flag class kelondroBitfield
- added scraper method in condenser
- added bitfield structure for all scrape types (see also condenser)
- added bitfield structure for appearance locations (see RWIEntry)
- added handover protocol for remote search and index distribution
- extended kelondroColumn class to hold bitfield types
- added another search attribute on search page (index.html)
- extended search-filter to enable filtering of non-matching constraints
- set all new database types to be default
- refactoring: moved word hash generation to condenser class

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2999 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-23 02:16:30 +00:00
orbiter
49a83f99d9 - fix for wrong DHT ordering in DHT selection
- fix for http://www.yacy-forum.de/viewtopic.php?t=3112&highlight=

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2995 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-21 00:36:41 +00:00
low012
75915502ec *) Cursor will jump to textfield on http://localhost:8080/yacysearch.html when page is loaded if JavaScript is enabled. (No changed behavior if JavaScript is diabled.)
*) If text is entered in textbox on http://localhost:8000/yacysearch.html and user clicks on "Web Search" in top menu, text will appear in textfield on http://localhost:8080/index.html if JavaScript is enabled. (No changed behavior if JavaScript is diabled.)


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2994 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-21 00:00:18 +00:00
orbiter
e55ef0df28 - automatic migration of old RWI entries to new format during remote search
if new collections are activated
- one more assert in RowSet, control of removeMarker

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2993 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-20 22:55:27 +00:00
auron_x
6cca0ee98d *) used regexp to recognize invalid peer-names
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2992 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-20 20:03:05 +00:00
orbiter
d66dbd0d65 bugfix for received number in transferRWI
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2989 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-20 15:38:11 +00:00
orbiter
f4b547dc13 limited index transfer to peer with version 0.486
this protects peers with version below 0.486 from new RWI objects
(which they cannot handle)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2988 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-20 02:46:53 +00:00
orbiter
e3d75f42bd final version of collection entry type definition
- the test phase of the new collection data structure is finished
- test data that had been generated is void. There will be no migration
- the new collection files are located in DATA/INDEX/PUBLIC/TEXT/RICOLLECTION
- the index dump is void. There will be no migration
- the new index dump is in DATA/INDEX/PUBLIC/TEXT/RICACHE

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2983 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-19 20:05:25 +00:00
theli
5c0669429e *) soap: adding function to query the peer list
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2968 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-17 19:26:55 +00:00
orbiter
c9364246cc introduced new RWI-Object.
This will be used for the final version of the collections.
The new object is not yet used.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2966 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-17 14:17:20 +00:00
rramthun
fd1dcf0d9b - patch for dropdown-box on IndexCreate_p.html by Philipp Redeker
- YaCy updater v0.2: Runs not interactively anymore, but automatically. Some documentation on usage will be added later.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2964 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-16 20:03:07 +00:00
auron_x
400c86f2ad *) fixed possible div by zero
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2960 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-15 16:58:04 +00:00
orbiter
d34f10c63d some tests with reverse dns lookup
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2954 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-12 00:28:10 +00:00
orbiter
497428c8ec refactoring
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2949 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-10 01:13:33 +00:00
(no author)
a51417d86b Bugfix: language of ConfigLanguage_p.html was not changed properly when a different language was choosen here
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2948 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-09 22:18:16 +00:00
orbiter
76fceb9997 refactoring
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2945 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-09 16:32:34 +00:00
daburna
794105f04a -replaced tags and updated translation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2940 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-08 22:08:55 +00:00
orbiter
bb7d4b5d5e refactoring to prepare new RWI entry object
- moved all url and index(RWI) entries to index package
- better naming to distinguish RWI entries and URL entries


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2937 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-08 16:17:47 +00:00
auron_x
de6295d0ae *) added legend to "Connects" on Status.html via span-titles
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2929 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-06 21:29:55 +00:00
daburna
2788e7269d -translated WikiHelp
-small other changes
-please use <br /> instead of <br>!

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2926 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-06 11:33:23 +00:00
orbiter
114a76a86e - added flag to urlhash that shows that domain is a local domain
- enhanced local domain detection
- bugfixing for memory assignment in kelondroFlexSplit
- automatic memory assignment to caches according to available RAM
- bugfixes for details during search process

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2924 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-06 02:05:39 +00:00
theli
bd3710a974 *) new xml template to view peer profile as xml
*) bugfix for wrong profile display (some fields where displayed twice)
*) new soap functions to get and set peer profile

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2919 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-05 16:26:25 +00:00
theli
d1afe1ce6b *) adding xml template to get the message list as xml
*) Bugfix in client stub jar generation (too many files where added)
*) new soap service to manage peer messages

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2918 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-05 15:18:33 +00:00
orbiter
8fdefd5c68 generalization of payload definition of index storage
this is one step forward to the migration to a new collection data format

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2912 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-05 02:10:40 +00:00
allo
a75f895884 memory and traffic informations
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2904 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-03 19:44:31 +00:00
daburna
78d65e128e -small changes to German translation
-changed <i> tags to <em>

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2897 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-02 20:45:19 +00:00
auron_x
f80a1304c2 *) check if xmx is greater or equal xms did not work in every case
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2892 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-01 18:42:48 +00:00
auron_x
62ad1476ac *) fix for missing picture-file, may be improved to support skins
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2880 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-29 17:21:04 +00:00
daburna
96604218eb -removed unused strings as suggested by Rostislav Svoboda (not all, but many)
-updated language file
-replaced <b> tags with <strong> tags

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2879 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-29 14:28:17 +00:00
orbiter
d3431433b0 more anonymization in logging
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2876 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-28 22:28:33 +00:00
(no author)
5141fa5942 combinedVersionString2PrettyString(..) renamd to combined2prettyVersion(..), new parameter "computerName" added to indentify the source of problems
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2871 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-28 11:33:05 +00:00
(no author)
8c6e1531e6 a bugfix against empty paramter "user"
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2870 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-28 01:20:10 +00:00
orbiter
78b7f6f7fd bugfix for index remove bug,
appeared after search where snippet-loading triggered word removal

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2869 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-28 00:22:10 +00:00
orbiter
147d88cf23 re-design of database caching
this should reduce IO a lot, because write caches are now actived for all databases
- added new caching class that combines a read- and write-cache.
- removed old read and write cache classes
- removed superfluous RAM index (can be replaced by kelonodroRowSet)
- addoped all current classes that used the old caching methods
- more asserts, more bugfixes


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2865 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-26 13:50:50 +00:00
allo
36f8b2c05a missing space :-(
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2864 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-26 10:30:03 +00:00
allo
63a2616eb7 -If you click on "Administration", you can log-in.
-better Linktext on Status.html


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2863 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-26 10:11:14 +00:00
orbiter
2a9d868f6d - removed object cache from kelondroTree
- generalized object caching and added new object caching class
- added object caching wherever kelondroTree was used
- added object caching also to usage of kelondroFlex
- added object buffering (a write cache) to NURLs
- added many assert statements; fixed bugs here and there
- added missing close methods to latest added classes

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2858 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-24 13:48:16 +00:00
theli
777e39cea0 *) new template to display the dir-listing in xml format.
This can e.g. be done by using the url http://localhost:8080/share/?format=xml

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2856 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-24 12:13:46 +00:00
allo
2ba56f70a8 XML-safe put.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2848 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-23 10:35:48 +00:00
orbiter
6a73d858de voting of surftipps can now be done with comments
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2846 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-23 01:10:29 +00:00
allo
5a6488256d catch the "username too short" exception
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2844 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-22 21:40:16 +00:00
theli
ac13fa763a *) bugfix for blacklist remove (blacklist was not informed about remove)
*) adding new soap service class for blacklist management
*) new junit class to test soap blacklist service

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2841 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-22 08:32:55 +00:00
daburna
6251944012 -updated languagefile
-corrected little mistakes

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2837 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-21 15:44:41 +00:00
allo
a831c83025 create servletProperties, with the servlet specific funktions from serverObjects
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2835 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-21 15:01:53 +00:00
allo
a17c43779f removed wrong part of template
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2830 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-20 14:50:57 +00:00
allo
27f9e0b1c6 xml interface for blacklists
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2829 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-20 14:47:18 +00:00
orbiter
06854988da - full integration of new LURL database in INDEX
- added migration method for urlHash.db into INDEX

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2819 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-19 21:14:37 +00:00
theli
52466067d8 *) Bugfix for ArrayIndexOutOfBoundsExceptions which occure because SimpleDateFormat is not thread-safe
See: http://www.yacy-forum.de/viewtopic.php?t=2995

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2810 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-19 08:33:53 +00:00
orbiter
b79e06615d - added new LURL.Entry class for next database migration
- refactoring of affected classes

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2802 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-18 22:25:07 +00:00
daburna
c97984bbac -corrected link and updated language file for simpleheader.template
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2799 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-18 18:41:46 +00:00
karlchenofhell
b14a500b88 - removed debug output from PerformanceMemory_p
- added URL escaping (tested, nevertheless watch out for possibly broken URLs)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2797 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-18 14:51:37 +00:00
karlchenofhell
ebf0da2a45 - now the fix http://www.yacy-forum.de/viewtopic.php?t=2974 works
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2796 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-18 12:07:17 +00:00
karlchenofhell
98a84ddb12 - reverted last change partly, can't handle the template system
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2793 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-17 22:22:36 +00:00
karlchenofhell
b5e40e2fa2 - fix for http://www.yacy-forum.de/viewtopic.php?t=2974 (no cache-sizes for new db)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2792 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-17 21:01:35 +00:00
daburna
6d1db21d0b -updated German language file
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2790 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-17 11:08:06 +00:00
orbiter
77a59a115d refactoring of indexing methods
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2787 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-16 15:04:16 +00:00
orbiter
688cbfb776 - bugfixing for flextable bug
- bugfixing for collection index bug
- several other bugfixes

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2785 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-16 00:27:25 +00:00
allo
74f09a0510 some more xml-backend files.
ConfigAdvanced_p.java: list settings after changing.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2784 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-15 21:41:47 +00:00
allo
a29b4d4fb5 extended Supertemplates for Headerincludes.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2780 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-15 13:43:46 +00:00
theli
3bebe72544 *) Default Rex.Exp. changed back to .*.*
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2778 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-15 08:37:19 +00:00
daburna
ea9411f9d2 -surftips now working correct
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2775 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-14 14:13:12 +00:00
theli
5b114249ce *) Bugfix for ViewLog problem with multiline logging messages
See: http://www.yacy-forum.de/viewtopic.php?t=2972

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2774 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-14 13:21:07 +00:00
daburna
a1736675ca -ups
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2769 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-14 11:19:43 +00:00
daburna
2de939f544 -updated translation
-removed wrong spelling; there is only 1 p in the English tip. I think the surftipps.java have to be updated.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2768 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-14 11:17:14 +00:00
auron_x
c628df43a4 *) removed unused image-file
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2762 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-13 09:23:55 +00:00
orbiter
50f2578c55 - some bugfixing and code cleanup
- now assortments can completely left out if they do not exist
  before startup and collection index is selected.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2757 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-13 01:19:26 +00:00
orbiter
a5dd0d41af - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
- added test migration method to migrate the old LURL to a new LURL
the new LURL will be splitted into different tables for each month
this solves several problems:
- the biggest table in YaCy is splitted in different parts and can
  also be managed in filesystems that are limited to 2GB
- the oldest entries can easily be identified, used for re-crawl und
  deleted
- The complete database can be limited to a specific size (as wanted many times)


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2755 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-12 23:14:41 +00:00
auron_x
e126598a0f *) small enhancement to webinterface, progressbars are now not stretched images, but <div>'s with colored background
-> all skin files were set to use green progressbars (should be changed to colors fitting the skins appearence)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2751 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-12 17:10:28 +00:00
rramthun
581dd2ec72 *)Proper arrow-function on Network.html, but ordering is still broken. Perhaps someone could fix that?
*)Removed double creation of DATA directory. New warning message in case of insufficient rights.
*) Removed roland-ramthun.de-seedlist temporarily, because of server changes

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2747 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-11 18:27:38 +00:00
orbiter
918b59dc5e - bugfix for snippet profile (no delete button)
- bugfix for search process (avoided null pointer exception in case other peer does not respond)


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2742 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-10 20:16:20 +00:00
orbiter
2bb529cedb added peer tags for peers in robinson mode
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2741 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-10 20:09:26 +00:00
low012
f7447894f1 *) fixed link to WatchCrawler_p.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2740 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-10 12:39:29 +00:00
orbiter
afbb547f3d extended options for abstracts generation in remote search interface
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2739 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-10 12:22:16 +00:00
allo
3730ec3440 moving to a _p page.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2738 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-10 10:31:21 +00:00
orbiter
c8f3a7d363 added snippet-url re-indexing
- snippets will generate an entry in responseHeader.db
- there is now another default profile for snippet loading
- pages from snippet-loading will be indexed, indexing depth = 0
- better organization of default profiles

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2733 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-09 23:07:10 +00:00
orbiter
2e4aa6a170 refactoring of Advanced Config:
- removed settings that are in Basic Settings
- joined pages that belong together
- moved include pages from yacy/ to /

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2726 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-09 10:24:54 +00:00
orbiter
0f10bdde22 more generic cache methods
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2721 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-09 02:13:13 +00:00
hermens
440c6ee657 Implement alternative htcache layout
mostly according to: http://www.yacy-forum.de/viewtopic.php?p=26205#26205



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2718 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-08 17:25:19 +00:00
allo
226f2c5b2c first version, of the Serverlet Debugger
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2717 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-08 14:25:54 +00:00
allo
e25172853a fixed license notice
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2714 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-07 22:25:05 +00:00
allo
1d0c0edda3 first version of posts/get from the del.icio.us api
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2713 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-07 22:16:09 +00:00
orbiter
1969522dc1 removed lowercase of snippets (and other things):
- added new sentence parser to condenser
- sentence parsing can now handle charsets

to do: charsets must be handed over to new sentence parser

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2712 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-07 00:06:09 +00:00
low012
07155ef3b0 *) added a few constraints to prevent exceptions when clicking on stop or pause on IndexCleaner_p.html when no thread is started
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2710 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-05 21:32:07 +00:00
orbiter
db294687ea enhanced logging
- more logging output
- fix in log line preparation
- added filter to log page
- some small bugfixes

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2707 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-03 22:55:59 +00:00
theli
f17ce28b6d *) plasmaHTCache:
- method loadResourceContent defined as deprecated. 
     Please do not use this function to avoid OutOfMemory Exceptions 
     when loading large files
   - new function getResourceContentStream to get an inputstream of a cache file
   - new function getResourceContentLength to get the size of a cached file
*) httpc.java:
   - Bugfix: resource content was loaded into memory even if this was not requested
*) Crawler:
   - new option to hold loaded resource content in memory
   - adding option to use the worker class without the worker pool 
     (needed by the snippet fetcher)
*) plasmaSnippetCache
   - snippet loader does not use a crawl-worker from pool but uses
     a newly created instance to avoid blocking by normal crawling
     activity.
   - now operates on streams instead of byte arrays to avoid OutOfMemory 
     Exceptions when operating on large files 
   - snippet loader now forces the crawl-worker to keep the loaded
     resource in memory to avoid IO 
*) plasmaCondenser: adding new function getWords that can directly operate on input streams
*) Parsers
   - keep resource in memory whenever possible (to avoid IO)
   - when parsing from stream the content length must be passed to the parser function now.
     this length value is needed by the parsers to decide if the parsed resource content is to large
     to hold it in memory and must be stored to file 
   - AbstractParser.java: new function to pass the contentLength of a resource to the parsers
   


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2701 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-03 11:05:48 +00:00
orbiter
bcf2b800b4 applied UTF-8 encoding parameter to yacy-internal protocol communication
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2694 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-02 13:35:38 +00:00
orbiter
5a40ea7866 refactoring of wget string list generation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2692 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-02 09:59:20 +00:00
orbiter
dbc2e039bb added time-out option parameter to call hierarchy
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2691 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-02 09:40:18 +00:00
orbiter
b59d4576af increased version number to emphasise that the snippet fix
_dramatically_ increased search speed

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2690 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-02 01:50:57 +00:00
orbiter
d4c239e4be - fixed problem in collection index with deletion of single url references
- added automatic deletion of not-found snippets after search

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2689 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-02 01:40:52 +00:00
orbiter
00746ca232 identified and fixed search performance problem caused by
snippet loading. Some access to header-db had been twice and even
more times in some cases. Snippet resource loading fixed.
Furthermore the snippet loading during remote search within the
remote peer has been disabled, but can be switched on remotely by
new flag 'includesnippet=true'

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2688 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-02 01:15:02 +00:00
orbiter
4d9e1b43dd surftipps appearance update
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2687 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-10-02 00:13:59 +00:00
orbiter
310f1c41cd added option to see ranking scores in surftipps
and some cleanups

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2684 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-30 23:28:03 +00:00
orbiter
7c0e6de366 bugfix for surftipps votes (wrong page)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2683 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-30 23:06:38 +00:00
orbiter
3ad0709b53 added a delete button to crawl profile list.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2682 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-30 22:35:59 +00:00
theli
a2e3095044 *) Bugfix. Add missing plasmaParserDocument.close() calls
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2680 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-30 10:09:01 +00:00
theli
cd5f349666 *) Better handling of large files during parsing
Extracted text of files that are larger than 5MB is stored in a temp file instead of keeping it in memory
*) plasmaParserDocument.java; getText now returnes an inputStream instead of a byte array
*) plasmaParserDocument.java: new function getTextBytes returns the parsed content as byte array
   Attention: the caller of this function has to ensure that enough memory is available to do this 
   to avoid OutOfMemory Exceptions
*) httpd.java: better error handling if the soaphander is not installed
*) pdfParser.java: 
   - better handling of documents with exotic charsets
   - better handling of large documents
   - better error logging of encrypted documents
*) rtfParser.java: Bugfix for UTF-8 support
*) tarParser.java: better handling of large documents
*) zipParser.java: better handling of large documents
*) plasmaCrawlEURL.java: new errorcode for encrypted documents
*) plasmaParserDocument.java: the extracted text can now be passed
   to this object as byte array or temp file   

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2679 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-30 09:31:53 +00:00
theli
8b2ceddb91 *) Displaying servere and warning logging messages in different colors on ViewLog_p.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2678 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-30 08:12:22 +00:00
orbiter
df1629b05a - code cleanup
- version 0.471
- moved surftipps to own web page


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2676 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-29 22:27:20 +00:00
theli
ed8227d222 *) Bugfix for NullpoinerException in IndexCreateIndexingQueue_p.java
See: http://www.yacy-forum.de/viewtopic.php?p=25874

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2667 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-27 04:35:02 +00:00
theli
c0f7a4124c *) Bugfix for soap templates
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2666 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-27 04:24:32 +00:00
theli
3433dfb5e2 *) Bugfix for soap search template: correction for resultCount tags, cdata for snippet tag
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2664 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-26 16:18:04 +00:00
theli
d42dcead1d *) Bugfix renaming snippet tag in soap search template
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2663 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-26 16:11:38 +00:00
theli
97615af406 *) Restructuring of YaCy SOAP services
- general functions moved to abstract service class
   - service class splitted into SearchService, CrawlService, StatusService
*) Bugfix for SOAP search services
   - Attention: some xml tages where renamed
   See: http://www.yacy-forum.de/viewtopic.php?p=25877
*) New SOAP service function urlInfo to view the parsed content of an URL
   See: http://www.yacy-forum.de/viewtopic.php?p=25869

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2660 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-26 14:47:44 +00:00
rramthun
b0cab1e731 *)Adapted surftipps to use common 0/1 parameters
*)Added translation of WatchCrawler.html
*)Changed format of German translation. Formal description will probably follow.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2657 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-24 20:15:28 +00:00
allo
4922ab8920 try to fix a nullpointer on snippet generation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2653 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-22 22:51:44 +00:00
hermens
d8fde14c3a Make maximum number of words in DHT-In cache configurable at runtime
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2652 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-22 12:44:58 +00:00
hydrox
740696f6c3 *) few fixes XHTML-validation ( there is still much to do)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2648 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-22 08:04:24 +00:00
theli
57415b6889 *) Bugfix for surftipps UTF-8 problem
See: http://www.yacy-forum.de/viewtopic.php?t=2864

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2647 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-22 05:40:29 +00:00
theli
706572f18d *) Bugfix for ArithmeticException caused by setting max crawling thread count was to 0
See: http://www.yacy-forum.de/viewtopic.php?t=2862

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2646 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-22 04:29:31 +00:00
orbiter
2d3b96eeba bugfixes for surftipps
- added missing authorization check for votes
- second vote on same entry was possible after complete publishing of current vote

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2645 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-21 21:37:40 +00:00
hydrox
85f3617835 *) moved HTML from class-file to template-file (please check if it is valid HTML)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2644 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-21 20:36:46 +00:00
hydrox
9434dba8f2 *) corrected title of IndexCleaner_p.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2643 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-21 07:26:41 +00:00
theli
b6c7b91582 *) Parser now throws an ParserException instead of returning null on parsing errors (e.g. needed by snippet fetcher)
*) better logging of parser failures
*) simplified usage of plasmaparser through switchboard
*) restructuring of crawler
   - crawler now returns an error message if it is used in sync mode (e.g. by snippet fetcher)
*) snippet-fetcher: more verbose error messages
*) serverByteBuffer.java: adding new function append(String,encoding)
*) serverFileUtils.java: adding functions to copy only a given number of bytes between streams


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2641 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-20 12:25:07 +00:00
orbiter
aa38721cf6 new features for surftipps
- new generation with less memory
- removal of doubles
- positive votes can generate entries without original news (so they can live on)
- link deletion on search results are now also negative votes for surftipps (but they may rarely hit any news)


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2640 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-20 12:01:51 +00:00
orbiter
e03427871e enhanced surftipps:
- added switchh to show or hide surftipps
- more news contribute to surftipps
- added voting system for surftipps

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2638 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-20 07:17:41 +00:00
theli
e745b63c77 *) Bugfix for different behavior of indexDistributeWhileCrawling to other checkboxes on IndexControl_p.html
See: http://www.yacy-forum.de/viewtopic.php?t=2849

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2637 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-20 04:44:56 +00:00
borg-0300
42173462f5 rename cutUrlText to shortenURLString;
other little things;

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2635 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-19 20:47:45 +00:00
borg-0300
16ba5d1b46 topwords: only [a-z] words, quality is better;
blank removes; 
properties added;


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2632 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-19 10:44:45 +00:00
orbiter
ec031eb993 first version of surftipps
see http://localhost:8080/index.html

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2627 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-18 20:14:21 +00:00
orbiter
f453c14b5d removed unreacheable catch blocks and unused imports
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2619 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-18 11:23:58 +00:00
theli
97d2a08ef1 *) restructuring needed to support parsing of documents using various charsets
- serverFileUtils.java: 
   -- adding methods to copy from stream to writer and readers to writers
   -- moving httpc writeX methods into serverFileUtils class
   - serverCharBuffer.java: removing inheritance from Writer class
   - replacing htmlFilterOutputStream by htmlFilterWriter class which handles
     content as char stream
   - htmlFilterContentTransformer.java: deactivating getText mode 
    (still needs to be migrated to use char streams instead of byte streams)
   - changes in several classes to use htmlFilterWriter instead of htmlFilterOutputStream
   - changes in Scraper and Transformer classes to operate on chars instead of bytes
   - httpdProxyHandler.java: bugfix. clientTimeout setting was missing in config file

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2617 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-18 10:12:11 +00:00
orbiter
3aac5b26da - added automatic tag generation when a web page from the search results is added
- added new image 'B' in front of search results for bookmark generation
- added news generation when a public bookmark is added
- the '+' in front of search results has new meaning: positive rating for that result
- added news generation when a '+' is hit

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2613 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-18 00:37:02 +00:00
michitux
2d9496577f Removed double labels for forms in Blacklist_p.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2610 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-17 08:07:21 +00:00
michitux
aa46269eff Less margin/padding for dls (e.g. in Messages)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2609 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-17 07:42:07 +00:00
michitux
567c40f5f0 Bookmark/delete-links now visible when mouse is over the searchresult, in standard-compliant browsers with css, in Microsoft Internet Explorer via JavaScript
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2608 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-16 16:56:22 +00:00
rramthun
20e1754379 Various fixes for the languages
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2600 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-15 16:02:37 +00:00
orbiter
d54144a4e3 fixed bad snippet behavior (hopefully)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2596 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-15 14:17:18 +00:00
orbiter
5015e780c2 - simplified watchCrawler code
- changed display of watchCrawler slightly

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2594 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-15 13:54:10 +00:00
michitux
31d6cdea53 WatchCrawler.html now valid xhtml, added the class TableCellActive to default skin, please update your skins (sorry, I removed it before because I hadn't seen it in any html-file)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2590 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-15 11:50:25 +00:00
allo
9bed90f8dc bugfix in js
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2587 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-15 06:33:22 +00:00
orbiter
c42b011648 added watch crawler to menu
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2584 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-15 01:09:34 +00:00
michitux
92157febcd Bugfix for Blacklist_p.html: Adding of new patterns possible again
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2582 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-14 15:20:32 +00:00
low012
f4af607b79 *) just some typos
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2576 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-14 01:11:49 +00:00
orbiter
e03740c306 small fix for last commit
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2575 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-14 00:57:41 +00:00
orbiter
c89d8142bb replaced old 'kCache' by a full-controlled cache
there are now two full-controlled caches for incoming indexes:
- dhtIn
- dhtOut
during indexing, all indexes that shall not be transported to remote peers
because they belong to the own peer are stored to dhtIn. It is furthermore
ensured that received indexes are not again transmitted to other peers
directly. They may, however be transmitted later if the network grows.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2574 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-14 00:51:02 +00:00
orbiter
6e2907135a bugfixes for remote search server part
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2573 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-13 22:19:34 +00:00
rramthun
2c6f2a1f74 First language fixes for new XHTML-layout
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2572 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-13 20:03:15 +00:00
orbiter
cf9884e22b first attempt to implement a secondary search
this is a set of search processes that shall enrich search results
with specialized requests to realize a combination of search results
from different peers.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2571 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-13 17:13:28 +00:00
allo
13d0cff257 right dhtml.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2568 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-13 14:02:34 +00:00
orbiter
dfc0ac1958 syntax error fixes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2565 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-13 12:02:02 +00:00
allo
6e03f61daa fix for highlighting searched words in snippets
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2564 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-13 11:26:26 +00:00
michitux
3bbe6a77da Smaller font-size for tables in PerformanceQueues_p.html and PerformanceMemory_p.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2560 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-12 21:41:16 +00:00
michitux
d6204fd956 Forms in Blacklist_p.html splitted as suggested by KoH to avoid wrong submits when pressing [enter]
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2559 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-12 21:34:02 +00:00
michitux
b573f5b4c2 New layout in Blacklist_p.html, more padding and margin for fieldsets
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2558 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-12 20:31:42 +00:00
michitux
48d8da44d1 Design changes: less margin in fieldsets, search form in yacysearch.html not centered, smaller rows in IndexMonitor.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2557 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-12 16:15:02 +00:00
michitux
140c3e1db9 Some bugfixes: updated ids for labels in DetailedSearch.html, fixed a template-bug in Network.html and added a workaround for a bug in the template engine in IndexTransfer_p.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2556 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-12 14:16:36 +00:00
orbiter
75b198bc02 - updated references to indexContainer
- more bugfixes and debugging for indexAbstract processing

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2555 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-12 11:13:27 +00:00
orbiter
4f9e42d5ed more changes towards better join-search
- fixed problems with index-abstract generation
- added analysis output for index abstract receive

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2551 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-12 00:42:42 +00:00
orbiter
8219ce6c67 bugfix inn DetailedSearch form names
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2550 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-11 23:07:28 +00:00
orbiter
462c64a935 removed superfluous file
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2549 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-11 22:13:41 +00:00
orbiter
1137605edf - small change to DetailedSearch layout
- version 0.463 for new xhtml interface

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2548 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-11 22:11:05 +00:00
michitux
31393312d0 New XHTML-template for a large part of the frontend, for details see http://yacy-websuche.de/wiki/index.php/Dev:XHTML
If you don't use the default skin, the style will be broken or at least not complete.
YaCy now has two css-files: base.css in htroot/env and the skin. In base.css the layout and black/white text-formating-rules are defined. Colors are only defined in the skin.
The skin is now very easy to read and to change. If you want to make more changes than the colors you see in the default-skin, feel free to use the full power of css, but you are warned: The code is still not ready and may change, but we try to avoid changes which affect anything in the default-style.
Translation will be broken too because the language-files contain HTML-Code which has changed.


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2547 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-11 18:18:12 +00:00
orbiter
82a6054275 - fixed bug with new indexAbstract generation
- added partly evaluation of indexAbstracts during remote searches

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2544 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-11 10:39:25 +00:00
theli
fded1f4a5d *) better handling of maximum file size limit in crawler
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2543 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-11 08:26:39 +00:00
theli
7b1881754d *) Corrections for FF search plugin
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2540 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-11 06:30:34 +00:00
orbiter
74d1dea30b changes towards better join-search
- added generation of a compressed index within remote peers during global search
- added selection of specific urls within remote peers during secondary global search


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2539 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-10 22:36:47 +00:00
auron_x
4a494464af *) splitted form to avoid unwanted remove of selected entries as suggested by KoH
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2538 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-10 21:03:34 +00:00
theli
63893003be *) Adding settings page for the crawler which allows to specify a file size limit and the timeout to use.
*) adding first version of maximum filesize check for the crawler

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2534 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-09 15:06:49 +00:00
theli
ecbce206b7 *) Bugfix for UNRESOLVED_PATTERN in threadpool configuration
See: http://www.yacy-forum.de/viewtopic.php?p=25377#25377

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2533 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-09 13:07:40 +00:00
theli
92e986bb91 *) adding missing return prop (requested by allo)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2532 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-09 12:34:24 +00:00
allo
f0529fe53e update for ftp urls
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2531 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-09 12:28:28 +00:00
orbiter
c543028dd4 fixed double/missing null check for LURLs
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2520 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-08 11:54:28 +00:00
orbiter
96c6e4e322 - enhancements to detailed search page
- enhancements to search ranking computation process
- removed bugs in postranking

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2516 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-08 01:26:06 +00:00
orbiter
e33eb528ba fixed DetailedSearch
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2514 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-07 22:54:49 +00:00
orbiter
9340dbb501 fixed all possible problems with nullpointer exception for LURLs
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2513 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-07 18:24:39 +00:00
hermens
ff4362b02d some more fixes for new plasmaCrawlLURL.load behavior
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2511 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-07 14:32:46 +00:00
theli
35b70f164c *) Status_p.inc: link to memory configuration added
*) startYACY.bat: comment added
See: http://www.yacy-forum.de/viewtopic.php?p=25311#25311

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2500 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-07 04:36:58 +00:00
orbiter
4866868c0e added write cache for LURLs
This was necessary to speed up the index receive process during global search


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2498 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-07 01:13:03 +00:00
orbiter
8a0e35618b enhancements to search result preparation
- added detailed count on remote search results
- enhanced search sequence during remote searches (doing local search in sequence)
- strict adherence to timout limits

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2497 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-06 17:51:28 +00:00
theli
dae763d8e3 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2495 6c8d7289-2bf4-0310-a012-ef5d649a1542 2006-09-06 14:31:17 +00:00
theli
413e6b9855 *) direct access to responseheaders of sbQueue.Entry removed to make it more http independent
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2489 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-04 15:56:49 +00:00
theli
2126c51906 *) bugfix for ViewFile.java. Wrong http header were used
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2488 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-04 15:49:51 +00:00
theli
7a35b8e237 *) direct access to responseheaders of sbQueue.Entry removed to make it more http independent
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2487 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-04 15:36:19 +00:00
theli
3870d615e3 *) setting htCache.Entry fields to private
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2485 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-04 15:06:58 +00:00
theli
5847492537 *) next step of restructuring for new crawlers
- IndexCreate_p.java: correcting problems with ftp urls
   - URL.java does not cutout the userinfo anymore 
    (needed to transport authentication info in ftp urls, e.g. ftp://username:pwd@ftp.irgendwas.de)
   - plasmaCrawlLoader.java: 
   -- hack to re enable https urls
   -- adding function getSupportedProtocols

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2482 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-04 13:17:11 +00:00
theli
4e2a950ac9 *) next step of restructuring for new crawlers
- avoid using the http crawler class directly. Using the interface class instead

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2476 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-04 09:24:24 +00:00
theli
f94131c13d *) Bugfix for Blacklist_p.java
- avoid nullpointerexception

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2475 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-04 09:02:23 +00:00
theli
eb9b138986 *) next step of restructuring for new crawlers
- conversion of the crawler pool into a keyed object pool
   - crawlers are now loaded based on the url protocol (of course works only for http now)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2473 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-04 06:52:55 +00:00
theli
1395aae742 *) starting restructuring which is needed to add crawlers for additional protocols
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2472 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-04 06:09:20 +00:00
theli
857a2d76a2 *) better handling of server shutdown
See: e.g. http://www.yacy-forum.de/viewtopic.php?p=25234

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2471 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-04 05:47:17 +00:00
orbiter
7df572756a fist step+attempt so solve the snippet marking problem.
See: http://www.yacy-forum.de/viewtopic.php?p=22855#22855

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2469 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-03 23:22:23 +00:00
theli
f3ac4dbbb9 *) better handling of server shutdown
See: e.g. http://www.yacy-forum.de/viewtopic.php?t=2584

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2468 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-03 14:59:00 +00:00
auron_x
57dda1a92c *)again fixing for wrong version display, now totally working with double instead of float
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2464 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-08-28 17:54:07 +00:00
orbiter
18b6876860 new cache flush configuration settings
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2460 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-08-25 22:31:21 +00:00
orbiter
27a159b401 * documentation update
* removed doc from release
* release information in doc/News.html
* release 0.46

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2442 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-08-23 11:36:09 +00:00
orbiter
d78b824e85 fixed problem with default path after first start-up
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2440 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-08-22 13:35:51 +00:00
low012
c2264962d1 *) Fixed a bug.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2438 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-08-21 21:34:45 +00:00
low012
b387bd47db *) Added a Wiki Code help page and links to it at every textbox Wiki Code can be used in. Maybe this will encourage people to play around with the Wiki and the Blog more.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2437 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-08-21 21:23:29 +00:00
rramthun
38c4248814 Some language updates
Removes the ; behind Slovenky in language list

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2430 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-08-20 11:52:02 +00:00
orbiter
23dd972608 fixed memory calculation in performanceMemory web page
fixed also maximum cache size computation

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2429 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-08-20 01:20:34 +00:00
orbiter
1ce3c22761 better memory control:
- added memory monitor for preNURL-db in performanceMemory
- changed default memory assignments

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2427 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-08-19 13:09:04 +00:00
orbiter
39b4c26bdc more memory control:
- catchup of OutOfMemoryError in server threads
- automatic adoption of word cache size after a Short Mem Cycle

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2426 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-08-19 00:06:39 +00:00
orbiter
3e9d509c39 some small fixes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2425 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-08-18 22:50:05 +00:00
low012
479861a3cf *) Value entered in searchfield will not get lost anymore when link "more options..." is clicked. If Javascript is disabled or function does not work for any other reason, there should be no change to current behaviour.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2424 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-08-18 21:53:15 +00:00
orbiter
16993e84dd replaced non-7bit umlaut ascii
@Franz: sorry, ich hab hier ein ant das daran rumzickt. Bitte in deinem
Namen ein 'ss' statt der sz-Ligatur benutzen

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2422 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-08-18 12:45:19 +00:00
orbiter
0187c60010 because of a bug in the JRE 1.4.2 there was no memory protection
see http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=4686462
this commit fixes the bug by using a memory-computation patch.
All uses of Runtime.maxMemory had been replaced by serverMemory.max
The bug is not present any more in Java 1.5

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2419 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-08-18 01:33:54 +00:00
auron_x
4eca0f8830 *) fixed PPM calculation for multiple indexer-threads
*) fixed totalPPM calculation and added total PPM to Network.html

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2418 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-08-17 19:15:30 +00:00
orbiter
6ad471ef96 * applied many compiler warning recommendations
* cleaned up code
* added unit test code
* migrated ranking RCI computation to kelondroFlex and kelondroCollectionIndex


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2414 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-08-16 19:49:31 +00:00