Commit Graph

4443 Commits

Author SHA1 Message Date
orbiter
a3df23659c re-implementation of charset checking
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4750 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-30 13:23:05 +00:00
orbiter
75a1702133 - fix for ConcurrentModificationException during shutdown
- fix for Ranking distribution problem (suma-lab peer does not exist any more)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4749 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-30 11:19:52 +00:00
orbiter
27ab0a5f89 fixed XSS problem in ConfigProperties
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4748 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-29 22:47:00 +00:00
orbiter
32b5b057b9 - modified, simplified old kelondroHTCache object; I believe it should be replaced by something completely new
- removed tree data type in kelondroHTCache
- added new class kelondroHeap; may be the core for a storage object that will once replace the many-files strategy of kelondroHTCache
- removed compatibility mode in indexRAMRI


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4747 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-29 22:31:05 +00:00
orbiter
d3715e02ae removed double/redundant servlet Config_p
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4746 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-29 19:19:14 +00:00
danielr
ec84a52adb change for problem with NPE (seen as "PROXY Unknown Error while processing request")
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4745 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-29 16:06:54 +00:00
orbiter
5813cc149f fix for bad rssTerminal behavior
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4744 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-28 20:34:37 +00:00
orbiter
88216c1f1f fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1103&hilit=&p=7362#p7362
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4743 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-26 22:59:20 +00:00
orbiter
d0b893523e - protection against RAM overflow caused by new peer rss news
- more XSS protection

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4742 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-26 22:53:04 +00:00
orbiter
685794e7e7 fix for parser/encoding Exception
see http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1111&hilit=&sid=55a320b54e1e3bda9410e7c50b5147f1&p=7431#p7431

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4741 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-26 22:14:45 +00:00
orbiter
cf042e6957 reverted change by mistake in yacyVersion
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4740 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-26 01:08:59 +00:00
orbiter
9935e83c86 added new news window into the status page. At this moment it is just a test.
The news inside the window are about peer arrivals and departures, remote search accesses and crawls

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4739 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-26 01:00:10 +00:00
orbiter
bac38cfa18 added very rudimentary peer news as rss feed. An example can be retrieved with
http://localhost:8080/xml/feed.rss?channel=PEERNEWS
to be extended and integrated in interface ...

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4738 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-24 23:30:13 +00:00
orbiter
6495227ad6 the class rssReader is replaced by RSSReader, RSSFeed and RSSMessage
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4737 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-24 21:45:43 +00:00
orbiter
724bbdf9b2 refactoring of RSS reader
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4736 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-24 21:31:07 +00:00
orbiter
b9a2a2d287 more search performance hacks
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4735 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-24 15:09:06 +00:00
orbiter
ff755fb858 small corrections and enhancements after search timing profiling
search should be a little bit faster now

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4734 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-24 13:31:55 +00:00
orbiter
0702dd2507 added a profiling script to analyse search process timing
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4733 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-24 13:28:18 +00:00
orbiter
d0e2830e01 enhanced the thread dump to make it usable for scripted remote-debugging
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4732 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-24 13:25:38 +00:00
orbiter
e024e3b9cf added new default profiles to distinguish snippet fetch for local and global search
the difference is, that a local search will no not cause a re-indexing of loaded pages

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4731 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-24 08:42:08 +00:00
orbiter
2c0c8f0f0c SRU compliance according to
http://www.loc.gov/standards/sru/specs/search-retrieve.html
The example given on this page can be used to retrieve opensearch-compatible rss pages with YaCy

Try it:
The transcription to YaCys search servlet address is
http://localhost:8080/yacysearch.rss?version=1.1&operation=searchRetrieve&query=dinosaur&maximumRecords=1&recordSchema=dc

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4730 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-23 16:16:41 +00:00
danielr
9b03310f8a bin jetzt wach :/
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4729 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-23 07:50:21 +00:00
danielr
7bd8601f04 delete old releases compatible with java 1.5 ;)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4728 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-23 07:22:20 +00:00
orbiter
e90282da1c added experimental javascript terminal for rss feeds (not used yet anywhere yet, expect the worst)
.. possibly to be used as content for iframes within monitoring pages
not ready yet!

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4727 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-22 23:09:24 +00:00
danielr
da386a1924 fixed deleteOldDownloads if there are no downloads
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4726 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-22 21:36:52 +00:00
danielr
21418a22a3 removed DEBUG output
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4725 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-22 17:14:34 +00:00
danielr
79a3edeeef deleting downloaded releases after x days (default 30)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4724 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-22 16:53:53 +00:00
danielr
763f9d4f5d serverCore: setting timeout for new connection before SSLDetect
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4723 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-22 09:03:16 +00:00
orbiter
1995faef8d - refactoring of Colage back-end: move to plasma package
- renamed also the plasmaCrawlResults to have a consistent naming for url and image queues
- added a double-check for the images
- added additional queues for the images: all worse-quality images go there, so the queue can be used also if no sizes are given; no image is lost
- added a cleanup for the stacks so they cannot flood the memory

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4722 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-21 22:42:49 +00:00
orbiter
d7e89c2aca fixed near-deadlock situation when deleting crawl profiles
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4721 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-20 22:10:26 +00:00
orbiter
5e3ce46339 - better logging when rejecting a url because it is not in declared domain
- more XSS attack protection

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4720 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-20 21:36:25 +00:00
orbiter
6d1be66822 - longer refresh rate for reload of WatchCrawler page forwarding to indexing start (does not work in IE)
- better names for search pages
- Release 0.58

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4719 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-20 08:10:52 +00:00
apfelmaennchen
2149728227 - major rework on YaCy-UI
- search results are retrieved from rss/xml, no other servlet needed
- added double accordion sidebar menus

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4718 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-19 11:31:41 +00:00
orbiter
c270d02176 Reverting SVN 4716
ein 1.6er Target versehentlich zu setzen bedeutet bei automatischer Release-Erzeugung und updates bei Usern,
die nur Java 5 installiert haben (was bei allen Mac-Usern so ist), dass große Teile des Netzes crashen können und manuell
wieder hoch gezogen werden müssen.
Neu-User die mit einem dev-Release Beginnen können mit der intranet-Einstellung gar keine Websuche starten.
Bitte nach einem Commit immer kontrollieren, was man da eingecheckt hat.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4717 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-19 09:58:36 +00:00
danielr
48ffd61e6a changed "patched wrong" to warning, so it goes to the logfile
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4716 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-19 07:54:44 +00:00
orbiter
2f629d20a7 - tried to fix the '4217666-problem'
- removed more unused code

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4715 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-19 04:24:29 +00:00
orbiter
512f48e7d6 - removed unused methods
- fixed xss attack on peer list in CrawlStartSimple

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4714 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-19 03:33:07 +00:00
orbiter
14384e7a45 deactivated unnecessary and very CPU-intensive deletion check for blacklisted URLs in index receive
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4713 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-19 03:02:44 +00:00
f1ori
701f769c66 * removed comma, which caused invalid xml
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4711 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-18 15:07:36 +00:00
orbiter
3c76342619 - added servlet to configure the search page greeting line
- added information output about the current network definition in the network servlet
- better description and usage of profile entries in User Profile servlet regarding FOAF format
- reformatting of menues at status page

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4710 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-18 13:58:56 +00:00
f1ori
b9602e891a * added CrawlProfileEditor_p.xml for monitoring in yacybar
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4708 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-18 09:13:02 +00:00
orbiter
d03940f2ec - included patch from http://forum.yacy-websuche.de/viewtopic.php?p=7193#p7193
- fixed problem with crawl profile editor after deletion of a crawl profile

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4706 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-17 22:21:03 +00:00
danielr
d1ee231866 HTTPC close more unused connections
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4702 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-15 16:37:51 +00:00
danielr
181796cffb - HTTPC ConnectionInfo entfernen bei Exceptions, unnötigen Code entfernt
- FTPC (GET-)connections bei Fehlern auf jeden Fall schliessen


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4701 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-15 15:27:32 +00:00
orbiter
04c1226c80 added/fixed missing integrity-test else-case during deploy in case that we update with a tar file
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4700 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-15 15:20:35 +00:00
orbiter
6155f0e634 last small changes until main release
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4699 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-14 07:26:33 +00:00
orbiter
45ae3da7e7 another patch to prevent NPE in EcoTable
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4698 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-14 05:33:32 +00:00
orbiter
cb93ded5c6 applied configuration path patch
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4697 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-14 04:10:51 +00:00
danielr
96e39b297a reduced StackTraces (by connect timed out)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4696 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-14 03:50:49 +00:00
orbiter
93376acdca fixed a bad chunkcache limit check which could have caused ArrayIndexOutOfBoundsExceptions
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4695 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-14 03:49:02 +00:00