Commit Graph

7099 Commits

Author SHA1 Message Date
f1ori
fd74bc388c * fix small bug in sessionid-removal
* add testcase for seesionid-removal

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7333 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-21 23:55:40 +00:00
f1ori
dca9e16f51 * don't index pages, which redirect, twice
* there fore auto-redirection of HTTPClient for crawling is disabled and the old code is reactivated

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7332 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-21 22:46:12 +00:00
low012
eb79b952ef *) cleaner code
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7331 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-21 03:39:53 +00:00
low012
38fdf43587 *) renamed classes according to standard Java coding conventions
*) String.isEmpty() was introduced in Java 1.6, but we still use Java 1.5

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7330 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-21 01:29:32 +00:00
low012
8281d12305 *) Ooops!
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7329 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-21 00:41:45 +00:00
low012
025e3f4790 *) renamed classes according to standard Java coding conventions
*) removed unsused code

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7328 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-21 00:39:21 +00:00
low012
3b9aa0504e *) removed unsused code
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7327 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-21 00:28:32 +00:00
low012
db3db0fdb9 *) trying to make this class less confusing (probably failing)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7326 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-21 00:13:08 +00:00
apfelmaennchen
54e63b556e intermediate step for a YMark auto-tagging function based on word frequencies.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7325 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-17 15:17:29 +00:00
apfelmaennchen
403ee9c014 added a drill-down for metadata and word count to /api/ymarks/test_treeview.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7324 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-16 00:48:38 +00:00
f1ori
a025b1da89 * fix bug when browsing local filesystem (e. g. repository) with yacy
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7323 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-15 14:47:16 +00:00
mikeworks
28a290336d de.lng: Started translation of YMark feature here http://localhost:8080/Table_YMark_p.html
Table_YMark_p.html: removed some labels because columns is not filled yet and added ids for other label references and added full enabled="enabled" tags because SHORTTAG is not specified in XHTML 1.0 Strict

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7322 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-15 01:48:54 +00:00
lotus
25426c6548 change language immediately
there is a bad behaviour:
if language is changed, servlets are translated but the returning servlet is yet the old one. therefore the page is loaded again after language changed.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7321 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-13 14:27:30 +00:00
apfelmaennchen
11ae5b108e enabled rebuildIndex for /Table_YMark_p.html (rebuilds the tags and folders index)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7320 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-13 13:02:56 +00:00
apfelmaennchen
f147a022f8 enabled YMark Import for /Table_YMark_p.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7319 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-13 10:32:37 +00:00
apfelmaennchen
2726606fc8 slightly enhanced interface for /Table_YMark_p.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7318 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-12 21:13:27 +00:00
sixcooler
b87bf88ac8 using less memory on merging and rewriting blobs
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7317 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-12 16:02:20 +00:00
apfelmaennchen
94a9be18a4 added a ymark table administration: /Table_YMark_p.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7316 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-10 22:53:27 +00:00
apfelmaennchen
25339f93c7 more updates to ymarks
- working xbel import/export
- exported xbel includes yacy specific metadata but still validates against PUBLIC DTD


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7315 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-09 17:01:31 +00:00
f1ori
d62e449a11 * fix FilterEngine, forgot comparision-operator
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7314 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-08 09:37:44 +00:00
apfelmaennchen
cdd65aca71 update to ymarks
- get_xbel.xml is almost working
- startet ymark api documentation info.html

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7313 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-07 20:03:01 +00:00
apfelmaennchen
808edffaf6 ymarks
- some refactoring
- working xbel and html import (/api/ymarks/test_import.html)
- working treeview (/api/ymarks/test_treeview.html)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7312 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-06 20:26:13 +00:00
f1ori
2c539b514a * add domaincheck (local/global/domainlist) to urlcleaner
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7311 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-06 16:50:33 +00:00
f1ori
442bebca2b * %0 does not belong to the IPv6-Address -> entry does not work on some systems
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7310 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-06 15:09:28 +00:00
orbiter
9fc940aa35 release 0.99
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7309 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-05 13:20:51 +00:00
orbiter
117fc86b3d fix for http://forum.yacy-websuche.de/viewtopic.php?p=21199#p21199
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7308 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-05 13:19:37 +00:00
orbiter
441fbc26e2 security patch for WeakPriorityBlockingQueue (produced a deadlock)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7307 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-05 09:38:31 +00:00
orbiter
5dcb838293 - removed thread overhead when calling dns services
- fixed localsearch (changed it by accident)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7306 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-05 00:29:32 +00:00
orbiter
4c50d3428e smaller file size for array stacks to support smaller deletion sizes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7305 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-04 13:29:19 +00:00
orbiter
09badc697b - low-memory patch for crawler
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7304 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-04 13:26:27 +00:00
f1ori
6ac4f8142e * allow proxy requests from localhost via ipv6
(%0 does not belong to the address)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7303 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-04 10:52:54 +00:00
mikeworks
274d5b3a87 de.lng: Added missed translation string in SVn 7301
ConfigHTCache_p.html: Added missing id for label pointing to actualCacheSize in span tag for XHTML 1.0 Strictness

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7302 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-04 07:30:00 +00:00
mikeworks
9239ac1e56 de.lng: Added translation for new page http://localhost:8080/ConfigHTCache_p.html and old one http://localhost:8080/IndexControlRWIs_p.html
ConfigHTCache_p.html: Removed additional </form> and changed title text

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7301 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-04 00:53:25 +00:00
orbiter
becc463d8a enhanced did-you-mean
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7300 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-04 00:25:19 +00:00
apfelmaennchen
43586a2ace a update to ymarks (please test if you wish):
- import HTML (e.g. FF export) via /api/ymarks/import.html
- view your import via /api/ymarks/test.html
- get a xml list via /api/ymarks/get_ymark_list.xml?tags=&folders=
- delete bookmark tables via standard interface /Tables_p.html
it is still very experimental!! 

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7299 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-03 22:52:03 +00:00
orbiter
93c535d111 fixed http://forum.yacy-websuche.de/viewtopic.php?p=21113#p21113
fixed a concurrent modification exception during search and a time-out problem

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7298 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-03 20:58:50 +00:00
orbiter
04932dc268 added rdf data structure for rss feeds
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7297 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-03 20:06:23 +00:00
orbiter
84f2953cd8 fix for rss loader / rss type recognition
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7296 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-03 19:58:01 +00:00
orbiter
4c72885cba added a sitemap entry parser and loader for sitemaps
(a recursion if a sitemap refers to another sitemap)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7295 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-03 19:48:33 +00:00
orbiter
790e0b1894 - enhanced index deletion in IndexControlRWIs_p: delete also robots.txt database and cache if demanded
- added option for details of deletion
- added deletion to new ConfigHTCache_p servlet

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7294 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-03 18:31:36 +00:00
orbiter
3e303db1fa added search suggestions in XML format. This also supports CORS
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7293 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-03 09:20:36 +00:00
apfelmaennchen
f5324b27f2 more updates to the new bookmarks (ymarks)....
- split YMarkTables and YMarkIndex in two different classes
- HTML import is working properly
- XBEL import is still broken


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7292 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-03 06:47:02 +00:00
orbiter
445619f3ec added a submenu ConfigHTCache_p.html to set the size of the HTCache separately from the proxy configuration.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7291 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-02 23:57:11 +00:00
sixcooler
85c65475fa smal but important correction of last commit @ HTTPClient
(if there is a response it really should be taken to its end)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7290 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-02 21:18:44 +00:00
f1ori
acd93b1b31 * add failsafe mechanisme to domainlist retrieval
domainlist is saved locally, if none of the given urls in network.unit.domainlist
  could be retrieved, the file from the last boot is used instead

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7289 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-02 17:57:48 +00:00
orbiter
70c95608d4 Added CORS Access header for yacysearch.rss output
used some of the recommendations from Copro:
http://forum.yacy-websuche.de/viewtopic.php?p=21015#p21015
Original Request:
http://forum.yacy-websuche.de/viewtopic.php?p=20829#p20829

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7288 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-02 16:28:40 +00:00
orbiter
917d715374 lulabad found his signature
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7287 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-01 23:32:20 +00:00
lotus
18729351e7 upnp: hint for wrongly detected local ip address
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7286 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-10-31 20:05:59 +00:00
f1ori
def4253555 * add option to network definition to provide a domainlist (syntax like in blacklists)
* crawler and search allow only urls matching one in domainlist (if list is provided)
* this may be useful to prevent dedicated networks from being "polluted"
* FilterEngine is improved Backlist-object, Blacklist may inherit from FilterEngine in the future

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7285 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-10-30 14:44:33 +00:00
lotus
2db6ca8104 - do not allow user to disable java installation
- java version update
- build cleanup

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7284 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-10-30 08:49:39 +00:00