Commit Graph

223 Commits

Author SHA1 Message Date
orbiter
58b59f9bc8 - a collection of bug fixes and some redesign of the Scanner class
- fixed smb crawling
- added smbget to download script generation

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7381 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-12-16 23:37:21 +00:00
orbiter
c36da90261 added a very fast ftp file list generator to site crawler:
- when a site-crawl for ftp sites is now started, then a special directory-tree harvester gets the complete directory structure of a ftp server at once
- the harvester runs concurrently and feeds into the normal crawl queue

also in this:
- fixed the 'start from file' crawl function
- added a link detector for the html parser. The html parser can now also extract links that are not included in <a> tags.
- this causes that a crawl start is now also possible from clear text link files

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7367 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-12-09 17:17:25 +00:00
orbiter
4565b2f2c0 removed the display option from index.html, yacysearch.html and yacyinteractive.html
instead, a setting at ConfigPortal.html can be made to define if the topmenu shall be shown at these pages or if there is no naviagtion at all. 

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7366 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-12-08 10:50:23 +00:00
orbiter
18d33b5c6d fixed several search result navigation bugs
fixed bad behaviours during search result collection

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7362 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-12-05 23:54:00 +00:00
orbiter
49b5a206cd - better caclculation of search result size
- predefined search recommendations

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7361 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-12-02 12:19:59 +00:00
orbiter
f0651e5f2f added image search to yacyinteractive.html
this causes that the search result view switches from list format to image preview format when a search is restricted to png, gif or jpg documents

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7358 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-12-01 18:48:21 +00:00
orbiter
4e771e2063 enhanced interactive search:
- better table design
- less enumeration of same table structure (prepared now for streaming)
- added a 'remove filetype' link

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7356 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-12-01 14:43:07 +00:00
orbiter
b769cce433 - added a catch-all parser for all documents that cannot be parsed: they will contributed with their document url for the search index only
- enhanced the pdf and torrent parser: better documents titles
- enhanced the ftp client: more time-out time
- fixed bugs in json for search results
- enhanced yacyinteractive.html: added a file type navigator and a download-script generator for search result files

Please have a look at yacyinteractive.html: this will become the hacker-download tool for 27c3!

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7355 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-11-30 16:13:55 +00:00
orbiter
45b1ab3d07 custom + generic skins:
- added a generic skin which is filled with actual color assignment using a servlet
- enabled css servlets
- added a generic color scheme in configuration file
- added configuration input in Customization/Appearance servlet
- added a jquery color picker widget
- placed color picked widget to input field of generic colour definition input fields

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7235 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-10-11 00:00:10 +00:00
mikeworks
70576e88d2 de.lng: Added some more untranslated strings I found and uncommented old ones that were removed
terminal_p.html: Put back the old ID which was really easy to find
IndexCreate.js: Because XHTML 1.0 Strict does not allow name tags for some elements rewrote most element access functions to use getElementById
Table_API_p.html and all other html pages: Some XHTMl 1.0 Strict fixes, changed checkAll javascript, marked the first row with checkboxes as unsortable where applicable
Table_API_p.java and all other java pages: URLencoded lines with possible ampersands & -> &amp; for validation XHTML 1.0 Strict sourcecode
--> All Index Create pages should validate now. Hope I did not break anything else (too much :-)


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7225 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-10-06 00:00:23 +00:00
orbiter
099def2a04 small changes in search widget appearance
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7221 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-10-04 23:43:33 +00:00
orbiter
2c549ae341 fixed a number of small bugs:
- better crawl star for files paths and smb paths
- added time-out wrapper for dns resolving and reverse resolving to prevent blockings
- fixed intranet scanner result list check boxes
- prevented htcache usage in case of file and smb crawling (not necessary, documents are locally available)
- fixed rss feed loader
- fixes sitemap loader which had not been restricted to single files (crawl-depth must be zero)
- clearing of crawl result lists when a network switch was done
- higher maximum file size for crawler

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7214 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-30 23:57:58 +00:00
orbiter
f6eebb6f99 replaced auto-dom filter with easy-to-understand Site Link-List crawler option
- nobody understand the auto-dom filter without a lenghtly introduction about the function of a crawler
- nobody ever used the auto-dom filter other than with a crawl depth of 1
- the auto-dom filter was buggy since the filter did not survive a restart and then a search index contained waste
- the function of the auto-dom filter was in fact to just load a link list from the given start url and then start separate crawls for all these urls restricted by their domain
- the new Site Link-List option shows the target urls in real-time during input of the start url (like the robots check) and gives a transparent feed-back what it does before it can be used
- the new option also fits into the easy site-crawl start menu

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7213 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-30 12:50:34 +00:00
mikeworks
421aa6a8bb ConfigLiveSearch.html: Fixed some HTML problems to validate at least XHTML 1.0 Transitional - strict is not possible because iframes are used. Replacing iframes with embedded object tag does not work in IE
ConfigPortal.html: Fixed some HTML problems to validate at least XHTML 1.0 Transitional - for strict the target attribute of the a link has to be removed
yacyinteractive.html: Moved all JS code to an external yacyinteractive.js file in JS folder
yacysearch.html: Removed embedded scripts from in between the body tags - now everything is loaded in the header
de.lng: Just in case JS files will be parsed at some point added translation for yacyinteractive.html result counter

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7208 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-29 20:06:39 +00:00
mikeworks
190de644dd de.lng: Added German translations for some missing table content on Network view
WatchWebStructure_p.html: Added JS verification of RGB color codes (currently only RGB value is checked but this could be enhanced to also check for websafe colors)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7195 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-09-26 22:28:32 +00:00
low012
4c6dc396d8 *) more beautyful (IMO) code, no functional changes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6764 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-03-23 21:05:26 +00:00
orbiter
0465f28f7f applied 'null in rss2.js' fix from Quix0r, see
http://forum.yacy-websuche.de/viewtopic.php?p=19612#p19612

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6752 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-03-20 09:58:05 +00:00
orbiter
c77fbd0390 added sorttable (http://www.kryogenix.org/code/browser/sorttable/)
javascript library to make tables sortable

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6736 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-03-09 23:40:16 +00:00
orbiter
d126d6c1b5 renamed the servlet WatchCrawler_p to Crawler_p
this was done because that servlet may be used for wget/cronjob
triggered crawl starts and it appears to be confusing that the
name of the crawl start servlet looks like a pure monitoring tool.


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6568 6c8d7289-2bf4-0310-a012-ef5d649a1542
2010-01-12 10:05:28 +00:00
orbiter
c6c97f23ad - added cache usage properties to crawl start
- added special rule to balancer to omit forced delays if cache is used exclusively
- extended the htCache size by default to 32GB

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6241 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-24 11:54:04 +00:00
orbiter
5bb8074150 removed the indexing queue. This queue was superfluous since the introduction of the blocking queues last year, where documents are parsed, analysed and stored in the index with concurrency.
- The indexing queue was a historic data structure that was introduced at the very beginning at the project as a part of the switchboard organisation object structure. Without the indexing queue the switchboard queue becomes also superfluous. It has been removed as well.
- Removing the switchboard queue requires that all servlets are called without a opaque generic ('<?>'). That caused that all serlets had to be modified.
- Many servlets displayed the indexing queue or the size of that queue. In the past months the indexer was so fast that mostly the indexing queue appeared empty, so there was no use of it any more. Because the queue has been removed, the display in the servlets had also to be removed.
- The surrogate work task had been a part of the indexing queue control structure. Without the indexing queue the surrogates needed its own task management. That has been integrated here.
- Because the indexing queue had a special queue entry object and properties attached to this object, the propterties had to be moved to the queue entry object which is part of the new indexing queue withing the blocking queue, the Response Object. That object has now also the new properties of the removed indexing queue entry object.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6225 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-07-17 13:59:21 +00:00
lotus
187ee4d06e another IE fix (also same names in html and js)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6116 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-22 11:24:01 +00:00
lotus
37c162053d fix for IE & WatchCrawler (were same names in js and html)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6114 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-06-22 11:03:44 +00:00
lotus
fbca4f8354 more stability on watchcrawler
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5861 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-04-23 18:42:15 +00:00
orbiter
61f9dbf0cc - fixed a display problem in watch crawler
- another small enhancement in balancer

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5729 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-03-18 21:25:52 +00:00
orbiter
98f36a801a - small update to search result layout
- some more mime types

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5669 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-03-06 10:16:14 +00:00
orbiter
dedfc7df7f removed distinction between DHT-in and DHT-out. This is necessary to make room for the new cell data structure, which cannot use this this distinction in the first place, but will enable the same meaning with different mechanisms (segments, later)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5511 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-01-22 00:03:54 +00:00
orbiter
6663365720 adopted many calls to new api path
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5498 6c8d7289-2bf4-0310-a012-ef5d649a1542
2009-01-16 00:02:55 +00:00
lotus
b1e211b258 no error-alert: http://forum.yacy-websuche.de/viewtopic.php?t=1639
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5391 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-12-13 12:04:08 +00:00
lotus
1545e5440a * index deletion: checkbox-confirmation
* watch crawler: less load on exhausted peers; wait for data before reloading again

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5359 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-11-23 12:02:58 +00:00
apfelmaennchen
8d1bedfc3a - added bookmarkTitle to CrawlStart_p.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5068 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-08-21 21:07:21 +00:00
lotus
f9fb2d0fee hide memory tables by default
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5020 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-07-27 06:59:24 +00:00
orbiter
01b3e9431a - fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1140&p=7626#p7626
- less dots for ppm bar in watchcrawler (one dot for each 10 ppm)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4846 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-05-24 11:30:16 +00:00
orbiter
9935e83c86 added new news window into the status page. At this moment it is just a test.
The news inside the window are about peer arrivals and departures, remote search accesses and crawls

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4739 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-26 01:00:10 +00:00
orbiter
e90282da1c added experimental javascript terminal for rss feeds (not used yet anywhere yet, expect the worst)
.. possibly to be used as content for iframes within monitoring pages
not ready yet!

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4727 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-04-22 23:09:24 +00:00
f1ori
76eac114ed * define global javascript-variable with var to get rid of warnings
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4624 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-03-30 19:51:19 +00:00
orbiter
f4c73d8c68 - fixed highslide usage
- some enhancement to index management, better types

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4497 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-02-19 14:13:35 +00:00
orbiter
3441ec3928 - some small changes to highslide integration to get it working... (does not work yet)
- performance enhancement for url list parser

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4495 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-02-18 23:49:03 +00:00
orbiter
6c3cd2b4f2 - added new way to watch images from the image search:
they appear as separate, floating window above the search results,
  not in a new window
- added highslide javascript library for feature mentioned above
- removed dir servlet. This thing was not used as it was supposed to be (as an example applet)
  and was a major problem for intranet-indexing when files are hosted on the same peer.
- added yacy-httpd-internal directory listing. Because YaCy is a search engine,
  directory listings are similar to search result listings. Intranet indexing from the same peer
  will get nice index pages for document collections.
- removed unused test applet

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4494 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-02-18 16:38:06 +00:00
orbiter
a8a5df4a51 - more dublin core naming of page metadata
- better presentation of result counters in search results

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4420 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-01-30 21:58:30 +00:00
apfelmaennchen
e68b133b35 added JavaScript for folder tree view
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4355 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-01-22 19:05:36 +00:00
orbiter
89b9b2b02a redesigned remote crawl process:
- instead of pushing urls to other peers, the urls are actively pulled
  by the peer that wants to do a remote crawl
- the remote crawl push process had been removed
- a process that adds urls from remote peers had been added
- the server-side interface for providing 'limit'-urls exists since 0.55 and works with this version
- the list-interface had been removed
- servlets using the list-interface had been removed (this implementation did not properly manage double-check)
- changes in configuration file to support new pull-process
- fixed a bug in crawl balancer (status was not saved/closed properly)
- the yacy/urls-protocol was extended to support different networks/clusters
- many interface-adoptions to new stack counters

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4232 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-11-29 02:07:37 +00:00
fuchsi
0e1738899f * Complete number localization and provide a more reasonable interface to serverObjects:
- put(key, value) methods are now used if a value added to the map should be kept as it is. Numbers are transformed (but not formatted) to an equivalent String representation.
- putASIS(...) have been removed, now done with simple put(...) (see above).
- puNum(...) can be used for number values which should be stored in a formatted way, either depending on the current locale setting for yacy (default) or in a "none" locale (see javadocs and setLocalize()).
- putHTML(...) escapes special characters into corresponding HTML enities ('<' => '&lt;') which was done with put(...) before and so was called too often, becauses it is necessary only for very few cases. Additionally there is a "forXML" mode which only replaces < > & ".
In short: Use put(...) for almost everything, use putXY(...) if you need some special transformation of the value.
A few bugs have been fixed as well, and there should be a small performance improvement for complex pages with a lot of values.

* added additional Sum/Avg rows to access tracker pages, see http://forum.yacy-websuche.de/viewtopic.php?f=5&t=456
* removed duplicate code (mostly related to the big changes above).

TODO:
- make sure, number formats work as expected _everywhere_, report overseen stuff http://forum.yacy-websuche.de/viewtopic.php?f=5&t=437
- probably a good idea to add special putDate() methods as they are used in many pages and create duplicated formatting code + maybe some centralized handling for memory value formatting.
- further improve the speed of page creation for the WatchCrawler.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4178 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-10-24 21:38:19 +00:00
fuchsi
9524b9c16a second try of rev 4100 :). Tested in Iceweasel/Firefox 2.0.6, Konqueror 3.5.7, Opera 9.23 (all linux) and IE6-SP1 (wine)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4102 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-09-17 19:39:15 +00:00
fuchsi
6b8faaadb6 undo last commit for further evaluation, a progressbar element is used on other pages as well...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4101 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-09-17 03:36:35 +00:00
fuchsi
1880bba420 A few changes to the progress bar and search result statistics layout influenced by the discussion in <http://forum.yacy-websuche.de/viewtopic.php?f=5&t=268> with the idea of saving vertical space. Please check in every available browser and comment wether it's better than before. ;)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4100 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-09-16 14:30:53 +00:00
fuchsi
e78098be9b According to HTML-Specs "name" and "id" attributes share the same namespace. So we can't have one element with name="offset" and another one with id="offset". Additionally IE6's getElementById() returns elements with matching names as well and Opera is mimicing this behaviour.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4094 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-09-11 16:21:14 +00:00
orbiter
6c3bcadc1c - re-implemented image search
- generalized search result status bar, is now also visible during text search


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4077 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-09-06 13:26:38 +00:00
orbiter
4779f314fe first version of next-generation search interface:
- snippets are not fetched by browser using ajax, they are now fetched internally
- YaCy-internat threads control existence of snippets and sort out bad results
- search results are prepared using SSI includes
- the search result page is visible right after the search request, the results drop in when they are detected
- no more time-out strategy during search processes, results are shifted within queues when they arrive from remote peers
- added result page switching! after the first 10 results, the next page can be retrieved
- number of remote results is updated online on the result page as they drop in
- removed old snippet servelet (which had been also a security leak btw)
- media search is broken now, will be redesigned and fixed in another step


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4071 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-09-03 23:43:55 +00:00
orbiter
71e5d24f4a fix for watch crawler, see http://forum.yacy-websuche.de/viewtopic.php?p=1771#p1771
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4064 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-08-28 12:20:19 +00:00
orbiter
e332b844b2 - enhanced remote search: during waiting time for remote crawls
some urls are fetched so the url cache can be filled with these urls
- the url-prefetch is used to sort out some unresolved urls
- the snippet-fetcher is triggered with the search event id. This is used
  to remove missing snippets from the search cache so they will not be displayed again


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4060 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-08-26 18:18:35 +00:00
michitux
5cf634a4a4 New media-search ui:
- uses the progressbar introduced in the image-search
- results are displayed using the same layout as the text-search
- results are displayed in the order they arrive


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4041 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-08-11 22:20:01 +00:00
orbiter
62347b50f4 added security layer for ViewImage:
- images may be requested by localhost and authorized users only, if the request is done using a clear-text URL
- the image may be requested also using a code that can be a license to retrieve a URL for everyone
- some servelets produce URL licenses for ViewImage, like image search results


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4027 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-08-03 23:06:53 +00:00
michitux
8ebfd732ce - Fix for the redisplay of hidden results in Opera, see http://forum.yacy-websuche.de/viewtopic.php?f=6&t=140 for details.
- Now the message that there are hidden results is hidden when all results are displayed again.


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3994 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-07-19 19:49:48 +00:00
orbiter
89e1848db6 fixed problem with favicons:
target servers had been able to see search words from the referrer of the favicon fetch.
This has been removed by using the getImage - servlet for favicon fetch.
Since java does not support loading of bmp and ico-Images, such parsers had been added.
The image parser had been coded from their original microsoft documentation.
This influences also the image-search functionality: there can now be a preview
of found bmp-images. Another benefit: favicons for search results are now cached with the HTCACHE.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3965 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-07-15 01:34:01 +00:00
michitux
110a1a2b16 - fixed the handover of the searchterm and -type on index.html when the user clicks on "more options..."
- some small changes to make index.html and the menu valid XHTML 1.0 strict
- changed the inconsistent eol - characters in index.html to unix-ones


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3940 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-29 19:23:42 +00:00
orbiter
1d0cce8f3a documentation update
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3911 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-17 22:35:16 +00:00
michitux
25529290ca - 2 small changes in documentation
- hopefully fixed logging of GCs (in order to avoid things like "performed necessary GC, freed 18014398509481565 KB (requested/available/average: 4096 / 1631 / 2957 KB)") with the help of KoH


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3909 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-17 19:32:38 +00:00
michitux
184ba22ce9 New image - search HTML/JavaScript - frontend:
- <noscript> - area for non-JS - Browsers
- progressbar for the loading - process (may be used in other searches too)
- the image that is available first ist displayed first, so the images aren't moved around when new results arrive
- the correct number of results is displayed
- successfully tested in IE 5.5 and 6, Opera, Firefox and Konqueror (recent versions)



git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3904 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-16 21:55:15 +00:00
theli
339153d40e *) favicons that are specified in the document content via html link-tags
are now detected and displayed on the search page (requested by allo).

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3845 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-09 15:22:37 +00:00
allo
38c180b28b hide results with wrong("red") snippet.
(maybe not as default? But it works pretty good for me)


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3842 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-08 23:59:18 +00:00
allo
559d1c447f Bookmarks tag suggestion
AJAX fix for configadvanced
empty bugs are not a interface bug, but a scraper bug.


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3821 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-07 22:49:31 +00:00
theli
e75ca857c3 *) Bugfix for problem with ajax graphic
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3815 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-07 07:40:32 +00:00
allo
54ddb3262c enter on webstrucutre
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3783 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-02 14:50:36 +00:00
allo
d0f8254f95 better refresh ui
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3779 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-06-01 13:58:48 +00:00
(no author)
5cc8bb075b Syntaxfehler beseitigt
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3764 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-26 19:51:05 +00:00
(no author)
ef24bed406 Sorry...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3760 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-24 16:25:07 +00:00
(no author)
a29cb2e1af blupp
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3759 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-24 16:14:46 +00:00
orbiter
a3ecfe0a45 replaced failed-icon by new 'bad'-icon
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3680 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-07 14:05:49 +00:00
theli
6f46245a51 *) Bookmarks: Ajax icon is displayed while loading title
*) First version of a sitemap parser added
   - currently only autodetection of sitemap files is supported
*) DB-Import restructured
   - pause/resume should work again now


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3666 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-05-06 09:52:04 +00:00
michitux
56b30d6593 * fixed a bug in ie: class-names for image-snippets were set, but had no influence on ie, they have to be set with className =
* fixed a bug in safari (hopefully, sorry that I removed the old fix, the divs): yacy-logo is now above the fieldset, the fieldset clears and has a margin-left set
 * fixed a bug with the dls: for example in ViewProfile.html the dt's (the terms/keys) had not the same height as the dd's, so the dt's were not in the same row with the coresponding dd's towards the bottom
 * moved my new css-classes to the right place in base.css

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3572 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-13 23:14:12 +00:00
michitux
e917bfcae3 * Bugfix: changed handling of the query-string to be independent from input-elements
* removed unnecessary divs

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3571 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-13 18:01:02 +00:00
michitux
4990909178 Some bugfixes, new layout/style for image search results:
* removed divide by zero bug when 20_dhtdistribution_busysleep is 0
 * replaced German comment with wrong charset in source/de/anomic/plasma/plasmaCrawlBalancer.java by an English one
 * replaced the table-fix for floating behind snipped images by a br with clear
 * removed unnecessary old xhtml-files (were not in use, they were created when we weren't having xhtml for testing)
 * new layout for image-search results: replaced the old one with spans and tables inside (not valid) with new divs, now each image snippet container has the same size
TODO:
 * the ids of the snippetLoading-divs aren't valid because ids must start with an alphabetic letter or an underscore, they have to be prefixed
 * in the returned snippet-xml is an unresolved pattern for status (the status is only set for text snippets)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3566 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-04-12 18:21:17 +00:00
karlchenofhell
4f2e6ef47b - WatchCrawler_p shows max. 80 characters of URLs now (maybe dynamically adjustable based on browser width?)
- typo in BlacklistCleaner

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3445 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-03-07 23:16:25 +00:00
karlchenofhell
bf7a69197d - fix for possible NPE in queues_p
- WatchCrawler_p:
  - display crawler traffic
  - pause/resume local- and global crawler


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3389 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-02-22 22:26:11 +00:00
karlchenofhell
b873ad51ab - fix for http://www.yacy-forum.de/viewtopic.php?t=3369
- merged netBude's alternative for tables in yacysearch.html & search results valid
- added statistic info to index.html as proposed here: http://www.yacy-forum.de/viewtopic.php?p=29762#29762
- fixed error-log in httpTemplate

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3189 6c8d7289-2bf4-0310-a012-ef5d649a1542
2007-01-12 00:52:38 +00:00
orbiter
1d2d1854b9 added size of rwi and urls to WatchCrawler
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3112 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-12-21 21:33:35 +00:00
orbiter
61798f0ae6 added option to distinguish between text crawl and media crawl
- for each crawl start, there is now a flag for text and media
- the localCrawl flag is superfluous
- added new crawl profiles
- if an image search is done, only media links are crawled for the snippets


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3100 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-12-19 03:10:46 +00:00
orbiter
febe6b114a design update of crawler monitor
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3094 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-12-18 01:18:28 +00:00
orbiter
40049e0635 fixed media search snippet flow
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3092 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-12-17 22:52:58 +00:00
orbiter
7ff86d6ba6 - image search now shows thumbnails (in bad order, but it works)
- repaired DHT selection

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3081 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-12-14 02:48:37 +00:00
orbiter
28971da91c fix for last commit
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3074 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-12-12 02:51:59 +00:00
orbiter
e4570bffaf -implemented a specialized snippet-fetch for media content
-changed search result preparation for media search presentation

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3073 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-12-12 02:09:25 +00:00
orbiter
1377c53aa3 extraction of media links from search results
these links are mixed to the snippets for testing purpose
(a final version will handle this differently)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3069 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-12-11 01:31:23 +00:00
orbiter
fb9e0f0284 preparations for media snippets
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3064 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-12-09 23:15:58 +00:00
orbiter
b5a29e9651 - fix for snippets that are too short
- added keyword to snippet fetch to suppres removal of not-found snippet words (for debugging)


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3009 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-11-25 00:38:09 +00:00
michitux
567c40f5f0 Bookmark/delete-links now visible when mouse is over the searchresult, in standard-compliant browsers with css, in Microsoft Internet Explorer via JavaScript
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2608 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-16 16:56:22 +00:00
orbiter
d54144a4e3 fixed bad snippet behavior (hopefully)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2596 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-15 14:17:18 +00:00
orbiter
5015e780c2 - simplified watchCrawler code
- changed display of watchCrawler slightly

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2594 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-15 13:54:10 +00:00
allo
9bed90f8dc bugfix in js
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2587 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-15 06:33:22 +00:00
allo
13d0cff257 right dhtml.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2568 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-13 14:02:34 +00:00
orbiter
dfc0ac1958 syntax error fixes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2565 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-13 12:02:02 +00:00
allo
6e03f61daa fix for highlighting searched words in snippets
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2564 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-09-13 11:26:26 +00:00
allo
251b3b636e some refactoring for js
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2191 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-06-09 17:20:17 +00:00
allo
1a13c8b78e right wordCachesize after orbiters commit.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1882 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-03-13 18:09:09 +00:00
allo
6b056610e3 updated watchcrawler for the recent changes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1881 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-03-13 16:33:47 +00:00
orbiter
a74687d3c1 fixed problem with snippet fetch
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1818 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-03-04 19:54:03 +00:00
allo
aca1a3dc8f avoid confusion with green_bar.gif and red_bar.gif
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1780 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-28 09:17:56 +00:00
allo
8f471c258c ajax fix
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1774 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-27 10:24:41 +00:00
allo
88c0e1da1e prevent the browsing from overwriting the requestobject, before the answer is received
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1764 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-24 20:36:56 +00:00
allo
bb22bfed89 xml functions in xml.js
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1751 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-23 14:33:56 +00:00
theli
05be9264a3 *) bugfix for optra problem with span node
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1750 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-23 14:27:00 +00:00
theli
dc9174c809 *) Implementing snippet fetching via ajax
Snippets that are not available on page load time will be fetched using ajax requests.
   see: http://www.yacy-forum.de/viewtopic.php?p=16479

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1748 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-23 13:29:07 +00:00
allo
7e7a72b108 display wordcaches number on WatchCrawler.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1746 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-22 16:25:13 +00:00
allo
974d8c1722 typo
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1741 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-22 12:39:47 +00:00
allo
f2908d0208 another bar. shows your current ppm on WatchCrawler.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1739 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-22 11:52:08 +00:00
allo
3fd1641893 queuesizes in queues_p.xml
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1714 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-20 22:48:39 +00:00
allo
6b20ccaa0f count to the next update
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1699 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-18 20:05:55 +00:00
allo
91b78d9f04 missing File for IndexCreate
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1694 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-18 12:01:52 +00:00
allo
62664d7252 AJAX Check for robots.txt before crawling.
Icons from herrlich
TODO: Style it nicely ;-)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1689 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-17 20:55:31 +00:00
allo
a4b3981f33 all queues on WatchCrawler.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1675 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-16 21:08:34 +00:00
allo
127396436f more queues in the xml backend
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1674 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-16 20:26:10 +00:00
allo
5e04b13819 status_p.xml values are working in konqueror now
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1614 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-11 19:13:40 +00:00
allo
e58b558151 The Queue works now in Konqueror.
TODO: status_p.xml values, why is anchor not working?


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1613 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-11 18:05:18 +00:00
allo
9b056617b5 better XML Parsing
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1611 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-11 16:58:01 +00:00
allo
5a8b06a851 fixing JS-Errors
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1606 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-11 12:52:49 +00:00
allo
a1e1aa039c More links
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1603 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-10 19:28:28 +00:00
allo
62a0bb475a More values displayed on WatchCrawler.html
status_p.xml: to be extended.

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1561 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-06 19:47:59 +00:00
allo
2f188d3fa8 more stable layout and String Constant (for translation)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1560 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-06 17:01:39 +00:00
allo
fd7af851bf WatchCrawler is working NOW!
Thx to theli.


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1557 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-06 15:12:03 +00:00
allo
2e2fa99501 bookmarksManager:
-gettitle_p.xml and AJAX to use it
-classpath change
httpc:
-simple wget function

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1267 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-12-29 17:45:50 +00:00
allo
749f44d144 LiveUpdate.
(this does not work correctly at the Moment :-/)


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1003 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-29 10:52:21 +00:00
allo
5526d2ff73 Basic WatchCrawler.html, which loades the Queues using AJAX.
ajax.js: generic Ajax-Functions
xml/queues/indexing_p*: the Backend


git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1002 6c8d7289-2bf4-0310-a012-ef5d649a1542
2005-10-29 09:47:30 +00:00