yacy_search_server/htroot/ProxyIndexingMonitor_p.html
theli fbbbf5f411 *) remote trigger for proxy-crawl
- remote crawling can now be enabled for the proxy crawling profile
   See: http://www.yacy-forum.de/viewtopic.php?p=17753#17753
   

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1758 6c8d7289-2bf4-0310-a012-ef5d649a1542
2006-02-24 09:35:54 +00:00

101 lines
4.5 KiB
HTML

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN" "http://www.w3.org/TR/REC-html40/loose.dtd">
<html>
<head>
<title>YaCy '#[clientname]#': Proxy Index Monitor</title>
#%env/templates/metas.template%#
</head>
<body marginheight="0" marginwidth="0" leftmargin="0" topmargin="0">
#%env/templates/header.template%#
<br><br>
<h2>Index Monitor for Proxy Indexing</h2>
<p>
This is the control page for web pages that your peer has indexed during the current application run-time
as result of proxy fetch/prefetch.
<b>No personal or protected page is indexed</b>;
those pages are detected by properties in the HTTP header (like Cookie-Use, or HTTP Authorization)
or by POST-Parameters (either in URL or as HTTP protocol)
and automatically excluded from indexing.
</p>
<p><form action="ProxyIndexingMonitor_p.html" method="post" enctype="multipart/form-data">
<table border="0" cellpadding="5" cellspacing="1" width="100%">
<tr class="TableCellDark">
<td colspan="3"><div class=small><b>Proxy pre-fetch setting:</b>
this is an automated html page loading procedure that takes actual proxy-requested
URLs as crawling start points for crawling.</div></td>
</tr>
<tr valign="top" class="TableCellLight">
<td class=small>Prefetch Depth</td>
<td class=small><input name="proxyPrefetchDepth" type="text" size="2" maxlength="2" value="#[proxyPrefetchDepth]#"></td>
<td class=small>
A prefetch of 0 means no prefetch; a prefetch of 1 means to prefetch all
embedded URLs, but since embedded image links are loaded by the browser
this means that only embedded href-anchors are prefetched additionally.</td>
</tr>
<tr valign="top" class="TableCellLight">
<td class=small>Store to Cache</td>
<td class=small><input type="checkbox" name="proxyStoreHTCache" align="top" #(proxyStoreHTCacheChecked)#::checked#(/proxyStoreHTCacheChecked)#></td>
<td class=small>It is almost always recommended to set this on. The only exception is that you have another caching proxy running as secondary proxy and YaCy is configured to used that proxy in proxy-proxy - mode.</td>
</tr>
<tr valign="top" class="TableCellLight">
<td class=small>Do Remote Indexing</td>
<td class=small><input type="checkbox" name="proxyCrawlOrder" align="top" #(proxyCrawlOrder)#::checked#(/proxyCrawlOrder)#></td>
<td class=small>If checked, the crawler will contact other peers and use them as remote indexers for your crawl.
If you need your crawling results locally, you should switch this off.
Only senior and principal peers can initiate or receive remote crawls.
Please note that this setting only take effect for a prefetch depth greater than 0.</td>
</tr>
<tr class="TableCellDark">
<td colspan="3"><div class=small><b>Proxy generally</b></div></td>
</tr>
<tr valign="top" class="TableCellLight">
<td class=small>Path</td>
<td class=small><input name="proxyCache" type="text" size="20" maxlength="300" value="#[proxyCache]#"></td>
<td class=small>The path where the pages are stored (max. length 300)</td>
</tr>
<tr valign="top" class="TableCellLight">
<td class=small>Size</td>
<td class=small><input name="proxyCacheSize" type="text" size="8" maxlength="24" value="#[proxyCacheSize]#"></td>
<td class=small>The size in MB of the cache.</td>
</tr>
<tr valign="top" class="TableCellDark">
<td class=small colspan="1">&nbsp;</td>
<td class=small colspan="2"><input type="submit" name="proxyprofileset" value="Set proxy profile"></td>
</tr>
</table>
</form></p>
<p>
#(info)#
<!-- info 0 -->
::
<!-- info 1 -->
<b>The file DATA/PLASMADB/crawlProfiles0.db is missing or corrupted.
Please delete that file and restart.</b><br>
::
<!-- info 2 -->
<b>Pre-fetch is now set to depth-#[message]#.</b><br>
<b>Caching is now #(caching)#off::on#(/caching)#.</b><br>
<b>Remote Indexing is now #(crawlOrder)#off::on#(/crawlOrder)#.</b><br>
#(path)#::<b>Cachepath is now set to '#[return]#'.</b> Please move the old data in the new directory.<br>#(/path)#
#(size)#::<b>Cachesize is now set to #[return]#MB.</b><br>#(/size)#
#(restart)#::<br><font color="red"><b>Changes will take effect after restart only.</b></font><br>#(/restart)#
::
<!-- info 3 -->
<b>An error has occurred: #[error]#.</b><br>
#(/info)#
</p>
<p>You can see a snapshot of recently indexed pages
on the <a href="/IndexMonitor.html?process=4">Proxy Index Monitor</a> Page.
</p>
#%env/templates/footer.template%#
</body>
</html>