mirror of
https://github.com/yacy/yacy_search_server.git
synced 2024-09-21 00:00:13 +02:00
40777556c5
- adding automatic refresh - accepts new parameter nameLookup which can be used to deactivate yacy-peer name lookup (because we have problems with this on large seed-dbs) *) ViewFile New page that can be used to view - original content - plain text content - parsed content - parsed sentences of a webpage specified by there url hash Mainly for debugging purpose at the moment *) Robots.txt Bugfix for if-modified-since usage TODO: synchronization of downloads to avoid loading the same robots-file multiple times in parallel by different threads *) Shutdown Better abortion of transferRWI and transferURL sessions on server shutdown *) Status Page Adding icon to start/stop crawling via status page git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@950 6c8d7289-2bf4-0310-a012-ef5d649a1542
85 lines
2.0 KiB
HTML
85 lines
2.0 KiB
HTML
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN" "http://www.w3.org/TR/REC-html40/loose.dtd">
|
|
<html>
|
|
<head>
|
|
<title>YaCy '#[clientname]#': View URL Content</title>
|
|
#[metas]#
|
|
</head>
|
|
<body marginheight="0" marginwidth="0" leftmargin="0" topmargin="0">
|
|
#[header]#
|
|
<br><br>
|
|
<h2>View URL Content</h2>
|
|
|
|
<p><font color="red">
|
|
#(error)#
|
|
<table border="0" cellpadding="2" cellspacing="1">
|
|
<tr class="TableHeader">
|
|
<td>URL</td>
|
|
<td><a href="#[url]#">#[url]#</a></td>
|
|
</tr>
|
|
<tr class="TableCellDark">
|
|
<td>Hash</td>
|
|
<td><tt>#[hash]#</tt></td>
|
|
</tr>
|
|
<tr class="TableCellLight">
|
|
<td>Word Count</td>
|
|
<td><tt>#[wordCount]#</tt></td>
|
|
</tr>
|
|
<tr class="TableCellDark">
|
|
<td>Description</td>
|
|
<td><tt>#[desc]#</tt></td>
|
|
</tr>
|
|
<tr class="TableCellLight">
|
|
<td>Size</td>
|
|
<td><tt>#[size]#</tt></td>
|
|
</tr>
|
|
<tr class="TableCellDark">
|
|
<td>View as:</td>
|
|
<td>
|
|
<a href="?urlHash=#[hash]#&viewMode=iframe">Original</a> |
|
|
<a href="?urlHash=#[hash]#&viewMode=plain">Plain Text</a> |
|
|
<a href="?urlHash=#[hash]#&viewMode=parsed">Parsed Text</a> |
|
|
<a href="?urlHash=#[hash]#&viewMode=sentences">Parsed Sentences</a>
|
|
</td>
|
|
</tr>
|
|
</table>
|
|
:: <!-- 1 -->
|
|
No URL hash submitted.
|
|
:: <!-- 2 -->
|
|
Unable to find URL Entry in DB
|
|
:: <!-- 3 -->
|
|
Invalid URL
|
|
:: <!-- 4 -->
|
|
Unable to download resource content.
|
|
:: <!-- 5 -->
|
|
Unable to parse resource content.
|
|
#(/error)#
|
|
</font>
|
|
</p>
|
|
<p>
|
|
#(viewMode)#
|
|
:: <!-- 1 -->
|
|
<h3>Plain Resource Content</h3><br>
|
|
<tt>#[plainText]#</tt>
|
|
:: <!-- 2 -->
|
|
<h3>Parsed Resource Content</h3><br>
|
|
<tt>#[parsedText]#</tt>
|
|
:: <!-- 3 -->
|
|
<h3>Parsed Resource Sentences</h3><br>
|
|
<table border="0" cellpadding="2" cellspacing="1">
|
|
#{sentences}#
|
|
<tr class="TableCell#(dark)#Light::Dark::Summary#(/dark)#" title="#[sessionName]#">
|
|
<td>#[nr]#</td>
|
|
<td><tt>#[text]#</tt></td>
|
|
</tr>
|
|
#{/sentences}#
|
|
</table>
|
|
:: <!-- 4 -->
|
|
<h3>Original Resource Content</h3><br>
|
|
<iframe src="#[url]#" width="800" height="400">
|
|
</iframe>
|
|
#(/viewMode)#
|
|
</p>
|
|
|
|
#[footer]#
|
|
</body>
|
|
</html> |