Commit Graph

129 Commits

Author SHA1 Message Date
reger
ec24a0c85a add test case for optimized toTokens() 2016-03-24 19:26:38 +01:00
reger
84c970eaec move test classes to test/java (subdirectory as in Maven standard subdir layout)
because ViewImage*Test.java breaks test run
2016-01-16 19:22:27 +01:00
luc
571bc55937 Refactoring : use StandardCharsets constants instead of hard-coded
charset names.
2016-01-05 23:37:05 +01:00
reger
1af0e9ef74 remove workaround for Solr bug regarding multivalued date fields
fixed in 5.4.0
http://issues.apache.org/jira/browse/SOLR-8050
2016-01-03 01:11:27 +01:00
reger
4d2b934487 prevent mailto links getting into parser result document's in/outbound link collection
by checking mailto scheme early.
- fix upper case mailto protocol assignment
- add test case for getProtocol
2015-12-16 03:01:17 +01:00
reger
288acceac3 fix test htmlParserTest, charset parameter
+ upd maven templating-plugin version
2015-12-15 02:09:43 +01:00
luc
f01d49c37a Process large or local file images dealing directly with content
InputStream.
2015-11-18 10:15:38 +01:00
luc
0de6988604 Added links to more image test suites. 2015-11-12 08:21:37 +01:00
luc
745e97a575 Merge branch 'master' of https://github.com/yacy/yacy_search_server 2015-11-02 08:10:11 +01:00
luc
2895ab552a Made ViewImagePerfTest extend ViewImageTest to ease automated image
render tests
2015-10-30 04:19:56 +01:00
luc
4a03cf06e1 Corrected encoding extension arg parsing 2015-10-29 02:24:17 +01:00
reger
d223cf0ae4 adjust MediaWiki importer geo coordinate calculation
- allow lat/long 0.xxx
- south / west assignment
include test class
2015-10-26 21:19:35 +01:00
luc
8da20718aa Created a class to test ViewImage rendering against multiple image
files.
2015-10-23 15:49:07 +02:00
luc
ec04d27473 Corrected APNG test suite link name. 2015-10-23 14:12:00 +02:00
luc
cbb84ba073 Detailed javadoc. 2015-10-23 13:57:24 +02:00
luc
70111876d2 Filled ViewImageTest.html with all remaining IANA image file formats.
Added some links to test suites and specifications.
2015-10-23 12:27:52 +02:00
luc
e093fb228d Created a generic ViewImage performance render test. 2015-10-15 09:18:24 +02:00
luc
3ad564e2e4 Created a ViewImage rendering performance measurement test. 2015-10-14 10:17:09 +02:00
luc
b3f044072e Updated table headers and SVG file url for case sensitive OS. 2015-10-14 10:13:37 +02:00
luc
f5746b5490 Added ico and bmp sample pictures 2015-10-06 20:48:09 +02:00
luc
baede48161 Added JPEG 2000 and FITS samples 2015-10-06 09:53:09 +02:00
luc
7c9d80c5d0 Added image formats and informations for each format. 2015-10-06 09:51:47 +02:00
luc
0ae9297ca5 Created a html test page to check ViewImage rendering with different
file formats.
2015-10-02 12:41:30 +02:00
reger
bad34804fe optimize parseInt for <img> tag attribute parsing
Performance better as using Numberformat.parse or parseInt(substring())
2015-09-26 15:42:23 +02:00
reger
d2cc11ea8f fix html parser taking <style> content as text.
Noticed some result description contain css content from style tag.
Added <style> to tag list to scrape it's content not as text
+ test case included
2015-09-19 05:30:55 +02:00
reger
e594130aec add test case for partial update - to discover effect on YaCy for update of documents with multivalued date fields (like dates_in_content_dts)
current result: loss of fields/information in index document, see EmbeddedSolrConnectorTest.testUdate_withMultivaluedDateField()
2015-09-13 06:02:07 +02:00
reger
d5da9e5a38 fix test methode (add throw for URIMetadataNode) 2015-09-12 20:07:43 +02:00
reger
4cf875336c complete TODO: getFileExtension handle dot in query part
+ testcase
2015-08-31 23:28:03 +02:00
reger
c37dda8849 fix NPE on MultiProtocolURL on url with parameter value and '='
in getAttribute
- added test case for it
2015-05-12 01:09:10 +02:00
reger
71bf95af8a upd parser calls in test cases 2015-04-25 03:24:28 +02:00
reger
f63fff9008 fix snippet containig number with comma as desmo point http://mantis.tokeek.de/view.php?id=344
to keep it as one word (by altering the split regex)
- added sniipet test case with number
- regex for word split to match multiple splitcars
2015-03-16 02:03:40 +01:00
reger
2ef8ffdb60 apply UTF-8 encoding
copied from escape()
2015-03-15 06:02:45 +01:00
reger
7120ea42f1 fix for path with char code > 255
(causing index out of bound exception)
+ test cas for it
2015-03-15 03:37:32 +01:00
reger
1d81bd0687 fix url encoding for path see http://mantis.tokeek.de/view.php?id=559
So far we used same escape procedure for all parts of the url (which includes x-www-form-urlencoded for all url components)
Added capability to use different encoding rules for the different url components (through specific bitset for each component).
(this is inspired by org.apache.http.client and java.net.uri implementation).
- Added test case for  http://mantis.tokeek.de/view.php?id=559
2015-03-15 00:46:07 +01:00
reger
f94e34058c fix url (path) %-decoding http://mantis.tokeek.de/view.php?id=519
- add test case for this
2015-03-11 01:05:14 +01:00
reger
16bc267a32 add test case for snippet html encoding check 2015-03-01 23:50:17 +01:00
reger
77851fa53c fix parser test cases
(Vocabulary paramete)
2015-02-11 01:43:02 +01:00
reger
df83fcc4fc disable optimistic GC assumption in StandardMemoryStrategy
After several tests found that eom is not prevented. Major reason in testing was assumption future GC will free avg of last 5 GC.
Disabeling this check improved eom exceptions.

Added simplest testcase used for verification
2015-02-11 01:42:01 +01:00
Michael Peter Christen
68c605d637 replace with CommonPattern.SPACE for split 2015-01-29 02:28:03 +01:00
reger
9edc7308aa update to metadata-extractor-2.7.0.jar
add 2 simple JUnit test cases for jpeg and tif parsing
2014-12-15 20:45:05 +01:00
reger
5d67e165d9 remove redundant null check in ResponseHeader.lastModified
added a JUnit testcase for ResponseHeader dates (using age()),
adjusted age() to pass all tests
2014-12-09 00:58:08 +01:00
reger
ea633a794c including small junit test case for WordTokenizer 2014-11-29 22:13:24 +01:00
reger
aa2e15d846 allow url parameter in worktable apicall
allow url=wwwl?param=a&param=b (with ?, & encoded)
fix:  http://mantis.tokeek.de/view.php?id=100

fix double adding of  '&' in MultiProtocolURL.escape()
2014-10-05 20:05:03 +02:00
reger
e88537522d allow single quote " ' " in query
see http://mantis.tokeek.de/view.php?id=379
-add QueryGoal test case for this
2014-08-16 14:29:52 +02:00
reger
e50b2b4d04 fix test case MultiProtocolURL.toString()
(only allowed on AnchorURL)
2014-08-11 04:29:43 +02:00
reger
b510b182d8 - update Maven pom
- add ppt parser test case
2014-08-01 01:47:53 +02:00
Michael Peter Christen
2de159719b added an option to set 'obey nofollow' for links with rel="nofollow"
attribute in the <a> tag for each crawl. This introduces a lot of
changes because it extends the usage of the AnchorURL Object type which
now also has a different toString method that the underlying
DigestURL.toString. It is therefore not advised to use .toString at all
for urls, just just toNormalform(false) instead.
2014-07-18 12:43:01 +02:00
reger
1f2eba977d add test case for Records (used in HostBalancer)
- simulating seek error (http://mantis.tokeek.de/view.php?id=411)
2014-07-06 20:41:26 +02:00
reger
e94efd4d7c update to JUnit 4.11
- fix build.xml -> parserTest error on Windows due to javac encoding
2014-07-06 05:38:32 +02:00
reger
3b77e41f1a adding test for HostQueue crawl stack
- simulating problem with zero length stack file (but not fixing it)
- adding test data clean to maven pom
2014-07-06 00:38:16 +02:00