Commit Graph

6 Commits

Author SHA1 Message Date
Michael Peter Christen
e101c2e0e2 added changes from copperdust (submitted by email):
1. Improved and fixed language detection:
	1.1 Identificator.java - recognition fix (improved)
	1.2 DCEntry.java - fix (changed detection order due to detection from
tld in many cases is incorrect)
	1.3 MultiProtocolURI.java - fixed and enhanced language from tld
detection (all currently used top-level domains; ccTLD added but not
tested).
2. Ukrainian language update.
3. Main Slavic languages langstats (tested and works fine).
2012-02-22 12:21:27 +01:00
low012
fe6142a6ab *) ...and even more languages.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5252 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-10-05 20:20:21 +00:00
low012
9f0cc3afdd *) more languages
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5250 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-10-05 12:28:40 +00:00
low012
e96a3d0472 *) added statistics for a few languages: Czech, Esperanto, Irish, Turkish
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5249 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-10-05 10:07:22 +00:00
low012
b6cf4abc5e *) added a few language statistic files
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4840 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-05-23 16:55:50 +00:00
low012
a7dadf7f2f *) first version of a way to determine the language a text is written in (not perfect, but it works)
*) statistical data of languages can be found in the *.lng files in the new directory called "langfiles"

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4824 6c8d7289-2bf4-0310-a012-ef5d649a1542
2008-05-18 21:24:05 +00:00