"be consistent" | |
PerlMonks |
Re: How to Identify a languageby florg (Friar) |
on Sep 19, 2006 at 06:11 UTC ( [id://573640]=note: print w/replies, xml ) | Need Help?? |
If you just want a program to classify text you might also be interested in: TextCat. It's a Perl script that uses "N-Gram-Based Text Categorization" and has worked for me in the past. Though I did not need to classify Asian languages, it's supposed to support CJK. A list of languages and an article discussing the approach can be found on the page as well.
In Section
Seekers of Perl Wisdom
|
|