We don't bite newbies here... much | |
PerlMonks |
Idiom guessing scriptby Andre_br (Pilgrim) |
on Nov 21, 2005 at 03:13 UTC ( [id://510346]=perlquestion: print w/replies, xml ) | Need Help?? |
Andre_br has asked for the wisdom of the Perl Monks concerning the following question:
Hello folks
I need to develop a more trustable way to guess the language of strings as short as book titles. I've just tried Text::Language::Guess but it´s results are quite unreliable on tests with short strings. I´ve noticed this module considers mostly the articles to guess. But I´d need to provide Perl full dictionary recognition for the six idioms involved. (fr,it,es,en,de,pt)
So, I have two major issues to overcome:
2) Once I have these pure words loaded in distinct .txts, how to do the matching approach? Text::Language::Guess's article based guessing is not enough because you can have titles like 'Cutting Edges' that don´t happen to have any articles or pronoums. You just have to know that 'cutting' and 'edge(s)' is english and that´s all. I wait for thy help then! Thanks André
Back to
Seekers of Perl Wisdom
|
|