Beefy Boxes and Bandwidth Generously Provided by pair Networks
Just another Perl shrine
 
PerlMonks  

Re: (OT) How to deal with non-ascii names

by jonadab (Parson)
on Aug 13, 2010 at 17:34 UTC ( #854956=note: print w/ replies, xml ) Need Help??


in reply to (OT) How to deal with non-ascii names

Is there any Perl module that helps translating things into ASCII in a reliable way?

HTML::Entities. HTH.HAND.

In all seriousness, I agree to a large extent with what the others have said. If you have to do this automatically (without getting a romanized version from the user), the transliteration method is going to need to be language-specific.

For instance, for Japanese you might check out Lingua::JA::Hepburn::Passport. It doesn't appear to support kanji, but I'm not sure it's possible to automatically romanize kanji, since most of them have at least half a dozen different readings. The same character might romanize to "mei" in one name, "myo" or "myou" in another name, "min" in another, "a" in another, "aka" in another, "aki" in another (this is a real example). If you can't get furigana (pronunciation guide characters, usually kana) from the user, names are going to get romanized very incorrectly.


Comment on Re: (OT) How to deal with non-ascii names

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://854956]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others rifling through the Monastery: (10)
As of 2014-12-18 06:32 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    Is guessing a good strategy for surviving in the IT business?





    Results (43 votes), past polls