|Perl: the Markov chain saw|
Sorting Vietnamese textby pdenisowski (Acolyte)
|on Dec 22, 2013 at 17:08 UTC||Need Help??|
pdenisowski has asked for the
wisdom of the Perl Monks concerning the following question:
I have a very large UTF8 Vietnamese text file that I would like to sort in alphabetical order. The problem is that it seems almost every word processor, utility, etc. out there does not use what I would consider to be the "normal" Vietnamese alphabetical order, usually because it ignores the tone marks (dấu) or puts them in the wrong/random order.For example, for the first 3 letters of the Vietnamese alphabet I would like to use this sort order:
I've looked at all the different modules, etc. but none of them seem to do this "correctly" (the way most printed dictionaries do). I've also looked at dozens of web pages and can't make any of those examples work properly either.
Any ideas? I've struggled with this for years and would be eternally grateful to anyone who could figure this out.
(Here is the complete list of letters in the order in which I wish to order them)