![]() |
|
good chemistry is complicated, and a little bit messy -LW |
|
PerlMonks |
unicode normalization layerby DrWhy (Chaplain) |
on Sep 15, 2009 at 20:49 UTC ( [id://795472]=perlquestion: print w/replies, xml ) | Need Help?? |
DrWhy has asked for the wisdom of the Perl Monks concerning the following question:
Gretings brothers (of all genders),
I wonder if any of you know where I could get my hands on a perlio layer that does Unicode normalization. I have data that is (supposed to be) in UTF-8. I am writing code that uses the :encoding(utf8) layer to validate that it is in fact good UTF-8/Unicode data, but to work with the data I'd like to have it in normalized form (NFKC, to be specific). I'd really like to have that done in a layer on top of :encoding(utf8) so that I can read the data in blocks and not have to worry about the block boundaries falling between a base character and following combining characters. Thanks, --DrWhy "If God had meant for us to think for ourselves he would have given us brains. Oh, wait..."
Back to
Seekers of Perl Wisdom
|
|