Beefy Boxes and Bandwidth Generously Provided by pair Networks
XP is just a number

Re^2: Creating (and using) a custom encoding. (SOLUTION)

by graff (Chancellor)
on Jun 01, 2013 at 02:38 UTC ( #1036371=note: print w/replies, xml ) Need Help??

in reply to Re: Creating (and using) a custom encoding. (SOLUTION)
in thread Creating (and using) a custom encoding.

In case you haven't gotten all the way yet with enc2xs, the only "hard" part is to build the appropriate "Unicode Character Map" (ucm) file to describe the relationship between Unicode and your specialized character encoding.

In case it helps, you might want to look at Encode::Buckwalter, which includes a ucm file to define a specialized ASCII "alphabet" for transliterating Arabic characters. It's fairly simple, except that some character relations only work in one direction (e.g. when going from Unicode to "Buckwalter Transliteration", U+0030 and U+0660 will both map to ASCII "0", but when going from transliteration to Unicode, ASCII "0" will only map to U+0030, and likewise for other digits).

  • Comment on Re^2: Creating (and using) a custom encoding. (SOLUTION)

Replies are listed 'Best First'.
Re^3: Creating (and using) a custom encoding. (SOLUTION)
by davido (Archbishop) on Jun 01, 2013 at 04:03 UTC

    Thanks. I appreciate the links.

    What first motivated this investigation was a quest for alternatives to automatically apply fold case (fc) to an incoming file. I'm well aware that this is a road less traveled. Certainly it violates "the principle of least surprise", and as such I wouldn't consider it for production code. But it's been an interesting investigation so far. :)


Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1036371]
and monks are getting baked in the sun...

How do I use this? | Other CB clients
Other Users?
Others studying the Monastery: (6)
As of 2018-05-25 10:03 GMT
Find Nodes?
    Voting Booth?