Hi monks,
I want to find out cyrillic characters in the file. Cyrillic characters ranges from 0400-04FF. XML file contains tag <cd></cd>. Script should validate that cd element contains only cyrillic characters. If it contains other character set, it should prompt an error. I've tried following code.
use encode;
use Encode::Unicode;
while($val=~/<cd>(.*?)<\/cd>/gsi){
my $no = decode_utf8($1);
binmode STDOUT, ":utf8";}
But i do not know how to find the unicode value. Can anyone throw somelight on this?
thanks in advance.
--c