|Think about Loose Coupling|
trouble with umlautsby nefertari (Chaplain)
|on Mar 19, 2002 at 16:15 UTC||Need Help??|
nefertari has asked for the wisdom of the Perl Monks concerning the following question:
Dear monks, perhaps one of you can help me:
Here at our university i am the author of a perl-CGI-script, that searches for preprints on a preprintserver. Until last week it worked (with the exception of upgrades on the server and not getting the privilege to run CGIs). Since last week there is trouble with umlauts. It seems to be due to an update of the server from Debian Potato to Woody (i don't know any of these).
Today in the morning i changed it to die if the input didn't match /^((?:[a-zA-ZäöüÄÖÜß\s])*)$/, so that i could see, where our problems are. Now the funny part: "ö" matches every second time! (But in this case no matches with the preprint list are found, although we have two authors with ö in their names.)
To make it possible to search for people without umlauts on their keyboard we decided that oe should match ö in the search. So if i search for oe i again get no matches.
Does anybody of you have an idea what could be wrong?
Our data are in a XML-file, and umlauts are in a very ugly way encoded: <UL>o</UL> stands for ö. (I didn't design this part.) Then i parse them via XML::Parser and store only the matching preprints. One problem could be, that i write ö and the other umlauts directly in the perlscript. But i don't know another way to do this. If you know, i would be glad, if you could tell me how to achieve this.