Beefy Boxes and Bandwidth Generously Provided by pair Networks
Just another Perl shrine
 
PerlMonks  

Re^2: How to remove other language character from a string

by Anonymous Monk
on Nov 26, 2012 at 05:36 UTC ( #1005560=note: print w/ replies, xml ) Need Help??


in reply to Re: How to remove other language character from a string
in thread How to remove other language character from a string

Thanks moritz, but when I tried this I got the output like this:

α╕α╕α╕▒α╕α&# +9557;α╕α╕α╣α╣α& +#9557;α╕α╕α╣α╕α +╕┤α╕α╣α╕α╣ +α╕α╕▓α╕ Croissant Egg Sandwich α╕α╕α╕▒α&#9557 +;α╕α╕α╕α╣α&#957 +1;α╕α╕α╕α╣α&#95 +57;α╕┤α╕α╣α╕&#9 +45;╣α╕α╕▓α╕


Comment on Re^2: How to remove other language character from a string
Download Code
Re^3: How to remove other language character from a string
by frozenwithjoy (Curate) on Nov 26, 2012 at 05:51 UTC

    That's because it wasn't formatted correctly due to missing code tags (which were presumably left out so that the input text would be shown properly). When I first ran moritz's code, I just got the original string, but when I substituted:

    $str =~ s/[^\p{Latin}\s]//g;

    for this:

    $str =~ s/^\p{Latin}\s//g;

    it worked.

    EDIT: If you have lots of extra spaces in your output, you could run it through $str =~ s/ {2,}/ /g;, too. Something to keep in mind is that moritz's approach (as is) will remove punctuation.

      It worked smoothly. Thanks Frozenwithjoy and moritz.

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1005560]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others surveying the Monastery: (5)
As of 2014-09-03 07:57 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    My favorite cookbook is:










    Results (35 votes), past polls