Beefy Boxes and Bandwidth Generously Provided by pair Networks
laziness, impatience, and hubris
 
PerlMonks  

Re^2: How to remove other language character from a string

by Anonymous Monk
on Nov 26, 2012 at 05:36 UTC ( #1005560=note: print w/ replies, xml ) Need Help??


in reply to Re: How to remove other language character from a string
in thread How to remove other language character from a string

Thanks moritz, but when I tried this I got the output like this:

α╕α╕α╕▒α╕α&# +9557;α╕α╕α╣α╣α& +#9557;α╕α╕α╣α╕α +╕┤α╕α╣α╕α╣ +α╕α╕▓α╕ Croissant Egg Sandwich α╕α╕α╕▒α&#9557 +;α╕α╕α╕α╣α&#957 +1;α╕α╕α╕α╣α&#95 +57;α╕┤α╕α╣α╕&#9 +45;╣α╕α╕▓α╕


Comment on Re^2: How to remove other language character from a string
Download Code
Replies are listed 'Best First'.
Re^3: How to remove other language character from a string
by frozenwithjoy (Curate) on Nov 26, 2012 at 05:51 UTC

    That's because it wasn't formatted correctly due to missing code tags (which were presumably left out so that the input text would be shown properly). When I first ran moritz's code, I just got the original string, but when I substituted:

    $str =~ s/[^\p{Latin}\s]//g;

    for this:

    $str =~ s/^\p{Latin}\s//g;

    it worked.

    EDIT: If you have lots of extra spaces in your output, you could run it through $str =~ s/ {2,}/ /g;, too. Something to keep in mind is that moritz's approach (as is) will remove punctuation.

      It worked smoothly. Thanks Frozenwithjoy and moritz.

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1005560]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others cooling their heels in the Monastery: (14)
As of 2015-07-30 19:30 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The top three priorities of my open tasks are (in descending order of likelihood to be worked on) ...









    Results (273 votes), past polls