Beefy Boxes and Bandwidth Generously Provided by pair Networks
Think about Loose Coupling
 
PerlMonks  

Re: Regex help

by sauoq (Abbot)
on Oct 01, 2002 at 21:32 UTC ( #202121=note: print w/ replies, xml ) Need Help??


in reply to Regex help

If you are using this to strip potentially malicious code, you should be more liberal in what you match.

1. Use /i because tags can be upper, lower, or mixed cases.

<script>CODE</script> <SCRIPT>CODE</SCRIPT> <ScRiPt>CODE</ScRiPt>

2. Be careful of whitespace in tags.

<script >CODE</script> <script>CODE</script >

3. Be careful of what gets left behind after you strip it. (The following example is a good reason not to use a non-greedy match.)

<<script></script>script>CODE</script>

I'd use something like jeffa's and eliminate as much as possible. I don't see any immediate problems with this: s#<script.*script\s*>##gis; but I didn't test it very thoroughly and there may be some. You might consider substituting repeatedly until nothing matches in order be sure you've avoided the 3rd issue above but that may well be overkill.

-sauoq
"My two cents aren't worth a dime.";


Comment on Re: Regex help
Select or Download Code

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://202121]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others romping around the Monastery: (11)
As of 2014-11-26 11:40 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    My preferred Perl binaries come from:














    Results (167 votes), past polls