Beefy Boxes and Bandwidth Generously Provided by pair Networks
Perl: the Markov chain saw

Re: Removing Javascript

by domm (Chaplain)
on Jan 04, 2003 at 13:00 UTC ( #224247=note: print w/replies, xml ) Need Help??

in reply to Removing Javascript

Jet another way, using HTML::Tree:

Please note that this doesn't handle the unescaped closing script tag in document.write. I'd suggest running tidy on the input before passing it to the parser.

#!/usr/bin/perl -w use strict; use HTML::Tree; my $doc = <<EOF; <html> <head> <script language="Javascript"> document.write("Don't forget your &lt;/script&gt; tag! It's important! +"); </script> </head> <body> This is just some text. </body> </html> EOF my $root=HTML::TreeBuilder->new(); $root->parse($doc); $root->eof; foreach my $n ($root->descendants) { next unless $n->tag; # skip text nodes $n->delete if $n->tag eq 'script'; } print $root->dump; # prints structure print $root->as_HTML # prints as HTML
-- #!/usr/bin/perl for(ref bless{},just'another'perl'hacker){s-:+-$"-g&&print$_.$/}

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://224247]
and the sunlight beams...

How do I use this? | Other CB clients
Other Users?
Others having an uproarious good time at the Monastery: (3)
As of 2017-07-22 12:45 GMT
Find Nodes?
    Voting Booth?
    I came, I saw, I ...

    Results (338 votes). Check out past polls.