Beefy Boxes and Bandwidth Generously Provided by pair Networks
go ahead... be a heretic

Re: how to extract iframes from text

by moritz (Cardinal)
on Apr 30, 2013 at 17:55 UTC ( #1031442=note: print w/replies, xml ) Need Help??

in reply to how to extract iframes from text

Regex is an option, but it usally doesn't work well for nexted delimiters (which can happen with iframes).

I personally like Mojo::DOM for this kind of task:

use Mojo::DOM; say Mojo::DOM->new($yourstring)->at('iframe');

Replies are listed 'Best First'.
Re^2: how to extract iframes from text
by Anonymous Monk on Apr 30, 2013 at 18:32 UTC

    thanks for the quick answer.

    Here is the line of code: print STDERR (Mojo::DOM->new($args->{$t})->at('iframe'));

    this is what i get :

    $VAR1 = '<p>No one\'s telling the truth anymore, and that makes the nu +mbers suspect.</p> <p>***<iframe width="480" height="360" src="http://localhost:8000/embe +d/static/clips/2012/12/17/28210/test-rush" allowfullscreen="" framebo +rder="0" scrolling="no"></iframe>***</p> <p>\\n</p> <p>Instead of addressing the fact that some text</p> <p>\\n</p> <p>***<iframe width="480" height="360" src="http://localhost:8000/embe +d//static/video/2012/09/07/fnc-ff-20120907-doocytaxes" allowfullscree +n="" frameborder="0" scrolling="\\"no\\""></iframe>***</p> <p>\\n</p> <p>The very first example AP cites was already corrected.some text ... +.Reacting to recent <a href="/blog/2013/04/17/major-errors-undermine- +key-argument-for-austeri">research</a> that has questions.</p> <p>\\n</p> <p>***<iframe width="480" height="360" src="http://localhost:8000/embe +d/static/clips/2013/04/29/29939/fnc-an-20130429-hemmermooredebtgdp" a +llowfullscreen="" frameborder="0" scrolling="no"></iframe>***</p> <p>\\n Arriving at such a conclusion requires not only obscuring the i +mportance in pushing global austerity <a href="/static/images/item/gd +p-components.jpg">strong measures</a> of too little spending.</p>'; <iframe allowfullscreen="" frameborder="0" height="360" scrolling="no" + src="http://localhost:8000/embed/static/clips/2012/12/17/28210/test- +rush" width="480"></iframe>

    it seems to extract some of the non iframe stuff too....

    how to get rid of the non iframe part or put just the iframe part in an array
        You are right. that was the dumper(line before). Is there a way to put it in an array?
        You are right. that was the dumper(line before). But it is printing just the first iframe not the others. Is there a way to put them all in an array?

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1031442]
[choroba]: exceptions to this rule are rare

How do I use this? | Other CB clients
Other Users?
Others wandering the Monastery: (10)
As of 2018-03-20 10:47 GMT
Find Nodes?
    Voting Booth?
    When I think of a mole I think of:

    Results (250 votes). Check out past polls.