Re: How can I find the contents of an HTML tag?


Problems? Is your data what you think it is?
	PerlMonks

Re: How can I find the contents of an HTML tag?

by mikfire (Deacon)

on May 09, 2000 at 18:54 UTC ( [id://10761]=note: print w/replies, xml )

Need Help??

in reply to How can I find the contents of an HTML tag?

Assuming that the distinguishing characteristic of the TD entry you want to extract is the leading and following space, I'd suggested a regex something like this:

my( $var ) = $html =~ m#<TD> (.*?) </TD>#;

print "We found it: $var\n" if defined $var;
[download]

The part inside the capturing parens (.*?) says to save any characters found, possibly none. It says to take the fewest possible characters to complete the match — i.e., be non-greedy.

The only way you will be able to know if the match succeeded is to test for definedness. Testing for true/false will fail on the empty case because perl treats the empty string as false.

If the <TD>Foo:</TD> part will always occur immediately in front of the <TD> instances you're interested in, we can make the regex more robust:

m#<TD>Foo:</TD><TD> (.*?) </TD>#
[download]

Comment on Re: How can I find the contents of an HTML tag? Select or Download Code

In Section Seekers of Perl Wisdom

Domain Nodelet^?

www.com | www.net | www.org

Node Status^?

node history
Node Type: note [id://10761]
help

Chatterbox^?

How do I use this? • Last hour • Other CB clients

Other Users^?

Others taking refuge in the Monastery: (3)

As of 2024-04-19 22:42 GMT

Sections^?

Information^?

Find Nodes^?

Leftovers^?

Today I Learned

Voting Booth^?

No recent polls found