Beefy Boxes and Bandwidth Generously Provided by pair Networks
more useful options
 
PerlMonks  

Try HTML::Parser

by Kozz (Friar)
on Feb 18, 2002 at 17:36 UTC ( [id://146186]=note: print w/replies, xml ) Need Help??


in reply to shameful reg expression

You should give HTML::Parser module a try. Otherwise, I don't know what your regex looks like (where you're capturing the text), but you might try placing into your existing regex code
([^><]+)
which captures a string of characters that do NOT match either > or <. Keep in mind, however, that this should be placed correctly in the regex, otherwise it could also give you matches like "td" and "b".

But again, this may not necessarily work in all situations, even if you write a damned good regex. For best results, look into HTML::Parser.

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://146186]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others imbibing at the Monastery: (4)
As of 2024-04-20 02:01 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found