Beefy Boxes and Bandwidth Generously Provided by pair Networks
Perl: the Markov chain saw
 
PerlMonks  

Re: Extracting information

by fuzzysteve (Beadle)
on Jan 09, 2002 at 20:03 UTC ( [id://137472]=note: print w/replies, xml ) Need Help??


in reply to Extracting information

My regex's aren't what they could be, but after some experimentation with the code, the problem arises when you have a < before the </a>
also the problem would appear to be in your first regex (the while loop check).
looking at the reg exp , you've written it to exclude any data that has a < before the </a> the problem is with the ([^&lt;]+). you've specifically exluded any data with tags betweern the anchor tags.

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://137472]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others chanting in the Monastery: (4)
As of 2025-05-16 17:34 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found

    Notices?
    erzuuliAnonymous Monks are no longer allowed to use Super Search, due to an excessive use of this resource by robots.