Beefy Boxes and Bandwidth Generously Provided by pair Networks Cowboy Neal with Hat
Perl-Sensitive Sunglasses
 
PerlMonks  

Re: Better way?

by chromatic (Archbishop)
on Jun 16, 2000 at 22:47 UTC ( #18528=note: print w/ replies, xml ) Need Help??


in reply to Better way?

Sounds like you want an HTML Parser. Try HTML::Parser or something similar on CPAN.


Comment on Re: Better way?
RE: Re: Better way?
by jen (Novice) on Jun 17, 2000 at 01:17 UTC
    I did, and, as far as I can tell, it's not helpful, because the HTML tags themselves are almost never meaningful in the pages we get back. For example, it's all well and good to be able to pick out the data between table tags, but then I still have to sort through the table data.

    (I think the problem is that, in my case, it's the data and not the HTML tags that are significant - HTML::Parser is good for cases where the tags are the significant piece. If someone has used HTML::Parser in a similar way, please let me know.)
        I wouldn't think that it will happen anytime soon, specially if it depends on the same people that make their web servers ignore HEAD requests, or kill the connection if you don't fake an "official" user-agent... Take Hotmail for an example.

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://18528]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others chanting in the Monastery: (15)
As of 2014-04-21 16:08 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    April first is:







    Results (496 votes), past polls