Beefy Boxes and Bandwidth Generously Provided by pair Networks
Just another Perl shrine

Re: Better way?

by chromatic (Archbishop)
on Jun 16, 2000 at 22:47 UTC ( #18528=note: print w/ replies, xml ) Need Help??

in reply to Better way?

Sounds like you want an HTML Parser. Try HTML::Parser or something similar on CPAN.

Comment on Re: Better way?
Replies are listed 'Best First'.
RE: Re: Better way?
by jen (Novice) on Jun 17, 2000 at 01:17 UTC
    I did, and, as far as I can tell, it's not helpful, because the HTML tags themselves are almost never meaningful in the pages we get back. For example, it's all well and good to be able to pick out the data between table tags, but then I still have to sort through the table data.

    (I think the problem is that, in my case, it's the data and not the HTML tags that are significant - HTML::Parser is good for cases where the tags are the significant piece. If someone has used HTML::Parser in a similar way, please let me know.)
        I wouldn't think that it will happen anytime soon, specially if it depends on the same people that make their web servers ignore HEAD requests, or kill the connection if you don't fake an "official" user-agent... Take Hotmail for an example.

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://18528]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others contemplating the Monastery: (5)
As of 2015-11-30 07:15 GMT
Find Nodes?
    Voting Booth?

    What would be the most significant thing to happen if a rope (or wire) tied the Earth and the Moon together?

    Results (763 votes), past polls