Beefy Boxes and Bandwidth Generously Provided by pair Networks
Keep It Simple, Stupid

Re: Better way?

by chromatic (Archbishop)
on Jun 16, 2000 at 22:47 UTC ( #18528=note: print w/replies, xml ) Need Help??

in reply to Better way?

Sounds like you want an HTML Parser. Try HTML::Parser or something similar on CPAN.

Replies are listed 'Best First'.
RE: Re: Better way?
by jen (Novice) on Jun 17, 2000 at 01:17 UTC
    I did, and, as far as I can tell, it's not helpful, because the HTML tags themselves are almost never meaningful in the pages we get back. For example, it's all well and good to be able to pick out the data between table tags, but then I still have to sort through the table data.

    (I think the problem is that, in my case, it's the data and not the HTML tags that are significant - HTML::Parser is good for cases where the tags are the significant piece. If someone has used HTML::Parser in a similar way, please let me know.)
        I wouldn't think that it will happen anytime soon, specially if it depends on the same people that make their web servers ignore HEAD requests, or kill the connection if you don't fake an "official" user-agent... Take Hotmail for an example.

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://18528]
[erix]: ha ha
[LanX]: I don't get it

How do I use this? | Other CB clients
Other Users?
Others rifling through the Monastery: (5)
As of 2018-03-22 20:07 GMT
Find Nodes?
    Voting Booth?
    When I think of a mole I think of:

    Results (284 votes). Check out past polls.