Beefy Boxes and Bandwidth Generously Provided by pair Networks
Perl Monk, Perl Meditation

Re: HTTP Scripting

by ajt (Prior)
on Nov 29, 2002 at 12:30 UTC ( #216514=note: print w/replies, xml ) Need Help??

in reply to HTTP Scripting


Perl is good at this kind of thing. Perl has modules to connect to web servers (LWP), work with the cookies and passwords, and parse HTML (HTML).

Perl has several HTML/XML parsers, some are general purpose parsers, and some are dedicated, e.g. link extractors, header parsers.

You could argue that your choice is so wide that it becomes daunting!

I would suggest the following books: Perl and LWP which is all about connecting to, collecting from, and parsing of web data. I would also suggest Data Munging with Perl, it's a little older and more generic (it's for more than just web automation), but it's a fine book and has good examples of web data mining. Web Client Programming with Perl is old and out of print, but it's freely available as an OpenBook from O'Reilly, and quite useful.

I would also check out merlyn's columns as I think there are some good examples in there with good descriptions. There may also be something in's article archive.


Replies are listed 'Best First'.
Re: Re: HTTP Scripting
by marinersk (Priest) on Nov 29, 2002 at 15:24 UTC
    Thanks, ajt, excellent book references, and things which will gladly join my growing library.

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://216514]
and all is quiet...

How do I use this? | Other CB clients
Other Users?
Others meditating upon the Monastery: (6)
As of 2018-03-18 20:28 GMT
Find Nodes?
    Voting Booth?
    When I think of a mole I think of:

    Results (230 votes). Check out past polls.