Beefy Boxes and Bandwidth Generously Provided by pair Networks
We don't bite newbies here... much
 
PerlMonks  

comment on

( [id://3333]=superdoc: print w/replies, xml ) Need Help??

Not in Perl but ... we do web automation to fill in forms on other sites. That is if the site wount cooperate and accept the data (job offers) in a XML/CSV/plaintext/... file, we just fake a user clicking buttons, filling in fields, selecting pulldowns and radios, clicking links, ...

We do this by creating an Internet Explorer object and controling it. It's a big can of worms, but it seems to be working fine most of the time. I agree WWW::Mechanize would be easier most of the time, the thing is the sites that do not accept the files are usualy the same ones that use crazy JavaScript (if not something worse) on their pages so we do need the browser object to allow the JavaScript to run. This way is slow, but works with almost any site.

Jenda
Always code as if the guy who ends up maintaining your code will be a violent psychopath who knows where you live.
   -- Rick Osborne

The sites do know we are doing this (I believe). We (or our clients) pay for the job ads so they have no reason to complain.

Edit by castaway: Closed small tag in signature


In reply to Re: "Web Automation" -- your input is greatly desired! by Jenda
in thread "Web Automation" -- your input is greatly desired! by Dice

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post; it's "PerlMonks-approved HTML":



  • Are you posting in the right place? Check out Where do I post X? to know for sure.
  • Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
    <code> <a> <b> <big> <blockquote> <br /> <dd> <dl> <dt> <em> <font> <h1> <h2> <h3> <h4> <h5> <h6> <hr /> <i> <li> <nbsp> <ol> <p> <small> <strike> <strong> <sub> <sup> <table> <td> <th> <tr> <tt> <u> <ul>
  • Snippets of code should be wrapped in <code> tags not <pre> tags. In fact, <pre> tags should generally be avoided. If they must be used, extreme care should be taken to ensure that their contents do not have long lines (<70 chars), in order to prevent horizontal scrolling (and possible janitor intervention).
  • Want more info? How to link or How to display code and escape characters are good places to start.
Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others surveying the Monastery: (6)
As of 2024-04-24 03:32 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found