Beefy Boxes and Bandwidth Generously Provided by pair Networks
Pathologically Eclectic Rubbish Lister
 
PerlMonks  

Re^3: Selenium get_page_source function

by Illuminatus (Curate)
on Aug 08, 2013 at 15:44 UTC ( #1048597=note: print w/ replies, xml ) Need Help??


in reply to Re^2: Selenium get_page_source function
in thread Selenium get_page_source function

Your code example shows it writing the return from get_page_source to the file that you're renaming to ".zip". I wasn't sure if some additional transformation was necessary to remove the html 'wrapper'. These are the things I would do (some of which you may have already done):

  1. Download the zip file manually and verify that you can unzip it
  2. If you're on a *nix system, run 'sum file.zip' and 'file file.zip' on the file and keep the results
  3. Verfiy the 'content-encoding:' is what you expect
  4. Once you use your script to download, use 'sum' and 'file' on the result(s) to look for differences

fnord


Comment on Re^3: Selenium get_page_source function

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1048597]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others surveying the Monastery: (10)
As of 2014-08-20 06:20 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The best computer themed movie is:











    Results (105 votes), past polls