The following gets at the content of images displayed on a page:
#!perl
use strict;
use warnings;
use 5.012;
use WWW::Mechanize::Chrome;
use Log::Log4perl ':easy';
Log::Log4perl->easy_init($TRACE);
use File::Temp 'tempdir';
use Cwd;
my $tempdir = tempdir();
my $mech = WWW::Mechanize::Chrome->new(
headless => 1,
data_directory => $tempdir,
download_directory => cwd(),
);
use Data::Dumper;
my $res = $mech->get('https://egp.rutgers.edu/cgi/wmc.pl');
say Dumper $mech->getResourceTree_future()->get;
my $link = $mech->xpath( '//a[text()="MY IMAGE"]', single => 1 );
$mech->click($link);
$mech->sleep(1);
my $resources = $mech->getResourceTree_future()->get;
my @images = grep { $_->{type} eq 'Image' } @{$resources->{resources}}
+;
my $image = $mech->getResourceContent_future( $images[0]->{url} )->get
+->{content};
open my $fh, '>:raw', 'test.jpg';
print $fh $image;
Note that you will need a way to find which image is the one you want.
Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
Read Where should I post X? if you're not absolutely sure you're posting in the right place.
Please read these before you post! —
Posts may use any of the Perl Monks Approved HTML tags:
- a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
|
For: |
|
Use: |
| & | | & |
| < | | < |
| > | | > |
| [ | | [ |
| ] | | ] |
Link using PerlMonks shortcuts! What shortcuts can I use for linking?
See Writeup Formatting Tips and other pages linked from there for more info.
|
|