Beefy Boxes and Bandwidth Generously Provided by pair Networks
Just another Perl shrine
 
PerlMonks  

Re: Scraping a website - Unterminated quoted string

by Discipulus (Abbot)
on May 05, 2017 at 07:50 UTC ( #1189555=note: print w/replies, xml ) Need Help??


in reply to Scraping a website - Unterminated quoted string

hello Staralfur and welcome to the monastery and to the wonderful world of Perl

since you are a newbie, I permit to suggest something. First as already said, take the habit to use 3 args form for open using lexical filehandles: open my $fh, '<', $file_path or die "Unable to open [$file_path] $!" infact if you use $file_path as variable you can print it also in the die message, using square brackets to be sure you have no typos in it. In addition to $! you might want to print also $^E or last OS error. See them in perlvar

Now about your script: this is not scraping is.. curling ;=)

Scraping the web is a black art, and i'm still a newbie in that but besides basic tasks accomplished via LWP::UserAgent you can use App::scrape (fixed link thanks to kennethk) by our dear brother Corion or Web::Scraper by the genial author of Plack / PSGI Miyagawa.

You can read aboout perl web scraping at my homenode in the scraping link section

L*

There are no rules, there are no thumbs..
Reinvent the wheel, then learn The Wheel; may be one day you reinvent one of THE WHEELS.

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://1189555]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others romping around the Monastery: (1)
As of 2022-01-27 04:54 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?
    In 2022, my preferred method to securely store passwords is:












    Results (70 votes). Check out past polls.

    Notices?