Beefy Boxes and Bandwidth Generously Provided by pair Networks
Clear questions and runnable code
get the best and fastest answer
 
PerlMonks  

comment on

( [id://3333]=superdoc: print w/replies, xml ) Need Help??

Fetching and Storing Information from Websites

Good day everyone

I am pretty fresher on this forum as well as perl, and new to coding too. but started learning these, on seeing my colleagues do in windows box's, fortunately i am in love with linux and installed LinuxMint in my computer.Now i am trying to do a small coding to fetch data from a website and store it in database.the data available in web is .htm file and there are around 35 .htm files with different heading and the data format as follows.

A Week 1

-----------------------------------------------------------------------------------------------

DAY-1 DAY-2 DAY-3 DAY-4 DAY-5

28/01 29/01 30/01 31/01 01/02

------------------- ------------------- ------------------- ------------------- ---------------

000 000 000 000 000

030 030 030 030 030

018 019 019 019 020

002 002 001 002 002

093 096 093 093 090

053 052 056 053 053

007 007 007 007 007

140 130 110 080 080

At present i am copying these info's manually into an excel file as of now, but wanted to do through script and a cron job it. My friend suggested me to do using some scripts and another one suggested perl is the best bet for the same. Can someone guide me?


In reply to fetching and storing data from web. by nicolethomson

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post; it's "PerlMonks-approved HTML":



  • Are you posting in the right place? Check out Where do I post X? to know for sure.
  • Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
    <code> <a> <b> <big> <blockquote> <br /> <dd> <dl> <dt> <em> <font> <h1> <h2> <h3> <h4> <h5> <h6> <hr /> <i> <li> <nbsp> <ol> <p> <small> <strike> <strong> <sub> <sup> <table> <td> <th> <tr> <tt> <u> <ul>
  • Snippets of code should be wrapped in <code> tags not <pre> tags. In fact, <pre> tags should generally be avoided. If they must be used, extreme care should be taken to ensure that their contents do not have long lines (<70 chars), in order to prevent horizontal scrolling (and possible janitor intervention).
  • Want more info? How to link or How to display code and escape characters are good places to start.
Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others scrutinizing the Monastery: (4)
As of 2024-04-24 13:11 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found