http://www.perlmonks.org?node_id=1003781

perlnoobster has asked for the wisdom of the Perl Monks concerning the following question:

Hi perl monks,

I'm currently trying to extract some information from a website, this is the html segment that I am working on:

<td class="ttl"><a href=# onClick="helpW('h_status.htm');">Status</a>< +/td> <td class="nfo">Coming soon. Exp. release 2012, November 13th</td> </tr>

the code that I am using is the following to extract the Status onwards:

 my (@ASINS2)=$final_page=~m!Status</a></td>(.+?)/td>!g;

The results yield nothing, however if I were to change the code to the following:

 my (@ASINS2)=$final_page=~m!<td class="nfo">(.+?)</td>!g;

It seems to work, highlighting that the code/regex has an issue trying to grab any html that is separated on new lines, please can someone help? i'm sure its a regex issue but I cant figure it out?!

 

  (all i require is the Coming soon..... segment)

Thank you