Beefy Boxes and Bandwidth Generously Provided by pair Networks
The stupid question is the question not asked

Tokeparser extraction

by jonnyfolk (Vicar)
on Feb 20, 2010 at 14:39 UTC ( #824386=perlquestion: print w/replies, xml ) Need Help??

jonnyfolk has asked for the wisdom of the Perl Monks concerning the following question:

The following code:
my $item = qq~<tr> <td><font size=2><B><a href='javascript:void(0);' onClick="javascr'../exc_details.asp?PID=0000','Nyhet','toolbar=no,loc +ation=no,directories=no,status=no,menubar=no,scrollbars=yes,resizeabl +e=yes,width=380,height=500');" href="javascript:void(0);">EXCAVATOR</ +A></B></font></td> <td><font size=2>1 Jul 2010</font></td> <td><font size=2>10 Aug 2010</font></td> <td><font size=2><B>Shellshocked</B></font></td> </tr>~; my $p=HTML::TokeParser->new(\$item); my $data = $p->get_text(['</td>']); print "$data\n";
Creates output:
EXCAVATOR 1 Jul 2010 10 Aug 2010 Shellshocked
How can I cleanly just get the two dates fields?

Replies are listed 'Best First'.
Re: Tokeparser extraction
by Corion (Pope) on Feb 20, 2010 at 14:56 UTC

    Just filter for things that look like a date? Or consider HTML::TableExtract, if what you really want to do is parse the table?

Log In?

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: perlquestion [id://824386]
Approved by AnomalousMonk
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others scrutinizing the Monastery: (3)
As of 2021-09-19 08:03 GMT
Find Nodes?
    Voting Booth?

    No recent polls found