Beefy Boxes and Bandwidth Generously Provided by pair Networks
go ahead... be a heretic
 
PerlMonks  

Re: Lotto table extraction

by mojotoad (Monsignor)
on Jan 24, 2012 at 21:08 UTC ( #949759=note: print w/replies, xml ) Need Help??


in reply to Lotto table extraction

Those are some pretty nasty tables (50 or so of them). All sorts of empty rows and cells embedded throughout. In cases like these, it's better to extract all tables and filter based on inspecting particular cells. For example:
#!/usr/bin/perl use strict; use warnings; use LWP::Simple; use HTML::TableExtract; my $data = get('http://www.flalottery.com/exptkt/c3.htm'); my $te = HTML::TableExtract->new; $te->parse($data); for my $t ($te->tables) { my $rc = -1; my($d, $c) = $t->coords; for my $r ($t->rows) { ++$rc; @$r = map { s/^[^a-z0-9]//i; $_ } grep { /[a-z0-9]/i } grep { defined $_ } @$r; next unless @$r && $r->[0] =~ m/^\d+\/\d+\/\d+$/; print "row $d:$c:$rc: ", join(':', @$r), "\n"; } }
The grep/grep/map part eliminates empty cells and gets rid of the   entities that precede the M/E indicators. The 'next' statement afterwards eliminates empty rows and non-dated rows. This is a shotgun approach. You could easily filter each row using specific column indexes, for example.

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://949759]
help
Chatterbox?
[marto]: good morning all, TCIF
[gnosti]: I'm reading from ffmpeg lengths of video clips in H:M:S and needing to add them and format them back to H:M:S for feeding back to ffmpeg
[marioroy]: kcott Thanks
[Corion]: gnosti: I would cheat and use my approach instead of dealing with durations as objects, or a module
[Corion]: You could even do the calculation directly - my $duration = $H * 3600+ $M*60 + $S; strftime '%H:%M:%S', timelocal($duratio n)
[Corion]: (and that strftime call is largely superfluous too, converting back to H:M:S isn't hard either)

How do I use this? | Other CB clients
Other Users?
Others examining the Monastery: (9)
As of 2017-08-18 09:00 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?
    Who is your favorite scientist and why?



























    Results (297 votes). Check out past polls.

    Notices?