Depends on how complicated you HTML file is -
Check out HTML::TokeParser for parsing HTML file in a nice way.
However if your HTML file is as simple as <HTML><BODY><Tag>Stuff1=val1,Stuff2=Val2</tag></BODY> </HTML> then I would just do a simple regex and we can help you if we see the HTML page.
regarding reading 3000 files that is very simple
foreach my $file (@ARGV) {
my $fh;
open $fh, "<", $file or die "Can't open $file ($!)";
push @filehandles, $fh;
}
Once you have your handles you can loop through them. Actually you can do this whole thing in a while loop instead of storing the handbles. something like while ($i++ < 3000) { open (blah,blah); do_stuff with blah; }
there is also glob if your filenames are not counter based i.e. file1 file2 etc.
For DB part - you can use DBI. once you have the values you can instert them into a table easily. You can also create a CSV file and then just load them into the DB (most DBs support that)
cheers
SK |