in reply to Trivial HTML extractor utility
HTML::SimpleLinkExtor comes with linktractor which does the same thing as linkx. :)
If you just want TITLE, here's the one that I use:
#!/usr/bin/perl require HTML::HeadParser; local( $/ ); foreach ( @ARGV ) { open my( $fh ), "<", $_ or do { warn "$!"; next }; my $p = HTML::HeadParser->new; $p->parse( <$fh> ); print "$_: ", $p->header( 'title' ), "\n"; }
|
---|
Replies are listed 'Best First'. | |
---|---|
Re^2: Trivial HTML extractor utility
by Dominus (Parson) on Nov 23, 2007 at 03:06 UTC |
In Section
Meditations