Re: read HTML <title> tag


Pathologically Eclectic Rubbish Lister
	PerlMonks

Re: read HTML <title> tag

by dbwiz (Curate)

on May 31, 2005 at 14:06 UTC ( [id://462061]=note: print w/replies, xml )

Need Help??

in reply to read HTML <title> tag

HTML::Parser is not the easiest way of parsing HTML.

Besides Corion's suggestion about HTML::HeadParser, if you need to parse more than the document's title, you may want to get acquainted with HTML::TokeParser. Here is a way of finding the title:

#!/usr/bin/perl
use strict;
use warnings;
use HTML::TokeParser;

my $p = HTML::TokeParser->new('index.html') or die "can't open\n";

while (my $token = $p->get_token)
{
    if ($token->[0] eq "S" and lc $token->[1] eq 'title')
    {
        my $title =  $p->get_text() || "<NO TITLE FOUND>";
        print "$title\n";
        last;
    }
}
[download]

Comment on Re: read HTML <title> tag Download Code

In Section Seekers of Perl Wisdom

Domain Nodelet^?

www.com | www.net | www.org

Node Status^?

node history
Node Type: note [id://462061]
help

Chatterbox^?

How do I use this? • Last hour • Other CB clients

Other Users^?

Others surveying the Monastery: (10)

As of 2024-04-23 14:51 GMT

Sections^?

Information^?

Find Nodes^?

Leftovers^?

Today I Learned

Voting Booth^?

No recent polls found