comment on

Hi All,

I was trying to parse Item tag with regEx but I am having problems because $capture->{_content} as a string is being translated to some other characterset. So, I am trying to pull out <item> tag using below method and I keep getting this error. Can someone please let me know why?

Error: XPathContext: lost current node at link_ext2.pl line 30

#!/usr/bin/perl -w
#use strict;
use warnings;
use XML::RSS::LibXML;
use XML::LibXML;
use LWP::UserAgent;
use Data::Dumper;

#my ( $htmlInfile, $htmlOutfile, $cssOutfile ) = @ARGV;

my $html_link = "http://rss.news.yahoo.com/rss/topstories";
my $parser = XML::LibXML->new;

my $client = LWP::UserAgent->new();
my $capture = $client->get("$html_link") || die"$!\n";
useLibXmlParseXmlItems($capture->{_content});

sub useLibXmlParseXmlItems
{

    my $rss = XML::RSS::LibXML->new;
    $rss->parse($_[0]) || die "Could not parse. <$!>";

    my $xp = XML::LibXML::XPathContext->new($rss);

    my @nodes = $xp->findnodes("/rss/channel/item");

    #print @nodes;
}
[download]

In reply to Parsing Item Tag from RSS feed by mr_p

Are you posting in the right place? Check out Where do I post X? to know for sure.
Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
<code> <a> <b> <big> <blockquote> <br /> <dd> <dl> <dt> <em> <font> <h1> <h2> <h3> <h4> <h5> <h6> <hr /> <i> <li> <nbsp> <ol> <p> <small> <strike> <strong> <sub> <sup> <table> <td> <th> <tr> <tt> <u> <ul>
Snippets of code should be wrapped in <code> tags not <pre> tags. In fact, <pre> tags should generally be avoided. If they must be used, extreme care should be taken to ensure that their contents do not have long lines (<70 chars), in order to prevent horizontal scrolling (and possible janitor intervention).
Want more info? How to link or How to display code and escape characters are good places to start.


The stupid question is the question not asked
	PerlMonks