http://www.perlmonks.org?node_id=846969


in reply to Re^5: Parsing ITEM Tag from RSS feed using XML::RSS::LibXML
in thread Parsing ITEM Tag from RSS feed using XML::RSS::LibXML

Finding Item:

There are some RSS websites that do not have links as new Items, They are embedded as headlines and stories. In which case I need to parse the Item tag right from there.

LWP::UserAgent

It is reliable for me to use modification time. Does the modification time change when page changes, meaning is it RSS feed requirement?

  • Comment on Re^6: Parsing ITEM Tag from RSS feed using XML::RSS::LibXML

Replies are listed 'Best First'.
Re^7: Parsing ITEM Tag from RSS feed using XML::RSS::LibXML
by rowdog (Curate) on Jun 29, 2010 at 00:13 UTC
    Finding Item: There are some RSS websites that do not have links as new Items, They are embedded as headlines and stories. In which case I need to parse the Item tag right from there.

    I imagine there's a better way to do this but you can look at the structure of that particular file and figure out what you need to pull out. I see $rss->{channel}->{link} as being the kind of thing you're asking about, but there's no item there, just a link (and other elements of the channel).

    LWP::UserAgent It is reliable for me to use modification time. Does the modification time change when page changes, meaning is it RSS feed requirement?

    Yes and no, like Anonymous Monk said.

Re^7: Parsing ITEM Tag from RSS feed using XML::RSS::LibXML
by Anonymous Monk on Jun 28, 2010 at 20:57 UTC
    It is reliable for me to use modification time. Does the modification time change when page changes, meaning is it RSS feed requirement?

    Yes and no. There are recommendations and requirements (see rfc), but web servers/sites frequently ignore such.