http://www.perlmonks.org?node_id=974931


in reply to Re: Parsing HTML
in thread Parsing HTML

Hi, the source code is exactly the same. I usually work with firebug and the source code simultaneosly, but firebug is cool. Putting newlines wouldn't be a solution, I think... because the output of the current script are cruise trip names with all the details, and what I need is an output that gets me the details separately (within the cruise trip).

Replies are listed 'Best First'.
Re^3: Parsing HTML
by ww (Archbishop) on Jun 07, 2012 at 13:13 UTC
    For a start, you might want to use split ( split ) to break up $trip into its elements. But your recent sample-output-desired post involves additional data (for example, "Trip Name"...) which I ignored in checking the original .html. Therefore (among other reasons), I'm not sure that
         /<\/span>\n<span>/
    is an appropriate pattern for split.

    Even if so, you'll still have to hard-code some punctuation (such as the colons in the subheads) and, perhaps, the newlines.

    The previous newline suggestion was based on the output you showed with multiple itineraries as a single line.