Beefy Boxes and Bandwidth Generously Provided by pair Networks
Just another Perl shrine
 
PerlMonks  

Comment on

( #3333=superdoc: print w/ replies, xml ) Need Help??

I dont know whether i understood the question correctly. As you mentioned that each line in the file is single XML statement, I used XML::Twig to check the element and attributes by processing the file line by line.

This below code will print the line number at which you find the discrepancy. You can tweak this code to accommodate the changes you need.

#!/usr/bin/perl use strict; use XML::Twig; my %elem_att = qw(cat meow dog bark); my $reg = join '|', keys %elem_att; while (<DATA>) { next unless (m/<(?:$reg)/); my $line = $_; my $line_num = $.; my $elt = parse XML::Twig::Elt($line); my $element = $elt->name; my $att = $elem_att{$element}; unless ($elt->att_exists($att)) { print "Attribute $att is not found at line number $line_num\n" +; next; } } __DATA__ <root> <a/> <b/> <cat tail='text' meow='text'/> <cat tail='text'/> <cat tail='text'/> <dog tail='text' bark='text'/> <dog tail='text'/> </root>

Regards,
Murugesan Kandasamy
use perl for(;;);


In reply to Re: XML cleanup - regex or ? by murugu
in thread XML cleanup - regex or ? by ethrbunny

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post; it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • Outside of code tags, you may need to use entities for some characters:
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.
  • Log In?
    Username:
    Password:

    What's my password?
    Create A New User
    Chatterbox?
    and the web crawler heard nothing...

    How do I use this? | Other CB clients
    Other Users?
    Others chilling in the Monastery: (8)
    As of 2014-11-29 07:30 GMT
    Sections?
    Information?
    Find Nodes?
    Leftovers?
      Voting Booth?

      My preferred Perl binaries come from:














      Results (203 votes), past polls