Beefy Boxes and Bandwidth Generously Provided by pair Networks
Syntactic Confectionery Delight

Comment on

( #3333=superdoc: print w/replies, xml ) Need Help??


XML::PYX, based on XML::Parser is the Perl implementation. It comes with 3 tools: pyx (non-validating) and pyxv (validating) output the Pyx version of the document and pyxw writes an XML version of a Pyx flow. See - Pyxie for a description of Pyxie

Why use PYX?

  • you don't want to know to much about XML
  • you are used to, and you like line-oriented tools
  • you are just extracting some data from the XML document
  • you are doing simple XML transformation

Why NOT use XML::PYX

  • you want to do complex transformations
  • you are more at ease with tree-processing
  • you don't like writing all those regexps with \(
  • you need some information from the XML documents that Pyx does not provide (comments, entity declarations...)

Related Module

XML::TiePYX is easier to use on a Windows system

Personal notes

Pyx is really cool to extract information from an XML file, or to perform simple transformations on simple XML files. The module is mature (it is quite simple so there shouldn't be too many bugs in it). I never actually use the module, only pyx, which I pipe to a perl -n or perl -p script.


Print all the elements used in an XML document, with the number of occurences.

pyx file.xml | perl -n -e '$nb{$1}++ if( m/\A\((.*)\n/); \
                            END { map { print "$_ used $nb{$_} time(s)\n";} sort keys %nb;}'

Warn in case of duplicate ID:

pyx file.xml | perl -n -e '($id)=( m/^Aid (.*)\n/) or next; print "duplicate id: $id\n" if($id{$id});  $id{$id}=1;'

Change a tag name (class to color):

pyx wine.xml | perl -p -e 's/^([()])class/$1color/' | pyxw

In reply to XML::PYX by mirod

Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post; it's "PerlMonks-approved HTML":

  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.
  • Log In?

    What's my password?
    Create A New User
    and all is quiet...

    How do I use this? | Other CB clients
    Other Users?
    Others about the Monastery: (3)
    As of 2018-05-20 17:58 GMT
    Find Nodes?
      Voting Booth?