Beefy Boxes and Bandwidth Generously Provided by pair Networks
There's more than one way to do things
 
PerlMonks  

Re^18: Native newline encoding

by sauoq (Abbot)
on May 29, 2012 at 11:41 UTC ( #972998=note: print w/ replies, xml ) Need Help??


in reply to Re^17: Native newline encoding
in thread Native newline encoding

Binary!? But its exactly what ascii mode was designed for.

Errr... wrong. And badly so. You might look at how you just referred to the mode yourself. Congrats on not using ambiguous words like "text."

An editor!? Not much good for automated processing.

Use tools that are encoding aware, if you prefer a more general statement. But then, most tools for processing XML are... the problems usually arise when someone opens an XML file in an editor.

The point is: don't use FTP to to transfer XML files in ASCII mode as they may or not be ASCII encoded.

-sauoq
"My two cents aren't worth a dime.";


Comment on Re^18: Native newline encoding
Re^19: Native newline encoding
by BrowserUk (Pope) on May 29, 2012 at 13:03 UTC
    The point is: don't use FTP to to transfer XML files in ASCII mode as they may or not be ASCII encoded.

    You really don't have a clue do you.

    Let's posit a scenario. There is an XML file on a z/OS system that you need to transfer to your *nix system. The only access available is ftp. You do the transfer in binary mode and what do you end up with? Utter garbage! Because it is encoded in UTF-EBCDIC and is now totally unintelligible.

    The point of ASCII mode, is that the source encodes the data into a known format: "8-bit NVT-ASCII". At the destination, that format is then converted to whatever local format is required. The point of this is that each system only needs to know how to convert from its local format and the "well-known format".

    I'll gift you a hint. Under ASCII mode, if the source file is not actually ASCII encoded, the ftp protocol requires that it be converted to ASCII -- or, in this era of unicode, an "ASCII compatible format" like UTF-8 -- for transmission.

    If the source file is in EBCDIC, the it is converted to ASCII (or an ASCII compatible) format.

    If the source file is in UTF-EBCDIC, then it is converted to ASCII (or an ASCII compatible) format.

    Seeing the pattern yet?

    Go away, and read the RFC -- I mean actually read it -- and then just stay stum.


    With the rise and rise of 'Social' network sites: 'Computers are making people easier to use everyday'
    Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error.
    "Science is about questioning the status quo. Questioning authority".
    In the absence of evidence, opinion is indistinguishable from prejudice.

    The start of some sanity?

      You do the transfer in binary mode and what do you end up with? Utter garbage!

      Incorrect. Again. What you end up with is an XML file encoded in UTF-EBCDIC.

      Your supposition that your transfer tools should be able to handle all necessary conversions for you is just wrong-headed.

      Let's change this assumption...

      The only access available is ftp.

      ... and assume instead that the only available access is via a webserver.

      Since it isn't going to do any conversions for you, what are you going to do? Pull out iconv, of course. Or an equivalent tool. Or do nothing so long as your tool set handles the encoding without problems.

      Consider unpacking (on z/OS, if you like) an archive file containing thousands of different XML files with different encodings, created on different platforms. Then someone gives you the filename of one of those and tells you they need you to transfer it to a totally different platform. Whatchagonnado?

      I'll gift you a hint. Under ASCII mode [. . . snip . . .]

      And I've already gifted you the hint: don't use ASCII mode.

      Update:

      The point of ASCII mode, is that the source encodes the data into a known format: "8-bit NVT-ASCII". At the destination, that format is then converted to whatever local format is required. The point of this is that each system only needs to know how to convert from its local format and the "well-known format".

      You know, that whole paragraph explains a lot. Are you perhaps just so steeped in the old ways of doing things that you completely fail to recognize what advantages a universal cod(e)ing imparts and the problems it solves?

      -sauoq
      "My two cents aren't worth a dime.";

        Look you fatuous idiot. I didn't invent FTP. I'm simply explaining you the way it is designed to work.

        Designed & codified & verified and RFC'd, by people who have some idea of what they are doing.

        Unlike you who have no F***ing idea, but continue to spout crap anyway.

        YOU WERE WRONG! YOU ARE WRONG! YOU WILL CONTINUE TO BE WRONG! Accept it and move on.


        With the rise and rise of 'Social' network sites: 'Computers are making people easier to use everyday'
        Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error.
        "Science is about questioning the status quo. Questioning authority".
        In the absence of evidence, opinion is indistinguishable from prejudice.

        The start of some sanity?

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://972998]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others rifling through the Monastery: (4)
As of 2014-08-31 05:05 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The best computer themed movie is:











    Results (294 votes), past polls