|Perl Monk, Perl Meditation|
Re^6: Database vs XML output representation of two-byte UTF-8 characterby ikegami (Pope)
|on Sep 09, 2014 at 04:43 UTC||Need Help??|
No idea what that means.
Don't know. Don't care. Doesn't matter how they are stored, as those are internal details that aren't relevant.
What does matter is whether they returned decoded text or something else. That has nothing to do with the internal storage format.
It's completely false — nothing in Perl accepts or produces latin-1 — and it has nothing to do with anything discussed so far.
You were complaining that Perl let you concatenate decoded text and UTF-8 bytes. (Well, you called it something different, but this is the underlying issue.) It has no idea one of the the strings you are concatenating contains text and that the other contains UTF-8 bytes, so it can't let you know that you are doing something wrong.
This is all the information Perl currently has. Is that an error? You can't tell. Perl can't tell. Strings coming from a file handle with a decoding layer should be flagged "I'm decoded text!". Those coming from a file handle without a decoding layer should be flagged "I'm bytes!". Concatenating the two should be an error. These flags do not currently exist.