It's a bit strange, but the internal representation of the string shouldn't* matter.
What I do find very strange is that it doesn't croak when passed non-bytes.
use strict;
use warnings;
use Data::Dumper qw( Dumper );
$Data::Dumper::Useqq = 1;
$Data::Dumper::Terse = 1;
$Data::Dumper::Indent = 0;
my $s = chr(0xC9);
utf8::downgrade($s);
print(Dumper(pack('V/a*', $s)), "\n");
utf8::upgrade($s);
print(Dumper(pack('V/a*', $s)), "\n");
print(Dumper(pack('V/a*', "\x{C9}\x{2660}")), "\n");
5.10.0:
"\1\0\0\0\311" # Ok
"\1\0\0\0\x{c9}" # Ok
"\2\0\0\0\x{c9}\x{2660}" # Does this make sense???
On the other hand, 5.8.8 was very broken:
"\1\0\0\0\311" # Ok
"\1\0\0\0\303" # XXX
"\2\0\0\0\303\242" # XXX
* — I realize it matters all to often, but that's getting fixed. In plfaces where it does matter, you can use
utf8::upgrade and
utf8::downgrade to control the internal format.
-
Are you posting in the right place? Check out Where do I post X? to know for sure.
-
Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
<code> <a> <b> <big>
<blockquote> <br /> <dd>
<dl> <dt> <em> <font>
<h1> <h2> <h3> <h4>
<h5> <h6> <hr /> <i>
<li> <nbsp> <ol> <p>
<small> <strike> <strong>
<sub> <sup> <table>
<td> <th> <tr> <tt>
<u> <ul>
-
Snippets of code should be wrapped in
<code> tags not
<pre> tags. In fact, <pre>
tags should generally be avoided. If they must
be used, extreme care should be
taken to ensure that their contents do not
have long lines (<70 chars), in order to prevent
horizontal scrolling (and possible janitor
intervention).
-
Want more info? How to link
or How to display code and escape characters
are good places to start.