<?xml version="1.0" encoding="windows-1252"?>
<node id="979204" title="Re^5: UTF8 with YAML or JSON" created="2012-06-30 00:35:24" updated="2012-06-30 00:35:24">
<type id="11">
note</type>
<author id="715263">
zwon</author>
<data>
<field name="doctext">
&lt;blockquote&gt;&lt;em&gt;I should see UTF8 values ... YAML::Syck does this&lt;/em&gt;&lt;/blockquote&gt;
&lt;p&gt;I don't see this from your example. [mod://YAML::Syck] returns you two latin1 characters instead of a single &lt;c&gt;\x{103}&lt;/c&gt; that the file contains, which is exactly the opposite to what you are saying you want. [mod://YAML::XS] expects UTF-8 octets on input, and it checks that it is correct UTF-8, and it returns you UTF-8 characters. I have impression that you don't realise what you are getting from the modules, maybe you should use &lt;c&gt;Dump&lt;/c&gt; from the [mod://Devel::Peek] to inspect values instead of &lt;c&gt;Dumper&lt;/c&gt;, also if you add
&lt;c&gt;
use open ":utf8";
use open ":std";
&lt;/c&gt;
to your script, it will be clear to you, that [mod://YAML::Syck] doesn't return &lt;b&gt;&amp;#259;&lt;/b&gt;, but &lt;b&gt;ă&lt;/b&gt;.</field>
<field name="root_node">
979143</field>
<field name="parent_node">
979164</field>
</data>
</node>
