<?xml version="1.0" encoding="windows-1252"?>
<node id="980286" title="Re: Mixed character encoding issues" created="2012-07-06 10:48:22" updated="2012-07-06 10:48:22">
<type id="11">
note</type>
<author id="658643">
nikosv</author>
<data>
<field name="doctext">
&lt;i&gt;I believe Excel stores data using cp1252&lt;/i&gt;
&lt;p&gt;
I don't think that's correct.Excel is Unicode enabled by default. 
Try it out by entering a character available in the Unicode domain:
&lt;/p&gt;
&lt;p&gt;
download a free Japanese font available &lt;a href="http://www.wazu.jp/gallery/Fonts_Japanese.html"&gt;here&lt;/a&gt; ,install it, open a worksheet and do Insert&gt;Symbol&gt;find the font and click on a letter,save it and then open it again.The character should be there in its original representation. 
&lt;/p&gt;
&lt;p&gt;
Since MS has a twisted notion of Unicode, I consider that the excel file is saved as UTF16, which is what is considered Unicode by MS (while UTF8 is considered multi-byte)
&lt;p&gt;
my hunch is that you do some sort of double encoding, so I would suggest to decode the character from UTF16 to UTF8 
&lt;/p&gt;
</field>
<field name="root_node">
980169</field>
<field name="parent_node">
980169</field>
</data>
</node>
