in reply to
Re: How to deal with malformed utf8 from XML parsing
in thread How to deal with malformed utf8 from XML parsing
print "$str\n\n"; is a mistake, though. You shouldn't print unicode text data without specifying an output :encoding on the filehandle, or encode()ing it manually.
I thought I can print unicode to STDOUT. I will read more on that.
By the way, instead of the confusing, error-prone, and tedious process of figuring out the internal state of a variable using is_utf8 and a normal print, please use Devel::Peek instead
If you pay closer attention you will see that I am using the validating capability of is_utf8 ($string, 'true_value').
I verified your claim, indeed it seems that this is not malformed utf. The reason I started digging is google complaining this is not a valid character. I will troubleshoot more.