Beefy Boxes and Bandwidth Generously Provided by pair Networks
Problems? Is your data what you think it is?
 
PerlMonks  

Re: Re: Invalid UTF8 data: namespace suggestion needed or wheel reinvented?

by liz (Monsignor)
on Mar 02, 2004 at 11:33 UTC ( #333221=note: print w/ replies, xml ) Need Help??


in reply to Re: Invalid UTF8 data: namespace suggestion needed or wheel reinvented?
in thread Invalid UTF8 data: namespace suggestion needed or wheel reinvented?

Does the script use Encode::Guess?

Good point. I had forgotten about that module. Will look into that.

What does it do exactly?

With Encode: $string = decode("utf8", $string,FB_DEFAULT )

... cleaning up encodings is actually a task that is very difficult to automate in general.

I agree. But since XML is very picky about encoding errors, and the XML feed must continue in some way, this is (for now) the solution.

It's a problem that many people (will be|are) facing when migrating legacy systems to Unicode/XML aware systems, which is why I think it warrants someting on CPAN.

Liz


Comment on Re: Re: Invalid UTF8 data: namespace suggestion needed or wheel reinvented?
Download Code

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://333221]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others meditating upon the Monastery: (9)
As of 2015-07-06 06:04 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The top three priorities of my open tasks are (in descending order of likelihood to be worked on) ...









    Results (70 votes), past polls