Beefy Boxes and Bandwidth Generously Provided by pair Networks
go ahead... be a heretic
 
PerlMonks  

Re^2: UTF-8 text files with Byte Order Mark

by Joost (Canon)
on Feb 13, 2007 at 17:53 UTC ( #599730=note: print w/replies, xml ) Need Help??


in reply to Re: UTF-8 text files with Byte Order Mark
in thread UTF-8 text files with Byte Order Mark

Actually, I would be a little surprised to find a BOM in combination with UTF-8
Yeah, you don't usually see utf-8 files with a BOM. Nevertheless, it's perfectly valid

update: note that the utf-8 BOM consists of three bytes: "EF BB BF"

  • Comment on Re^2: UTF-8 text files with Byte Order Mark

Replies are listed 'Best First'.
Re^3: UTF-8 text files with Byte Order Mark
by muba (Priest) on Feb 13, 2007 at 20:03 UTC

    The test file seems to match that three-byte BOM indeed.

    I'm happy to know you don't usualy see utf-8 files with a BOM, but as pointed out below, some programs still store it, such as Notepad. One of my users seems to have a utf-8 file with a BOM too.

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://599730]
help
Chatterbox?
and all is quiet...

How do I use this? | Other CB clients
Other Users?
Others scrutinizing the Monastery: (8)
As of 2016-12-05 12:45 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?
    On a regular basis, I'm most likely to spy upon:













    Results (82 votes). Check out past polls.