Beefy Boxes and Bandwidth Generously Provided by pair Networks
Clear questions and runnable code
get the best and fastest answer

Re: can't get rid of BOM from UTF-8 webpage

by Anonymous Monk
on May 20, 2012 at 08:15 UTC ( #971471=note: print w/replies, xml ) Need Help??

in reply to can't get rid of BOM from UTF-8 webpage

Hi :)

My second question is why Perl insists on keeping the BOM and printing it later

Because it would be insane to throw it away without being told to throw it away.

I would have expected it to be stripped during the initial read of the text file, since it's just packaging, and omitted in Perl's internal character representation.

Besides not being mere packaging it isn't "omitted"; Your expectations is wrong.

Overall, though, I'd like to learn where to fix the problem. Do I configure Windows differently? Do I read the text file differently in Perl? Or do I just print things differently in Perl? Any insights or suggestions will be greatly appreciated.

for cmd.exe change fonts, I read fonts are responsible for not showing BOM

or try PowerShell, I hear that thing is unicode by default, so it ought to come with fonts that know to hide BOM

or from perl, strip the bom , say by using :encoding(UTF-8):via(File::BOM), and/or skip printing BOM when -t Filehandle is opened to a tty (tty means console, cmd.exe )

I've seen lots of suggestions ...

Next time, include those links in your post :)

FWIW, Content-type is not charset

FWIW, utf8 is not UTF-8, the difference could be important

BUT, FWIW, you shouldn't specify charset (utf8 or UTF-8) to decoded_content, that is webservers job , it should just work already

My first question is why the CMD prompt isn't handling the BOM correctly,

seems to me something on MSDN would answer that :p

  • Comment on Re: can't get rid of BOM from UTF-8 webpage

Replies are listed 'Best First'.
Re^2: can't get rid of BOM from UTF-8 webpage
by BeneSphinx (Acolyte) on May 20, 2012 at 20:25 UTC

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://971471]
[davido]: I am not finding closing STDIN to be an adequate means of making getlogin return undef.
[Corion]: Maybe doing a double-fork (daemonizing) can make go that information away, but maybe not
[Corion]: But I think my knowledge of unix/Linux datastructures is several decades out of date, so I don't really know what information it keeps on processes
[oiskuu]: The useful bits that relate to your process can be found under /proc/self. What information are you thinking of? Tty name?
[tye]: I just daemonized and getlogin() still knew who I had been.
[tye]: perhaps loginuid ? Not that I concede that something not being in /proc means it is not useful.

How do I use this? | Other CB clients
Other Users?
Others about the Monastery: (8)
As of 2017-06-23 19:36 GMT
Find Nodes?
    Voting Booth?
    How many monitors do you use while coding?

    Results (554 votes). Check out past polls.