Beefy Boxes and Bandwidth Generously Provided by pair Networks
There's more than one way to do things

Re^7: Encoding problem(reposting in more detail)

by john_oshea (Priest)
on May 30, 2006 at 15:47 UTC ( #552529=note: print w/replies, xml ) Need Help??

in reply to Re^6: Encoding problem(reposting in more detail)
in thread Encoding problem(reposting in more detail)

This is my thinking on what you should be checking for - done as a list to get things straight in my own head more than anything ;-)

The back-end

  • In cmd.exe, chcp 65001 to set utf-8 code page (this will probably only last for as long as you have the same cmd.exe window open).
  • Open up the .txt files in an utf-8 capable editor and verify that the contents look ok.
  • Save the files using greek characters in the filename - I know that MS Word, for one, can do that, so you can always use that if needs be.
  • Check that dir shows you ok-looking greek characters.
  • If it does, proceed to the web side of things - if not, you'll have to figure out what the issue is before even attempting to get it sorted on your web pages.

The web

  • Check that your web pages are saved as utf-8, and that they're being served up as utf-8 (Firefox will show you this in 'Tools -> Page info').
  • Remove all the from_to code from your form
  • Make sure that your browser/font settings are sane (i.e. if you're specifying a font, that it actually has greek characters in it) - I'm sure you are though
  • ...
  • Profit! ;-)

That's about all that I can think of right now, but that should get you most, if not all, of the way there.

Replies are listed 'Best First'.
Re^8: Encoding problem(reposting in more detail)
by Nik on May 30, 2006 at 19:54 UTC
    Thanks agian Jon for yout help.
    I tried to do what you ssaid but when i try in cmd.exe to change the codepage to UTF8 then the greek file names treing appear liek this whe i "dir" inside the text dir of mine:
    22/05/2006 05:30 ΞΌΞΌ 6.546 Ξ&#1 +56;ΞΞΞΉΞ ΞΊΞΞΉ +ΞΞΞΉΟ 22/05/2006 05:30 ΞΌΞΌ 4.892 Ξ&#1 +56;ΞΟΞΞΌΟˆΟ 22/05/2006 05:30 ΞΌΞΌ 3.962 Ξ&#1 +59;ΞΉ ΟΞΟΟΟΞΏΞ +Ή ΟΞΟ ΞΞC 22/05/2006 05:30 ΞΌΞΌ 4.194 Ξ &# +926;―ΟΟΞΟΞ ΞΊΞ&#9 +26;Ή  22/05/2006 05:30 ΞΌΞΌ 4.528 Ξ &# +926;ΟΞ― ΟΞΟ ΞΞΞ& +#8213;Ξ 22/05/2006 05:30 ΞΌΞΌ 5.638 Ξ &# +926;ΟΞΉ ΟΟΞ ΞΈΞ&#9 +27;ΞΌΞΟΟΟŽΞ Ξ&#927 +; ΞΞΌΞΞΟ 22/05/2006 05:30 ΞΌΞΌ 1.889 Ξ &# +926;ΟΞΉΟΞΟΞΞΉ&#927 +; ΞΞΟŒΟ Ξ ΟΞΏΟ +Ξ ΊΟΞ* 22/05/2006 05:30 ΞΌΞΌ 6.105 Ξ &# +926;ΏΟ ΟΞΞΌΞ ΞΌ&#926 +;ΟΞ 22/05/2006 05:30 ΞΌΞΌ 5.167 Ξ &# +927;ΟŒΟΞΟΞ ΟΞΞ 22/05/2006 05:30 ΞΌΞΌ 7.104 Ξ&# +927;ΞΟΞ ΞΟΞΉΟΟ&# +926;ΉΞΞΞΏΟ 22/05/2006 05:30 ΞΌΞΌ 6.546 Ξ&# +926; ΟΟΞ―Ξ Ξ΄ 22/05/2006 05:30 ΞΌΞΌ 12.425 Ξ&# +926;Ώ ΟŒΟΞΞΌΞ Ο&#926 +;ΏΟ ΞΞΟΞΏΞ 22/05/2006 05:30 ΞΌΞΌ 1.967 Ξ&# +926;ΟΞΞΞΞΌΟΞΏ&#927 +; ΞΞΉΞΏΞΟΟΞΉ&# +926; 22/05/2006 05:30 ΞΌΞΌ 46.760 Ξ&# +926;ΟΞΉΟΟΞΌΞΞ&#926 +;Ο ΞΞΉΞ΄ΞΟΞΟ
    What does that tell us?! Somehting is wrong with the encoding isnt it?

      Blimey. Yes. You appear to have your filenames saved with the characters represented as HTML entities. Whatever you're doing to save the filenames themselves needs to change. If you have access to MS Word, open up one of the documents, and do a 'save as' (in text format) from there. I'm pretty certain that Word will give you 'proper' unicode filenames, though it may use UTF-16 to do it. If that's the case, you'll probably see something like {greek character}?{greek character}?{greek character}..., which will at least confirm that your cmd.exe codepage settings are correct.

        I alo tried to crate and save as utf8 within word. still i ge tquestion marks :(

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://552529]
[erix]: does anyone have a link to a video-stream to the Giro
[erix]: ?

How do I use this? | Other CB clients
Other Users?
Others drinking their drinks and smoking their pipes about the Monastery: (11)
As of 2017-05-25 13:53 GMT
Find Nodes?
    Voting Booth?