Beefy Boxes and Bandwidth Generously Provided by pair Networks
The stupid question is the question not asked
 
PerlMonks  

Re^4: Windows NTFS UTF-16LE File-Operations

by BrowserUk (Pope)
on Feb 20, 2012 at 23:51 UTC ( #955186=note: print w/ replies, xml ) Need Help??


in reply to Re^3: Windows NTFS UTF-16LE File-Operations
in thread Windows NTFS UTF-16LE File-Operations

Hm. Doesn't seem to work for me:

C:\test\junk>dir
12/11/2010 05:58 7 acentó.txt
20/11/2010 09:46 <DIR> ελληνικά
1 File(s) 7 bytes
3 Dir(s) 236,893,585,408 bytes free

C:\test\junk>perl -E"say for glob '*'"
acent¾.txt
DC44~1

C:\test\junk>perl -E"say Win32::GetLongPathName( $_ ) for glob '*'"
acent¾.txt
Wide character in print at -e line 1.
╬Á╬╗╬╗╬À╬¢╬╣╬║╬¼

(Code tags deliberately omitted to ensure that you can see exactly what I see on my console.)

Conversely, Win32::FindFile does work for me:


C:\test\junk>perl -E"say for glob '*'"
acent�.txt
DC44~1

C:\test\junk>perl -E"say Win32::GetLongPathName( $_ ) for glob '*'"
acent�.txt
Wide character in print at -e line 1.
ελληνικά
ικά


C:\test\junk>perl -C0 -MWin32::FindFile -E"say for FindFile( '*' )"
.
..
acentó.txt

ελληνικά
ικά

With the rise and rise of 'Social' network sites: 'Computers are making people easier to use everyday'
Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error.
"Science is about questioning the status quo. Questioning authority".
In the absence of evidence, opinion is indistinguishable from prejudice.

The start of some sanity?


Comment on Re^4: Windows NTFS UTF-16LE File-Operations
Re^5: Windows NTFS UTF-16LE File-Operations
by repellent (Priest) on Feb 21, 2012 at 00:29 UTC
    You need to encode back to octets the return of Win32::GetLongPathName(), because say is printing out octets. Win32::FindFile is most likely doing that for you already.

    Which encoding you choose should depend on what your console expects (to be able to redisplay the characters).

    Try:
    perl -MEncode -E "say encode( 'UTF-8', Win32::GetLongPathName( $_ ) ) +for glob '*'"

    If 'UTF-8' doesn't work out, choose another encoding that works well with your console. Bear in mind that the UTF-8 encoding may be fine as-is, and that the console just needs better fonts (which is not your case, because I infer that you can see the characters displayed properly using Win32::FindFile).

      Yes. That seems to work. I thought that -C0 would take care of that for me...I'm just very glad that I don't have to deal with this crap!


      \test\junk>perl -C0 -E"say Win32::GetLongPathName( $_ ) for glob '*'"
      acent�.txt
      Wide character in print at -e line 1.
      ελληνικά
      ικά


      C:\test\junk>perl -MEncode -E "say encode('UTF-8', Win32::GetLongPathName( $_ ) ) for glob '*'"
      acentó.txt

      ελληνικά
      ικά


      C:\test\junk>dir
      12/11/2010 05:58 7 acentó.txt
      20/11/2010 09:46 <DIR> ελληνικά

      With the rise and rise of 'Social' network sites: 'Computers are making people easier to use everyday'
      Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error.
      "Science is about questioning the status quo. Questioning authority".
      In the absence of evidence, opinion is indistinguishable from prejudice.

      The start of some sanity?

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://955186]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others imbibing at the Monastery: (10)
As of 2014-09-18 10:44 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    How do you remember the number of days in each month?











    Results (111 votes), past polls