Beefy Boxes and Bandwidth Generously Provided by pair Networks
more useful options
 
PerlMonks  

Re^3: find junk file

by thomas895 (Hermit)
on Jun 07, 2012 at 07:26 UTC ( #974893=note: print w/ replies, xml ) Need Help??


in reply to Re^2: find junk file
in thread find junk file

Oh, well, in that case, it's quite simple:

use constant HIGHEST_CHAR_ON_KBD => 126, #These values may differ for +you, depending on where you bought LOWEST_CHAR_ON_KBD => 9; # your keyboard. There are so +me extra, non-keyboard chars in this range, as well. while( <FILE> ) { foreach( split("", $_) ) { if( ( ord($_) > HIGHEST_CHAR_ON_KBD ) || ( ord($_) < LOWEST_CH +AR_ON_KBD) ) { say "It's a binary file"; last; } } }

It isn't the best way of doing things, but it's a start.
Update: I completely forgot about spaces, tabs, carriage returns, and line feeds.

~Thomas~
confess( "I offer no guarantees on my code." );


Comment on Re^3: find junk file
Select or Download Code
Re^4: find junk file
by roboticus (Chancellor) on Jun 07, 2012 at 11:08 UTC

    thomas895:

    So a file is a text file unless someone uses a space?

        ...or a tab?

        ...or a carriage return?

        ...or a newline, escape sequence, ....?

    ...roboticus

    When your only tool is a hammer, all problems look like your thumb.

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://974893]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others musing on the Monastery: (9)
As of 2015-07-07 08:43 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The top three priorities of my open tasks are (in descending order of likelihood to be worked on) ...









    Results (87 votes), past polls