Beefy Boxes and Bandwidth Generously Provided by pair Networks
Perl: the Markov chain saw
 
PerlMonks  

Re^3: compute the occurrence of words

by Tux (Monsignor)
on Feb 13, 2013 at 14:27 UTC ( #1018546=note: print w/ replies, xml ) Need Help??


in reply to Re^2: compute the occurrence of words
in thread compute the occurrence of words

In which case you will also have to define "numbers" :) integers?, floats? e-notation? Roman? Only ASCII-digits, or also other Unicode numerals?

Let me assume simple integers and floats represented in ASCII (no triad-sep, radix-sep = '.', so valid numbers include 1234 and 0.23, but not DCVII, 2.34e12 or 1,234,567.00

my %count; while (<FH>) { $count{lc $_}++ for grep { !m{^[0-9]+(\.[0-9]+)?$} } m/\w+/g; }

For a complete regular expression to integers and reals, I'd like to refer to Regexp::Common (see $RE{num}).

update: /me just realized that it is overly complex, as \w+ can only match integers without a triad-sep, as . is not included in \w, reducing the loop-line to

$count{lc $_}++ for grep { !m{^[0-9]+$} } m/^\w+$/g;

Enjoy, Have FUN! H.Merijn


Comment on Re^3: compute the occurrence of words
Select or Download Code
Re^4: compute the occurrence of words
by BigGer (Novice) on Feb 13, 2013 at 14:40 UTC

    Thanks H.Merijn That's perfect. I will go and read up on the hash function. G

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1018546]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others rifling through the Monastery: (5)
As of 2014-11-25 23:43 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    My preferred Perl binaries come from:














    Results (160 votes), past polls