Beefy Boxes and Bandwidth Generously Provided by pair Networks
The stupid question is the question not asked
 
PerlMonks  

Re: Find most frequently used word in text file.

by Kenosis (Priest)
on Dec 19, 2013 at 19:47 UTC ( [id://1067876]=note: print w/replies, xml ) Need Help??


in reply to Find most frequently used word in text file.

If there are multiple words used the most frequently, the program just chooses the first match.

Perhaps the following doesn't address your general question, but it provides the option of showing all words with the highest frequency:

use strict; use warnings; use List::Util qw/max/; my ( %words, %count ); my $file = $ARGV[0]; while (<>) { $words{ lc $_ }++ for split /\W+/; } push @{ $count{ $words{$_} } }, $_ for keys %words; my $max = max keys %count; print "The most frequent word(s) in $file: @{ $count{$max} }; Times se +en: $max.\n"

This uses a hash to store word/count pairs. Next, it creates a hash of arrays (HoA), where the key is the count and the value is a reference to a list of words associated with that count. It then uses List::Util to find the max count, and that count is used to display the word(s) and frequency.

Hope this helps!

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://1067876]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others musing on the Monastery: (7)
As of 2024-04-23 12:32 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found