Beefy Boxes and Bandwidth Generously Provided by pair Networks
Keep It Simple, Stupid
 
PerlMonks  

Re: separate word count for each line of a file

by 2teez (Vicar)
on Nov 22, 2012 at 03:54 UTC ( [id://1005049]=note: print w/replies, xml ) Need Help??


in reply to separate word count for each line of a file

Hi melb100,

I think you are really not far away from the solution you "dearly" seek.
I will rather use a Perl Hash, to get at the answers, while going over each of the line in a given file.
As shown below, A foot in the door, would give an entrance to the whole solution, I believe.
Like this:

use warnings; use strict; use Data::Dumper; my %line; while (<DATA>) { chomp; # not chop # initialize total number of character to get for each line my $line_total_characters_without_space = 0; foreach my $w (split) { $line{'Total_number_of_words'}++; print " In line: ", $., " word: ", $w, " has length: ", length +($w), $/; $line_total_characters_without_space += length($w); } # get the number of characters for each line, space inclusive push @{ $line{$.}{'line_total_characters_with_space'} }, length($_ +); # get the number of characters for each line, without including sp +ace push @{ $line{"line $."} }, $line_total_characters_without_space; } $Data::Dumper::Sortkeys = 1; print Dumper \%line; __DATA__ Mary had a little lamb, whose fleece was white as snow. And everywhere that Mary went, the lamb was sure to go. It followed her to school one day which was against the rules.
Output
In line: 1 word: Mary has length: 4 In line: 1 word: had has length: 3 In line: 1 word: a has length: 1 In line: 1 word: little has length: 6 In line: 1 word: lamb, has length: 5 In line: 2 word: whose has length: 5 In line: 2 word: fleece has length: 6 In line: 2 word: was has length: 3 In line: 2 word: white has length: 5 In line: 2 word: as has length: 2 In line: 2 word: snow. has length: 5 In line: 4 word: And has length: 3 In line: 4 word: everywhere has length: 10 In line: 4 word: that has length: 4 In line: 4 word: Mary has length: 4 In line: 4 word: went, has length: 5 In line: 5 word: the has length: 3 In line: 5 word: lamb has length: 4 In line: 5 word: was has length: 3 In line: 5 word: sure has length: 4 In line: 5 word: to has length: 2 In line: 5 word: go. has length: 3 In line: 7 word: It has length: 2 In line: 7 word: followed has length: 8 In line: 7 word: her has length: 3 In line: 7 word: to has length: 2 In line: 7 word: school has length: 6 In line: 7 word: one has length: 3 In line: 7 word: day has length: 3 In line: 8 word: which has length: 5 In line: 8 word: was has length: 3 In line: 8 word: against has length: 7 In line: 8 word: the has length: 3 In line: 8 word: rules. has length: 6 $VAR1 = { '1' => { 'line_total_characters_with_space' => [ 23 ] }, '2' => { 'line_total_characters_with_space' => [ 31 ] }, '3' => { 'line_total_characters_with_space' => [ 0 ] }, '4' => { 'line_total_characters_with_space' => [ 30 ] }, '5' => { 'line_total_characters_with_space' => [ 24 ] }, '6' => { 'line_total_characters_with_space' => [ 0 ] }, '7' => { 'line_total_characters_with_space' => [ 33 ] }, '8' => { 'line_total_characters_with_space' => [ 28 ] }, 'Total_number_of_words' => 34, 'line 1' => [ 19 ], 'line 2' => [ 26 ], 'line 3' => [ 0 ], 'line 4' => [ 26 ], 'line 5' => [ 19 ], 'line 6' => [ 0 ], 'line 7' => [ 27 ], 'line 8' => [ 24 ] };

If you tell me, I'll forget.
If you show me, I'll remember.
if you involve me, I'll understand.
--- Author unknown to me

Replies are listed 'Best First'.
Re^2: separate word count for each line of a file
by melb100 (Initiate) on Nov 22, 2012 at 19:24 UTC
    Thank you for your tips monks,

    I am researcher and I needed to check lisst of experimental sentences very quickly (definitely not homework! ;-) ) so in the end I went for the most basic of solutions and am just running

    perl -ne "{print tr/ //, $/}" FILE
    on all my files and scanning down the resulting word counts for outliers. Not very elegant!!

    I am afraid that as a complete novice (this being the first thing I have ever tried to do in perl) I wasn't clever enough to work out number of words on each line from your hints. If there is a more elegant solution I'd be grateful but in the meantime this will do! :)

    Thanks again for trying to help, and I promise to persevere with the "proper" solution when time is less pressing!

    m

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://1005049]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others pondering the Monastery: (6)
As of 2024-04-19 12:23 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found