Beefy Boxes and Bandwidth Generously Provided by pair Networks
Pathologically Eclectic Rubbish Lister
 
PerlMonks  

Re^2: arabic alphabet ... how to deal with?

by Anonymous Monk
on Feb 12, 2009 at 16:53 UTC ( #743389=note: print w/ replies, xml ) Need Help??


in reply to Re: arabic alphabet ... how to deal with?
in thread arabic alphabet ... how to deal with?

I tried this way as well before, this way no output ;)

#!/usr/bin/perl open (STOPWORDS, '<:encoding(UTF-8)', $ARGV[1]) || die "Error opening +the stopwords file\n"; $count = 0; while ($word = <STOPWORDS>) { chop($word); $stopword[$count] = lc($word); $count++; } close(STOPWORDS); open (INFILE ,'<:encoding(UTF-8)', $ARGV[0]) || die "Error opening the + input file\n"; while ($line = <INFILE>) { chop($line); @entry = split(/ /, $line); $i = 0; while ($entry[$i]) { $found = 0; $j = 0; while (($j<=$count) && ($found==0)) { if (lc($entry[$i]) eq $stopword[$j]) { $found = 1; } $j++; } if ($found == 0) { print FH "$entry[$i]\n"; } $i++; } } close(INFILE);


Comment on Re^2: arabic alphabet ... how to deal with?
Download Code
Re^3: arabic alphabet ... how to deal with?
by kennethk (Monsignor) on Feb 12, 2009 at 17:22 UTC
    In this case, you have an orphaned file handle FH which is never associated with a file or channel.
      when I write in this way also :
      open (OUTFILE ,'>>:encoding(UTF-8)', $ARGV[2]) || die "Error opening t +he output file\n"; ... ... ... print OUTFILE "$entry[$i]\n"; ... ... ...
      still my words in the list of stop words would remain there ... :(
Re^3: arabic alphabet ... how to deal with?
by almut (Canon) on Feb 12, 2009 at 21:19 UTC

    Use Devel::Peek to get an ASCII-printable representation of the strings you're comparing, and then verify that what you think should match is in fact identical:

    use Devel::Peek; ... Dump lc($entry[$i]); Dump $stopword[$j]; if (lc($entry[$i]) eq $stopword[$j]) { ...

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://743389]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others scrutinizing the Monastery: (10)
As of 2014-09-02 17:44 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    My favorite cookbook is:










    Results (25 votes), past polls