Beefy Boxes and Bandwidth Generously Provided by pair Networks
Do you know where your variables are?
 
PerlMonks  

Re^2: arabic alphabet ... how to deal with?

by Anonymous Monk
on Feb 12, 2009 at 16:53 UTC ( #743389=note: print w/ replies, xml ) Need Help??


in reply to Re: arabic alphabet ... how to deal with?
in thread arabic alphabet ... how to deal with?

I tried this way as well before, this way no output ;)

#!/usr/bin/perl open (STOPWORDS, '<:encoding(UTF-8)', $ARGV[1]) || die "Error opening +the stopwords file\n"; $count = 0; while ($word = <STOPWORDS>) { chop($word); $stopword[$count] = lc($word); $count++; } close(STOPWORDS); open (INFILE ,'<:encoding(UTF-8)', $ARGV[0]) || die "Error opening the + input file\n"; while ($line = <INFILE>) { chop($line); @entry = split(/ /, $line); $i = 0; while ($entry[$i]) { $found = 0; $j = 0; while (($j<=$count) && ($found==0)) { if (lc($entry[$i]) eq $stopword[$j]) { $found = 1; } $j++; } if ($found == 0) { print FH "$entry[$i]\n"; } $i++; } } close(INFILE);


Comment on Re^2: arabic alphabet ... how to deal with?
Download Code
Re^3: arabic alphabet ... how to deal with?
by kennethk (Abbot) on Feb 12, 2009 at 17:22 UTC
    In this case, you have an orphaned file handle FH which is never associated with a file or channel.
      when I write in this way also :
      open (OUTFILE ,'>>:encoding(UTF-8)', $ARGV[2]) || die "Error opening t +he output file\n"; ... ... ... print OUTFILE "$entry[$i]\n"; ... ... ...
      still my words in the list of stop words would remain there ... :(
Re^3: arabic alphabet ... how to deal with?
by almut (Canon) on Feb 12, 2009 at 21:19 UTC

    Use Devel::Peek to get an ASCII-printable representation of the strings you're comparing, and then verify that what you think should match is in fact identical:

    use Devel::Peek; ... Dump lc($entry[$i]); Dump $stopword[$j]; if (lc($entry[$i]) eq $stopword[$j]) { ...

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://743389]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others browsing the Monastery: (15)
As of 2015-07-01 19:26 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The top three priorities of my open tasks are (in descending order of likelihood to be worked on) ...









    Results (17 votes), past polls