Beefy Boxes and Bandwidth Generously Provided by pair Networks
Perl-Sensitive Sunglasses
 
PerlMonks  

Re^3: Help with a regular expression for file name parsing

by Anonymous Monk
on Dec 07, 2011 at 12:21 UTC ( #942237=note: print w/ replies, xml ) Need Help??


in reply to Re^2: Help with a regular expression for file name parsing
in thread Help with a regular expression for file name parsing

Any better suggestions?

Learn to copy paste better :) because the regex you're using, isn't the same one BrowserUk posted

His regex works, despite him posting the code in the context of his REPL (Read Eval Print Loop), see RFC: IPerl - Interactive Perl ( read-eval-print loop ), Re^6: RFC: IPerl - Interactive Perl ( read-eval-print loop ) (x)

I checked

#!/usr/bin/perl -- #~ 2011-12-07-04:10:56PDT by Anonymous Monk #~ perltidy -csc -otr -opr -ce -nibc -i=4 use strict; use warnings; use autodie; # dies if open/close... fail Main( @ARGV ); exit( 0 ); sub Main { if ( @_ == 2 ) { NotDemoMeaningfulName(@_); } else { Demo(); print '#' x 33 ,"\n", Usage(); } } ## end sub Main sub NotDemoMeaningfulName { my ( $inputFile, $outputFile ) = @_; open my ($inFh), '<', $inputFile; open my ($outFh), '>', $outputFile; while( defined( my $data = <$inFh>) ){ print $outFh "$_\n" for $data =~ m[\@include\s('[^']+'|"[^"]+"|.+?(?<!\\))\s]g +; # /\@include\s+('[^']+'|"[^"]+"|.+?(?<!\\))\s+ +/g } close $inFh; close $outFh; } ## end sub NotDemoMeaningfulName sub Usage { <<"__USAGE__"; $0 $0 dataFile newDataFile __USAGE__ } ## end sub Usage sub Demo { my ( $Input, $WantedOutput ) = DemoData(); NotDemoMeaningfulName( \$Input, \my $Output ); require Test::More; Test::More::is( $Output, $WantedOutput, ' NotDemoMeaningfulName Works Aas Designed' ); Test::More::done_testing(); print "\n$Output\n"; } ## end sub Demo sub DemoData { #~ http://perlmonks... my $One = <<'__One__'; @include test #some "random stuff" @include "some file" did you parse that? #more 'random' stuff @include 'another file' you sure? #and more random stuff @include yet\ another\ file positive? __One__ #~ http://perlmonks... my $Two = <<'__Two__'; test "some file" 'another file' yet\ another\ file __Two__ return $One, $Two; } ## end sub DemoData __END__ $ perl pm.re.942167.pl ok 1 - NotDemoMeaningfulName Works Aas Designed 1..1 test "some file" 'another file' yet\ another\ file ################################# pm.re.942167.pl pm.re.942167.pl dataFile newDataFile


Comment on Re^3: Help with a regular expression for file name parsing
Download Code
Re^4: Help with a regular expression for file name parsing
by bontchev (Sexton) on Dec 07, 2011 at 13:12 UTC

    You checked what? I asked for a regular expression - not for three pages of code and a link to somebody's totally irrelevant custom module!

    Meanwhile, I figured out that the horrible BBoard software which this site uses simply mangles the posted code and to get the real stuff, you have to click on the "download" linke (which doesn't download!) and cut-and-paste from the page that opens.

    So, I managed to make TJPride's script output something meaningful:

    "some file" 'another file' yet\ another\ file

    Unfortunately, it is also wrong. This is not the proper output. The proper output, which such data, would be:

    some file another file yet another file

    Furthermore, I don't want all the stuff loaded into some kind of an array and everything in that array matched simultaneously. I have to do other stuff with each line, you know? I want to process the file line-by-line and, for this particular problem, I need a regular expression that fetches the file name after the "@include" keyword.

    Let's make this very simple, shall we? Consider this snippet of code:

    my $data = SOME STRING; if ($data =~ SOME EXPRESSION) { print "File name: \"$_\"\n"; }

    Question: What should SOME EXPRESSION be, so that if SOME STRING is

    "\@include file"

    or

    "\@include \"some file\""

    or

    "\@include another\\ file"

    the output is respectively

    File name: "file"

    or

    File name: "some file"

    or

    File name: "another file"

    Please do not answer, unless you have tested that your answer actually produces the desired output.

      Question: What should SOME EXPRESSION be, so that if SOME STRING is...

      Such a regular expression cannot exist, not in an if block

      You can match the filename portion (for which you already got a regex that works), but you have to remove the quotes and escapes afterwards (using s///|substitution operator)

      In short, you want a function that does the job

        Such a regular expression cannot exist, not in an if block

        OK, understood. I'll have to do it in two or more steps, then.

        You can match the filename portion (for which you already got a regex that works), but you have to remove the quotes and escapes afterwards (using s///|substitution operator)

        OK, that's what I'll do, although just removing the backslashes seems too primitive to me. I'll try to find a better solution.

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://942237]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others wandering the Monastery: (14)
As of 2014-08-27 18:16 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The best computer themed movie is:











    Results (248 votes), past polls