Beefy Boxes and Bandwidth Generously Provided by pair Networks
go ahead... be a heretic

Re: header footer

by Eily (Priest)
on Mar 04, 2014 at 21:49 UTC ( #1076980=note: print w/ replies, xml ) Need Help??

in reply to header footer

You probably want to use substr instead of a regex, because there is a length parameter for counting from the start, or negative offset for counting from the end. And to only cut the first 50 chars in the first line, and last 30 in the last, you can use the variable $. (line number, which is 0, so false on the first line) and eof (which will be true on the last). So :

RemoveHeader() unless $.; RemoveFooter() if eof;

If you intend to use your script on several files at once, have a look at eof on how to reset $. at the start of each file.

Comment on Re: header footer
Download Code
Replies are listed 'Best First'.
Re^2: header footer
by gupr1980 (Acolyte) on Mar 04, 2014 at 22:22 UTC
    Found another example that doesnt store the entire file in a variable but reads it line by line like this,
    open ( my $input_fh, "<", $input_file ); open ( my $output_fh, ">", $output_file ); foreach my $line ( <$input_fh> ) { ########## i have to use regex here to first see if this line start with a HDR - +correct? then if it does i would do a substring to delete the first 50 and wr +ite the rest to the output file else write the entire line to output similarly check if it starts with EDR and substring again ########## } close ( $input_fh ); close ( $output_fh );
    I didnt quiet get your suggestion on using the line number and eof. The file will have numerous records that has headers and footers. Am I on the right track with the above approach? Storing in a variable vs reading line by line. Is one way better than the other?

      Yeah, I just missed the multiple records in the same file. Mea culpa

      Then you can do something like:

      %length = (HDR => 5, FTR => 10); while(<DATA>) { while(/(HDR|FTR)/) # find either HDR or FTR { substr($_, $-[1], $length{$1}) = ''; # $-[1] is the position of +the first capture groups (parenthesis) } print; } __DATA__ HDR--Hello this is a test FTR-------Should there be text here? HDR--Tw +o records on the same line FTR------- HDR--and here an incomplete footer FTR--

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1076980]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others about the Monastery: (6)
As of 2016-04-28 21:29 GMT
Find Nodes?
    Voting Booth?
    :nehw tseb si esrever ni gnitirW

    Results (436 votes). Check out past polls.