Beefy Boxes and Bandwidth Generously Provided by pair Networks
Problems? Is your data what you think it is?

Re: text file problem

by hipowls (Curate)
on May 10, 2008 at 21:02 UTC ( #685909=note: print w/ replies, xml ) Need Help??

in reply to text file problem

I think your problem is

  • you have a large file
  • you have created a new file with line endings removed
  • now you want to read in characters between two positions in the new file
To do that you can use sysseek and sysread.
use Fcntl qw(SEEK_SET); my $input = '...'; my $start = 1_490_117; my $end = 1_492_312; my $length = $end - $start +1; open my $fh, '<', $input or die "Can't open $input: $!"; sysseek($fh, $start, SEEK_SET) or die "Can't seek to $start in $input: + $!"; my $sequence; my $read = sysread $fh, $sequence, $length; die "Failed to read $length bytes from $input, got $read" if $length != $read;
Now $sequence will contain the DNA sequence from 1,490,117 to 1,492,312 including both end points.

Note: sysread and sysseek use unbuffered IO, don't mix calls to them on a filehandle using other functions such as read, <> or eof.

Comment on Re: text file problem
Select or Download Code

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://685909]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others wandering the Monastery: (4)
As of 2016-02-14 02:15 GMT
Find Nodes?
    Voting Booth?

    How many photographs, souvenirs, artworks, trophies or other decorative objects are displayed in your home?

    Results (454 votes), past polls