find substring faster

by Diane4Luo
on May 29, 2011 at 14:53 UTC
Diane4Luo has asked for the wisdom of the Perl Monks concerning the following question:

In a set of unique sequences, I need to find substring sequences and get rid of them. Do you know how to do it faster? My input file is 52 MB. Thanks,

Re: find substring faster
by roboticus (Chancellor) on May 29, 2011 at 16:07 UTC


    Ermmm.... faster than what?


Re: find substring faster
by planetscape (Chancellor) on May 29, 2011 at 17:40 UTC
      yes. We are working together on this project. Thanks!
Re: find substring faster
by salva (Abbot) on May 29, 2011 at 15:43 UTC
Re: find substring faster
by thezip (Vicar) on May 29, 2011 at 15:01 UTC

Re: find substring faster
by SimonClinch (Deacon) on May 29, 2011 at 15:25 UTC
    See perlre and perlretut. Hint - its exactly the same syntax as with sed in this case. But first disable carriage control for the filehandle using local $/ = undef(); It's more efficient than using multiline substitution.

