Beefy Boxes and Bandwidth Generously Provided by pair Networks
Perl Monk, Perl Meditation

Re: Faster file read, text search and replace

by Jenda (Abbot)
on Feb 14, 2018 at 12:18 UTC ( #1209132=note: print w/replies, xml ) Need Help??

in reply to Faster file read, text search and replace

It's Perl (the language) or perl (the "interpreter"), not PERL. And no, there's nothing on the Perl side that can make this quicker. The IO costs will greatly overweight anything you can do on the Perl side. The data should not be in the XML format. It's one of the least space efficient ways to store data and when reading and writing is involved, space equals speed.

If you can't change the way you store the data, you might at least store it compressed and then decompress as you read and compress as you write. While it will mean more work for the CPU, the IO costs ought to be much lower. See PerlIO::gzip and PerlIO::via::Bzip2.

Also ... making changes to a XML file without the use of a module that actually understands the format is dangerous. Sooner or later you run into problems with encoding, entities or comments. I'm not saying you may never ever do it ... if it's a one time transformation of a known XML and the changes are simple enough, go ahead ... but do be careful.

Enoch was right!
Enjoy the last years of Rome.

  • Comment on Re: Faster file read, text search and replace

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1209132]
and all is quiet...

How do I use this? | Other CB clients
Other Users?
Others chanting in the Monastery: (9)
As of 2018-06-18 13:55 GMT
Find Nodes?
    Voting Booth?
    Should cpanminus be part of the standard Perl release?

    Results (110 votes). Check out past polls.