Beefy Boxes and Bandwidth Generously Provided by pair Networks
Keep It Simple, Stupid
 
PerlMonks  

Re^2: DBI::SQLite slowness

by Endless (Beadle)
on Sep 20, 2013 at 12:23 UTC ( #1055013=note: print w/ replies, xml ) Need Help??


in reply to Re: DBI::SQLite slowness
in thread DBI::SQLite slowness

Brilliant! With that little fix, my speed is up to 2022 per second; that's almost workable, and I understand what was happening. Now time to start looking through the other suggestions.


Comment on Re^2: DBI::SQLite slowness
Re^3: DBI::SQLite slowness
by vincent_veyron (Acolyte) on Sep 20, 2013 at 16:20 UTC
    
    This is what I get on an Atom eee pc (1.6Ghz), after I removed 
    
    use v5.16.0;
    
    and changed 
    
    say "Total time: ", (time - $start); # 180 seconds 
    
    to 
    
    print "Total time: ", (time - $start); # 180 seconds
    
    
    time perl db.pl
    
    Total time: 5
    real	0m5.348s
    user	0m0.360s
    sys	0m0.820s
    
    
    
    marica.fr : Gestion des contrats, des dossiers contentieux et des sinistres d'assurance
      I've never heard reports that 5.16.0 will significantly slow a program, or that say is so much slower. What's going on here?
Re^3: DBI::SQLite slowness
by Laurent_R (Vicar) on Sep 21, 2013 at 09:33 UTC

    Well, 200 millions records at a rate of 2000 per second, that's still 100,000 seconds, or almost 28 hours. That's still pretty long, isn't-it? Having said that, you may be able to live with that, a full day of processing is still manageable for a number of cases. Beware, though, that the rate might slow down as you database grows larger.

    If you are really only looking for filtering out duplicates, the ideas discussed by BrowserUk are probably much better than using a database.

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1055013]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others romping around the Monastery: (5)
As of 2014-07-29 02:51 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    My favorite superfluous repetitious redundant duplicative phrase is:









    Results (211 votes), past polls