Beefy Boxes and Bandwidth Generously Provided by pair Networks
We don't bite newbies here... much
 
PerlMonks  

Re: Hash Search is VERY slow

by karlgoethebier (Abbot)
on Sep 29, 2021 at 11:13 UTC ( [id://11137108]=note: print w/replies, xml ) Need Help??


in reply to Hash Search is VERY slow

«Die Geschichte wiederholt sich immer zweimal – das erste Mal als Tragödie, das zweite Mal als Farce.» (Karl Marx)

See Threads From Hell #2: How To Search A Very Huge File [SOLVED] for further inspiration. Regards, Karl

«The Crux of the Biscuit is the Apostrophe»

Replies are listed 'Best First'.
Re^2: Hash Search is VERY slow
by Tux (Canon) on Sep 29, 2021 at 11:32 UTC

    Unless you can guarantee that the CSV you are dealing with is absolutely free of fields with embedded newlines, CSV parsing cannot be threaded or parsed in parallel.

    The OP only deals with fields 7 and 31, which are unlikely to contain new-lines, but we have no idea what the other fields may hold.


    Enjoy, Have FUN! H.Merijn
      We do have some idea of what the other fields may hold, in that the original post states the CSV files are firewall logs. I'm not aware of any common firewall log format which includes data that might contain embedded newlines, so it's probably not an issue in this case. But OP would know the specifics of the format they're dealing with better than I do, of course.

      Yes sure. Anyway:

      And yes, i'm aware that i shouldn't split etc.

      Best regards, Karl

      «The Crux of the Biscuit is the Apostrophe»

      Right, there are no new-lines in there and it's pretty uniform data overall.

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://11137108]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others cooling their heels in the Monastery: (8)
As of 2024-04-18 10:19 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found