Re^2: Efficient Way to Parse a Large Log File with a Large Regex


Come for the quick hacks, stay for the epiphanies.
	PerlMonks

Re^2: Efficient Way to Parse a Large Log File with a Large Regex

by tlm (Prior)

on Apr 13, 2005 at 01:06 UTC ( [id://447224]=note: print w/replies, xml )

Need Help??

in reply to Re: Efficient Way to Parse a Large Log File with a Large Regex
in thread Efficient Way to Parse a Large Log File with a Large Regex

It's fun to read all the replies. A lot of good ideas. I don't have anything new to add, other than this pointer to a Perl snippet by Lincoln Stein for using a DBMS for httpd logging. This approach reduces the problem of parsing log files to the much cleaner one of constructing SQL queries. And, as CountZero already pointed out, you can build in some hooks for preprocessing of log records, including one that does the checking against your table of IP addresses. Then all you have to do is check the the entries recorded with a timestamp more recent than the last check. (Incidentally, I vote for holli's hash lookup approach.)

the lowliest monk