Beefy Boxes and Bandwidth Generously Provided by pair Networks
Syntactic Confectionery Delight
 
PerlMonks  

Re: Re: Remove Duplicates from a mbox file

by coolmichael (Deacon)
on Sep 24, 2003 at 05:41 UTC ( #293777=note: print w/ replies, xml ) Need Help??


in reply to Re: Remove Duplicates from a mbox file
in thread Remove Duplicates from a mbox file

I had, but it seemed like a little bit of overkill for what I was doing. And I got to learn a little more Perl doing it.

--
negativespace.net - all things inbetween.


Comment on Re: Re: Remove Duplicates from a mbox file
Re^3: Remove Duplicates from a mbox file
by Anonymous Monk on Oct 11, 2007 at 03:20 UTC
    I couldn't get the perl code above to work right, so I kept searching and I found the one on the web site below, It seems to work great! It removed 2400 duplicates from a 200MB mbox file. It also automatically creates a backup for you. www.wdr1.com/hacks/mbox-dedup.pl
      Yes, but beware it will skip messages which do not have a Message-ID header - and they won't be stored in the resulting file, so you'll have to keep the backup file nevertheless. However, all messages which were skipped will be output.

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://293777]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others making s'mores by the fire in the courtyard of the Monastery: (11)
As of 2014-12-19 15:29 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    Is guessing a good strategy for surviving in the IT business?





    Results (85 votes), past polls