Beefy Boxes and Bandwidth Generously Provided by pair Networks
laziness, impatience, and hubris

RE: What's eating all your disk space?

by ivey (Beadle)
on Jul 12, 2000 at 22:09 UTC ( [id://22233]=note: print w/replies, xml ) Need Help??

in reply to What's eating all your disk space?

excellent! i saved it as bin/tdu and will use it instead of the little shell bit i had written. thanks!

michael d. ivey,

Replies are listed 'Best First'.
RE: RE: What's eating all your disk space?
by knight (Friar) on Jul 25, 2000 at 19:22 UTC
    I like this script a lot; very handy. It was taking too long on some of my larger directory trees, though, so I took the liberty of speeding it up. The following does the sorting in Perl, and also calculates the sum internally to eliminate the '.' from the du call. This saves du from having to walk the directory tree twice (once for '.' and once for the individual '*' arguments) and sped things up a lot for me.
    #! /usr/bin/env perl open(DU, "du -sk *|") || die "Can't exec du: $!\n"; while (<DU>) { ($size, $inode)=split; chop($size); $sum += $size; push @entries, { size => $size, inode => $inode }; } close(DU); @entries = sort { $b->{size} <=> $a->{size} } @entries; foreach $e (@entries[0 .. 10]) { printf("%30s | %5d | %2.2f%%\n",$e->{inode},$e->{size},$e->{siz +e}/$sum*1000); }
    Thanks for a cool script!

      I believe your script above has issues:

      • Why did you chop $size? You are only taking off the last digit. If you meant to take off the metric notation (G, M, or k) you could just s/G|M|k//o
      • STRICT and WARNINGS!
      • You have have a percentage multiplied by 1000. I believe you meant 100.
      • If you want the metric notations (G,M, or k), you will have to do some funny math to get them all to the same measurement (kilobytes).
      J. J. Horner
      Linux, Perl, Apache, Stronghold, Unix
        To adress each point:
        • That's a goof left over from development. Oops. Fixes the 3rd point you make as well.
        • I agree that using -w and 'use strict;' are excellent things to do. However, I intended this to be used on the command line either as an alias or shell script somewhere in the your $PATH. I want it to be faster, rather than slower. Granted, the differences are small, I admit. Blindly following the dogma of "must use strict and -w all the time" is dangerous, just as blindly following any dogma is. That said, I use strict; and -w almost always.
        • Noted.
        • I've already taken care of this, or did you not see the -k flag I passed to 'df'? This forces df to print it's numbers in 1Kb blocks instead of the system default block size (which is not always 1024 bytes). Since the numbers are in Kb, I don't have to worry about stripping any metrics, and the 'funny math' becomes trivial. Not all versions of 'du' support Linux's -h tag, but they do support -k.
        In any case, I'll make a few refinments...
      I think that you meant $sum*100 in the end...

      Here is a small improvement. It adds a '/' to directory names,
      . . . ($size, $inode)=split; $inode .= "/" if (-d $inode); chop($size); . . .
      also, the printf should be changed. Instead of "%2.2f" you probably meant "%5.2f". The first digit is the total number of digits, including the period.
      I thought about building the sorting but decided against it ("One tool does one thing"). I figure that I'll leave sorting to 'sort'. ;-)

      As to 'df' walking the tree twice, I looked at that as well. In my tests, it looked like the results were cached somewhere, and thus 'df -sk . *' is quite fast. A prior version of the script did something horrible along the lines of: 'du -sk *|perl -e `$sum=du -sk .` while(<>) {....}, so this is an improvement already.

      This is pretty quick, and I use it on 40GB raid arrays. :-)

RE: What's eating all your disk space?
by orthanc (Monk) on Jul 25, 2000 at 17:30 UTC
    A very handy little script, i will use it a lot


Log In?

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://22233]
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others studying the Monastery: (3)
As of 2024-07-20 01:37 GMT
Find Nodes?
    Voting Booth?

    No recent polls found

    erzuuli‥ 🛈The London Perl and Raku Workshop takes place on 26th Oct 2024. If your company depends on Perl, please consider sponsoring and/or attending.