http://www.perlmonks.org?node_id=1043737


in reply to Re^5: grouping numbers
in thread grouping numbers

That's why I had $avg - I don't really like taking the average distance to do this (I'd much prefer to have a score system where I get everything and then filter out after) but as I can't even figure this out, I figure this is a good starting point. I could work with it if I got this working at least.

Replies are listed 'Best First'.
Re^7: grouping numbers
by QM (Parson) on Jul 11, 2013 at 14:41 UTC
    What you're trying to do is cluster analysis - naturally grouping data together in clusters (for some value of "naturally").

    Most approaches I'm aware of require you to know the number of clusters ahead of time (which sort of defeats the purpose).

    However, if you can come up with some heuristic, such as "any element of a cluster must be within 10% of the center point of the cluster's range", you might be able to quickly compute the results, and live with them. (Of course there are pathological cases where adding a new element changes the center point, causing other elements to be cast out.)

    -QM
    --
    Quantum Mechanics: The dreams stuff is made of