in reply to Re^4: grouping numbers in thread grouping numbers
Compared to 5000 and +5000, they're all 'close' together.
I think you first need to think about how you would like to define 'close together' and 'far apart' in practical terms, and when you've worked that out, write some code.
I'm happy to be corrected if I'm missing something...
Good luck!
Re^6: grouping numbers by ag4ve (Monk) on Jul 11, 2013 at 14:28 UTC 
That's why I had $avg  I don't really like taking the average distance to do this (I'd much prefer to have a score system where I get everything and then filter out after) but as I can't even figure this out, I figure this is a good starting point. I could work with it if I got this working at least.
 [reply] 

What you're trying to do is cluster analysis  naturally grouping data together in clusters (for some value of "naturally").
Most approaches I'm aware of require you to know the number of clusters ahead of time (which sort of defeats the purpose).
However, if you can come up with some heuristic, such as "any element of a cluster must be within 10% of the center point of the cluster's range", you might be able to quickly compute the results, and live with them. (Of course there are pathological cases where adding a new element changes the center point, causing other elements to be cast out.)
QM

Quantum Mechanics: The dreams stuff is made of
 [reply] 
