http://www.perlmonks.org?node_id=1082385


in reply to Optimizing a naive clustering algorithm

The dataset does not seem that large that one couldn't start with a distance matrix (or half-matrix, distance being commutative).

Wikipedia links: Hierarchical clustering, Single-linkage clustering. CPAN gives this (among others): Algorithm::Cluster.

Is a pure perl solution expected?

  • Comment on Re: Optimizing a naive clustering algorithm

Replies are listed 'Best First'.
Re^2: Optimizing a naive clustering algorithm
by BUU (Prior) on Apr 15, 2014 at 20:03 UTC
    Pure perl is not required, but the modules I found on cpan seemed awfully specific to their niche types of data, usually dna or genome related things. I've actually just implemented a slightly similar idea in pure sql.