Beefy Boxes and Bandwidth Generously Provided by pair Networks
XP is just a number

Re^3: recommendations on scientific computing with Perl

by educated_foo (Vicar)
on Feb 13, 2007 at 22:05 UTC ( #599793=note: print w/replies, xml ) Need Help??

in reply to Re^2: recommendations on scientific computing with Perl
in thread recommendations on scientific computing with Perl

I work in machine learning and use Perl for most of my scripting, but have never bothered to use CPAN's machine learning modules. First, you often need to do some additional linear algebra on your data (e.g. centering, finding eigenvalues, SVD, etc.), and these modules don't share a common matrix representation. The lack of a common format for compact storage and a rich library of numerical algorithms makes it hard to do things quickly in pure Perl. Second, many CPAN modules I've looked at seem to have been written either for their authors' edification or without caring about large datasets (e.g. Algorithm::SVMLight requires you to add your datapoints one at a time in bulky hash-refs), while most of the problems I care about involve huge amounts of data.

I think the PDL statistics paper someone else mentioned is the best "perl for statistics" resource I've seen. Depending on your problems and level of familiarity with the field, there may be some articles on of interest. As much as I loathe Java, I would actually recommend Weka as an implementation of lots of machine learning algorithms that work well together. But unless PDL does what you want, I'd suggest something other than Perl (including CPAN modules) for your core algorithms.

  • Comment on Re^3: recommendations on scientific computing with Perl

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://599793]
and all is quiet...

How do I use this? | Other CB clients
Other Users?
Others contemplating the Monastery: (9)
As of 2018-05-24 10:40 GMT
Find Nodes?
    Voting Booth?