Beefy Boxes and Bandwidth Generously Provided by pair Networks
XP is just a number

Re: How do I find peaks in noisy data?

by dokkeldepper (Friar)
on Apr 29, 2006 at 12:26 UTC ( #546487=note: print w/replies, xml ) Need Help??

in reply to How do I find peaks in noisy data?


one important point is missing so far.

Most answers more or less assume implicitely that

  1. there is a signal in the data
  2. the data is stationary
  3. the data has a distribution that has a finite variance

As general as you stated the question none of these assumption needs to be fulfilled. For example a random walk with drift can produce extremely large values very fast. Or if the underlying data generating process has a distribution without a finite variance (f.e. Cauchy-Distribution) then the usual signal extraction mechanisms fail. The most prominent examples are stock returns which possibly have no finite variance.

To make my complaints operational, you first need some good reason to assume that your data has some particular parametric distribution. Think about it. Then one can try to estimate the parameters and can identify the most unprobable values as "islands".

Another way would try to do this nonparametrically. Usually one needs much more data for these techniques.

Check: R

and the related R-modules on CPAN

Without some idea about the nature of the data generating process there are to much whens and ifs to consider to give some usable perl code for your question.

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://546487]
and the monks are chillaxin'...

How do I use this? | Other CB clients
Other Users?
Others pondering the Monastery: (5)
As of 2018-02-25 16:49 GMT
Find Nodes?
    Voting Booth?
    When it is dark outside I am happiest to see ...

    Results (314 votes). Check out past polls.