Beefy Boxes and Bandwidth Generously Provided by pair Networks
Do you know where your variables are?
 
PerlMonks  

comment on

( [id://3333]=superdoc: print w/replies, xml ) Need Help??
Personally, I would not view this as a Perl question, nor as a problem best solved using Perl -- except for building any sort of "wrapper" utility that would make it easier to use the existing tools that are available in C and C++.

For example, http://www.isip.msstate.edu/ has a fairly comprehensive set of signal processing tools (including an HMM toolkit). These tend to prefer raw pcm data so you can use SoX to strip the WAV headers (it also does a lot of other useful stuff -- you need it anyway).

A lot will depend on the scope and actual nature of your project: how many files to compare, what criteria define "same" vs. "not same", how confusable the samples are on these criteria, what error rate is acceptable. If you're looking for cases of two files that replicate the same portion of a single digital source with little or no alteration, then DSP approaches are likely succeed quite well -- but any other condition will have a measurable error rate on both "same" and "not-same" decisions.

Another approach to consider, if the job allows it, would be to build a Perl/Tk interface that makes it very easy, fast and efficient for a human to compare the audio files and make the decisions.

Update: It's not at all clear to me that HMM's are appropriate for classifying music data. The first thing to try should probably just be comparing DFT vectors, both "narrow band" (long analysis window) and "wide band" (short window). I believe the ISIP toolkit includes a vector quantization process, which will make the statistical assessments easier.


In reply to Re: How to compare 2 wav files. by graff
in thread How to compare 2 wav files. by shadox

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post; it's "PerlMonks-approved HTML":



  • Are you posting in the right place? Check out Where do I post X? to know for sure.
  • Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
    <code> <a> <b> <big> <blockquote> <br /> <dd> <dl> <dt> <em> <font> <h1> <h2> <h3> <h4> <h5> <h6> <hr /> <i> <li> <nbsp> <ol> <p> <small> <strike> <strong> <sub> <sup> <table> <td> <th> <tr> <tt> <u> <ul>
  • Snippets of code should be wrapped in <code> tags not <pre> tags. In fact, <pre> tags should generally be avoided. If they must be used, extreme care should be taken to ensure that their contents do not have long lines (<70 chars), in order to prevent horizontal scrolling (and possible janitor intervention).
  • Want more info? How to link or How to display code and escape characters are good places to start.
Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others cooling their heels in the Monastery: (2)
As of 2024-04-26 02:14 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found