Beefy Boxes and Bandwidth Generously Provided by pair Networks
Just another Perl shrine
 
PerlMonks  

Re: similar string matching

by dakkar (Hermit)
on Jul 05, 2004 at 16:08 UTC ( [id://371921]=note: print w/replies, xml ) Need Help??


in reply to similar string matching

The algorithms are quite well established, but they're not obvious. If you try to roll your own, you'll quite probably end up with very slow matching. Look at http://www.bioperl.org/ and you'll find all the information you need.

The short version: use BLAST or derived methods.

-- 
        dakkar - Mobilis in mobile

Most of my code is tested...

Perl is strongly typed, it just has very few types (Dan)

Replies are listed 'Best First'.
Re^2: similar string matching
by Anonymous Monk on Jul 05, 2004 at 19:22 UTC
    Nice hint but what if I want to find single amino acid sequences? like cystine bridges? Murcia

      Ehm.. I'm not a biologist, I only know someone... so you almost lost me there.

      The BLAST and related algorithms do exactly what you asked for: they find all the possible matches between two sequences, ranking them by 'edit distance', i.e. the number of operations needed to obtain a perfect match.

      This is really all I know, for details ask an expert ;-)

      -- 
              dakkar - Mobilis in mobile
      

      Most of my code is tested...

      Perl is strongly typed, it just has very few types (Dan)

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://371921]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others exploiting the Monastery: (2)
As of 2024-03-19 06:11 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found