Beefy Boxes and Bandwidth Generously Provided by pair Networks
Perl: the Markov chain saw

Re: De Duping Street Addresses Fuzzily

by Crian (Chaplain)
on Feb 01, 2005 at 12:19 UTC ( #426869=note: print w/replies, xml ) Need Help??

in reply to De Duping Street Addresses Fuzzily

We are doing something simular in our company (for addresses is germany) as one small part of our jobs. Therefore the addresses get plausibilised (if I can say this in this way(?)) with some kind of technic simulare to the technics described in the posts above.

If you want accurate results, don't think it's fast done. Perhaps it could be cheaper to buy a complete solution.

Data that comes from humans contains all kind of errors and rubish you can and can not think of. Specially if it's a large mass of data (such as 50000000 addresses or else), there will be almost everything in your data - be aware.

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://426869]
[erix]: one can only hope the Pope will administer the same humiliation
Discipulus the pope is the last good politic here around..sigh

How do I use this? | Other CB clients
Other Users?
Others avoiding work at the Monastery: (6)
As of 2017-05-24 07:04 GMT
Find Nodes?
    Voting Booth?