Beefy Boxes and Bandwidth Generously Provided by pair Networks
There's more than one way to do things
 
PerlMonks  

Re: Extensive List of Names by Gender?

by frozenwithjoy (Curate)
on Jul 12, 2013 at 17:01 UTC ( #1044029=note: print w/ replies, xml ) Need Help??


in reply to Extensive List of Names by Gender?

Here is a great dataset that will likely help you. It contains the top 1000 boy names and top 1000 girl names per year for babies born in the US from 1880 to 2009. Ranks and percents are also provided. Keep in mind that some names are shared between males and females, so a workaround could be to choose the sex w/ the highest percentage for conflict names.

The raw data originally comes from the Social Security Administration, but I am familiar with it because it comes with an R package as a sample dataset to play with.

UPDATE: the link I provided also has some R scripts to download the raw data from SSA and parse it. If you use these scripts, you can specify the dates and which states you want to pull the data from. Also, there is a script to grab the top X names for each sex. This could help you avoid shared name conflicts.


Comment on Re: Extensive List of Names by Gender?

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1044029]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others scrutinizing the Monastery: (4)
As of 2014-10-25 09:02 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    For retirement, I am banking on:










    Results (142 votes), past polls