Beefy Boxes and Bandwidth Generously Provided by pair Networks
Perl Monk, Perl Meditation

Re: Extensive List of Names by Gender?

by frozenwithjoy (Priest)
on Jul 12, 2013 at 17:01 UTC ( #1044029=note: print w/replies, xml ) Need Help??

in reply to Extensive List of Names by Gender?

Here is a great dataset that will likely help you. It contains the top 1000 boy names and top 1000 girl names per year for babies born in the US from 1880 to 2009. Ranks and percents are also provided. Keep in mind that some names are shared between males and females, so a workaround could be to choose the sex w/ the highest percentage for conflict names.

The raw data originally comes from the Social Security Administration, but I am familiar with it because it comes with an R package as a sample dataset to play with.

UPDATE: the link I provided also has some R scripts to download the raw data from SSA and parse it. If you use these scripts, you can specify the dates and which states you want to pull the data from. Also, there is a script to grab the top X names for each sex. This could help you avoid shared name conflicts.

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1044029]
and all is quiet...

How do I use this? | Other CB clients
Other Users?
Others drinking their drinks and smoking their pipes about the Monastery: (8)
As of 2018-01-22 21:14 GMT
Find Nodes?
    Voting Booth?
    How did you see in the new year?

    Results (237 votes). Check out past polls.