Beefy Boxes and Bandwidth Generously Provided by pair Networks
XP is just a number
 
PerlMonks  

Re: Extensive List of Names by Gender?

by frozenwithjoy (Curate)
on Jul 12, 2013 at 17:01 UTC ( #1044029=note: print w/ replies, xml ) Need Help??


in reply to Extensive List of Names by Gender?

Here is a great dataset that will likely help you. It contains the top 1000 boy names and top 1000 girl names per year for babies born in the US from 1880 to 2009. Ranks and percents are also provided. Keep in mind that some names are shared between males and females, so a workaround could be to choose the sex w/ the highest percentage for conflict names.

The raw data originally comes from the Social Security Administration, but I am familiar with it because it comes with an R package as a sample dataset to play with.

UPDATE: the link I provided also has some R scripts to download the raw data from SSA and parse it. If you use these scripts, you can specify the dates and which states you want to pull the data from. Also, there is a script to grab the top X names for each sex. This could help you avoid shared name conflicts.


Comment on Re: Extensive List of Names by Gender?

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1044029]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others making s'mores by the fire in the courtyard of the Monastery: (3)
As of 2014-12-27 09:52 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    Is guessing a good strategy for surviving in the IT business?





    Results (176 votes), past polls