Skip to content

Predicts religion, gender and ethnicity given a name string for Indian names.

Notifications You must be signed in to change notification settings

AdityaRoongta/ethnicity

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Ethnicity.io

Predicts religion, gender and ethnicity given a name string for Indian names.

How it works

Using bayesian models trained using string sequences. Data for gender was obtained by scraping CBSE results, while data for ethnicity and religion was scraped from a matrimonial site called SimplyMarry.

Takes in a name string as input, and returns the predicted gender, ethnicity and religion with associated probabilities.

TODO

  • Get more training data (!)
  • Use better models - consider using RNNs. They are better for sequence based data.
  • Design a (rate-limited) API for Ethnicity.io

About

Predicts religion, gender and ethnicity given a name string for Indian names.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 63.2%
  • Python 31.2%
  • HTML 5.6%