Using NLP to analyze OkCupid profile texts for my class
The Dataset okcupid_profiles.csv comes from Kaggle and can be directly downloaded here.
Demographic Variables are:
- age
- status
- sex
- orientation
- body type
- diet
- drinks
- drugs
- education
- ethnicity
- height
- income
- job
- last_online
- location
- offspring
- pets
- religion
- sign
- smokes
- speaks
Essay Questions are:
- essay0: About Me / Self summary
- essay1: Current Goals / Aspirations
- essay2: My Golden Rule / My traits
- essay3: I could probably beat you at / Talent
- essay4: The last show I binged / Hobbies
- essay5: A perfect day / Moments
- essay6: I value / Needs
- essay7: The most private thing I'm willing to admit / Secrets
- essay8: What I'm actually looking for / Dating