Skip to content

materials for the study on mental health subreddits. If you use this code in your work, please cite George Gkotsis, Anika Oellrich, Tim Hubbard, Richard Dobson, Maria Liakata, Sumithra Velupillai and Rina Dutta. The Language of Mental Health Problems in Social Media, Computational Linguistics and Clinical Psychology 2016

Notifications You must be signed in to change notification settings

gkotsis/reddit-mental-health

Repository files navigation

reddit-mental-health

This repository contains the methods for producing language features from subreddits. If you use the code and want to cite our work, please use the following paper:

George Gkotsis, Anika Oellrich, Tim Hubbard, Richard Dobson, Maria Liakata, Sumithra Velupillai and Rina Dutta. The Language of Mental Health Problems in Social Media, Computational Linguistics and Clinical Psychology (clpsych, NAACL 2016).

paper

supplement

The repository includes two Pandas Dataframes that are a small subset of the original datasets used in our study. The data provided here are mostly for demonstration purposes.

The complete dataset we used can be found in reddit (comments, posts).

Installation

Follow requirements.txt (spaCy has an extra step)

Language features

For the syntactic features, run:

import pandas as pd
import content
df = pd.read_pickle("suicidewatch-sample.pickle")
df = content.addSyntacticFeatures(df)

For the affection features, run:

import afinnsenti
import labmt
df['text'] = df.apply(content.getTextFromRecord, axis=1)
df = afinnsenti.addEmotionalFeature(df)
df = labmt.addEmotionalFeature(df)

Binary classification

import binaryClassification
binaryClassification.main()
rs = binaryClassification.readResults()

The complete output of the classification results is also stored as a dictionary in pickle format (file: combinations-10fold.pickle)

Wordclouds

Follow the link

About

materials for the study on mental health subreddits. If you use this code in your work, please cite George Gkotsis, Anika Oellrich, Tim Hubbard, Richard Dobson, Maria Liakata, Sumithra Velupillai and Rina Dutta. The Language of Mental Health Problems in Social Media, Computational Linguistics and Clinical Psychology 2016

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages