reddit-mental-health

This repository contains the methods for producing language features from subreddits. If you use the code and want to cite our work, please use the following paper:

George Gkotsis, Anika Oellrich, Tim Hubbard, Richard Dobson, Maria Liakata, Sumithra Velupillai and Rina Dutta. The Language of Mental Health Problems in Social Media, Computational Linguistics and Clinical Psychology (clpsych, NAACL 2016).

paper

supplement

The repository includes two Pandas Dataframes that are a small subset of the original datasets used in our study. The data provided here are mostly for demonstration purposes.

The complete dataset we used can be found in reddit (comments, posts).

Installation

Follow requirements.txt (spaCy has an extra step)

Language features

For the syntactic features, run:

import pandas as pd
import content
df = pd.read_pickle("suicidewatch-sample.pickle")
df = content.addSyntacticFeatures(df)

For the affection features, run:

import afinnsenti
import labmt
df['text'] = df.apply(content.getTextFromRecord, axis=1)
df = afinnsenti.addEmotionalFeature(df)
df = labmt.addEmotionalFeature(df)

Binary classification

import binaryClassification
binaryClassification.main()
rs = binaryClassification.readResults()

The complete output of the classification results is also stored as a dictionary in pickle format (file: combinations-10fold.pickle)

Wordclouds

Follow the link

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
AFINN		AFINN
wordclouds		wordclouds
CLPsych7.pdf		CLPsych7.pdf
CLPsych7_OptionalAttachment.pdf		CLPsych7_OptionalAttachment.pdf
README.md		README.md
afinnsenti.py		afinnsenti.py
ageGender.py		ageGender.py
binaryClassification.py		binaryClassification.py
content.py		content.py
depression-sample.pickle		depression-sample.pickle
labmt.py		labmt.py
labmt.txt		labmt.txt
ml.py		ml.py
readability.py		readability.py
requirements.txt		requirements.txt
suicidewatch-sample.pickle		suicidewatch-sample.pickle

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

reddit-mental-health

Installation

Language features

Binary classification

Wordclouds

About

Releases

Packages

Languages

gkotsis/reddit-mental-health

Folders and files

Latest commit

History

Repository files navigation

reddit-mental-health

Installation

Language features

Binary classification

Wordclouds

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages