Accepted author manuscript, 136 KB, PDF document
Available under license: CC BY: Creative Commons Attribution 4.0 International License
Research output: Contribution to conference - Without ISBN/ISSN › Conference paper › peer-review
Research output: Contribution to conference - Without ISBN/ISSN › Conference paper › peer-review
}
TY - CONF
T1 - Feasibility of Emotions as Features for Suicide Ideation Detection in Social Media
AU - Arreerard, Ratchakrit
AU - Piao, Scott
PY - 2023/6/16
Y1 - 2023/6/16
N2 - Suicide-related social media message detection is an important issue. Such messages can reveal a warning sign of suicidal behaviour. This paper examines the efficacy of using emotions as sole features to detect suicide-related messages. We investigated two methods which use a single emotion and a set of seven emotions as features respectively. For emotion classification, we used a classifier based on BERT named "Emotion English DistilRoBERTa-base". For detecting suicide-related messages, we tested Naive Bayes and Support Vector Machine. As our training/test data for suicide message detection, we used a publicly available dataset collected from Reddit in which each post is labelled as "suicide" or "non-suicide". Ourmethod obtained accuracies of 76.2% and 76.8% for detecting suicide-related messages with Naive Bayes and Support Vector Machine respectively. Our experiment also shows that three emotion categories, "anger", "fear" and "sadness", have a strongest correlation with suicide-related messages.
AB - Suicide-related social media message detection is an important issue. Such messages can reveal a warning sign of suicidal behaviour. This paper examines the efficacy of using emotions as sole features to detect suicide-related messages. We investigated two methods which use a single emotion and a set of seven emotions as features respectively. For emotion classification, we used a classifier based on BERT named "Emotion English DistilRoBERTa-base". For detecting suicide-related messages, we tested Naive Bayes and Support Vector Machine. As our training/test data for suicide message detection, we used a publicly available dataset collected from Reddit in which each post is labelled as "suicide" or "non-suicide". Ourmethod obtained accuracies of 76.2% and 76.8% for detecting suicide-related messages with Naive Bayes and Support Vector Machine respectively. Our experiment also shows that three emotion categories, "anger", "fear" and "sadness", have a strongest correlation with suicide-related messages.
KW - Natural Language Processing
KW - Suicide Ideation Detection
KW - Social Media Analytics
KW - Emotion Detection
KW - Large Language Models
KW - Machine Learning
M3 - Conference paper
T2 - HEALTHCARE TEXT ANALYTICS CONFERENCE 2023
Y2 - 15 June 2023 through 16 June 2023
ER -