The BPD and Behaviour Reddit Dataset (BBRD) comprises public Reddit data (posts and meta-data), from the r/BPD and r/BorderlinePDisorder subreddits, of 992 unique Reddit users with self-identified borderline personality disorder (BPD; BPD self-identification was manually verified from Reddit posts). This dataset spans from October 2011 - December 2023, and includes all posts (i.e., submissions and comments) made by the 992 users with self-identified BPD to the BPD subreddits named above between these dates (excluding the majority of short posts that contain fewer than 25 words). In total, this dataset contains 68,590 posts from the 992 users.
Specifically, the BBRD contains the posts themselves (including both submissions and comments; text data has been cleaned) in addition to post meta-data, such as post dates and times, post scores (upvotes minus downvotes), number of comments that submissions received, and the subreddit the post was made on. Further, this dataset also comprises manually annotated (by trained raters) demographic characteristics of users (e.g., age, gender, etc.) and manually annotated clinically meaningful behaviours and events (i.e., behaviours related to suicidality [suicide attempts and ideation] and deliberate self-harm, distinguishing between recent and past occurrences; psychiatric medication usage; therapy behaviours; substance use; impulsive behaviours; psychotic symptoms; social behaviours; intense emotions). The manual annotation of clinically meaningful behaviours and events was done for around ~17,000 posts. Refer to the attached CSV file to see all of the column names and descriptions for the BBRD.
For ethical reasons, the data is only available for non-commercial research upon request after signing a data usage agreement, due to its sensitive nature relating to severe mental health issues, combined with thorough manual coding of characteristics and events. Please complete and sign the attached Data Usage Agreement when requesting access to the dataset.
Date made available | 12/12/2024 |
---|
Publisher | Lancaster University |
---|
Temporal coverage | 10/2011 - 12/2023 |
---|
Date of data production | 02/2020 - 12/2024 |
---|
Legal/ethical | Ethical approval: For discussion around ethics, see description and Data Usage Agreement document. Data protection: See Data Usage Agreement document regarding data protection concerns. Commercial constraints: The data is only available for non-commercial research.
|
---|