Uncovering Patterns in r/TIFU and r/Confession: An Exploratory Analysis
Introduction
Reddit is a unique digital landscape where users share experiences, confessions, and mishaps. In this exploratory analysis, we dive into two popular subreddits, r/TIFU (Today I F*ed Up)** and r/confession, to understand the dynamics of user interactions and engagement. We aim to answer questions like: Who are the most active and impactful authors? What words and themes dominate these posts? How does controversiality affect engagement? By analyzing these patterns, we uncover the elements that make these subreddits such captivating spaces for storytelling and connection.
1. Who are the Top Contributors?
To start our journey, we examine the most active and impactful authors in these subreddits. Each community has its dedicated contributors, but who stands out the most?
The Top 10 Authors by Frequency with Average Comments and Score plot reveals intriguing insights. Some authors post frequently, yet may receive relatively lower scores or comments on average. On the other hand, a few authors stand out with fewer posts but much higher engagement per post. This contrast highlights a balance between quantity and quality—authors who consistently engage users despite limited posts versus those who post frequently but with varied reception. This tells us that impactful stories resonate more than frequent submissions.
2. What Do Users Talk About the Most?
Next, we delve into the language of these posts. What words are frequently used in posts across r/TIFU and r/confession? Analyzing these words helps us understand the common themes and tones that resonate with these communities.
The Word Cloud of common words in selftext shows dominant words like “know,” “like,” “get,” and “said.” These words suggest a confessional, storytelling style, with users often narrating personal experiences or sharing lessons learned. The prevalence of words such as “felt,” “wanted,” and “think” further emphasizes the introspective, reflective tone in both subreddits, as users recount moments of regret, humor, or self-discovery. This language invites readers into personal stories, creating a space where users can relate to each other’s experiences.
3. What Are Most of the Words Mentioned in the Title?
In exploring the most frequently mentioned words in post titles, a pattern emerges—many titles revolve around friendships or social interactions. This observation aligns with the idea that users often share stories involving friends, whether about real-life incidents or imaginary scenarios. These commonly used words shed light on the themes that resonate with users and draw them into posts.
Below is a table of the top 20 most frequently used words in titles:
word | count | |
---|---|---|
0 | get | 1911 |
1 | friend | 1321 |
2 | I’m | 697 |
3 | know | 592 |
4 | try | 517 |
5 | accident | 511 |
6 | work | 485 |
7 | go | 484 |
8 | year | 453 |
9 | tell | 432 |
10 | school | 412 |
11 | someone | 405 |
12 | girl | 402 |
13 | one | 400 |
14 | mak | 390 |
15 | something | 382 |
16 | best | 379 |
5. Does Being Controversial Affect a Post’s Score?
Controversiality is a unique aspect of Reddit posts. A post marked as controversial often indicates mixed reactions, either due to divisive content or differing opinions in the comments. We look into how controversiality impacts post scores in both subreddits.
r/TIFU
Subreddit
In r/TIFU, we see distinct Score Distributions based on controversiality. The distribution for non-controversial posts is narrow, while controversial posts display a wider range of scores, suggesting that divisive topics tend to evoke stronger, more varied reactions. Some controversial posts may receive low scores, reflecting disapproval, while others achieve high scores, indicating that some users appreciate the boldness or relatability of the content.
r/confession
Subreddit
In r/confession
, the score distribution reveals a similar trend but with a narrower range of controversial scores than in r/tifu
. This implies that while controversiality influences scores in both subreddits, r/tifu
has a more pronounced spread, perhaps due to its broader range of storytelling topics and humor-driven content.
6. How Common Are Controversial Comments?
To further explore controversiality, we examine the volume of controversial versus non-controversial comments. Do users engage more in controversial discussions, or do they prefer to keep interactions neutral?
The Count of Controversial vs. Non-Controversial Comments reveals that both subreddits have predominantly non-controversial comments. Interestingly, r/TIFU shows a higher volume of comments overall, suggesting a more active user base. The low number of controversial comments indicates that while controversial posts may spark diverse reactions, most users tend to respond neutrally or supportively, fostering a largely positive atmosphere in both subreddits.
7. Do More Comments Mean Higher Scores?
A key question in understanding engagement is the relationship between comments and scores. Does a higher number of comments correlate with a higher score, indicating greater overall engagement?
In the Comments vs Score Scatter Plot, we observe that while more comments often align with higher scores, there are posts with high scores but fewer comments, particularly in r/confession. This suggests that impactful posts can score highly even if they don’t generate extensive discussions. In r/TIFU, the relationship between comments and scores is more pronounced, possibly due to the community’s preference for interactive, story-driven content.
8. Which Characters Capture the Audience’s Attention?
In the world of storytelling, characters often play a pivotal role. This is no different in r/TIFU and r/confession, where the mention of certain people—friends, family members, or significant others—can influence a post’s engagement. By analyzing which characters are frequently mentioned in posts, we gain insight into the relational dynamics that resonate with readers.
Data Table:author | subreddit | selftext | title | most_frequent_selftext_word | word_count_selftext | most_frequent_title_word | word_count_title |
---|---|---|---|---|---|---|---|
goreptorator | confession | I’m 22 M from Wes… | 20 M Summer with … | sure | 4 | awkward…. | 1 |
SadTelevision6911 | confession | I started the job… | I am currently wo… | get | 3 | let | 1 |
Funkthoughts | tifu | This morning my t… | TIFU by letting m… | dog | 5 | dog | 1 |
No_Fly3027 | confession | Post Covid 19 and… | My mental health … | became | 6 | health | 1 |
mostly_poetic | confession | So, I’m currently… | It’s not always g… | i’m | 3 | side……………………. | 1 |
The Bubble Plot visualizes various characters mentioned in the selftext of posts. Each bubble represents a specific character (e.g., “wife,” “boyfriend,” “friend”), with the size and color of the bubble indicating the frequency and unique interactions these characters drive. The x-axis represents the average number of comments per mention, while the y-axis represents the average score associated with these mentions.
Characters like “wife,” “boyfriend,” and “girlfriend” receive higher average scores and comments, highlighting that relationship-focused posts tend to draw significant engagement. Mentions of close family members such as “mom” and “dad,” along with friends, also generate considerable interactions, though at slightly lower levels of engagement compared to romantic partners. This suggests that stories involving personal relationships, whether positive or challenging, are a powerful engagement driver in these subreddits.
Conclusion
Through this exploratory analysis, we have uncovered the dynamics that make r/TIFU and r/confession unique communities. From top contributors and common themes to the role of controversiality and engagement, each insight adds to our understanding of how these communities connect through shared stories and experiences. Our findings show that both subreddits foster spaces for storytelling and connection, with specific themes—like relationships and personal reflections—resonating deeply with their audiences. This analysis sets the stage for deeper insights, such as sentiment analysis and user behavior modeling, to further explore the nuances of user interaction on Reddit.