Uncovering Patterns in r/TIFU and r/Confession: An Exploratory Analysis

Introduction

Reddit is a unique digital landscape where users share experiences, confessions, and mishaps. In this exploratory analysis, we dive into two popular subreddits, r/TIFU (Today I F*ed Up)** and r/confession, to understand the dynamics of user interactions and engagement. We aim to answer questions like: Who are the most active and impactful authors? What words and themes dominate these posts? How does controversiality affect engagement? By analyzing these patterns, we uncover the elements that make these subreddits such captivating spaces for storytelling and connection.

1. Who are the Top Contributors?

To start our journey, we examine the most active and impactful authors in these subreddits. Each community has its dedicated contributors, but who stands out the most?

The Top 10 Authors by Frequency with Average Comments and Score plot reveals intriguing insights. Some authors post frequently, yet may receive relatively lower scores or comments on average. On the other hand, a few authors stand out with fewer posts but much higher engagement per post. This contrast highlights a balance between quantity and quality—authors who consistently engage users despite limited posts versus those who post frequently but with varied reception. This tells us that impactful stories resonate more than frequent submissions.

Figure 1: The plot above is sorted based on the frequency of posts. The user account with the highest average score has been deleted, which shows [deleted].

2. What Do Users Talk About the Most?

Next, we delve into the language of these posts. What words are frequently used in posts across r/TIFU and r/confession? Analyzing these words helps us understand the common themes and tones that resonate with these communities.

The Word Cloud of common words in selftext shows dominant words like “know,” “like,” “get,” and “said.” These words suggest a confessional, storytelling style, with users often narrating personal experiences or sharing lessons learned. The prevalence of words such as “felt,” “wanted,” and “think” further emphasizes the introspective, reflective tone in both subreddits, as users recount moments of regret, humor, or self-discovery. This language invites readers into personal stories, creating a space where users can relate to each other’s experiences.

3. What Are Most of the Words Mentioned in the Title?

In exploring the most frequently mentioned words in post titles, a pattern emerges—many titles revolve around friendships or social interactions. This observation aligns with the idea that users often share stories involving friends, whether about real-life incidents or imaginary scenarios. These commonly used words shed light on the themes that resonate with users and draw them into posts.

Below is a table of the top 20 most frequently used words in titles:

word count
0 get 1911
1 friend 1321
2 I’m 697
3 know 592
4 try 517
5 accident 511
6 work 485
7 go 484
8 year 453
9 tell 432
10 school 412
11 someone 405
12 girl 402
13 one 400
14 mak 390
15 something 382
16 best 379

5. Does Being Controversial Affect a Post’s Score?

Controversiality is a unique aspect of Reddit posts. A post marked as controversial often indicates mixed reactions, either due to divisive content or differing opinions in the comments. We look into how controversiality impacts post scores in both subreddits.

r/TIFU Subreddit

In r/TIFU, we see distinct Score Distributions based on controversiality. The distribution for non-controversial posts is narrow, while controversial posts display a wider range of scores, suggesting that divisive topics tend to evoke stronger, more varied reactions. Some controversial posts may receive low scores, reflecting disapproval, while others achieve high scores, indicating that some users appreciate the boldness or relatability of the content.

Figure 2: Distribution by controversiality for r/tifu

r/confession Subreddit

In r/confession, the score distribution reveals a similar trend but with a narrower range of controversial scores than in r/tifu. This implies that while controversiality influences scores in both subreddits, r/tifu has a more pronounced spread, perhaps due to its broader range of storytelling topics and humor-driven content.

Figure 3: Distribution by controversiality for r/confession

6. How Common Are Controversial Comments?

To further explore controversiality, we examine the volume of controversial versus non-controversial comments. Do users engage more in controversial discussions, or do they prefer to keep interactions neutral?

The Count of Controversial vs. Non-Controversial Comments reveals that both subreddits have predominantly non-controversial comments. Interestingly, r/TIFU shows a higher volume of comments overall, suggesting a more active user base. The low number of controversial comments indicates that while controversial posts may spark diverse reactions, most users tend to respond neutrally or supportively, fostering a largely positive atmosphere in both subreddits.

Figure 4: Count of controversiality for r/TIFU and r/confession

7. Do More Comments Mean Higher Scores?

A key question in understanding engagement is the relationship between comments and scores. Does a higher number of comments correlate with a higher score, indicating greater overall engagement?

In the Comments vs Score Scatter Plot, we observe that while more comments often align with higher scores, there are posts with high scores but fewer comments, particularly in r/confession. This suggests that impactful posts can score highly even if they don’t generate extensive discussions. In r/TIFU, the relationship between comments and scores is more pronounced, possibly due to the community’s preference for interactive, story-driven content.

Figure 5: Number of Comments vs. popularity scores for different subreddits.

8. Which Characters Capture the Audience’s Attention?

In the world of storytelling, characters often play a pivotal role. This is no different in r/TIFU and r/confession, where the mention of certain people—friends, family members, or significant others—can influence a post’s engagement. By analyzing which characters are frequently mentioned in posts, we gain insight into the relational dynamics that resonate with readers.

Data Table:
author subreddit selftext title most_frequent_selftext_word word_count_selftext most_frequent_title_word word_count_title
goreptorator confession I’m 22 M from Wes… 20 M Summer with … sure 4 awkward…. 1
SadTelevision6911 confession I started the job… I am currently wo… get 3 let 1
Funkthoughts tifu This morning my t… TIFU by letting m… dog 5 dog 1
No_Fly3027 confession Post Covid 19 and… My mental health … became 6 health 1
mostly_poetic confession So, I’m currently… It’s not always g… i’m 3 side……………………. 1

The Bubble Plot visualizes various characters mentioned in the selftext of posts. Each bubble represents a specific character (e.g., “wife,” “boyfriend,” “friend”), with the size and color of the bubble indicating the frequency and unique interactions these characters drive. The x-axis represents the average number of comments per mention, while the y-axis represents the average score associated with these mentions.

Figure 6: Bubble plot of words in Selftext - The size of the bubble meaning the frequency of the word

Characters like “wife,” “boyfriend,” and “girlfriend” receive higher average scores and comments, highlighting that relationship-focused posts tend to draw significant engagement. Mentions of close family members such as “mom” and “dad,” along with friends, also generate considerable interactions, though at slightly lower levels of engagement compared to romantic partners. This suggests that stories involving personal relationships, whether positive or challenging, are a powerful engagement driver in these subreddits.

Conclusion

Through this exploratory analysis, we have uncovered the dynamics that make r/TIFU and r/confession unique communities. From top contributors and common themes to the role of controversiality and engagement, each insight adds to our understanding of how these communities connect through shared stories and experiences. Our findings show that both subreddits foster spaces for storytelling and connection, with specific themes—like relationships and personal reflections—resonating deeply with their audiences. This analysis sets the stage for deeper insights, such as sentiment analysis and user behavior modeling, to further explore the nuances of user interaction on Reddit.