Feedback
This page aims to summarize each meaningful piece of feedback received from instructors and peers as well as the approach taken to address said feedback.
Project Plans Feedback
- No feedback was provided.
EDA Feedback
Instructor Specific:
- Website Title: Change the title from “Big Data Project” to the actual project name.
- Website Appearance: Improve the quality of tables, charts, etc. The current design resembles a Jupyter Notebook rather than a professional website.
- Introduction Absence: Include an introduction before diving into business questions.
- Subreddits Used: Specify which subreddits are being analyzed for the project.
- Mapping EDA to Questions: Clearly mention how each section of the Exploratory Data Analysis (EDA) addresses specific business questions.
- Wordcloud Visualization: Highlight useful insights from the wordcloud.
- Frequency of Comment Mentions vs. Average Yearly Compensation Chart: Describe the insights derived from the chart in this section.
- Questions Posed Over Time Chart: Expand on the findings in this section. A line chart was suggested represent the fraction of questions to all posts and its constancy.
- Ordering of Bar Charts: Order the bar chart in descending order.
All of the feedback above was addressed according to the suggestions mentioned. By changing our quarto theme, enhancing the professionalism of our plots, removing unessary tables, and adding further analysis to the report, the quality of the website was significantly improved.
Peer Specific:
- Post Engagement Explanation: Add explanation for why individuals post and engage in the subreddits.
- Search Terms: Look for terms such as “Remote work,” “Benefits,” and more to understand user interests.
- 2022 Posts Increase: Explain the observed increase in posts in 2022.
- Differentiating Technical Roles: Include in the technical proposal the intent to differentiate between various technical positions and their salary ranges.
- FAANG Companies Identification: Mention how we identified and extracted the names of FAANG companies.
- Connecting Topics 2 and 7: Attempt to find and establish connections between topics 2 and 7.
- Topic 8 - Value Addition: Provide more context on the value added by understanding features contributing to high popularity.
- Font Size Adjustment: Increase the font size of the x-axis and y-axis labels in plots.
- Comment/Submission Counts Representation: Convert the “Breakdown of Comment and Submission Counts by Subreddit” into percentages, and create a table for raw counts due to the dominance of ‘leetcode’.
- Plot Consistency: Ensure fixed sizes for heights, weights, and fonts in all plots for uniformity.
- Column Name Clarity: Clean and transform column names like “IS_TECH_COMMENTS” to “tech comments” for easier understanding in tables.
Almost all of the feedback above was addressed according to the suggestions mentioned. The suggestions to represent Comment/Submission Counts using percentages were not implemented. Actual numbers were used because they effectively demonstrate our utilization of big data. Using raw figures, as opposed to percentages, provides a clearer insight into the scale of data being analyzed.
NLP Work
- No feedback was provided. However, we applied most if not all appropriate feedback provided for EDA to this page which resulted in multiple upgrades.
ML Work
- No feedback was provided. However, we applied most if not all appropriate feedback provided for EDA to this page which resulted in multiple upgrades.
Website/Results
Instructor Specific:
- In terms of the website itself, instructor feedback was received specifying that the website looked similar to a jupyter notebook. Because this project was an analytic exploration of big data where usage of robust computing languages like PySpark was transparent, the aim was to strike a balance between accessibility and aesthetics. Therefore, we adopted Quarto, which allowed us to integrate code within the website yet keep it concealed in its respective sections. However, in an attempt to make it look less like a jupyter notebook, we transitioned to a more professional quarto theme. Additionally, we enhanced the website’s narrative flow with more text analysis, re-formatted the visuals for a more polished look, and transformed the tables to coincide with the professionalism of a business website.