Skip to main content

Reddit Scraper

Collects company discussions from Reddit to provide social sentiment data.

Overview

AttributeValue
SourceReddit API (PRAW)
AuthOAuth2
Rate Limit60 requests/minute
Cache6 hours

Data Collected

FieldDescription
titlePost title
textPost body + top comments
scoreUpvotes
subredditSource subreddit
datePost creation date

Target Subreddits

SubredditContent
r/jobsJob hunting, reviews
r/careerguidanceCareer advice
r/cscareerquestionsTech companies
r/antiworkWorkplace issues
r/germanyDACH-specific

Search Strategy

# Search across multiple subreddits
posts = search_company_posts("BMW", limit=100)

Relevance Filtering

  • Minimum 5 upvotes
  • Must mention company name
  • Exclude job listings
  • Exclude promotional posts

Sentiment Derivation

No star ratings, so we derive sentiment from:

  • Post score (upvotes)
  • Comment tone
  • Keyword detection
  • Subreddit context

Privacy

  • Only public posts collected
  • Author names not stored
  • Comply with Reddit API terms

Reddit provides unfiltered employee perspectives.