Most “what watch should I buy?” discussions online skew heavily male. A friend wanted to launch a women’s watch, so I helped with a small data analysis. In this post I’ll walk through a small but complete Python pipeline I built: Scrape relevant posts and comments from Reddit with no API keys Filter out irrelevant posts (e.g. men asking for themselves) Run NLP analysis: sentiment, brands, features, prices, keywords, clustering, topic modeling Generate visualizations and CSVs you can explore further Everything here is powered by standard Python libraries: requests, pandas, nltk, scikit‑learn, and wordcloud. 1. Collecting Reddit data without API keys We didn’t use the official Reddit API; instead we hit the public JSON endpoints directly using requests.…