Your ICP is already telling you what they want — on Reddit. They're discussing pain points, evaluating competitors, and sharing unfiltered opinions in the exact subreddits where your buyers hang out.
This tool scrapes those conversations and uses Claude AI to extract structured, actionable insights — buying signals, pain point clusters, content ideas, and competitor sentiment — for any B2B vertical. Point it at different subreddits and search queries to research any market.
- Scrapes targeted subreddits (r/sales, r/b2bmarketing, r/SaaS, etc.) for discussions about visitor identification, intent data, and lead enrichment
- Analyzes threads with Claude to extract structured insights
- Generates markdown reports with:
- Buying signals (people actively looking for solutions)
- Pain points (grouped by theme)
- Content ideas (questions that could become blog posts)
- Competitor sentiment analysis
- Emerging industry themes
-
Clone and install dependencies
cd reddit-customer-insights pip install -r requirements.txt -
Configure API credentials
cp .env.example .env
Edit
.envwith your Anthropic API key:ANTHROPIC_API_KEY=your_key_hereGet your key from https://console.anthropic.com
-
Run the tool
python main.py --output report.md --limit 5
| Option | Default | Description |
|---|---|---|
--output, -o |
insights_report.md |
Output file path |
--limit, -l |
10 |
Max threads per subreddit/query combo |
--with-comments |
disabled | Fetch comments for each thread (slower) |
Edit reddit_insights/config.py to modify:
TARGET_SUBREDDITS: Which subreddits to searchSEARCH_QUERIES: What terms to search forTIME_FILTER: How far back to search (hour, day, week, month, year, all)
See sample_output.md for an example of what the generated report looks like.
-
Scraping: Uses Reddit's public JSON API to search configured subreddits with relevant queries. Includes rate limiting (3s between requests) with exponential backoff for reliability. Deduplicates threads that match multiple queries.
-
Analysis: Sends all thread content to Claude in a single batch request. Claude extracts structured insights following a detailed prompt template.
-
Reporting: Transforms Claude's JSON response into a readable markdown report with proper formatting, links, and tables.
- Python 3.9+
- Anthropic API key (usage-based pricing)
No Reddit API credentials required - uses Reddit's public JSON endpoints.
A typical run analyzing ~300 threads uses approximately 30-50K input tokens and 3-5K output tokens with Claude Sonnet, costing roughly $0.10-0.20 per run.