Social Media Datasets
Bulk exports of public Twitter/X, Instagram, TikTok, and Reddit data. Download as CSV or JSON. Free 100-record sample on every dataset.
12
datasets
3B+
records
from $0.0012
per record
$250
min order
Browse datasets
Filter by platform, record type, or search by name
Twitter / X — Posts
2.7B+ records
id, text, authorId, authorUsername, +16 more
Twitter / X — Profiles
570M+ records
id, username, name, description, +16 more
Instagram — Profiles
124M+ records
id, username, fullName, biography, +10 more
Instagram — Posts
13M+ records
id, postType, userId, username, +16 more
Instagram — Comments
64M+ records
id, text, parentPostId, type, +13 more
TikTok — Profiles
34M+ records
id, username, nickname, signature, +12 more
TikTok — Posts
3.6M+ records
id, postType, isPrivate, userId, +17 more
TikTok — Comments
50M+ records
id, text, postId, userId, +5 more
TikTok — Sounds
1.2M+ records
id, title, author, album, +5 more
Reddit — Profiles
15M+ records
id, username, profileUrl, profilePicUrl, +16 more
Reddit — Posts
4.6M+ records
id, title, selftext, url, +16 more
Reddit — Comments
120M+ records
id, body, parentPostId, parentId, +16 more
Build a custom dataset
Need specific filters, date ranges, or niche record types? Configure a custom order.
Platform
Record type
Price estimate
Custom (filtered) datasets are priced above standard data sets. Free 100-record sample available before purchase.
Fields included
20 fieldsTwitter / X — Posts · 2.7B+ available
Pricing & volume tiers
Flat per-record pricing across every platform — rates step down with volume. 1 record = one entity (one post, profile, or comment).
| Volume | Data set ($/record) | Custom ($/record) |
|---|---|---|
| 1M records | $0.0012 | $0.0020 |
| 10M records | $0.0010 | $0.0015 |
| 100M records | $0.0005 | $0.0009 |
Free sample: Every dataset includes 100 records at no charge so you can validate field structure and data quality before committing to an order. Minimum order is $250.
Need live queries instead?
Datasets are point-in-time exports. For real-time or ongoing data, use the Social Data API or SDK.
What teams build with datasets
Common workflows powered by bulk social data exports
AI / LLM training data
Fine-tune language models on authentic social conversations. Posts, comments, and threads from Reddit, Twitter, and TikTok provide rich, diverse training signals at scale.
Social listening
Track brand mentions, product sentiment, and competitor activity across all four platforms. Filter by keyword, date range, language, and engagement tier.
Influencer discovery
Build qualified creator lists from profile datasets with follower counts, engagement history, and verified status. Works across Twitter, Instagram, and TikTok.
Sentiment & market research
Run sentiment models on millions of comments and posts to surface opinion trends before they hit mainstream media. Ideal for financial research and product teams.
Trust & compliance
Every dataset is built on public data, handled responsibly
Public data only
All records sourced from publicly accessible posts and profiles.
Ethically sourced
No scraping of private accounts, DMs, or gated content.
Privacy-aware
Datasets contain no special-category data and support deletion requests.
Refreshed regularly
Snapshots updated monthly, quarterly, or on a one-time basis per your order.
Frequently asked questions
Start with a free sample. Scale when you are ready.
Every dataset ships a free 100-record sample. Minimum order is $250. Per-record rates step down at higher volumes.
