Power your product or AI Agent with billions of datapoints on hundreds of millions of people, refreshed monthly. Build your AI SDR, recruiting platform, internal or commercial investment tool with Crustdata's Full People Dataset. Delivered as Parquet files
Hi Product Hunt, Garry, CEO of Y Combinator, here and I’m excited to again support and help launch Crustdata (YC F24)'s newest product: Full People Dataset. This product delivers billions of datapoints on hundreds of millions of people, refreshed monthly.
Crustdata tracks B2B people and company data. Most of Crustdata’s 200 customers (including Y Combinator!) build their platforms - whether AI SDRs, AI recruiting platforms, internal or external investment platforms - on top of Crustdata’s APIs, but for teams who need extremely high levels of volume, the Crustdata People Dataset is an alternative option.
Benefits of the Full Dataset over APIs: -higher volume for lower cost -lower latency -more control over the data Built as the infra layer for AI Agent platforms: Access 180 million+ professional profiles Delivered via S3 as Parquet files
Each profile includes: Current job title Current Company Headline Location Profile photo Past job titles, companies, durations, and descriptions Education (university, school, degree) Activities and societies Skills Bio Licenses Certifications
How do they do this? They’ve developed technology that accesses the web in real time to gather information on people and companies. They unify live data and deliver it via APIs or Parquet files (full datasets).
Who is this for?
Recruiting Platforms
Use Crustdata’s full dataset product to build your tool/platform over their people data which would serve as your candidate data warehouse. You wouldn’t have to worry about getting the data yourself and spending time and manpower to keep it updated.
AI SDRs / Sales Automation platforms Get access to tens of millions of decision makers and prospects without wasting credits with APIs. Use the dataset for bulk lead generation or for preloading your sales agent with a prospect database. See changes in monthly refreshes that can act as intent signals. Investment platforms or teams Get a large dataset of people to track founder movements, map leadership teams, or build internal founder databases without relying on fragmented data sources.
Anyone building a tool that uses people data
For people who need clean, up-to-date people profiles but don’t want to work with APIs or need more control over large amounts of data you can access Crustdata’s people dataset.
Wohooo. Exactly, what I want. Thanks team!! :D
Would like to know if there is some plan to support startups which have limited budgets?
Best wishes!
This is super useful, especially for early-stage projects that need solid data without jumping through hoops. Love how straightforward it is. Great job putting this together—and congrats to the team on the launch!
@rishabh_raj13@danthevc I didn't see any obvious information about pricing — can you elaborate?
How much of this data is LinkedIn data? Products like Apollo are practically 99% LinkedIn data
Finally! Been waiting for something like this since Apollo.io's API started feeling clunky (´• ω •\`) Crustdata's dataset looks way more intelligent lol – already imagining how we'll build smarter AI SDR tools without burning API credits. Teams execution here is chefs kiss, congrats on launch!
I'm looking for an API like this for a product I'm working on called Kithbook. This looks incredibly powerful - congrats on the launch.
I'm curious when you guys are planning to do self-serve and do you have any thoughts on opening a credits-based system to try the dataset? I'm not VC-backed yet so I wanted to try and see if it works for Kithbook.
Why the dataset is only refreshed monthly and does not provide event-driven tracking like APIs?
Congrats on the launch! 🚀 Quick Questions: 1. From where you get these data and how you will update them. 2. is there any privacy concern? 3. What product can be build on top of this? 4. How to know data is correct?
Dataset is going to be key to drive both AI advancement and as well as academic research. Upvoted!
Quick question: I cannot see pricing model on the website. How can i know more about same. Also any special discount for academic researchers?
This sounds like a game-changer for teams that need comprehensive and up-to-date people data! The ability to access such vast and detailed datasets with lower latency and cost is definitely going to empower a lot of platforms. Congrats to the Crustdata team and thanks for supporting innovative tools, Garry! 🌟
Congrats on the launch! 🚀 This looks like a super valuable resource for anyone working with data-driven projects, especially in recruitment, analytics, or AI training. Curious to know more about how often the dataset is updated and if there are any privacy considerations built into how the data is sourced. Great work!