Product upvotes vs the next 3

Waiting for data. Loading

Product comments vs the next 3

Waiting for data. Loading

Product upvote speed vs the next 3

Waiting for data. Loading

Product upvotes and comments

Waiting for data. Loading

Product vs the next 3

Loading

Raindrop

Sentry for AI Products

AI engineers use Raindrop to get alerts about hidden issues and successes in their AI products. Raindrop sends alerts when your AI misbehaves and links straight to events, so you can dig into conversations or traces, understand the root cause, and fix it, fast

Top comment

Hey PH! We’re excited to launch Raindrop. You can sign up for Raindrop to get issue alerts for your AI product.

AI products fail constantly—in ways both hilarious and terrifying.

Regular software throws exceptions. But AI products fail silently.

Raindrop is the first Sentry-like monitoring platform for AI products.

The Problem
Traditionally, when a user hits an error, it’s easy to detect and easy to get notified once it becomes an issue (this is exactly what Sentry does!). But when building AI products, these issues go undetected.

These issues can be serious:

- Figma had to roll back their AI design product when it copied Apple’s designs, and they haven’t relaunched since
- Air Canada got sued — and lost — when their chatbot incorrectly offered a customer a refund
- Virgin Money came under fire when their chatbot scolded customers for using the company name, ‘Virgin’

The current status quo is sifting through millions of logs and trying debug flaky evals.

Evals are not enough - like unit tests, they confirm that your model passed specific test cases. But in the real world, AI chatbots and agents encounter millions of unpredictable actions. AI engineers need issue monitoring to discover production issues so they can make AI products that not only pass tests but also perform well in the real world.

Solution
Raindrop sends you alerts when your AI misbehaves and links straight to the events, so you can dig into the conversations or traces, understand the root cause, and fix it, fast.

Daily Alerts Include
- Issues: detects issues like Assistant Forgetting, Laziness, Task Failure, User Frustration, and more, depending on the type of AI app
- Wins: surfaces what your product does well, so you can double down on those experiences and create great evals

The Pro tier lets you go even further with:

- Custom Issues / Topics: define and track any issue or topic
- Topic Clustering: clusters data in real-time to find your AI product’s most popular use cases, and what use cases have the most issues
- Signals: finds patterns in explicit signals like thumbs up / thumbs down
- Deep Research: deep research for your production AI data, letting you use natural language to search for any kind of behavior
- Traces: track every step of your AI call
- Edge PII Redaction: intelligently strip PII from any user and model messages
- Dataset Creation: create custom datasets out of any set of events

Companies like Clay.com, Tolans, Websim, and more have been using Raindrop to improve their AI products. They’ve been able to quickly iterate on fixes, see how issue incidence rates decrease in production, and confidently ship changes.

“Raindrop has been invaluable as we've been growing quickly. It's critical for us to keep issue incidence below an acceptable threshold and become aware of any spikes. It's like if we see an iOS crash report in Sentry, but for our AI capabilities.” -Evan Goldschmidt, CTO Tolans

Excited to get your feedback!!