Selene 1
Evaluate your AI app with the most accurate LLM Judge
API
Developer Tools
Artificial Intelligence

Featured onMarch 11th, 2025

CodeRabbit

Cut Code Review Time & Bugs in Half • Sponsored

Get started ⧉

Product upvotes vs the next 3

Waiting for data. Loading

Product comments vs the next 3

Waiting for data. Loading

Product upvote speed vs the next 3

Waiting for data. Loading

Product upvotes and comments

Waiting for data. Loading

Product vs the next 3

Selene 1

Evaluate your AI app with the most accurate LLM Judge

Selene 1 is an LLM-as-a-Judge that evaluates AI responses with human-like precision. Get eval scores and actionable feedback via our API to boost your AI's reliability. Measure what matters to you by building custom evals in our Alignment Platform.

Top comment

Upvotes203

▲ 203View on ProductHunt ⧉

Comments18

18 commentsSee comments on PH ⧉

Product of the Day10th

Hey Product Hunt! Maurice here, CEO and co-founder of Atla.

At Atla, we’re a team of researchers and engineers dedicated to training models and building tools that monitor AI performance.

If you’re building with AI, you know that good evals are critical to ensuring your AI apps perform as intended.
Turns out, getting accurate evals that assess what matters for your use case is challenging. Human evaluations don’t scale and general-purpose LLMs are inconsistent evaluators. We’ve also heard that default eval metrics aren’t precise enough for most use cases, and prompt engineering custom evals from scratch is a lot of work.
🌖 Our solution
Selene 1: a LLM Judge trained specifically for evals. Selene outperforms all frontier models (OpenAI’s o-series, Claude 3.5 Sonnet, DeepSeek R1, etc.) across 11 benchmarks for scoring, classifying, and pairwise comparisons.
Alignment Platform: a tool that helps users automatically generate, test, and refine custom evaluation metrics with just a description of their task, little-to-no prompt engineering required.

🛠️ Who is it for?
Builders of GenAI apps who need accurate and customizable evals—whether you’re fine-tuning LLMs, comparing outputs, or monitoring performance in production. Evaluate your GenAI products with Selene and ship with confidence.
You can start with our API for free. Our Alignment Platform is available for all users.
We’d love your feedback in the comments! What challenges have you faced with evals?

Selene 1Evaluate your AI app with the most accurate LLM JudgeAPIDeveloper ToolsArtificial Intelligence

Product upvotes and comments

Product vs the next 3

Top comment

Selene 1
Evaluate your AI app with the most accurate LLM Judge
API
Developer Tools
Artificial Intelligence