Product Thumbnail

Airtrain.ai LLM Playground

Vibe-check many open-source and proprietary LLMs at once

Developer Tools
Artificial Intelligence
Data Science

A no-code LLM playground to vibe-check and compare quality, performance, and cost at once across a wide selection of open-source and proprietary LLMs: Claude, Gemini, Mistral AI models, Open AI models, Llama 2, Phi-2, etc.

Top comment

Hello Product Hunt community! 🚀 We're very proud to introduce the Airtrain.ai LLM Playground, a no-code tool to prompt many open-source and proprietary LLMs at once: Claude, Gemma, GPT-4, Llama 2, Gemini, Phi-2, Mistral models, and more. Compare quality, cost, and performance. We built this playground to help AI enthusiasts and practitioners of all stripes easily “vibe check” popular LLMs. Key features include: 📌 Prompt multiple models at once 📌 18 models supported (8 open-source, 10 proprietary) 📌 Inference metrics (i/o token counts, throughput, inference cost) 📌 Persisted sessions (review and resume previous chat sessions) We'd love for you to try it out and share your feedback with us. Feel free to ask any questions, and we'll be more than happy to answer them. Thanks so much for your support, and we hope you enjoy using the LLM Playground! ✨

Comment highlights

Create a similar tool inside of our tool in a couple minutes: https://app.wordware.ai/r/26dcd1... Check it out :) Let us know if you would like to expand the functionality of airtrain we would be happy to help :)

Hi @emmanuel_turlay I love Airtrain.ai's comprehensive playground for LLM comparison! It's a brilliant tool for developers and AI enthusiasts like me :) Have you considered integrating a community-driven feature where users can share their findings or best practices with specific models? I think this would help with the platform's utility and have shared insights.

Congrats on the launch, Emmanuel and team! This looks really amazing and I’ll give it a try soon :)

Very cool! As a full stack dev who's primarily used OpenAI and Gemini models for code generation, this is super useful for comparing performance and quality of other LLMs.

Hey! This is really cool Thanks for what you're doing for the LLM community. I think investments like this in accessibility around these models is going to be critical to fulfill it's full potential. Love the vibe check positioning too lol

Hi, guys, congrats! How frequently are the supported LLM models updated or refreshed in the Airtrain.ai playground? Good luck with the developing ✨

This is awesome. Really like the broad coverage across both OS and closed LLMs. Good luck with the launch!

Just Saw a post About Airtrain on Social Media. It is a really great tool. Great work, Airtrain Team

Congrats on the launch! The availability of open-source LLM is only part of the solution - the real big missing piece for faster adoption of them is tools like these to easily evaluate, pick, and customize models. Looking forward to trying it out!

The side-by-side comparison is very nice, makes it really easy to compare the different models.

Super interesting! Gotta say I love the inference metrics -- makes it way easier to compare costs than what I've been doing. Claude 3 is so pricey!