Product Thumbnail

Langtail 1.0

The low-code platform for testing AI apps

SaaS
Developer Tools
Artificial Intelligence

LLM testing made easy with a spreadsheet-like interface. Score tests with natural language, pattern matching, or code. Optimize LLM apps by experimenting with models, parameters, and prompts. Gain insights from test results and analytics.

Top comment

Great to see Langtail evolving into a full-fledged testing suite for AI apps! The spreadsheet-style interface is exactly what I've been looking for - been struggling with messy prompt iterations in my recent projects. Love that you've added hosted tools and that AI firewall feature (seriously, prompt injection has been keeping me up at night 😅). The self-hosting option is a huge plus for enterprise teams who need to keep everything in-house. Feels like you guys really listened to the community pain points and delivered. Definitely giving this a spin on my next LLM project! 👍

Comment highlights

I recommend Langtail 1.0 for those who want to simplify the testing and optimization of applications based on large language models (LLMs). The platform offers a table-like interface, making testing accessible and easy to understand.

It's side by side model comparison is such a practical feature. It's helpful to evaluate different setups quickly for optimal results. @durk0

I really like the new light theme option. It's always nice to have a choice of themes for a better user experience. @durk0

The stateful assistants with memory management are a big win. It makes building conversational models more manageable. @durk0

Having shareable AI apps is perfect for collaboration. It's nice to let others test your app without extra sign ups. @durk0

Langtail's hosted tools make testing much easier without the need for external setup. Great for prototyping on the go. @durk0

I appreciate the AI firewall feature. It's reassuring to have those security measures to prevent prompt injections and other vulnerabilities. @durk0

The Magic Buttons feature is a clever way to automate repetitive tasks. It definitely speeds up prompt iteration and testing @khanenyao

The cost analytics feature is a smart addition. It's so useful to track and control spending when working with large scale models. @khanenyao

I love the versatility of Langtail's integration. The option to self host gives teams added flexibility and control. @durk0

This platform looks great for streamlining LLM testing. The spreadsheet interface makes test care organization straightforward and efficient. @khanenyao

This tool sounds promising but I would like to see more examples of its effectiveness in real world scenarios.

The learning curve fascinates me. Will be be simple for non-technical people to adjust?

Great job on the launch! Can you tell me more about the analytics feature and how it can help improve LLM apps?

I have used quite a few options out there, and this is probably the nicest UI I've seen. Now, it does lack (maybe I didn't see it) a couple of things I'd like to see: - Can I do bulk updates from API? - How would I unify version control with the prompts in my system. - Prompts in actual systems are usually composed (i.e. dynamically created) so the evaluation should ideally pick up from that moment on. - More preset evaluations, particularly for RAG evals Really great work @petrbrzek, happy to chat if you want to brainstorm some options!

The ability to have self-hosting options makes Langtail flexible for both large teams and solo devs! :)

Huge congrats to the Langtail team on the 1.0 launch! I love how you've simplified LLM testing with a spreadsheet-like interface - who wouldn't want to tame AI complexity with a familiar feel? Quick question: Can you share an example of a surprising insight or optimization that a beta tester achieved using Langtail's analytics?

I was always testing LLMs quite manually, but this can be a game changer. Congrats on the launch, I will definitely try this out!

This approach could really accelerate AI deployment and feedback cycles. Excited to see how Langtail handles integration with various AI models and if there are tools for automating complex test scenarios!