Product Thumbnail

Giskard

Open-source testing framework for LLMs & ML models

Open Source
Developer Tools
Artificial Intelligence
GitHub

Fast LLM & ML testing at scale πŸ›‘οΈ Detect hallucinations & biases automatically πŸ” Enterprise Testing Hub ☁️ Self-hosted / cloud 🀝 Integrated with πŸ€—, MLFlow, W&B From tabular models to LLMs, Giskard handles everything! https://github.com/Giskard-AI/giskard

Top comment

Hello Product Hunt, I'm Alex, here with Jean-Marie, Andrey, and the rest of the Giskard team. We're thrilled and slightly nervous to present Giskard 2.0. This has been 2 years in the making, involving a group of passionate ML engineers, ethicists, and researchers, in partnerships with leading standard organizations, such as AFNOR and ISO. So, why Giskard? Because we understand the dilemma you face. Manually creating test cases, crafting reports, building dashboards, and enduring endless review meetings - testing ML models can take weeks, even months! With the new wave of Large Language Models (LLMs), testing models becomes an even more impossible mission. The questions keep coming: Where to start? What issues to focus on? How to implement the tests? 🫠 Meanwhile, the pressure to deploy quickly is constant, often pushing models into production with unseen vulnerabilities. The bottleneck? ML Testing systems. Our experience includes leading ML Engineering at Dataiku and years of research in AI Ethics. We saw many ML teams struggling with the same issues: slowed down by inefficient testing, allowing critical errors and biases to slip into production. Current MLOps tools fall short. They lack transparency and don’t cover the full range of AI risks: robustness, fairness, security, efficiency, you name it. Add to this compliance to AI regulations, some of which could be punitive, costing up to 6% of your revenue (EU AI Act). Enter Giskard: πŸ“¦ A comprehensive ML Testing framework for Data Scientists, ML Engineers, and Quality specialists. It offers automated vulnerability detection, customizable tests, CI/CD integration, and collaborative dashboards. πŸ”Ž An open-source Python library for automatically detecting hidden vulnerabilities in ML and LLMs, tackling issues from robustness to ethical biases. πŸ“Š An enterprise-ready Testing Hub application with dashboards and visual debugging, built to enable collaborative AI Quality Assurance and compliance at scale. ∭ Compatibility with the Python ML ecosystem, including Hugging Face, MLFlow, Weights & Biases, PyTorch, Tensorflow, and Langchain. ↕️ A model-agnostic approach that serves tabular models, NLP, and LLMs. Soon, we'll also support Computer Vision, Recommender Systems, and Time Series. Equip yourself with Giskard to defeat your AI Quality issues! πŸ’πŸ›‘οΈ We build in the open, so we’re welcoming your feedback, feature requests and questions. For further information: Website: https://www.giskard.ai/ GitHub: https://github.com/Giskard-AI/gi... Discord Community: https://gisk.ar/discord Best, Alex & the Giskard Team

Comment highlights

Hey Team Giskard! Really pumped about Giskard 2.0 hitting the market! The automation of tests, compatibility with a variety of ML tools, and a strong commitment to standards – that sounds like the future of MLOps. One thing to watch out for is ensuring that expanding to accommodate growing data and models doesn't compromise performance or ease of use. The potential downside I see is the challenge of integrating with clients' established systems. Convincing teams to switch to new tech can be tough, even when it's better. How do you plan to tackle this adoption hurdle? And what cool features are you lining up to stay ahead of the curve?

Finally, a solution for the ML testing bottleneck! I'm curious to know, can Giskard handle testing for models trained on non-English languages effectively?

Wow, Giskard 2.0 sounds like it's on another level! What's the first thing you'd recommend trying out in Giskard for a newbie?

Amazing software highly recommend to everyone and thanks for sharing this with us

Fantastic work as always Giskard team! Congrats on this PH Launch and can't wait to test it

Congratulation on the successful launch of Giskard on Product Hunt ! I can't wait to try and provide you my feedback!