Llama Stack
Build Once and Deploy Anywhere
Open Source
Developer Tools
Artificial Intelligence
GitHub
YouTube

Featured onJanuary 27th, 2025

Llama Stack defines and standardizes genAI agentic application development in various environments (on-prem, cloud, single-node, on-device) through a standard API interface and developer experience that’s optimized for use with Llama models.

Top comment

Upvotes187

▲ 187View on ProductHunt ⧉

Comments13

13 commentsSee comments on PH ⧉

Product of the Day7th

Hey Makers! 👋 I'm Raghotham Murthy from the Llama Stack team at Meta. I’m thrilled to share our latest stable API release of Llama Stack with the Product Hunt community. We built Llama Stack because we wanted to make it easy for developers to get a fast and reliable experience to build with Meta’s Llama models while giving them the ability to move their applications to inference and other API providers of their choice. What is Llama Stack? Llama Stack is an open source framework with a comprehensive and coherent interface that simplifies AI application development and codifies best practices across the Llama ecosystem. More specifically, it provides: Unified API layer for Inference, RAG, Agents, Tools, Safety, Evals, and Telemetry. Plugin architecture to support the rich ecosystem of implementations of the different APIs in different environments like local development, on-premises, cloud, and mobile. Prepackaged verified distributions which offer a one-stop solution for developers to get started quickly and reliably in any environment Multiple developer interfaces like CLI and SDKs for Python, Node, iOS, and Android Standalone applications as examples for how to build production-grade AI applications with Llama Stack Why Llama Stack? Flexible Options: Developers can choose their preferred infrastructure without changing APIs, enjoy flexible deployment choice, and use pre-configured toolkits to build upon and customize according to their needs. Consistent Experience: With its unified APIs Llama Stack makes it easier to build, test, and deploy AI applications with consistent application behavior and reduces reliance on multiple service providers for different AI capabilities. Robust Network: Llama Stack supports collaboration with distribution partners, which are cloud providers, hardware vendors, and AI-focused companies that offer tailored infrastructure, software, and services for deploying Llama models. We believe that by reducing friction and complexity, Llama Stack empowers developers to focus on what they do best: building transformative generative AI applications. We want your input! 🎯 Help us improve by sharing: What tools/platforms are you currently using to build applications on Llama models? What's your current stage of adoption? 🔍 Consideration 🧪 Pilot 🚀 Production Drop your thoughts, questions, and feedback below. We're listening and eager to learn from the community! Happy Hunting!

Comment highlights

where was this months ago?? The amount of custom code this would've saved me... 😅

This looks extremely powerful! 🧙‍♂️ It looks like this makes it possible to build a swift ios app that uses a locally running llama model within a llamastack (!!) In that deployment model what would the options be for a memory bank (sqlite?), or is that still not supported yet? Similar question for cloud hosting on aws bedrock, what would the recommendation/options be for a memory bank? Does aws bedrock support it natively or would a separate server/service be required?

Llama Stack’s potential to support various AI models beyond Llama is truly exciting! Expanding compatibility would empower developers to create more versatile applications, benefiting the entire AI community. Looking forward to seeing this evolution! 🎉🤖

Congrats on the release .How does @Llama Stack handle safety and evaluation processes at scale, especially for teams deploying across diverse environments like mobile and cloud?

Congratulations on the launch of Llama Stack! It's impressive to see a standardized approach to genAI application development. How does the standard API interface accommodate different deployment environments while ensuring consistent performance across all platforms?