Make Claude Code faster and cheaper without losing context
Context Gateway cuts latency and token spend for Claude Code / Codex / OpenClaw by compressing tool output while preserving important context. Setup takes less than a minute. Quality-of-life features: instant context compaction and setting spend limit in Claude Code.
Hey all! 👋 We are releasing Context Gateway - our context compression proxy, which cuts token spend and improves accuracy/latency for Claude Code, OpenClaw, Codex, and other agents.
We built it because agents struggle to efficiently manage lengthy context: each tool an agent calls can return thousands of redundant tokens, leading to unnecessary spend, higher latency, and lower generation quality. The proxy invisibly fixes that by compressing whatever context the agent has to deal with.
We've also added a number of quality-of-life features, which are missing from Claude Code: instant context compaction (same /compact, but you don't wait for 3 minutes), setting the spend cap, sending Slack notifications, and more.
We are open-sourcing everything except for the models we use for context compression, which are free to use during the launch.
Excited to hear your feedback and the features you'd want next! 🚀