Currently AI browser agents send screenshots to the model. Each screenshot costs thousands of tokens. Over a multi-step task, that means high latency and high API cost.
This package takes a different approach: it renders pages as ASCII wireframes with numbered elements. The agent sees [12]Sign Up instead of a 1280x720 image. Same information, far fewer tokens.
It started as a way to make my own agents cheaper to run. Then I build a package around it. Fully open source and open to feedbacks!