Foundry (YC F24) Is Hiring
Published on: 2025-05-12 03:01:00
Browser agents are broken—and whoever fixes them will shape the next decade of software.
Today, even the best browser agents from labs like OpenAI, Anthropic, and Google fail over 80% of real-world tasks, often taking three times as long as humans to complete simple actions. Foundry is addressing this by building the first robust simulator, RL training environment, and evaluation platform designed specifically for browser agents. Historically, simulation environments and standardized benchmarks were critical in advancing self-driving cars (e.g., Waymo Sim, KITTI) and LLMs (e.g., HELM, MMLU). We're applying this proven method to browser automation, enabling accurate benchmarking, rapid iteration, and real-world reliability.
For example, OpenAI could use Foundry to build a perfect replica of DoorDash's website, enabling them to run millions of ordering tests without ever touching real-world complexities like CAPTCHAs, payments, or anti-bot measures. This approach lets them clearly pinp
... Read full article.