Skip to content
Tech News
← Back to articles

Show HN: Smart model routing directly in Claude, Codex and Cursor

read original more articles
Why This Matters

This new routing solution simplifies and optimizes model selection across multiple AI providers, enabling developers to seamlessly integrate and manage diverse models with a single endpoint. Its ability to automatically choose the best model for each request enhances performance, cost-efficiency, and security, making it a significant advancement for AI deployment and scalability in the industry.

Key Takeaways

One endpoint. Every model. Always the right one. A drop-in proxy for Anthropic, OpenAI, and Gemini that picks the best model for every request: using a tiny on-box embedder, not a vibes-based prompt. 🥇 #1 on the RouterArena leaderboard 1 — Acc-Cost Arena 76.09. Built by Weave: The #1 engineering intelligence platform, loved by Robinhood, PostHog, Reducto, and hundreds of others.

What it does

Point Claude Code, Codex, Cursor, or your own app at localhost:8080 . The router:

🎯 Routes per request. A cluster scorer derived from Avengers-Pro 2 picks the right model from your enabled providers, every turn.

A cluster scorer derived from Avengers-Pro picks the right model from your enabled providers, every turn. 🔌 Speaks everyone's API. Anthropic Messages, OpenAI Chat Completions, Gemini native. Streaming, tools, vision, the works.

Anthropic Messages, OpenAI Chat Completions, Gemini native. Streaming, tools, vision, the works. 🧠 Knows OSS too. DeepSeek, Kimi, GLM, Qwen, Llama, Mistral via OpenRouter (or any OpenAI-compatible endpoint).

DeepSeek, Kimi, GLM, Qwen, Llama, Mistral via OpenRouter (or any OpenAI-compatible endpoint). 🔒 BYOK by default. Provider keys stay on your box, encrypted at rest.

Provider keys stay on your box, encrypted at rest. 📊 Observable. OTLP traces out of the box. See your dashboard in the Weave dashboard (http://localhost:8080/ui/dashboard) or drop in Honeycomb, Datadog, Grafana, whatever.

30-second quickstart

The fastest way: point Claude Code, Codex, or opencode at the hosted Weave Router with one command. No clone, no Docker, no Postgres.

... continue reading