Latest Tech News

Stay updated with the latest in technology, AI, cybersecurity, and more

Filtered by: routing Clear Filter

Show HN: ArchGW – An intelligent edge and service proxy for agents

Arch is a proxy server designed as a modular edge and AI gateway for agentic apps Arch handles the pesky low-level work in building agentic apps — like applying guardrails, clarifying vague user input, routing prompts to the right agent, and unifying access to any LLM. It’s a language and framework friendly infrastructure layer designed to help you build and ship agentic apps faster. Quickstart • Demos • Build agentic apps with Arch • Route LLMs • Documentation • Contact Overview AI demos a

New 1.5B router model achieves 93% accuracy without costly retraining

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Researchers at Katanemo Labs have introduced Arch-Router, a new routing model and framework designed to intelligently map user queries to the most suitable large language model (LLM). For enterprises building products that rely on multiple LLMs, Arch-Router aims to solve a key challenge: how to direct queries to the best model for the job

Optimizing Tool Selection for LLM Workflows with Differentiable Programming

Modern agentic architectures rely heavily on chaining LLM calls. A typical pattern looks like: Use an LLM to decide which tool to invoke Call the tool (e.g. search, calculator, API) Use another LLM call to interpret the result and generate a final response This structure is easy to reason about, simple to prototype, and generalizes well. But it scales poorly. Each LLM call incurs latency, cost, and token overhead. More subtly, it compounds context: every step includes not only the original q