Plano is an AI-native proxy and data plane for agentic apps — with built-in orchestration, safety, observability, and smart LLM routing so you stay focused on your agents core logic.
-
Updated
Jun 29, 2026 - Rust
Plano is an AI-native proxy and data plane for agentic apps — with built-in orchestration, safety, observability, and smart LLM routing so you stay focused on your agents core logic.
List of Permanent Free LLM API (API Keys)
High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model discovery across local and remote inference backends.
A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"
Context engineering for AI agents. ~80% fewer tokens. Fix tool overload. Skills and memory with in-process BM25 retrieval. No vector DB. No embeddings.
RouterArena: An open framework for evaluating LLM routers with standardized datasets, metrics, an automated framework, and a live leaderboard.
Self-hosted LLM gateway that routes requests across AI providers (OpenAI, Anthropic, Gemini, Mistral, Ollama) using intelligent multi-policy scoring — including an LLM-native routing policy. Drop-in compatible: just swap the base URL. No database required, built-in cost tracking, budget enforcement and multi-tenant isolation.
Open-source, OpenAI-compatible LLM gateway you run yourself. One endpoint for 40+ providers, with virtual keys, budgets, and usage tracking.
Qualixar OS: The Universal OS for AI Agents. Claw-compatible. 12 topologies, Forge AI team designer, 24-tab dashboard, skill marketplace. PAPER: https://arxiv.org/abs/2604.06392
Edge-native AI API gateway — cost-optimized routing across providers, multi-protocol support, built on Cloudflare Workers.
Stop overpaying to run your agents. Kalibr routes every request to lower-cost model and tool paths without degrading performance.
A prompt-aware LLM router that predicts which models can complete each request, then selects the cheapest capable one: 53.2% lower cost and +1.9 pts completion on our tested dataset.
Automatic cost-aware model routing plugin for Hermes Agent
Web-based command center for managing Telegram bots — multi-bot dashboard, reverse proxy, inter-bot routing, protocol bridges, and LLM-powered smart routing
Combine the results from a panel of models into an enhanced response
Ultra-low-latency LLM gateway with microsecond caching, dynamic routing, budgets, analytics, and forecasting.
Route, manage, and analyze your LLM requests across multiple providers with a unified API interface
Tools, libraries, papers, and patterns for reducing the cost of running large language models in production.
Budget-Aware Agentic Routing (BAAR) — Intelligent LLM model selection with a zero-call financial kill-switch. Save 90% on costs without losing accuracy.
Add a description, image, and links to the llm-routing topic page so that developers can more easily learn about it.
To associate your repository with the llm-routing topic, visit your repo's landing page and select "manage topics."