llm-routing

Here are 171 public repositories matching this topic...

katanemo / plano

Plano is an AI-native proxy and data plane for agentic apps — with built-in orchestration, safety, observability, and smart LLM routing so you stay focused on your agents core logic.

proxy routing gateway prompt proxy-server openai envoy envoyproxy llms generative-ai llmops llm-inference llm-proxy ai-gateway llm-gateway llm-routing ai-gateway-support

Updated Jun 29, 2026
Rust

mnfst / awesome-free-llm-apis

Star

List of Permanent Free LLM API (API Keys)

awesome router gemini openai awesome-list ai-agents llm anthropic ollama llm-router llm-routing openclaw openclaw-plugin

Updated Jun 16, 2026
JavaScript

thushan / olla

Sponsor

Star

High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model discovery across local and remote inference backends.

Updated Jun 28, 2026
Go

junchenzhi / Awesome-LLM-Ensemble

Star

A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"

multi-agent moe ensemble ensemble-learning routing-algorithm multi-agent-systems ensemble-prediction ensemble-models ensemble-machine-learning ensemble-methods large-language-models llms llm-agents llm-routing llm-collaboration llm-ensemble multi-llms

Updated Jun 29, 2026
HTML

ratel-ai / ratel

Star

Context engineering for AI agents. ~80% fewer tokens. Fix tool overload. Skills and memory with in-process BM25 retrieval. No vector DB. No embeddings.

skills memory optimization mcp context accuracy agents harness rag llm tool-selection tool-calling llm-routing mcp-server token-optimization claude-skills

Updated Jul 1, 2026
TypeScript

RouteWorks / RouterArena

Star

RouterArena: An open framework for evaluating LLM routers with standardized datasets, metrics, an automated framework, and a live leaderboard.

arena routing multi-agent multi-agent-systems router-benchmark llm llm-router llm-routing router-evaluation router-leaderboard

Updated Jun 23, 2026
Python

Self-hosted LLM gateway that routes requests across AI providers (OpenAI, Anthropic, Gemini, Mistral, Ollama) using intelligent multi-policy scoring — including an LLM-native routing policy. Drop-in compatible: just swap the base URL. No database required, built-in cost tracking, budget enforcement and multi-tenant isolation.

multi-tenant self-hosted anthropic openai-proxy llm-proxy cost-tracking ai-gateway llm-gateway llm-routing ai-router budget-enforcement

Updated Jun 29, 2026
TypeScript

mozilla-ai / otari

Star

Open-source, OpenAI-compatible LLM gateway you run yourself. One endpoint for 40+ providers, with virtual keys, budgets, and usage tracking.

python open-source ai gateway self-hosted openai multi-provider budgets llm llmops anthropic llm-proxy cost-tracking ai-gateway api-key-management llm-gateway llm-routing openai-compatible litellm-alternative

Updated Jul 1, 2026
Python

qualixar / qualixar-os

Star

Qualixar OS: The Universal OS for AI Agents. Claw-compatible. 12 topologies, Forge AI team designer, 24-tab dashboard, skill marketplace. PAPER: https://arxiv.org/abs/2604.06392

typescript mcp multi-agent ai-agents agent-framework llm-routing agent-orchestration agent-reliability agent-os qualixar judge-pipeline

Updated May 25, 2026
TypeScript

BingoWon / keyaos

Star

Edge-native AI API gateway — cost-optimized routing across providers, multi-protocol support, built on Cloudflare Workers.

react typescript api-proxy edge-computing multimodal cloudflare-workers openai-api anthropic ai-gateway llm-routing

Updated May 21, 2026
TypeScript

kalibr-ai / kalibr-sdk-python

Star

Stop overpaying to run your agents. Kalibr routes every request to lower-cost model and tool paths without degrading performance.

Updated Jun 3, 2026
Python

xorbitsai / xrouter-llm

Star

A prompt-aware LLM router that predicts which models can complete each request, then selects the cheapest capable one: 53.2% lower cost and +1.9 pts completion on our tested dataset.

ai model-selection ai-agents cost-optimization llm llmops openrouter llm-router llm-routing model-routing prompt-routing

Updated Jun 29, 2026
Python

open-world-project / model-router

Star

Automatic cost-aware model routing plugin for Hermes Agent

python plugin ai-agent openrouter llm-routing hermes-agent

Updated May 10, 2026
Python

skrashevich / botmux

Star

Web-based command center for managing Telegram bots — multi-bot dashboard, reverse proxy, inter-bot routing, protocol bridges, and LLM-powered smart routing

bot docker golang telegram dashboard sqlite proxy webhook self-hosted admin-panel botapi bot-management longpolling slack-bridge llm-routing webupdates

Updated Jul 1, 2026
Go

animaios / animarouter

Star

smart routing that learns

free-tier llm-router llm-routing free-llm

Updated Jul 1, 2026
TypeScript

shahar-dagan / openfusion

Star

Combine the results from a panel of models into an enhanced response

open-source machine-learning ai ml fusion model-fusion llm llm-evaluation ai-gateway llm-gateway llm-routing

Updated Jul 1, 2026
Python

Hyperion-HQ / Hyperion

Star

Ultra-low-latency LLM gateway with microsecond caching, dynamic routing, budgets, analytics, and forecasting.

Updated Apr 2, 2026
Go

deltawi / deltallm

Star

Route, manage, and analyze your LLM requests across multiple providers with a unified API interface

kubernetes api-gateway mcp self-hosted multi-llm llm-proxy ai-gateway ai-infrastructure llm-gateway llm-routing model-context-protocol openai-compatible

Updated Jun 29, 2026
Python

ankitvirdi4 / awesome-llm-cost

Star

Tools, libraries, papers, and patterns for reducing the cost of running large language models in production.

awesome gemini openai awesome-list quantization finops cost-engineering llm prompt-caching anthropic llm-observability llm-cost llm-routing llm-caching ai-cost

Updated Jun 5, 2026

orvi2014 / Baar-Core

Star

Budget-Aware Agentic Routing (BAAR) — Intelligent LLM model selection with a zero-call financial kill-switch. Save 90% on costs without losing accuracy.

python-library openai budget-management ai-safety cost-optimization langchain agentic-ai llm-routing

Updated May 20, 2026
Python

Improve this page

Add a description, image, and links to the llm-routing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-routing topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-routing

Here are 171 public repositories matching this topic...

katanemo / plano

mnfst / awesome-free-llm-apis

thushan / olla

junchenzhi / Awesome-LLM-Ensemble

ratel-ai / ratel

RouteWorks / RouterArena

Inebrio / Routerly

mozilla-ai / otari

qualixar / qualixar-os

BingoWon / keyaos

kalibr-ai / kalibr-sdk-python

xorbitsai / xrouter-llm

open-world-project / model-router

skrashevich / botmux

animaios / animarouter

shahar-dagan / openfusion

Hyperion-HQ / Hyperion

deltawi / deltallm

ankitvirdi4 / awesome-llm-cost

orvi2014 / Baar-Core

Improve this page

Add this topic to your repo