litellm
by
BerriAI

Description: Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, VLLM, NVIDIA NIM]

View BerriAI/litellm on GitHub ↗

Summary Information

Updated 29 minutes ago

Added to GitGenius on April 29th, 2026

Created on July 27th, 2023

Open Issues & Pull Requests: 2,818 (+2)

Number of forks: 7,690

Total Stargazers: 45,354 (+2)

Total Subscribers: 196 (+0)

Issue Activity (beta)

Open issues: 62

New in 7 days: 33

Closed in 7 days: 25

Avg open age: 51 days

Stale 30+ days: 7

Stale 90+ days: 4

Recent activity

Opened in 7 days: 31

Closed in 7 days: 25

Comments in 7 days: 38

Events in 7 days: 147

Top labels

bug (3,377)
stale (2,231)
enhancement (1,399)
llm translation (790)
mlops user request (629)
proxy (196)
enterprise (136)
SDK (94)

Most active issues this week

#26395 [Bug]: DeepSeek V4 Pro (deepseek-v4-pro) fails in multi-turn conversations - reasoning_content stripped from assistant messages - 34 events / 12 comments
#25280 [Bug]: Dependency pinning in commit #5f63873 — intentional change? - 11 events / 7 comments
#26770 [Bug]: Service must be in list - S3 callback does not work - 6 events / 1 comments
#26782 Upgrade prod litellm version to v1.83.10-stable - 5 events / 2 comments
#26897 [Bug]: GPT-5.4+ Responses bridge fails with "api_base=None" when calling via langchain create_agent on Azure Cognitive Services endpoint - 4 events / 0 comments

Explore full issue details

Detailed Description

LiteLLM is an open-source AI gateway and Python SDK designed to simplify and unify access to over 100 large language model (LLM) APIs. Its primary purpose is to provide developers and enterprises with a single, consistent interface for interacting with a wide array of LLM providers—including OpenAI, Anthropic, Google Gemini, AWS Bedrock, Azure, Vertex AI, Cohere, HuggingFace, Sagemaker, VLLM, NVIDIA NIM, and many others—using the familiar OpenAI API format. This approach eliminates the complexity of managing different SDKs, authentication patterns, request formats, and error handling for each provider.

LiteLLM can be used in two main ways: as a Python SDK for direct integration into applications, or as a self-hosted proxy server (AI Gateway) that centralizes LLM access for teams and organizations. The proxy server acts as a drop-in replacement for the OpenAI API, allowing users to switch between providers without rewriting their code. This flexibility is particularly valuable for organizations seeking to optimize costs, performance, or compliance by leveraging multiple LLMs.

Key features of LiteLLM include unified API endpoints for chat completions, embeddings, image generation, audio transcription, batch processing, reranking, and more. The gateway supports advanced production-ready capabilities such as virtual API keys, spend tracking, guardrails for safety and compliance, load balancing across providers, and an admin dashboard for monitoring and management. LiteLLM is engineered for high performance, with benchmarks showing 8ms P95 latency at 1,000 requests per second, making it suitable for demanding enterprise workloads.

Beyond basic LLM access, LiteLLM offers support for agent-based workflows via its A2A protocol, enabling invocation of agents from providers like LangGraph, Vertex AI Agent Engine, Azure AI Foundry, Bedrock AgentCore, and Pydantic AI. This allows developers to build more complex, multi-step AI applications. The repository also includes MCP (Model Control Protocol) tools, which facilitate integration with MCP servers and enable advanced tool usage in OpenAI format, further expanding the versatility of the platform.

LiteLLM’s compatibility extends to a wide range of endpoints and providers, as detailed in its documentation and supported models list. The project is actively adopted by major organizations such as Stripe, Netflix, Google, and others, demonstrating its reliability and scalability in real-world production environments. Deployment is streamlined with support for platforms like Render and Railway, and the project offers extensive documentation, community support via Discord and Slack, and an enterprise tier for organizations with advanced needs.

In summary, LiteLLM is a robust, enterprise-ready solution for managing LLM access across multiple providers. It abstracts away the complexities of provider-specific APIs, offers high performance and scalability, and provides essential features for cost control, security, and operational management. Whether used as a Python SDK or a centralized proxy server, LiteLLM empowers developers and organizations to build, deploy, and manage AI applications efficiently and flexibly, making it a valuable tool in the rapidly evolving landscape of generative AI.

litellm
by
BerriAI

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

litellm
by
BerriAIBerriAI/litellm

Repository Details

litellm by BerriAI

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

litellm by BerriAIBerriAI/litellm

Repository Details

litellm
by
BerriAI

litellm
by
BerriAIBerriAI/litellm