LiteRT
by
google-ai-edge

Description: LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via efficient conversion, runtime, and optimization

View on GitHub ↗

Summary Information

Updated 31 minutes ago

Added to GitGenius on March 13th, 2026

Created on September 4th, 2024

Open Issues & Pull Requests: 1,985 (+1)

Number of forks: 341

Total Stargazers: 2,555 (+1)

Total Subscribers: 24 (+0)

Issue Activity (beta)

Open issues: 109

New in 7 days: 7

Closed in 7 days: 0

Avg open age: 184 days

Stale 30+ days: 84

Stale 90+ days: 52

Recent activity

Opened in 7 days: 7

Closed in 7 days: 0

Comments in 7 days: 29

Events in 7 days: 104

Top labels

status:awaiting user response (170)
status:awaiting LiteRTer (124)
status:stale (115)
type:bug (113)
type:support (98)
type:feature (74)
type:build/install (43)
status:need more data (23)

Most active issues this week

#7878 [Intel-OpenVINO] The `openvino_compiler_plugin_test` and `dispatch_api_openvino_test` targets cannot be built - 17 events / 5 comments
#7997 Rust segmentation example fails to build and run on macOS (arm64) - 17 events / 3 comments
#8062 [Google Tensor] Generative decoder LLMs (Gemma-3-1B, Gemma-4, Qwen2.5) fail to compile to Tensor_G5 with INTERNAL; vision compiles in the same env - 15 events / 5 comments
#8065 [LiteRT.js / WebGPU] CompiledModel produces correct output only on its first inference on Adreno 8xx (Galaxy Z Fold 7, Chrome 149) - 10 events / 2 comments
#7598 [Bug] AOT compilation succeeds for MediaPipe selfie model but fails for YOLO11n on Tensor G4 — INTERNAL error at compiler_plugin.cc:481 - 9 events / 3 comments

Explore full issue details

Detailed Description

LiteRT, developed by Google, is an on-device framework designed for high-performance machine learning (ML) and generative AI (GenAI) deployment on edge platforms. It serves as the successor to TensorFlow Lite, building upon its legacy to provide a more efficient and optimized solution for running AI models directly on devices. The primary purpose of LiteRT is to enable developers to bring the power of AI to the edge, offering a streamlined and performant experience for various hardware platforms.

The core functionality of LiteRT revolves around efficient model conversion, runtime execution, and optimization. It facilitates the deployment of ML and GenAI models on a wide range of devices, including smartphones, tablets, and embedded systems. Key features include advanced GPU/NPU acceleration, superior ML and GenAI performance, and a simplified developer experience. The framework offers a new LiteRT Compiled Model API, which streamlines development through automated accelerator selection, asynchronous execution, and efficient I/O buffer handling. This API simplifies the process of integrating AI models into applications, reducing development time and complexity.

One of the significant advancements in LiteRT is its unified NPU acceleration. It provides seamless access to NPUs from major chipset providers, ensuring a consistent developer experience across different hardware. This allows developers to leverage the specialized processing capabilities of NPUs for faster and more energy-efficient inference. Furthermore, LiteRT boasts best-in-class GPU performance, utilizing state-of-the-art GPU acceleration techniques. The framework's buffer interoperability minimizes latency and enables zero-copy operations across various GPU buffer types, further enhancing performance.

LiteRT supports a broad range of platforms, including Android, iOS, Linux, macOS, Windows, Web, and IoT devices. This cross-platform compatibility allows developers to target a wide audience and deploy their AI-powered applications on various devices. The framework provides CPU, GPU, and NPU support on these platforms, enabling developers to choose the optimal hardware acceleration for their specific needs. The documentation highlights the supported hardware, including Google Tensor, Qualcomm, MediaTek, and other chipsets for NPU acceleration.

The repository provides clear guidance on getting started with LiteRT, including installation instructions and various "Choose Your Adventure" paths. These paths cater to different developer needs, such as converting PyTorch models, running pre-trained models, and maximizing performance. For developers working with PyTorch models, LiteRT offers tools like the LiteRT Torch Converter and LiteRT Generative Torch API to facilitate model conversion and deployment. For those new to on-device ML, the framework provides step-by-step instructions and sample applications to help them get started quickly. Developers seeking to maximize performance can leverage the LiteRT API to accelerate their existing models. For those working with GenAI models, LiteRT-LM is available for efficient deployment.

The repository also outlines the project's roadmap, which includes expanding hardware acceleration, optimizing for GenAI models, improving developer tools, and enhancing platform support. The project actively encourages contributions and provides resources for getting help, including GitHub issues and discussions. LiteRT is part of a larger ecosystem of Google AI Edge tools, including LiteRT Samples, LiteRT Torch Converter, LiteRT-LM, XNNPACK, and MediaPipe, providing developers with a comprehensive suite of resources for on-device ML development. The project is licensed under the Apache-2.0 License and adheres to a Code of Conduct to foster a welcoming and collaborative community.

LiteRT
by
google-ai-edge

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week