MIOpen
by
ROCm

Description: [DEPRECATED] Moved to ROCm/rocm-libraries repo

View on GitHub ↗

Summary Information

Updated 17 minutes ago

Added to GitGenius on August 31st, 2025

Created on June 27th, 2017

Open Issues & Pull Requests: 10 (+0)

Number of forks: 272

Total Stargazers: 1,189 (+0)

Total Subscribers: 85 (+0)

Issue Activity (beta)

Open issues: 3

New in 7 days: 0

Closed in 7 days: 0

Avg open age: 56 days

Stale 30+ days: 3

Stale 90+ days: 1

Recent activity

Opened in 7 days: 0

Closed in 7 days: 0

Comments in 7 days: 0

Events in 7 days: 0

Top labels

bug (192)
urgency_high (130)
enhancement (111)
quality (104)
urgency_normal (99)
urgency_blocker (96)
value_high (91)
testing (90)

Most active issues this week

No issue events were indexed in the last 7 days.

Explore full issue details

Repository Insights (GitGenius)

Median issue/PR response: 595.2 days

Mean response time: 738.5 days

90th percentile: 1748.8 days

Tracked items: 441

Most active contributors

JonathanLichtnerAMD - 407 events, 202 issues
ppanchad-amd - 262 events, 171 issues
junliume - 141 events, 53 issues
atamazov - 106 events, 58 issues
huanrwan-amd - 56 events, 24 issues

Related by overlapping contributors

Detailed Description

MIOPEN is AMD’s high-performance, highly-tuned library for deep learning primitives, specifically focused on accelerating neural network operations on AMD GPUs. It’s a key component of the ROCm (Radeon Open Compute platform) ecosystem, providing optimized implementations of common deep learning layers and functions. Essentially, it aims to be the cuDNN equivalent for AMD hardware, enabling developers to achieve optimal performance when training and deploying AI models. The repository contains the source code, documentation, and build instructions for MIOPEN.

The core functionality of MIOPEN revolves around providing highly optimized kernels for a wide range of deep learning operations. These include convolutions (both forward and backward passes, various data formats like NHWC and NCHW, and support for different convolution algorithms), pooling (max, average), activation functions (ReLU, Sigmoid, Tanh, etc.), normalization layers (Batch Normalization, Layer Normalization), recurrent neural network (RNN) operations (LSTM, GRU), and matrix multiplication (GEMM). Crucially, MIOPEN doesn’t just offer a single implementation for each operation; it employs sophisticated algorithm selection based on input sizes, data types, and hardware capabilities to choose the fastest possible kernel. This dynamic optimization is a significant performance booster.

MIOPEN is designed to be integrated with popular deep learning frameworks like TensorFlow, PyTorch, and ONNX Runtime. It achieves this through a plugin mechanism. Frameworks can query MIOPEN to determine if it can accelerate a particular operation, and if so, offload the computation to the GPU using MIOPEN’s optimized kernels. This integration allows developers to leverage MIOPEN’s performance benefits without needing to rewrite their models or training loops. The repository includes examples and instructions for integrating with these frameworks, simplifying the process for users. The goal is seamless interoperability, allowing users to switch between CPU and GPU execution with minimal code changes.

The repository itself is structured to facilitate development and contribution. It includes extensive unit tests to ensure the correctness and stability of the library. Build scripts are provided for various platforms and configurations, allowing users to compile MIOPEN for their specific AMD GPU and ROCm version. Documentation is available, though it's continually evolving, covering API usage, performance tuning, and integration details. The project actively encourages community contributions, with guidelines for submitting bug reports, feature requests, and code patches.

Beyond the core functionality, MIOPEN is continually evolving to support new features and optimizations. Recent developments include improvements to support for INT8 quantization, which reduces memory usage and accelerates inference, and enhancements to the performance of transformer models, a crucial architecture in modern NLP. The project also focuses on improving support for newer AMD GPU architectures and ROCm releases, ensuring that users can benefit from the latest hardware advancements. The repository’s commit history and issue tracker provide insights into the ongoing development efforts and future directions of the project.

MIOpen
by
ROCm

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

MIOpen
by
ROCmROCm/MIOpen

Repository Details

MIOpen by ROCm

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

MIOpen by ROCmROCm/MIOpen

Repository Details

MIOpen
by
ROCm

MIOpen
by
ROCmROCm/MIOpen