gemma_pytorch
by
google

Description: The official PyTorch implementation of Google's Gemma models

View on GitHub ↗

Summary Information

Updated 9 minutes ago

Added to GitGenius on February 21st, 2024

Created on February 20th, 2024

Open Issues & Pull Requests: 31 (+0)

Number of forks: 599

Total Stargazers: 5,707 (+0)

Total Subscribers: 41 (+0)

Issue Activity (beta)

Open issues: 21

New in 7 days: 0

Closed in 7 days: 0

Avg open age: 241 days

Stale 30+ days: 21

Stale 90+ days: 13

Recent activity

Opened in 7 days: 0

Closed in 7 days: 0

Comments in 7 days: 0

Events in 7 days: 0

Top labels

type:support (22)
stat:awaiting response (10)
bug (8)
enhancement (5)
duplicate (2)
stat:awaiting internal (1)

Most active issues this week

No issue events were indexed in the last 7 days.

Explore full issue details

Repository Insights (GitGenius)

Median issue/PR response: 41.7 days

Mean response time: 38.5 days

90th percentile: 62.2 days

Tracked items: 60

Most active contributors

Gopi-Uppari - 71 events, 20 issues
tilakrayal - 53 events, 36 issues
Balakrishna-Chennamsetti - 11 events, 9 issues
pengchongjin - 8 events, 5 issues
sonali-kumari1 - 6 events, 6 issues

Related by overlapping contributors

Detailed Description

The google/gemma_pytorch repository is the official PyTorch implementation of Google's Gemma family of large language models. Gemma represents a collection of lightweight, state-of-the-art open models derived from research and technology developed for Google's Gemini models. The repository provides implementations for both text-only and multimodal decoder-only large language models with open weights, supporting pre-trained and instruction-tuned variants across multiple model sizes.

The repository supports a range of Gemma model versions. Gemma 3, the most recent addition as of March 2025, includes text-only 1B variants and multimodal options in 4B, 12B, and 27B_v3 sizes. Gemma 2, released in June 2024, offers text-only models in 2B-v2, 9B, and 27B configurations. The original Gemma line provides 2B and 7B text-only variants. Additionally, the repository includes support for CodeGemma, a specialized variant released in April 2024. Model checkpoints are available through both Kaggle and Hugging Face Hub, with programmatic downloading supported through the huggingface_hub library.

A key feature of this implementation is its flexible hardware support. The repository enables inference on CPU, GPU, and TPU through dual implementation approaches using both standard PyTorch and PyTorch/XLA. Docker-based setup is provided for different hardware configurations, with separate Dockerfiles for CPU/TPU and GPU variants when using PyTorch/XLA. This multi-platform approach allows users to deploy Gemma models across diverse computational environments without requiring separate codebases.

The repository includes comprehensive documentation and accessibility features. Users can try Gemma models for free on Google Colab through guided steps at the official documentation site. The tokenizer includes 99 reserved unused tokens in the format `<unused[0-97]>` with token IDs ranging from 7 to 104, designed to facilitate more efficient training and fine-tuning workflows.

Community engagement metrics tracked by GitGenius reveal active maintenance and support operations. The repository shows a median issue and pull request response latency of 1001.7 hours with a mean of 924.8 hours across 60 tracked items. Support-related issues represent the most active category with 22 labeled items, followed by items awaiting response with 10 instances and bug reports with 8 instances. The most active contributors include Gopi-Uppari with 71 recorded events, tilakrayal with 53 events, and Balakrishna-Chennamsetti with 11 events, indicating sustained development and maintenance efforts.

The repository maintains connections with other major machine learning projects through overlapping contributors, linking to ggml-org/llama.cpp, pytorch/pytorch, and huggingface/transformers. This integration within the broader open-source machine learning ecosystem positions Gemma as part of a larger landscape of language model implementations and frameworks. The repository is classified across multiple domains including neural networks, language modeling, transformer architecture, generative models, sequence generation, and NLP research, reflecting its comprehensive scope as a production-ready implementation of advanced language models.

gemma_pytorch
by
google

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

gemma_pytorch
by
googlegoogle/gemma_pytorch

Repository Details

gemma_pytorch by google

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

gemma_pytorch by googlegoogle/gemma_pytorch

Repository Details

gemma_pytorch
by
google

gemma_pytorch
by
googlegoogle/gemma_pytorch