seaweedfs
by
seaweedfs

Description: SeaweedFS is a distributed storage system for object storage (S3), file systems, and Iceberg tables, designed to handle billions of files with O(1) disk access and effortless horizontal scaling.

View seaweedfs/seaweedfs on GitHub ↗

Summary Information

Updated 28 minutes ago

Added to GitGenius on October 30th, 2025

Created on July 14th, 2014

Open Issues & Pull Requests: 735 (+0)

Number of forks: 2,845

Total Stargazers: 32,736 (+0)

Total Subscribers: 520 (+0)

Issue Activity (beta)

Open issues: 577

New in 7 days: 27

Closed in 7 days: 30

Avg open age: 592 days

Stale 30+ days: 552

Stale 90+ days: 505

Recent activity

Opened in 7 days: 24

Closed in 7 days: 26

Comments in 7 days: 43

Events in 7 days: 88

Top labels

enhancement (11)
help wanted (9)
NeedMoreInfo (2)
duplicate (2)
wontfix (2)
Knowledge (1)
bug (1)
java (1)

Most active issues this week

#9816 Volume grow stuck / doesn't complete - 10 events / 6 comments
#9813 After upgrading Filer from version 4.01 to 4.31, starting the Filer service results in memory exhaustion. - 9 events / 4 comments
#8110 seaweedfs mini - enabling webhook notifications ? - 8 events / 3 comments
#9739 fs.log.purge followed by volume.vacuum does not release disk space for /topic/.system/log metadata events - 6 events / 4 comments
#9807 Versioning in S3 buckets deletes data on it's own and is not capable of deleting data at the same time - 6 events / 5 comments

Explore full issue details

Detailed Description

SeaweedFS is a highly scalable and distributed object store, file system, and key-value store designed to efficiently store and serve billions of files. Written in Go, it prioritizes simplicity, high performance, and massive scalability, making it an ideal solution for handling both numerous small files and very large files with ease. Its core design philosophy revolves around a master-volume server architecture, where the master server manages metadata like volume allocation and garbage collection, while the volume servers are responsible for storing the actual file data. This separation ensures that the master server is never a bottleneck for data access, allowing for extremely high throughput and low latency.

At its heart, SeaweedFS functions as a distributed object store, providing a simple HTTP API for file operations. When a file is uploaded, the master server assigns a unique file ID and a volume server location, and the client directly uploads the data to that volume server. This direct client-to-volume server communication bypasses the master for data transfer, significantly boosting performance. It efficiently handles small files by packing them into larger volumes, reducing inode overhead, and large files by streaming them directly. Furthermore, SeaweedFS offers an S3-compatible API, allowing existing applications and tools designed for Amazon S3 to seamlessly integrate and operate with SeaweedFS, providing a cost-effective and self-hosted alternative.

Beyond its object store capabilities, SeaweedFS extends its functionality through the Filer component, transforming it into a full-fledged distributed file system. The Filer provides traditional file system interfaces such such as POSIX, WebDAV, NFS, and FUSE, enabling users to mount SeaweedFS as a local file system. This allows standard applications to interact with SeaweedFS as if it were a local disk, supporting directories, symbolic links, and access control lists (ACLs). The Filer intelligently maps traditional file system paths to the underlying object store, offering a familiar experience while leveraging the scalability and performance of the SeaweedFS backend.

SeaweedFS incorporates robust features for data durability and availability. It supports configurable data replication across multiple volume servers, ensuring data redundancy in case of hardware failures. For even greater storage efficiency and fault tolerance, it offers erasure coding, which can protect data with less storage overhead compared to full replication. This makes it suitable for a wide range of applications, from content delivery networks (CDNs) and big data analytics to general-purpose cloud storage and backup solutions. Its operational simplicity, low resource footprint, and lack of complex dependencies like HDFS or Java make it remarkably easy to deploy and manage, contributing to its overall cost-effectiveness.

In summary, SeaweedFS stands out as a versatile and powerful distributed storage system. It combines the simplicity and scalability of an object store with the familiarity of a traditional file system, all while delivering high performance and robust data protection. Its design choices, such as the master-volume architecture, direct data access, and the Filer component, address common challenges in distributed storage, making it an excellent choice for organizations needing to manage vast amounts of data efficiently and reliably.

seaweedfs
by
seaweedfs

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week