milvus
by
milvus-io

Description: Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

View on GitHub ↗

Summary Information

Updated 1 hour ago

Added to GitGenius on March 7th, 2024

Created on September 16th, 2019

Open Issues & Pull Requests: 1,003 (+2)

Number of forks: 4,113

Total Stargazers: 45,169 (+6)

Total Subscribers: 333 (+0)

Issue Activity (beta)

Open issues: 733

New in 7 days: 28

Closed in 7 days: 45

Avg open age: 372 days

Stale 30+ days: 422

Stale 90+ days: 366

Recent activity

Opened in 7 days: 21

Closed in 7 days: 18

Comments in 7 days: 34

Events in 7 days: 243

Top labels

kind/bug (9,442)
triage/accepted (6,062)
stale (4,657)
kind/enhancement (2,635)
needs-triage (1,614)
priority/critical-urgent (1,157)
kind/feature (1,021)
triage/needs-information (752)

Most active issues this week

#47404 [Enhancement]: Add Trace for WAL Append operation - 43 events / 0 comments
#51078 [Bug]: Milvus 2.6.18 + Woodpecker (remote OSS): streaming recovery failures with reader temp info not found and pchannels stuck in ASSIGNING - 20 events / 4 comments
#51069 [Bug]: [Nightly] array_contains on arr_float returns 0 rows while ground truth expects row 0 - 18 events / 2 comments
#49706 [Bug]: Snapshot restore returns extra rows after snapshot point under etcd chaos - 13 events / 5 comments
#51033 [Bug]: Query output fails for ARRAY<STRUCT> when struct-array vector subfield uses DISKANN - 11 events / 0 comments

Explore full issue details

Repository Insights (GitGenius)

Median issue/PR response: 1541.0 days

Mean response time: 1073.4 days

90th percentile: 1677.2 days

Tracked items: 675

Most active contributors

yanliang567 - 15,624 events, 4,116 issues
xiaofan-luan - 4,279 events, 2,038 issues
zhuwenxing - 3,369 events, 581 issues
chyezh - 2,253 events, 539 issues
wangting0128 - 2,198 events, 365 issues

Related by overlapping contributors

Detailed Description

Milvus is a high-performance vector database written in Go and C++ that enables scalable vector similarity search for AI applications. The system is designed to efficiently organize and search vast amounts of unstructured data such as text, images, and multi-modal information by storing and indexing vector embeddings alongside scalar data types like integers, strings, and JSON objects.

The architecture separates compute and storage layers, allowing horizontal scaling across distributed Kubernetes clusters. This design enables Milvus to handle tens of thousands of concurrent search queries on billions of vectors while maintaining real-time data freshness through streaming updates. The system offers both a fully distributed deployment mode for production environments and a Standalone mode for single-machine deployments, with Milvus Lite providing a lightweight Python-based option for quick prototyping via pip installation.

Milvus implements hardware acceleration for both CPU and GPU processing to achieve high vector search performance. The system supports multiple vector index types optimized for different scenarios, including HNSW, IVF, FLAT brute-force search, SCANN, and DiskANN, with quantization-based variations and memory-mapped file support. Beyond dense vector search, Milvus natively supports sparse vectors for full-text search using BM25 and learned sparse embeddings like SPLADE and BGE-M3, enabling hybrid search capabilities that combine semantic and full-text search within the same collection.

The repository demonstrates significant community engagement with 6521 tracked issues and pull requests. The median response latency for issues and PRs is 0.0 hours, while the mean latency is 2697.1 hours, indicating rapid initial triage followed by longer resolution timelines for complex issues. The most active issue labels are kind/bug with 4465 items, triage/accepted with 3417 items, and stale with 2375 items. Top contributors include yanliang567 with 15592 events, xiaofan-luan with 4274 events, and zhuwenxing with 3362 events, reflecting sustained development activity.

Milvus provides flexible multi-tenancy strategies with isolation at the database, collection, partition, or partition key level, allowing a single cluster to serve hundreds to millions of tenants. The system implements hot and cold storage mechanisms to optimize costs by keeping frequently accessed data in memory or on SSDs while storing less-accessed data on slower, more cost-effective storage. Enterprise security features include mandatory user authentication, TLS encryption for network communications, and Role-Based Access Control for fine-grained permission management.

The project is distributed under the Apache 2.0 license and operates under the LF AI & Data Foundation with Zilliz as its major contributor. A fully managed cloud service called Zilliz Cloud offers Serverless, Dedicated, and Bring-Your-Own-Cloud deployment options. The repository overlaps with contributors from microsoft/vscode, microsoft/typescript, and rust-lang/rust, indicating cross-project collaboration. Milvus powers production applications including text and image search, Retrieval-Augmented Generation systems, and recommendation engines for both startups and enterprises.

milvus
by
milvus-io

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

milvus
by
milvus-iomilvus-io/milvus

Repository Details

milvus by milvus-io

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

milvus by milvus-iomilvus-io/milvus

Repository Details

milvus
by
milvus-io

milvus
by
milvus-iomilvus-io/milvus