cassandra
by
apache

Description: Open source transactional distributed database. Linear scalability and proven fault-tolerance on commodity hardware or cloud infrastructure without compromising performance.

View apache/cassandra on GitHub ↗

Summary Information

Updated 16 minutes ago

Added to GitGenius on March 25th, 2026

Created on May 21st, 2009

Open Issues & Pull Requests: 436 (+0)

Number of forks: 3,860

Total Stargazers: 9,738 (-1)

Total Subscribers: 427 (-1)

Issue Activity (beta)

Open issues: 0

New in 7 days: 0

Closed in 7 days: 0

Avg open age: N/A days

Stale 30+ days: 0

Stale 90+ days: 0

Recent activity

Opened in 7 days: 0

Closed in 7 days: 0

Comments in 7 days: 0

Events in 7 days: 0

Top labels

No label distribution available yet.

Most active issues this week

No issue events were indexed in the last 7 days.

Explore full issue details

Detailed Description

Apache Cassandra is a highly scalable, distributed NoSQL database designed to handle massive amounts of data across multiple commodity servers. This repository, hosted by the Apache Software Foundation, contains the source code and related resources for the Cassandra database. Its primary purpose is to provide a robust and reliable data storage solution for applications that require high availability, fault tolerance, and the ability to scale horizontally.

The core functionality of Cassandra revolves around its partitioned row store architecture. Data is organized into tables, similar to relational databases, with each row identified by a primary key. The key feature of Cassandra is its ability to distribute this data across a cluster of machines in a transparent manner. This partitioning allows for horizontal scalability, meaning that as the data volume grows, more machines can be added to the cluster to handle the increased load. Cassandra automatically re-partitions the data as nodes are added or removed, ensuring continuous availability and performance.

Cassandra's data model is based on a row store, which is analogous to the structure of relational databases, organizing data by rows and columns. It utilizes the Cassandra Query Language (CQL), which is closely related to SQL, making it relatively easy for developers familiar with SQL to learn and use. However, unlike traditional SQL databases, Cassandra does not support joins or subqueries, focusing instead on efficient data retrieval and storage across a distributed environment.

The repository provides all the necessary components for building and deploying a Cassandra cluster. It includes the source code, build scripts, and documentation. The "Getting Started" guide within the README provides a basic introduction to setting up and running a single-node Cassandra cluster. It walks users through the process of unpacking the archive, starting the server, and interacting with the database using the `cqlsh` command-line client. This client allows users to execute CQL commands for creating keyspaces, tables, inserting data, and querying data.

The repository also provides links to various resources for users and developers. These include links to the official Apache Cassandra website, where users can find comprehensive documentation, downloads, and community resources. The README also provides links to the Cassandra Jira for reporting issues, as well as links to the ASF Slack channel, mailing lists, and social media platforms like Bluesky, LinkedIn, and YouTube, fostering a strong community for support and collaboration.

The requirements for running Cassandra are relatively straightforward, primarily involving a supported version of Java. The repository's build.xml file specifies the supported Java versions. Additionally, the `cqlsh` client requires Python.

In essence, the Apache Cassandra repository offers a powerful and flexible solution for managing large datasets. Its distributed architecture, fault tolerance, and scalability make it suitable for a wide range of applications, including those requiring high availability, such as e-commerce platforms, social networks, and IoT applications. The repository serves as the central hub for the development, distribution, and community support of this critical database technology.

cassandra
by
apache

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week