pandas-ai
by
sinaptik-ai

Description: Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.

View on GitHub ↗

Summary Information

Updated 1 hour ago

Added to GitGenius on March 29th, 2024

Created on April 22nd, 2023

Open Issues & Pull Requests: 19 (+0)

Number of forks: 2,335

Total Stargazers: 23,640 (+0)

Total Subscribers: 169 (+0)

Issue Activity (beta)

Open issues: 6

New in 7 days: 0

Closed in 7 days: 0

Avg open age: 16 days

Stale 30+ days: 4

Stale 90+ days: 0

Recent activity

Opened in 7 days: 0

Closed in 7 days: 0

Comments in 7 days: 0

Events in 7 days: 0

Top labels

bug (263)
enhancement (137)
good first issue (21)
documentation (17)
duplicate (9)
test (6)
dev (5)
help wanted (4)

Most active issues this week

No issue events were indexed in the last 7 days.

Explore full issue details

Repository Insights (GitGenius)

Median issue/PR response: 3.4 hours

Mean response time: 37.2 days

90th percentile: 91.4 days

Tracked items: 425

Most active contributors

gdcsinaptik - 296 events, 144 issues
gventuri - 115 events, 67 issues
ArslanSaleem - 27 events, 22 issues
SimranAnand1 - 22 events, 4 issues
Subhashchandra-P - 17 events, 2 issues

Related by overlapping contributors

Detailed Description

PandasAI is a Python library that enables natural language interaction with data sources including SQL databases, CSV files, and Parquet files. The library leverages large language models and retrieval-augmented generation to convert conversational queries into executable data analysis operations, making data exploration accessible to both technical and non-technical users.

The core functionality allows users to ask questions about their data using plain English rather than writing code. The library supports asking questions about single or multiple DataFrames simultaneously, enabling comparative analysis across related datasets. Beyond simple queries, PandasAI can generate visualizations such as charts and histograms in response to user requests, automating the visualization creation process. The library also provides a Docker sandbox environment for secure code execution, isolating operations to mitigate security risks from potentially malicious inputs.

PandasAI requires Python 3.8 or higher up to version 3.11 and can be installed via pip or poetry package managers. The project maintains comprehensive documentation at docs.pandas-ai.com and provides example notebooks demonstrating various use cases. The library is distributed under the MIT expat license, with the exception of the pandasai/ee directory which contains enterprise edition code under separate licensing terms. The project also offers managed cloud and self-hosted enterprise offerings for organizations requiring additional capabilities.

According to GitGenius activity tracking, the repository has demonstrated steady growth with 23,635 stargazers as of the most recent check. The project maintains active development with 425 tracked issues and pull requests showing a median response latency of 3.4 hours, indicating responsive maintainer engagement. The most frequently reported issues relate to bugs (143 instances), followed by enhancement requests (44) and documentation improvements (10). The core development team includes gdcsinaptik with 296 tracked events, gventuri with 115 events, and ArslanSaleem with 27 events, representing the primary contributors managing the project's evolution.

The repository is classified across multiple domains reflecting its broad applicability: it functions as a pandas extension and dataframe enhancement tool while simultaneously serving as an AI-driven analytics automation platform. It bridges data science and machine learning integration, enabling predictive modeling workflows through conversational interfaces. The project overlaps with contributors from related repositories including avaiga/taipy, modin-project/modin, and avelino/awesome-go, suggesting connections to broader data processing and development tool ecosystems.

The library addresses a specific gap in data analysis workflows by reducing friction for users who need to perform exploratory data analysis without deep programming expertise. For technical users, it accelerates common data analysis tasks by eliminating boilerplate code writing. The combination of LLM-powered query interpretation with RAG capabilities allows the system to understand context about data structure and content, enabling more sophisticated analysis than simple keyword matching would permit.

pandas-ai
by
sinaptik-ai

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

pandas-ai
by
sinaptik-aisinaptik-ai/pandas-ai

Repository Details

pandas-ai by sinaptik-ai

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

pandas-ai by sinaptik-aisinaptik-ai/pandas-ai

Repository Details

pandas-ai
by
sinaptik-ai

pandas-ai
by
sinaptik-aisinaptik-ai/pandas-ai