chatgpt-retrieval-plugin
by
openai

Description: The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

View on GitHub ↗

Summary Information

Updated 36 minutes ago

Added to GitGenius on May 20th, 2023

Created on March 23rd, 2023

Open Issues & Pull Requests: 269 (+0)

Number of forks: 3,587

Total Stargazers: 21,190 (+0)

Total Subscribers: 312 (+0)

Issue Activity (beta)

Open issues: 130

New in 7 days: 1

Closed in 7 days: 0

Avg open age: 800 days

Stale 30+ days: 129

Stale 90+ days: 125

Recent activity

Opened in 7 days: 1

Closed in 7 days: 0

Comments in 7 days: 0

Events in 7 days: 0

Top labels

No label distribution available yet.

Most active issues this week

No issue events were indexed in the last 7 days.

Explore full issue details

Repository Insights (GitGenius)

Median issue/PR response: 0.0 hours

Mean response time: 119.1 days

90th percentile: 541.5 days

Tracked items: 26

Most active contributors

jymmyjr - 4 events, 3 issues
caseyclements - 2 events, 1 issues
Coder-Krishna-189251 - 1 events, 1 issues
Daniyal67 - 1 events, 1 issues
Ein-Tim - 1 events, 1 issues

Related by overlapping contributors

Detailed Description

The ChatGPT Retrieval Plugin is a standalone retrieval backend written in Python that enables semantic search and retrieval of personal or organizational documents through natural language queries. It serves as a flexible solution for integrating document retrieval capabilities with ChatGPT custom GPTs, function calling via the chat completions and assistants APIs, or the deprecated ChatGPT plugins model. The plugin leverages OpenAI's embeddings model, specifically text-embedding-3-large with 256 dimension embeddings by default, to generate embeddings of document chunks which are then stored and queried using a vector database backend.

The repository is comprehensively organized into multiple functional directories. The datastore directory contains core logic for storing and querying document embeddings across various vector database providers. The docs directory provides setup and usage documentation for each supported vector database provider, webhooks, and dependency management. Examples directory offers configuration samples, authentication methods, and provider-specific implementations. The local_server directory contains a localhost testing implementation, while models houses the data structures for documents and metadata. Scripts are provided for processing and uploading documents from different data sources, and the server directory implements the main FastAPI server. Services handle utility functions like document chunking, metadata extraction, and PII detection. Tests include integration tests for various vector database providers, and the .well-known directory stores the plugin manifest and OpenAPI schema defining the plugin configuration and API specification.

The plugin supports an extensive array of vector database providers including Pinecone, Elasticsearch, MongoDB Atlas, Weaviate, Zilliz, Milvus, Qdrant, Redis, Llama Index, Chroma, Azure Cognitive Search, Azure CosmosDB Mongo vCore, Supabase, Postgres, and AnalyticDB, allowing developers to select their preferred backend. A FastAPI server exposes endpoints for upserting, querying, and deleting documents, with users able to refine search results through metadata filters by source, date, author, or other criteria. The plugin implements retrieval-augmented generation to enable models to write answers informed by relevant retrieved context.

For deployment, the plugin can be hosted on cloud platforms supporting Docker containers such as Fly.io, Heroku, Render, or Azure Container Apps. It supports continuous document updates through incoming webhooks to upsert and delete endpoints, with tools like Zapier or Make helping configure webhooks based on events or schedules. The plugin offers multiple authentication methods including API key authentication with Basic or Bearer tokens and OAuth support.

According to GitGenius activity tracking, the repository shows a median issue and pull request response latency of 0.0 hours across 26 tracked items, with a mean latency of 2857.3 hours. The most active contributors tracked include jymmyjr with 4 events, caseyclements with 2 events, and Coder-Krishna-189251 with 1 event. The repository shares overlapping contributors with github/gh-aw, meteor/meteor, and langchain-ai/langchain, indicating cross-project collaboration within the broader development ecosystem.

chatgpt-retrieval-plugin
by
openai

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

chatgpt-retrieval-plugin
by
openaiopenai/chatgpt-retrieval-plugin

Repository Details

chatgpt-retrieval-plugin by openai

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

chatgpt-retrieval-plugin by openaiopenai/chatgpt-retrieval-plugin

Repository Details

chatgpt-retrieval-plugin
by
openai

chatgpt-retrieval-plugin
by
openaiopenai/chatgpt-retrieval-plugin