pageindex-mcp
by
VectifyAI

Description: MCP server for PageIndex. PageIndex is a vectorless reasoning-based RAG system which uses multi-step reasoning and tree search to retrieve information like a...

View on GitHub ↗

Summary Information

Updated 2 hours ago

Added to GitGenius on January 31st, 2026

Created on August 25th, 2025

Open Issues & Pull Requests: 1 (+0)

Number of forks: 44

Total Stargazers: 373 (+0)

Total Subscribers: 1 (+0)

Issue Activity (beta)

Open issues: 1

New in 7 days: 0

Closed in 7 days: 0

Avg open age: 5 days

Stale 30+ days: 0

Stale 90+ days: 0

Recent activity

Opened in 7 days: 0

Closed in 7 days: 0

Comments in 7 days: 1

Events in 7 days: 3

Top labels

No label distribution available yet.

Most active issues this week

#5 failed to initialize MCP client - 3 events / 1 comments

Explore full issue details

Repository Insights (GitGenius)

Median issue/PR response: 5.1 days

Mean response time: 5.1 days

90th percentile: 5.1 days

Tracked items: 1

Most active contributors

ahonn - 1 events, 1 issues

Related by overlapping contributors

No overlapping-contributor repos identified yet.

Detailed Description

PageIndex MCP is a Model Context Protocol server that integrates PageIndex, a vectorless reasoning-based retrieval-augmented generation system, directly into LLM platforms and agent frameworks. Written in TypeScript, the repository provides the infrastructure for connecting PageIndex's document indexing and retrieval capabilities to LLMs like Claude, Cursor, and other MCP-compatible clients without requiring vector databases or traditional similarity-based search.

The core innovation of PageIndex is its approach to document retrieval. Rather than converting documents into vector embeddings and performing similarity searches, PageIndex represents documents as hierarchical tree structures and uses multi-step reasoning and tree search to locate relevant information. This approach mirrors how human experts navigate documents, such as using a book's index to find information. The system eliminates several traditional RAG pain points: it requires no vector database infrastructure, performs no document chunking that loses context, and retrieves all relevant passages automatically rather than relying on top-K selection methods.

PageIndex MCP exposes this reasoning-based index directly to LLMs through the MCP protocol, enabling them to reason over document structure and retrieve information without external vector databases. The primary use case addresses a common limitation users face when working with long documents: context limit errors when trying to chat with lengthy PDFs. By adding files to PageIndex, users can seamlessly interact with long documents across any MCP-compatible agent or LLM platform.

The repository supports multiple deployment and authentication options. For developers, PageIndex MCP integrates with major AI frameworks including the Claude Agent SDK, Vercel AI SDK, OpenAI Agents SDK, and LangChain. Authentication uses a simple API key system without OAuth complexity, with keys generated through the PageIndex Dashboard. For Claude Desktop users, installation is streamlined through a downloadable .mcpb file that handles OAuth authentication automatically. The system also supports running as a local MCP server for users who need to upload local PDF files, requiring only Node.js version 18.0.0 or higher.

The service offers a free tier with 1000 pages of indexing and unlimited conversations, making it accessible for initial exploration. PageIndex also provides a fully hosted alternative called PageIndex Chat, which offers a dedicated document analysis interface using the same reasoning-based workflow. The repository is licensed under the MIT open source license, making it freely available for both commercial and personal use.

According to GitGenius classification data, the repository spans multiple technical domains including page indexing, real-time processing, scalability, multi-tenant architecture, search services, content processing, vector search alternatives, semantic search, information retrieval, and data indexing. This breadth reflects PageIndex MCP's position as a comprehensive solution for modern document retrieval that challenges conventional vector-based approaches while maintaining compatibility with existing LLM ecosystems.

pageindex-mcp
by
VectifyAI

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

pageindex-mcp
by
VectifyAIVectifyAI/pageindex-mcp

Repository Details

pageindex-mcp by VectifyAI

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

pageindex-mcp by VectifyAIVectifyAI/pageindex-mcp

Repository Details

pageindex-mcp
by
VectifyAI

pageindex-mcp
by
VectifyAIVectifyAI/pageindex-mcp