Large-Language-Models
by
VincentGranville

Description: xLLM 1.0, smart crawling, knowledge graph discovery.

View on GitHub ↗

Summary Information

Updated 33 minutes ago

Added to GitGenius on March 1st, 2024

Created on December 17th, 2023

Open Issues & Pull Requests: 8 (+0)

Number of forks: 135

Total Stargazers: 475 (+0)

Total Subscribers: 18 (+0)

Issue Activity (beta)

Open issues: 8

New in 7 days: 0

Closed in 7 days: 0

Avg open age: 539 days

Stale 30+ days: 8

Stale 90+ days: 8

Recent activity

Opened in 7 days: 0

Closed in 7 days: 0

Comments in 7 days: 0

Events in 7 days: 0

Top labels

No label distribution available yet.

Most active issues this week

No issue events were indexed in the last 7 days.

Explore full issue details

Repository Insights (GitGenius)

Median issue/PR response: 0.9 hours

Mean response time: 46.4 days

90th percentile: 129.0 days

Tracked items: 4

Most active contributors

loganclark360 - 2 events, 2 issues
Sandy4321 - 1 events, 1 issues

Related by overlapping contributors

Detailed Description

The vincentgranville/large-language-models repository is a comprehensive resource focused on Large Language Models and the xLLM architecture, with primary documentation written in TeX. The repository centers on practical applications of generative AI and LLMs, featuring the xLLM 1.0 framework alongside implementations of smart crawling and knowledge graph discovery techniques. The project is closely tied to a published textbook titled "State of the Art in GenAI & LLMs - Creative Projects, with Solutions" which describes the xLLM architecture and related generative AI applications.

The core educational material is contained in a project textbook called Projects4.pdf, which serves as the primary documentation for understanding the repository's contents. The textbook includes detailed solutions to all projects and is designed to support a free course offering called the GenAI Fellowship, a deep tech dive program for those interested in advanced generative AI topics. The repository explicitly recommends project 7.2.2 as the best starting point, identifying it as the core project on the LLM and xLLM topic with references to all related satellite projects.

The repository's methodology involves browsing the Wolfram directory and associated URLs to create category structures and develop quality word embeddings. This crawled data is then used to generate enriched prompts for various large language models and search platforms including GPT, Wikipedia, ArXiv, Google Scholar, Stack Exchange, and Google search. The initial focus area is the Probability & Statistics subdirectory from the Wolfram resource. The repository includes a file called crawl_final_stats.txt.gz containing all the crawled data needed as input for Python scripts located in the XLLM5 and XLLM6 folders, with an uncompressed version available on Google Drive.

According to GitGenius activity tracking, the repository shows relatively quick issue and pull request response times, with a median latency of 0.9 hours across tracked items, though the mean latency of 1112.4 hours indicates some longer-running discussions. The most active contributors tracked include loganclark360 with 2 events and Sandy4321 with 1 event. The repository shares overlapping contributors with several other significant machine learning projects including hkuds/lightrag, theano/theano, and lightgbm-org/lightgbm, suggesting connections within the broader machine learning and deep learning community.

GitGenius classification places this repository across a wide spectrum of AI and machine learning domains including neural networks, natural language processing, text understanding, machine learning frameworks, language understanding, text generation, deep learning, language generation, NLP tools, transformers, LLM development, model training, AI research, transformer architecture, model development, and AI applications. The repository's homepage at bondingai.io provides additional context for the broader project ecosystem. The combination of smart crawling capabilities, knowledge graph discovery, and the xLLM framework positions this repository as a practical toolkit for researchers and practitioners working with large language models and generative AI applications.

Large-Language-Models
by
VincentGranville

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

Large-Language-Models
by
VincentGranvilleVincentGranville/Large-Language-Models

Repository Details

Large-Language-Models by VincentGranville

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

Large-Language-Models by VincentGranvilleVincentGranville/Large-Language-Models

Repository Details

Large-Language-Models
by
VincentGranville

Large-Language-Models
by
VincentGranvilleVincentGranville/Large-Language-Models