LMOps
by
microsoft

Description: General technology for enabling AI capabilities w/ LLMs and MLLMs

View on GitHub ↗

Summary Information

Updated 18 minutes ago

Added to GitGenius on July 29th, 2024

Created on December 13th, 2022

Open Issues & Pull Requests: 117 (+0)

Number of forks: 373

Total Stargazers: 4,431 (+0)

Total Subscribers: 54 (+0)

Issue Activity (beta)

Open issues: 67

New in 7 days: 0

Closed in 7 days: 0

Avg open age: 487 days

Stale 30+ days: 67

Stale 90+ days: 61

Recent activity

Opened in 7 days: 0

Closed in 7 days: 0

Comments in 7 days: 0

Events in 7 days: 0

Top labels

No label distribution available yet.

Most active issues this week

No issue events were indexed in the last 7 days.

Explore full issue details

Repository Insights (GitGenius)

Median issue/PR response: 11.1 hours

Mean response time: 31.4 days

90th percentile: 51.0 days

Tracked items: 98

Most active contributors

t1101675 - 80 events, 46 issues
zhouchang123 - 49 events, 14 issues
cdxeve - 36 events, 15 issues
donglixp - 30 events, 21 issues
YTianZHU - 18 events, 9 issues

Related by overlapping contributors

Detailed Description

LMOps is a Microsoft research initiative focused on fundamental research and technology for building AI products with foundation models, particularly large language models and multimodal generative AI systems. The repository serves as a central hub for multiple research projects and implementations that advance the practical capabilities and efficiency of LLMs across various dimensions.

The project is organized around several core research areas. The Prompt Intelligence section encompasses technologies for optimizing how users interact with language models. Promptist uses reinforcement learning to automatically optimize user inputs into model-preferred prompts, effectively training a language model to serve as a prompt interface. Structured Prompting enables efficient consumption of long-sequence prompts, allowing systems to scale in-context learning to thousands of examples and handle scenarios like retrieval-augmented generation with many retrieved documents. X-Prompt extends prompting capabilities beyond natural language, providing an extensible interface for fine-grained specifications through context-guided imaginary word learning.

The LLMA section addresses inference acceleration. The Lossless Acceleration of LLMs approach achieves two to three times speed-up by identifying and copying text spans from reference documents into LLM inputs, then verifying these spans, without requiring additional models. This technique applies to important scenarios including retrieval-augmented generation and multi-turn conversations.

Beyond optimization and acceleration, LMOps includes research on fundamental understanding of how LLMs operate. Work on in-context learning reveals that GPT models produce meta gradients through forward computation that are applied via attention mechanisms, establishing a dual view between in-context learning and explicit parameter finetuning. The repository also covers LLM alignment, domain customization, and context extension through length-extrapolatable transformers.

The repository maintains active development with significant community engagement. GitGenius tracking shows a median issue and pull request response latency of 11.1 hours across 98 items, with mean latency of 754.3 hours reflecting occasional complex discussions. The most active contributors tracked are t1101675 with 80 events, zhouchang123 with 49 events, and cdxeve with 36 events. The project shares overlapping contributors with major ecosystem repositories including huggingface/transformers, hiyouga/llamafactory, and huggingface/datasets, indicating integration with the broader open-source machine learning community.

The repository is classified across 25 distinct categories spanning natural language processing, transformer models, text generation, machine learning operations, inference optimization, and related areas. Recent paper releases from 2022 and 2023 demonstrate ongoing research output, with publications appearing at major conferences including EMNLP 2023. The project is written primarily in Python and maintains connections to related Microsoft initiatives including the unilm and torchscale repositories. The initiative actively recruits researchers and interns interested in foundation models, AGI, NLP, machine translation, speech, document AI, and multimodal AI.

LMOps
by
microsoft

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

LMOps
by
microsoftmicrosoft/LMOps

Repository Details

LMOps by microsoft

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

LMOps by microsoftmicrosoft/LMOps

Repository Details

LMOps
by
microsoft

LMOps
by
microsoftmicrosoft/LMOps