lsdefine/GenericAgent

Description: Self-evolving agent: grows skill tree from 3.3K-line seed, achieving full system control with 6x less token consumption

View on GitHub ↗Jump to charts ↓

Summary Information

Updated 20 minutes ago

Added to GitGenius on April 25th, 2026

Created on January 16th, 2026

Open Issues & Pull Requests: 168 (+1)

Number of forks: 1,578

Total Stargazers: 13,577 (+0)

Total Subscribers: 36 (+0)

Issue Activity (beta)

Open issues: 107

New in 7 days: 3

Closed in 7 days: 1

Avg open age: 6 days

Stale 30+ days: 92

Stale 90+ days: 8

Recent activity

Opened in 7 days: 3

Closed in 7 days: 1

Comments in 7 days: 4

Events in 7 days: 6

Top labels

No label distribution available yet.

Most active issues this week

#665 Race condition: `user_prompt_{int(time.time())}.md` temp file collides under concurrent `--func` / `run_subagents` - 3 events / 1 comments
#683 [Bug][桌面] 切换到处理中会话时不会滚到最新输出 - 2 events / 2 comments
#680 st.text_area height=1 crashes on newer Streamlit (min 68px required) - 1 events / 1 comments
#692 有微信交流群吗？请入群学习，交流。 - 1 events / 1 comments

Explore full issue details

Repository Insights (GitGenius)

Median issue/PR response: 4.4 hours

Mean response time: 3.1 days

90th percentile: 6.9 days

Tracked items: 140

Most active contributors

lsdefine - 128 events, 75 issues
desmonna - 6 events, 5 issues
szdtzpj - 6 events, 6 issues
benemorphy - 5 events, 2 issues
Solaris-star - 4 events, 3 issues

Related by overlapping contributors

Detailed Description

GenericAgent is a minimal, self-evolving autonomous agent framework built on approximately 3,000 lines of core Python code. The repository demonstrates a novel approach to agent design by starting with a lean seed codebase and allowing the system to grow its capabilities organically through task execution. Rather than preloading extensive skill libraries, GenericAgent automatically crystallizes execution paths into reusable skills that accumulate over time, forming a personal skill tree that expands with each new task the agent solves.

The framework grants large language models system-level control over local computers through nine atomic tools and a roughly 100-line agent loop. These tools provide comprehensive system access including browser automation, terminal commands, filesystem operations, keyboard and mouse input, screen vision capabilities, and mobile device control via ADB. A distinctive feature is the use of TMWebdriver, which injects directly into real browser instances while preserving login sessions, enabling the agent to handle real-world challenges like CAPTCHA verification without relying on headless browser workarounds.

GenericAgent achieves significant token efficiency, consuming less than 30,000 tokens in context windows compared to other agents that require 200,000 to 1,000,000 tokens. This efficiency translates to reduced hallucinations, higher task success rates, and substantially lower operational costs. The framework supports multiple major language models including Claude, Gemini, Kimi, and MiniMax, with cross-platform compatibility across Windows, macOS, and Linux.

The repository's self-bootstrap proof is notable: the entire codebase, from initial Git setup through all commit messages, was completed autonomously by GenericAgent itself without human terminal interaction. This demonstrates the framework's practical viability for autonomous task execution. The demonstration gallery showcases diverse capabilities including real-browser CAPTCHA survival, autonomous web exploration, food delivery ordering, quantitative stock screening, expense tracking, and batch messaging across multiple platforms.

Installation is deliberately tiered to minimize dependencies. The core agent requires only the requests library plus four lightweight packages for the local server supporting TMWebdriver. Optional UI extras including Streamlit and terminal UI components can be installed separately. The framework provides multiple frontend interfaces: a terminal UI built on prompt_toolkit and rich, a Streamlit web interface, and instant messaging integrations with Telegram, Discord, and Lark.

Advanced capabilities are unlocked through natural language instructions rather than configuration files or documentation reading. Users can enable web automation, OCR, vision capabilities, and computer use by simply instructing the agent to set up these features. The agent then consults pre-installed standard operating procedures, installs missing dependencies, adapts to the operating system, and persists configurations to memory.

According to GitGenius activity tracking, the repository shows median issue and pull request response latency of 3.0 hours across 87 tracked items, with mean latency of 62.2 hours. Primary contributor lsdefine has logged 44 events, with secondary contributors desmonna and szdtzpj each contributing 6 events. The repository connects to related projects through overlapping contributors, including alirezarezvani/claude-skills, huggingface/transformers, and huggingface/datasets. The framework is classified across multiple AI domains including agent systems, LLM integration, tool usage, memory management, and task automation, reflecting its comprehensive approach to autonomous agent design.

lsdefine/GenericAgent

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

GenericAgent
by
lsdefinelsdefine/GenericAgent

Repository Details

lsdefine/GenericAgent

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

GenericAgent by lsdefinelsdefine/GenericAgent

Repository Details

GenericAgent
by
lsdefinelsdefine/GenericAgent