How ClawSweeper Uses AI to Triage 13,000 Open Source Issues

Maintaining a thriving open source project often feels like a losing battle against the accumulation of digital debris. Years of unresolved questions, stale bug reports, and abandoned feature requests can transform a healthy repository into a cluttered graveyard, making it nearly impossible for maintainers to distinguish between active development and historical noise. The openclaw/openclaw repository recently addressed this mounting technical debt by deploying ClawSweeper, an AI-driven management bot designed to categorize and prune neglected issues and pull requests without stripping away human oversight.

The Three-Stage Automation Pipeline

ClawSweeper operates through a structured three-stage pipeline: Plan, Review, and Apply. In the planning phase, the system prioritizes the backlog. During the review phase, OpenAI’s Codex model analyzes code, commit history, and discussion threads in read-only mode to determine the relevance of each item. The final application phase requires a human to manually trigger the closure of any issue, ensuring that the AI acts as a decision-support tool rather than an autonomous janitor.

The bot is governed by five strict criteria for closure: features already implemented, non-reproducible errors, requests better suited for external plugins, ambiguous inquiries, and items left untouched for over 60 days. Anything that does not fit these narrow parameters remains open to prevent accidental data loss. To handle the massive scale of 13,000 items, the system utilizes 40 parallel shards with a 10-minute timeout per item to maintain stability. Every decision is logged in a Markdown file, providing a transparent audit trail that includes the specific file paths and commit SHA identifiers that informed the AI’s recommendation.

Human-in-the-Loop Architecture

Unlike traditional automated scripts that blindly delete content, ClawSweeper enforces a strict separation between suggestion and execution. The bot is hard-coded to ignore any issues created by project maintainers or known contributors, protecting core development history from accidental pruning. Furthermore, the system employs snapshot hashes to verify data integrity; if an issue is updated between the time the bot makes a recommendation and the time a human attempts to apply it, the system automatically skips the task to prevent outdated logic from overwriting recent progress.

Technically, the tool is built as a single TypeScript file of approximately 2,500 lines, minimizing external dependencies. The build process is optimized using the Go-based `tsgo` compiler, while linting and formatting are handled by the Rust-based `oxlint` and `oxfmt` tools. The bot runs via GitHub Actions using a cron schedule, incorporating built-in throttling logic to respect API rate limits and ensure consistent performance.

Performance Metrics and AI Utility

As of April 2025, the impact of ClawSweeper on repository health has been significant. Over a seven-day period, the bot reviewed 13,000 items, identifying 33.7% of issues and 11.4% of pull requests as candidates for closure. Ultimately, 3,907 items were cleared, with a failure rate of just 17 instances—less than 0.1% of the total volume. With a daily review coverage of 98.2%, the tool demonstrates that the true value of AI in open source management lies in its ability to filter high-confidence data, leaving the nuanced final judgment to the human maintainers. By keeping the closure rate to roughly one-quarter of the total backlog, ClawSweeper proves that the most effective AI tools are those that know exactly when to stop.

Effective open source maintenance is no longer about manual labor, but about defining the precise boundary where automation ends and human expertise begins.

How ClawSweeper Uses AI to Triage 13,000 Open Source Issues

The Three-Stage Automation Pipeline

Human-in-the-Loop Architecture

Performance Metrics and AI Utility

Related Articles