Best Claude Skills for Testing & TDD in 2026: 15 Compared

Testing & TDD is the use-case cut where Claude Code's installed base has the strongest taste signal. The top two skills aren't test runners or framework wrappers — they're TDD methodology skills from obra/superpowers (167K, 4/5 quality) and NousResearch/hermes-agent (124K, 4/5 quality). Both encode the RED → GREEN → REFACTOR loop and force Claude to write tests before implementation. Combined install signal: 290K, more than the rest of the top 15 combined. The pattern is clear — developers using Claude Code for testing want philosophy more than tools.

Quick Pick

test-driven-development (obra/superpowers) — The highest-installed testing skill in the catalog. 167K signal, 4/5 quality. Encodes "write the test first, then the code." Install this before picking a Playwright wrapper.

What These Skills Actually Do

Testing skills cluster into three patterns: (1) Methodology skills (#1, #2) — encode the TDD discipline itself. They don't run tests; they make sure you write the test first. Both top entries do this and the install signal validates the audience cares more about discipline than tooling. (2) Test-runner wrappers (#3 Playwright Best Practices, #4 Playwright CLI, #5 Browser Use, #8 Playwright Commander, #12 E2E Testing Patterns) — five entries about running tests, mostly Playwright-flavored. (3) Strategy skills (#6 Agent Skills by Addy Osmani, #7 Code, #10 Test Runner, #11 Test Master, #14 Developer, #15 Testing Patterns) — broader software-quality skills that include testing as one component. What separates great from mediocre here is whether the skill enforces behavior change (methodology skills do; CLI wrappers don't) and whether the author shows their work (Currents' Playwright skill at #3 is 4/5-quality-grade detailed; the generic "Developer" #14 is shallow by comparison).

How We Ranked

We sorted 15 candidate skills by a composite score:

Popularity signal — the highest of GitHub stars, install count, or ClawHub download count. Log-scaled so a 100-star skill doesn't get buried under a 100,000-star one if the smaller one is meaningfully better.
Quality score — when set, a 0–5 rubric that breaks ties within popularity tiers. Roughly 15% of catalog skills carry a quality score today; we surface it in the comparison table when available.

The formula is identical across the entire Best-Of 2026 series, so you can compare apples to apples between categories.

The Top 15

1. test-driven-development

Skill · obra/superpowers · 167.5K signal · quality 4/5 Use when implementing any feature or bugfix, before writing implementation code.

The take: The single highest-installed testing skill in the catalog. From the obra/superpowers repo — the same author behind several top entries across the series. The skill is short (the discipline is simple), but the result is profound: Claude actually writes the failing test first instead of skipping ahead to the implementation it "knows" is right.

#	Skill	Type	Stars / Installs	Quality	License
1	test-driven-development	Skill	167.5K	4/5	—
2	test-driven-development	Skill	124.8K	4/5	MIT
3	Playwright Best Practices (Currents)	Plugin	33.2K	3/5	MIT
4	Playwright CLI	Plugin	26.6K	3/5	MIT
5	Browser Use	Skill	33.3K	—	—
6	Agent Skills by Addy Osmani	Plugin	21.0K	—	MIT
7	Code	Skill	17.9K	—	—
8	Playwright Commander	Skill	15.8K	—	—
9	Debug Pro	Skill	15.6K	—	—
10	Test Runner	Skill	11.9K	—	—
11	Test Master	Skill	7.4K	—	—
12	E2E Testing Patterns	Skill	6.5K	—	—
13	CI/CD Pipeline	Skill	4.2K	—	—
14	Developer	Skill	3.9K	—	—
15	Testing Patterns	Skill	3.8K	—	—

Quick Pick

What These Skills Actually Do

How We Ranked

The Top 15

1. test-driven-development

2. test-driven-development (Hermes)

3. Playwright Best Practices (Currents)

4. Playwright CLI

5. Browser Use

6. Agent Skills by Addy Osmani

7. Code

8. Playwright Commander

9. Debug Pro

10. Test Runner

11. Test Master

12. E2E Testing Patterns

13. CI/CD Pipeline

14. Developer

15. Testing Patterns

Comparison Table

FAQ

How is this list different from the category page on aiskill.market?

Why does the #1 pick have fewer stars than #5?

Are these all free?

How do I install one?

How often does this list update?

Should I install both TDD skills (#1 and #2)?

What's the right starter pack for testing?

Related Categories

Browse The Full Catalog

Related Skills to Try

Related Skills to Try

Matt Pocock TypeScript Skills

Related Articles

Related Articles

CI/CD on Apple Silicon With AI

Design Systems for Solo Builders

First-Party Benchmarks Are Marketing: A Skeptic's Checklist for Launch Day

test-driven-development

Agent Evaluation Frameworks

Compare Travel Options on Expedia