Testing Skills Comparison: Find the Right QA Automation

Testing is where good intentions meet reality. Every developer knows tests are important. Fewer write comprehensive tests consistently. The gap isn't knowledge; it's friction. Writing tests takes time, and in the pressure of delivery, testing often gets shortchanged.

Claude Code testing skills reduce this friction by automating test generation, improving test quality, and making comprehensive testing the path of least resistance. In this comparison, we'll examine the major approaches to testing skills and help you choose the right tools for your QA needs.

The Testing Landscape

Testing comes in layers, and different skills target different layers:

Unit Tests: Test individual functions and classes in isolation. Fast, focused, many of them.

Integration Tests: Test components working together. Slower, fewer, more realistic.

End-to-End Tests: Test complete user flows. Slowest, fewest, most confidence-building.

Property-Based Tests: Generate test cases from specifications. Finds edge cases humans miss.

Mutation Tests: Verify test quality by introducing bugs. Measures test effectiveness.

Most testing skills focus on one or two layers. Understanding the tradeoffs helps you build a comprehensive testing strategy.

Skill Category 1: Unit Test Generation

The most common testing skills generate unit tests from existing code.

test-generator (Basic)

The straightforward approach: analyze a function, generate tests for it.

# Test Generator

Generate unit tests for functions and classes.

## Process

1. Read the target function/class
2. Identify:
   - Input parameters and types
   - Return values and types
   - Side effects
   - Dependencies
3. Generate test cases for:
   - Happy path (typical usage)
   - Edge cases (empty, null, boundary values)
   - Error cases (invalid inputs)
4. Output in test framework format

## Frameworks Supported

- Jest (JavaScript/TypeScript)
- pytest (Python)
- RSpec (Ruby)
- JUnit (Java)
- Go testing
- Vitest (JavaScript/TypeScript)

## Example

Input:
```typescript
function divide(a: number, b: number): number {
  if (b === 0) throw new Error('Division by zero')
  return a / b
}

Output:

describe('divide', () => {
  it('divides two positive numbers', () => {
    expect(divide(10, 2)).toBe(5)
  })

  it('divides negative numbers', () => {
    expect(divide(-10, 2)).toBe(-5)
  })

  it('returns decimal results', () => {
    expect(divide(1, 3)).toBeCloseTo(0.333, 2)
  })

  it('throws on division by zero', () => {
    expect(() => divide(10, 0)).toThrow('Division by zero')
  })

  it('handles zero dividend', () => {
    expect(divide(0, 5)).toBe(0)
  })
})


### Strengths

- **Fast generation**: Get tests quickly for existing code
- **Comprehensive coverage**: Covers cases you might forget
- **Framework-appropriate**: Generates idiomatic tests

### Limitations

- **Implementation-focused**: Tests what code does, not what it should do
- **May miss intent**: Doesn't understand business requirements
- **Brittle to changes**: Tests tied to current implementation

### Best For

Legacy code without tests. Getting started with testing. Filling coverage gaps.

### test-generator-pro (Advanced)

Goes deeper with semantic analysis and behavior inference.

```markdown
# Test Generator Pro

Intelligent test generation with behavior inference.

## Analysis Depth

### Signature Analysis
- Parameter types and constraints
- Return type and possible values
- Thrown exceptions

### Body Analysis
- Conditional branches (each path tested)
- Loops (boundary conditions)
- External calls (mocked appropriately)

### Semantic Analysis
- Function name suggests expected behavior
- Variable names indicate intent
- Comments explain edge cases

### Context Analysis
- How is this function called?
- What does the caller expect?
- What are related functions?

## Advanced Features

### Branch Coverage
Generates tests for each code path:
```python
def categorize(score):
    if score >= 90:
        return 'A'
    elif score >= 80:
        return 'B'
    elif score >= 70:
        return 'C'
    else:
        return 'F'

Generates:

test_score_90_returns_A
test_score_89_returns_B
test_score_80_returns_B
test_score_79_returns_C
test_score_70_returns_C
test_score_69_returns_F
test_score_negative
test_score_100
test_score_above_100

Mock Generation

For functions with dependencies:

@patch('module.database')
def test_get_user_calls_database(mock_db):
    mock_db.query.return_value = {'id': 1, 'name': 'Test'}
    result = get_user(1)
    mock_db.query.assert_called_once_with('SELECT * FROM users WHERE id = ?', [1])

Parameterized Tests

When many similar cases exist:

@pytest.mark.parametrize('score,expected', [
    (90, 'A'), (95, 'A'), (100, 'A'),
    (80, 'B'), (85, 'B'), (89, 'B'),
    (70, 'C'), (75, 'C'), (79, 'C'),
    (0, 'F'), (50, 'F'), (69, 'F'),
])
def test_categorize(score, expected):
    assert categorize(score) == expected


### Strengths

- **High coverage**: Targets all code paths
- **Proper mocking**: Isolates units correctly
- **Efficient**: Parameterized tests reduce duplication

### Limitations

- **Complex output**: May generate too many tests
- **Mock complexity**: Mocks can become maintenance burden
- **Still implementation-focused**: Tests structure, not requirements

### Best For

Comprehensive coverage goals. Complex functions with many branches. Code with external dependencies.

## Skill Category 2: Behavior-Driven Testing

These skills start from requirements, not implementation.

### bdd-test-generator

Generates tests from behavior specifications:

```markdown
# BDD Test Generator

Generate tests from behavior descriptions.

## Input Format

Describe behavior in plain English:

"Users should be able to:
- Create an account with email and password
- Password must be at least 8 characters
- Email must be valid format
- Duplicate emails should be rejected
- Successful signup returns user ID"

## Output

### Feature File (Gherkin)
```gherkin
Feature: User Registration

  Scenario: Successful registration
    Given a valid email "user@example.com"
    And a password "securepass123"
    When I register a new account
    Then I should receive a user ID
    And my account should be created

  Scenario: Password too short
    Given a valid email "user@example.com"
    And a password "short"
    When I register a new account
    Then I should receive an error "Password must be at least 8 characters"

  Scenario: Invalid email format
    Given an invalid email "not-an-email"
    And a password "securepass123"
    When I register a new account
    Then I should receive an error "Invalid email format"

  Scenario: Duplicate email
    Given an existing user with email "user@example.com"
    When I register with email "user@example.com"
    Then I should receive an error "Email already registered"

Step Definitions

Given('a valid email {string}', async function(email) {
  this.email = email
})

When('I register a new account', async function() {
  this.result = await registerUser(this.email, this.password)
})

Then('I should receive a user ID', async function() {
  expect(this.result.userId).toBeDefined()
})


### Strengths

- **Requirements-driven**: Tests what should happen, not what does happen
- **Readable by non-developers**: Product managers can review
- **Living documentation**: Tests describe system behavior

### Limitations

- **Setup overhead**: Requires step definition infrastructure
- **Slower execution**: Often integration-level tests
- **Maintenance burden**: Step definitions can proliferate

### Best For

Feature-level testing. Teams with product managers involved in QA. Documentation-heavy projects.

### spec-first-testing

Generate implementation from test specifications:

```markdown
# Spec-First Testing

Write tests first, generate implementation guidance.

## Process

1. Accept test cases from user
2. Analyze what implementation must do to pass
3. Generate implementation skeleton
4. Verify tests pass after implementation

## Example

Input tests:
```typescript
describe('EmailValidator', () => {
  it('accepts valid email formats', () => {
    expect(isValidEmail('user@example.com')).toBe(true)
    expect(isValidEmail('user.name@example.co.uk')).toBe(true)
  })

  it('rejects invalid formats', () => {
    expect(isValidEmail('no-at-sign')).toBe(false)
    expect(isValidEmail('@no-local-part.com')).toBe(false)
    expect(isValidEmail('no-domain@')).toBe(false)
  })

  it('rejects empty input', () => {
    expect(isValidEmail('')).toBe(false)
  })
})

Output implementation guide:

function isValidEmail(email: string): boolean {
  // Must handle:
  // 1. Empty string -> false
  // 2. No @ sign -> false
  // 3. Nothing before @ -> false
  // 4. Nothing after @ -> false
  // 5. Valid format with subdomains -> true
  // 6. Valid format with dots in local part -> true

  // Suggested implementation:
  if (!email || email.length === 0) return false
  const parts = email.split('@')
  if (parts.length !== 2) return false
  if (!parts[0] || !parts[1]) return false
  // Additional validation for domain format
  return /^[^\s@]+@[^\s@]+\.[^\s@]+$/.test(email)
}


### Strengths

- **True TDD**: Tests drive implementation
- **Clear requirements**: Tests define acceptance criteria
- **Focused implementation**: Only build what tests require

### Limitations

- **Requires discipline**: Must write tests first
- **Learning curve**: Different mental model
- **Upfront investment**: Slower start, faster finish

### Best For

Teams committed to TDD. New feature development. Complex domain logic.

## Skill Category 3: Integration Testing

Testing components working together.

### api-test-generator

Generate tests for REST/GraphQL APIs:

```markdown
# API Test Generator

Generate API integration tests.

## Input Sources

- OpenAPI/Swagger specifications
- GraphQL schemas
- Existing request/response examples
- API documentation

## Generated Tests

### REST API
```typescript
describe('Users API', () => {
  describe('POST /users', () => {
    it('creates user with valid data', async () => {
      const response = await request(app)
        .post('/users')
        .send({ email: 'new@example.com', password: 'password123' })
        .expect(201)

      expect(response.body).toMatchObject({
        id: expect.any(String),
        email: 'new@example.com',
      })
    })

    it('returns 400 for invalid email', async () => {
      const response = await request(app)
        .post('/users')
        .send({ email: 'invalid', password: 'password123' })
        .expect(400)

      expect(response.body.error).toContain('email')
    })

    it('returns 409 for duplicate email', async () => {
      await createUser({ email: 'existing@example.com' })

      await request(app)
        .post('/users')
        .send({ email: 'existing@example.com', password: 'password123' })
        .expect(409)
    })
  })

  describe('GET /users/:id', () => {
    it('returns user for valid ID', async () => {
      const user = await createUser({ email: 'test@example.com' })

      const response = await request(app)
        .get(`/users/${user.id}`)
        .expect(200)

      expect(response.body.email).toBe('test@example.com')
    })

    it('returns 404 for unknown ID', async () => {
      await request(app)
        .get('/users/unknown-id')
        .expect(404)
    })
  })
})

GraphQL API

describe('GraphQL Users', () => {
  it('creates user via mutation', async () => {
    const mutation = `
      mutation CreateUser($input: CreateUserInput!) {
        createUser(input: $input) {
          id
          email
        }
      }
    `

    const response = await graphql(schema, mutation, null, context, {
      input: { email: 'test@example.com', password: 'password123' }
    })

    expect(response.data.createUser).toMatchObject({
      id: expect.any(String),
      email: 'test@example.com',
    })
  })
})


### database-test-helper

Handle database state in tests:

```markdown
# Database Test Helper

Manage database state for integration tests.

## Capabilities

### Test Isolation
Each test runs in a transaction that's rolled back:
```typescript
beforeEach(async () => {
  await db.beginTransaction()
})

afterEach(async () => {
  await db.rollback()
})

Fixture Generation

Create test data:

const user = await fixtures.createUser({
  email: 'test@example.com',
  // Other fields have sensible defaults
})

const order = await fixtures.createOrder({
  userId: user.id,
  items: [
    await fixtures.createOrderItem({ productId: 'prod-1' }),
  ],
})

Data Verification

Assert database state:

await expect(db.users.findById(userId)).toMatchObject({
  email: 'updated@example.com',
  updatedAt: expect.any(Date),
})

Seeding

Populate test database:

await seed.runScript('minimal')  // Just required data
await seed.runScript('full')     // Complete test dataset


## Skill Category 4: End-to-End Testing

Testing complete user flows.

### e2e-test-generator

Generate browser-based end-to-end tests:

```markdown
# E2E Test Generator

Generate end-to-end browser tests.

## Frameworks

- Playwright (recommended)
- Cypress
- Puppeteer

## Test Generation

From user flow descriptions:

"User signs up, verifies email, logs in, and sees dashboard"

Generates:
```typescript
import { test, expect } from '@playwright/test'

test.describe('User Onboarding', () => {
  test('complete signup flow', async ({ page }) => {
    // Navigate to signup
    await page.goto('/signup')
    await expect(page).toHaveTitle(/Sign Up/)

    // Fill signup form
    await page.fill('[name="email"]', 'newuser@example.com')
    await page.fill('[name="password"]', 'SecurePassword123!')
    await page.click('button[type="submit"]')

    // Verify email sent message
    await expect(page.locator('.success-message')).toContainText(
      'Check your email'
    )

    // Simulate email verification (test mode)
    await page.goto('/verify?token=test-token')
    await expect(page.locator('.verification-success')).toBeVisible()

    // Login
    await page.goto('/login')
    await page.fill('[name="email"]', 'newuser@example.com')
    await page.fill('[name="password"]', 'SecurePassword123!')
    await page.click('button[type="submit"]')

    // Verify dashboard
    await expect(page).toHaveURL('/dashboard')
    await expect(page.locator('h1')).toContainText('Dashboard')
  })
})

Best Practices

Page Objects

class SignupPage {
  constructor(private page: Page) {}

  async fillEmail(email: string) {
    await this.page.fill('[name="email"]', email)
  }

  async submit() {
    await this.page.click('button[type="submit"]')
  }
}

Test Data Management

Use unique identifiers per test run
Clean up created data
Isolate tests from each other

Reliability

Wait for elements, not arbitrary timeouts
Handle flaky elements with retries
Use stable selectors (data-testid)


### visual-regression

Catch visual bugs:

```markdown
# Visual Regression Testing

Detect unintended visual changes.

## Process

1. Capture baseline screenshots
2. On each test run, capture new screenshots
3. Compare against baselines
4. Flag differences for review
5. Update baselines when changes are intentional

## Integration

```typescript
test('homepage looks correct', async ({ page }) => {
  await page.goto('/')
  await expect(page).toHaveScreenshot('homepage.png')
})

test('button states', async ({ page }) => {
  await page.goto('/components/button')

  // Default state
  await expect(page.locator('.button-demo')).toHaveScreenshot('button-default.png')

  // Hover state
  await page.hover('.button-demo button')
  await expect(page.locator('.button-demo')).toHaveScreenshot('button-hover.png')
})

Handling Differences

When screenshot differs:

Show side-by-side comparison
Highlight changed pixels
Ask: intentional change or bug?
Update baseline or fix bug


## Skill Category 5: Test Quality

Skills that measure and improve test effectiveness.

### coverage-analyzer

Analyze and improve test coverage:

```markdown
# Coverage Analyzer

Analyze test coverage and suggest improvements.

## Metrics Collected

- Line coverage
- Branch coverage
- Function coverage
- Statement coverage

## Analysis

### Coverage Gaps
Identify untested code:

Uncovered code in src/api/users.ts:

Lines 45-52: Error handling for database failure Lines 78-85: Edge case for empty results Lines 120-135: Batch update logic


### Test Suggestions
For each gap, suggest tests:
```typescript
// For lines 45-52:
it('handles database connection failure', async () => {
  mockDb.query.mockRejectedValue(new Error('Connection lost'))
  await expect(getUser(1)).rejects.toThrow('Database unavailable')
})

Coverage Quality

Not just percentage, but meaningful coverage:

Critical paths covered?
Error handling tested?
Edge cases verified?


### mutation-tester

Verify test effectiveness:

```markdown
# Mutation Testing

Verify tests catch bugs by introducing them.

## Process

1. Parse source code
2. Apply mutations (change operators, remove statements)
3. Run tests against mutated code
4. Calculate mutation score (% of mutants killed)

## Mutation Types

### Value Mutations
- Change constants: 1 -> 0, true -> false
- Change strings: "hello" -> "mutated"

### Operator Mutations
- Arithmetic: + -> -, * -> /
- Comparison: > -> >=, == -> !=
- Logical: && -> ||, ! removed

### Statement Mutations
- Remove statement
- Replace return value
- Remove function call

## Output

### Mutation Score: 85%

15% of mutants survived (tests didn't catch the bug).

### Surviving Mutants

**src/utils/validate.ts:23**
Mutation: Changed `>` to `>=`
Original: `if (length > 8)`
Mutated: `if (length >= 8)`
Missing test: No test for exactly 8 characters

**src/api/users.ts:67**
Mutation: Removed null check
Original: `if (user === null) return notFound()`
Mutated: `/* removed */`
Missing test: No test for null user case

Choosing the Right Skills

Decision Framework

Situation	Recommended Skills
No tests exist	test-generator (basic) to start
Coverage gaps	coverage-analyzer + test-generator-pro
New features	bdd-test-generator or spec-first
API development	api-test-generator
UI changes	e2e-test-generator + visual-regression
Test quality concerns	mutation-tester

Building a Testing Stack

Combine skills for comprehensive testing:

## Our Testing Stack

### Unit Tests
- test-generator-pro for implementation
- coverage-analyzer for gap analysis
- Target: 80% line coverage

### Integration Tests
- api-test-generator for endpoints
- database-test-helper for fixtures
- Target: All endpoints tested

### E2E Tests
- e2e-test-generator for critical flows
- visual-regression for UI
- Target: Key user journeys covered

### Quality Verification
- mutation-tester quarterly
- Target: 75% mutation score

Practical Tips

Start with Generation, Improve with Review

Generated tests are a starting point. Review them for:

Missing business cases
Incorrect assumptions
Unnecessary tests

Balance Coverage with Value

100% coverage isn't the goal. Focus coverage on:

Business-critical code
Error-prone areas
Frequently changed code

Integrate with CI

Run different test levels at different stages:

Unit tests: Every commit
Integration tests: Every PR
E2E tests: Before release
Mutation tests: Weekly/monthly

Evolve Your Strategy

Testing needs change over time:

Early stage: Focus on critical paths
Growth stage: Expand coverage
Mature stage: Optimize for maintenance

Conclusion

Testing skills transform QA from a burden into a streamlined process. Generation skills bootstrap coverage quickly. BDD skills align tests with requirements. Integration skills verify components work together. E2E skills confirm user flows function. Quality skills ensure tests actually catch bugs.

The right combination depends on your situation:

Starting from zero? Use basic test generation to build momentum.
Have coverage but miss bugs? Add mutation testing to verify effectiveness.
Building new features? Try BDD for requirements-driven tests.
Shipping UI? Include visual regression.

Start with the skill that addresses your biggest pain point. A few good tests beat no tests, and generated tests beat the tests you never wrote.

Want to improve other aspects of code quality? Check out Code Review Skills Roundup for PR automation, or explore TDD Skill Analysis for deep-dive testing methodology.

Testing Skills Comparison: Find the Right QA Automation

The Testing Landscape

Testing comes in layers, and different skills target different layers:

Unit Tests: Test individual functions and classes in isolation. Fast, focused, many of them.

Integration Tests: Test components working together. Slower, fewer, more realistic.

End-to-End Tests: Test complete user flows. Slowest, fewest, most confidence-building.

Property-Based Tests: Generate test cases from specifications. Finds edge cases humans miss.

Mutation Tests: Verify test quality by introducing bugs. Measures test effectiveness.

Most testing skills focus on one or two layers. Understanding the tradeoffs helps you build a comprehensive testing strategy.

Skill Category 1: Unit Test Generation

The most common testing skills generate unit tests from existing code.

test-generator (Basic)

The straightforward approach: analyze a function, generate tests for it.

# Test Generator

Generate unit tests for functions and classes.

## Process

1. Read the target function/class
2. Identify:
   - Input parameters and types
   - Return values and types
   - Side effects
   - Dependencies
3. Generate test cases for:
   - Happy path (typical usage)
   - Edge cases (empty, null, boundary values)
   - Error cases (invalid inputs)
4. Output in test framework format

## Frameworks Supported

- Jest (JavaScript/TypeScript)
- pytest (Python)
- RSpec (Ruby)
- JUnit (Java)
- Go testing
- Vitest (JavaScript/TypeScript)

## Example

Input:
```typescript
function divide(a: number, b: number): number {
  if (b === 0) throw new Error('Division by zero')
  return a / b
}

Output:

describe('divide', () => {
  it('divides two positive numbers', () => {
    expect(divide(10, 2)).toBe(5)
  })

  it('divides negative numbers', () => {
    expect(divide(-10, 2)).toBe(-5)
  })

  it('returns decimal results', () => {
    expect(divide(1, 3)).toBeCloseTo(0.333, 2)
  })

  it('throws on division by zero', () => {
    expect(() => divide(10, 0)).toThrow('Division by zero')
  })

  it('handles zero dividend', () => {
    expect(divide(0, 5)).toBe(0)
  })
})


### Strengths

- **Fast generation**: Get tests quickly for existing code
- **Comprehensive coverage**: Covers cases you might forget
- **Framework-appropriate**: Generates idiomatic tests

### Limitations

- **Implementation-focused**: Tests what code does, not what it should do
- **May miss intent**: Doesn't understand business requirements
- **Brittle to changes**: Tests tied to current implementation

### Best For

Legacy code without tests. Getting started with testing. Filling coverage gaps.

### test-generator-pro (Advanced)

Goes deeper with semantic analysis and behavior inference.

```markdown
# Test Generator Pro

Intelligent test generation with behavior inference.

## Analysis Depth

### Signature Analysis
- Parameter types and constraints
- Return type and possible values
- Thrown exceptions

### Body Analysis
- Conditional branches (each path tested)
- Loops (boundary conditions)
- External calls (mocked appropriately)

### Semantic Analysis
- Function name suggests expected behavior
- Variable names indicate intent
- Comments explain edge cases

### Context Analysis
- How is this function called?
- What does the caller expect?
- What are related functions?

## Advanced Features

### Branch Coverage
Generates tests for each code path:
```python
def categorize(score):
    if score >= 90:
        return 'A'
    elif score >= 80:
        return 'B'
    elif score >= 70:
        return 'C'
    else:
        return 'F'

Generates:

test_score_90_returns_A
test_score_89_returns_B
test_score_80_returns_B
test_score_79_returns_C
test_score_70_returns_C
test_score_69_returns_F
test_score_negative
test_score_100
test_score_above_100

Mock Generation

For functions with dependencies:

@patch('module.database')
def test_get_user_calls_database(mock_db):
    mock_db.query.return_value = {'id': 1, 'name': 'Test'}
    result = get_user(1)
    mock_db.query.assert_called_once_with('SELECT * FROM users WHERE id = ?', [1])

Parameterized Tests

When many similar cases exist:

@pytest.mark.parametrize('score,expected', [
    (90, 'A'), (95, 'A'), (100, 'A'),
    (80, 'B'), (85, 'B'), (89, 'B'),
    (70, 'C'), (75, 'C'), (79, 'C'),
    (0, 'F'), (50, 'F'), (69, 'F'),
])
def test_categorize(score, expected):
    assert categorize(score) == expected


### Strengths

- **High coverage**: Targets all code paths
- **Proper mocking**: Isolates units correctly
- **Efficient**: Parameterized tests reduce duplication

### Limitations

- **Complex output**: May generate too many tests
- **Mock complexity**: Mocks can become maintenance burden
- **Still implementation-focused**: Tests structure, not requirements

### Best For

Comprehensive coverage goals. Complex functions with many branches. Code with external dependencies.

## Skill Category 2: Behavior-Driven Testing

These skills start from requirements, not implementation.

### bdd-test-generator

Generates tests from behavior specifications:

```markdown
# BDD Test Generator

Generate tests from behavior descriptions.

## Input Format

Describe behavior in plain English:

"Users should be able to:
- Create an account with email and password
- Password must be at least 8 characters
- Email must be valid format
- Duplicate emails should be rejected
- Successful signup returns user ID"

## Output

### Feature File (Gherkin)
```gherkin
Feature: User Registration

  Scenario: Successful registration
    Given a valid email "user@example.com"
    And a password "securepass123"
    When I register a new account
    Then I should receive a user ID
    And my account should be created

  Scenario: Password too short
    Given a valid email "user@example.com"
    And a password "short"
    When I register a new account
    Then I should receive an error "Password must be at least 8 characters"

  Scenario: Invalid email format
    Given an invalid email "not-an-email"
    And a password "securepass123"
    When I register a new account
    Then I should receive an error "Invalid email format"

  Scenario: Duplicate email
    Given an existing user with email "user@example.com"
    When I register with email "user@example.com"
    Then I should receive an error "Email already registered"

Step Definitions

Given('a valid email {string}', async function(email) {
  this.email = email
})

When('I register a new account', async function() {
  this.result = await registerUser(this.email, this.password)
})

Then('I should receive a user ID', async function() {
  expect(this.result.userId).toBeDefined()
})


### Strengths

- **Requirements-driven**: Tests what should happen, not what does happen
- **Readable by non-developers**: Product managers can review
- **Living documentation**: Tests describe system behavior

### Limitations

- **Setup overhead**: Requires step definition infrastructure
- **Slower execution**: Often integration-level tests
- **Maintenance burden**: Step definitions can proliferate

### Best For

Feature-level testing. Teams with product managers involved in QA. Documentation-heavy projects.

### spec-first-testing

Generate implementation from test specifications:

```markdown
# Spec-First Testing

Write tests first, generate implementation guidance.

## Process

1. Accept test cases from user
2. Analyze what implementation must do to pass
3. Generate implementation skeleton
4. Verify tests pass after implementation

## Example

Input tests:
```typescript
describe('EmailValidator', () => {
  it('accepts valid email formats', () => {
    expect(isValidEmail('user@example.com')).toBe(true)
    expect(isValidEmail('user.name@example.co.uk')).toBe(true)
  })

  it('rejects invalid formats', () => {
    expect(isValidEmail('no-at-sign')).toBe(false)
    expect(isValidEmail('@no-local-part.com')).toBe(false)
    expect(isValidEmail('no-domain@')).toBe(false)
  })

  it('rejects empty input', () => {
    expect(isValidEmail('')).toBe(false)
  })
})

Output implementation guide:

function isValidEmail(email: string): boolean {
  // Must handle:
  // 1. Empty string -> false
  // 2. No @ sign -> false
  // 3. Nothing before @ -> false
  // 4. Nothing after @ -> false
  // 5. Valid format with subdomains -> true
  // 6. Valid format with dots in local part -> true

  // Suggested implementation:
  if (!email || email.length === 0) return false
  const parts = email.split('@')
  if (parts.length !== 2) return false
  if (!parts[0] || !parts[1]) return false
  // Additional validation for domain format
  return /^[^\s@]+@[^\s@]+\.[^\s@]+$/.test(email)
}


### Strengths

- **True TDD**: Tests drive implementation
- **Clear requirements**: Tests define acceptance criteria
- **Focused implementation**: Only build what tests require

### Limitations

- **Requires discipline**: Must write tests first
- **Learning curve**: Different mental model
- **Upfront investment**: Slower start, faster finish

### Best For

Teams committed to TDD. New feature development. Complex domain logic.

## Skill Category 3: Integration Testing

Testing components working together.

### api-test-generator

Generate tests for REST/GraphQL APIs:

```markdown
# API Test Generator

Generate API integration tests.

## Input Sources

- OpenAPI/Swagger specifications
- GraphQL schemas
- Existing request/response examples
- API documentation

## Generated Tests

### REST API
```typescript
describe('Users API', () => {
  describe('POST /users', () => {
    it('creates user with valid data', async () => {
      const response = await request(app)
        .post('/users')
        .send({ email: 'new@example.com', password: 'password123' })
        .expect(201)

      expect(response.body).toMatchObject({
        id: expect.any(String),
        email: 'new@example.com',
      })
    })

    it('returns 400 for invalid email', async () => {
      const response = await request(app)
        .post('/users')
        .send({ email: 'invalid', password: 'password123' })
        .expect(400)

      expect(response.body.error).toContain('email')
    })

    it('returns 409 for duplicate email', async () => {
      await createUser({ email: 'existing@example.com' })

      await request(app)
        .post('/users')
        .send({ email: 'existing@example.com', password: 'password123' })
        .expect(409)
    })
  })

  describe('GET /users/:id', () => {
    it('returns user for valid ID', async () => {
      const user = await createUser({ email: 'test@example.com' })

      const response = await request(app)
        .get(`/users/${user.id}`)
        .expect(200)

      expect(response.body.email).toBe('test@example.com')
    })

    it('returns 404 for unknown ID', async () => {
      await request(app)
        .get('/users/unknown-id')
        .expect(404)
    })
  })
})

GraphQL API

describe('GraphQL Users', () => {
  it('creates user via mutation', async () => {
    const mutation = `
      mutation CreateUser($input: CreateUserInput!) {
        createUser(input: $input) {
          id
          email
        }
      }
    `

    const response = await graphql(schema, mutation, null, context, {
      input: { email: 'test@example.com', password: 'password123' }
    })

    expect(response.data.createUser).toMatchObject({
      id: expect.any(String),
      email: 'test@example.com',
    })
  })
})


### database-test-helper

Handle database state in tests:

```markdown
# Database Test Helper

Manage database state for integration tests.

## Capabilities

### Test Isolation
Each test runs in a transaction that's rolled back:
```typescript
beforeEach(async () => {
  await db.beginTransaction()
})

afterEach(async () => {
  await db.rollback()
})

Fixture Generation

Create test data:

const user = await fixtures.createUser({
  email: 'test@example.com',
  // Other fields have sensible defaults
})

const order = await fixtures.createOrder({
  userId: user.id,
  items: [
    await fixtures.createOrderItem({ productId: 'prod-1' }),
  ],
})

Data Verification

Assert database state:

await expect(db.users.findById(userId)).toMatchObject({
  email: 'updated@example.com',
  updatedAt: expect.any(Date),
})

Seeding

Populate test database:

await seed.runScript('minimal')  // Just required data
await seed.runScript('full')     // Complete test dataset


## Skill Category 4: End-to-End Testing

Testing complete user flows.

### e2e-test-generator

Generate browser-based end-to-end tests:

```markdown
# E2E Test Generator

Generate end-to-end browser tests.

## Frameworks

- Playwright (recommended)
- Cypress
- Puppeteer

## Test Generation

From user flow descriptions:

"User signs up, verifies email, logs in, and sees dashboard"

Generates:
```typescript
import { test, expect } from '@playwright/test'

test.describe('User Onboarding', () => {
  test('complete signup flow', async ({ page }) => {
    // Navigate to signup
    await page.goto('/signup')
    await expect(page).toHaveTitle(/Sign Up/)

    // Fill signup form
    await page.fill('[name="email"]', 'newuser@example.com')
    await page.fill('[name="password"]', 'SecurePassword123!')
    await page.click('button[type="submit"]')

    // Verify email sent message
    await expect(page.locator('.success-message')).toContainText(
      'Check your email'
    )

    // Simulate email verification (test mode)
    await page.goto('/verify?token=test-token')
    await expect(page.locator('.verification-success')).toBeVisible()

    // Login
    await page.goto('/login')
    await page.fill('[name="email"]', 'newuser@example.com')
    await page.fill('[name="password"]', 'SecurePassword123!')
    await page.click('button[type="submit"]')

    // Verify dashboard
    await expect(page).toHaveURL('/dashboard')
    await expect(page.locator('h1')).toContainText('Dashboard')
  })
})

Best Practices

Page Objects

class SignupPage {
  constructor(private page: Page) {}

  async fillEmail(email: string) {
    await this.page.fill('[name="email"]', email)
  }

  async submit() {
    await this.page.click('button[type="submit"]')
  }
}

Test Data Management

Use unique identifiers per test run
Clean up created data
Isolate tests from each other

Reliability

Wait for elements, not arbitrary timeouts
Handle flaky elements with retries
Use stable selectors (data-testid)


### visual-regression

Catch visual bugs:

```markdown
# Visual Regression Testing

Detect unintended visual changes.

## Process

1. Capture baseline screenshots
2. On each test run, capture new screenshots
3. Compare against baselines
4. Flag differences for review
5. Update baselines when changes are intentional

## Integration

```typescript
test('homepage looks correct', async ({ page }) => {
  await page.goto('/')
  await expect(page).toHaveScreenshot('homepage.png')
})

test('button states', async ({ page }) => {
  await page.goto('/components/button')

  // Default state
  await expect(page.locator('.button-demo')).toHaveScreenshot('button-default.png')

  // Hover state
  await page.hover('.button-demo button')
  await expect(page.locator('.button-demo')).toHaveScreenshot('button-hover.png')
})

Handling Differences

When screenshot differs:

Show side-by-side comparison
Highlight changed pixels
Ask: intentional change or bug?
Update baseline or fix bug


## Skill Category 5: Test Quality

Skills that measure and improve test effectiveness.

### coverage-analyzer

Analyze and improve test coverage:

```markdown
# Coverage Analyzer

Analyze test coverage and suggest improvements.

## Metrics Collected

- Line coverage
- Branch coverage
- Function coverage
- Statement coverage

## Analysis

### Coverage Gaps
Identify untested code:

Uncovered code in src/api/users.ts:

Lines 45-52: Error handling for database failure Lines 78-85: Edge case for empty results Lines 120-135: Batch update logic


### Test Suggestions
For each gap, suggest tests:
```typescript
// For lines 45-52:
it('handles database connection failure', async () => {
  mockDb.query.mockRejectedValue(new Error('Connection lost'))
  await expect(getUser(1)).rejects.toThrow('Database unavailable')
})

Coverage Quality

Not just percentage, but meaningful coverage:

Critical paths covered?
Error handling tested?
Edge cases verified?


### mutation-tester

Verify test effectiveness:

```markdown
# Mutation Testing

Verify tests catch bugs by introducing them.

## Process

1. Parse source code
2. Apply mutations (change operators, remove statements)
3. Run tests against mutated code
4. Calculate mutation score (% of mutants killed)

## Mutation Types

### Value Mutations
- Change constants: 1 -> 0, true -> false
- Change strings: "hello" -> "mutated"

### Operator Mutations
- Arithmetic: + -> -, * -> /
- Comparison: > -> >=, == -> !=
- Logical: && -> ||, ! removed

### Statement Mutations
- Remove statement
- Replace return value
- Remove function call

## Output

### Mutation Score: 85%

15% of mutants survived (tests didn't catch the bug).

### Surviving Mutants

**src/utils/validate.ts:23**
Mutation: Changed `>` to `>=`
Original: `if (length > 8)`
Mutated: `if (length >= 8)`
Missing test: No test for exactly 8 characters

**src/api/users.ts:67**
Mutation: Removed null check
Original: `if (user === null) return notFound()`
Mutated: `/* removed */`
Missing test: No test for null user case

Choosing the Right Skills

Decision Framework

Situation	Recommended Skills
No tests exist	test-generator (basic) to start
Coverage gaps	coverage-analyzer + test-generator-pro
New features	bdd-test-generator or spec-first
API development	api-test-generator
UI changes	e2e-test-generator + visual-regression
Test quality concerns	mutation-tester

Building a Testing Stack

Combine skills for comprehensive testing:

## Our Testing Stack

### Unit Tests
- test-generator-pro for implementation
- coverage-analyzer for gap analysis
- Target: 80% line coverage

### Integration Tests
- api-test-generator for endpoints
- database-test-helper for fixtures
- Target: All endpoints tested

### E2E Tests
- e2e-test-generator for critical flows
- visual-regression for UI
- Target: Key user journeys covered

### Quality Verification
- mutation-tester quarterly
- Target: 75% mutation score

Practical Tips

Start with Generation, Improve with Review

Generated tests are a starting point. Review them for:

Missing business cases
Incorrect assumptions
Unnecessary tests

Balance Coverage with Value

100% coverage isn't the goal. Focus coverage on:

Business-critical code
Error-prone areas
Frequently changed code

Integrate with CI

Run different test levels at different stages:

Unit tests: Every commit
Integration tests: Every PR
E2E tests: Before release
Mutation tests: Weekly/monthly

Evolve Your Strategy

Testing needs change over time:

Early stage: Focus on critical paths
Growth stage: Expand coverage
Mature stage: Optimize for maintenance

Conclusion

The right combination depends on your situation:

Starting from zero? Use basic test generation to build momentum.
Have coverage but miss bugs? Add mutation testing to verify effectiveness.
Building new features? Try BDD for requirements-driven tests.
Shipping UI? Include visual regression.

Start with the skill that addresses your biggest pain point. A few good tests beat no tests, and generated tests beat the tests you never wrote.

Want to improve other aspects of code quality? Check out Code Review Skills Roundup for PR automation, or explore TDD Skill Analysis for deep-dive testing methodology.