🎯

shift-right-testing

🎯Skill

from proffesor-for-testing/agentic-qe

VibeIndex|
What it does

Enables progressive testing in production using feature flags, canary deployments, synthetic monitoring, and chaos engineering to validate software reliability and performance.

shift-right-testing

Installation

Install skill:
npx skills add https://github.com/proffesor-for-testing/agentic-qe --skill shift-right-testing
4
AddedJan 27, 2026

Skill Details

SKILL.md

"Testing in production with feature flags, canary deployments, synthetic monitoring, and chaos engineering. Use when implementing production observability or progressive delivery."

Overview

# Shift-Right Testing

When testing in production or implementing progressive delivery:

  1. IMPLEMENT feature flags for progressive rollout (1% β†’ 10% β†’ 50% β†’ 100%)
  2. DEPLOY with canary releases (compare metrics before full rollout)
  3. MONITOR with synthetic tests (proactive) + RUM (reactive)
  4. INJECT failures with chaos engineering (build resilience)
  5. ANALYZE production data to improve pre-production testing

Quick Shift-Right Techniques:

  • Feature flags β†’ Control who sees what, instant rollback
  • Canary deployment β†’ 5% traffic, compare error rates
  • Synthetic monitoring β†’ Simulate users 24/7, catch issues before users
  • Chaos engineering β†’ Netflix-style failure injection
  • RUM (Real User Monitoring) β†’ Actual user experience data

Critical Success Factors:

  • Production is the ultimate test environment
  • Ship fast with safety nets, not slow with certainty
  • Use production data to improve shift-left testing

Quick Reference Card

When to Use

  • Progressive feature rollouts
  • Production reliability validation
  • Performance monitoring at scale
  • Learning from real user behavior

Shift-Right Techniques

| Technique | Purpose | When |

|-----------|---------|------|

| Feature Flags | Controlled rollout | Every feature |

| Canary | Compare new vs old | Every deployment |

| Synthetic Monitoring | Proactive detection | 24/7 |

| RUM | Real user metrics | Always on |

| Chaos Engineering | Resilience validation | Regularly |

| A/B Testing | User behavior validation | Feature decisions |

Progressive Rollout Pattern

```

1% β†’ 10% β†’ 25% β†’ 50% β†’ 100%

↓ ↓ ↓ ↓

Check Check Check Monitor

```

Key Metrics to Monitor

| Metric | SLO Target | Alert Threshold |

|--------|------------|-----------------|

| Error rate | < 0.1% | > 1% |

| p95 latency | < 200ms | > 500ms |

| Availability | 99.9% | < 99.5% |

| Apdex | > 0.95 | < 0.8 |

---

Feature Flags

```javascript

// Progressive rollout with LaunchDarkly/Unleash pattern

const newCheckout = featureFlags.isEnabled('new-checkout', {

userId: user.id,

percentage: 10, // 10% of users

allowlist: ['beta-testers']

});

if (newCheckout) {

return ;

} else {

return ;

}

// Instant rollback on issues

await featureFlags.disable('new-checkout');

```

---

Canary Deployment

```yaml

# Flagger canary config

apiVersion: flagger.app/v1beta1

kind: Canary

spec:

targetRef:

apiVersion: apps/v1

kind: Deployment

name: checkout-service

progressDeadlineSeconds: 60

analysis:

interval: 1m

threshold: 5 # Max failed checks

maxWeight: 50 # Max traffic to canary

stepWeight: 10 # Increment per interval

metrics:

- name: request-success-rate

threshold: 99

- name: request-duration

threshold: 500

```

---

Synthetic Monitoring

```javascript

// Continuous production validation

await Task("Synthetic Tests", {

endpoints: [

{ path: '/health', expected: 200, interval: '30s' },

{ path: '/api/products', expected: 200, interval: '1m' },

{ path: '/checkout', flow: 'full-purchase', interval: '5m' }

],

locations: ['us-east', 'eu-west', 'ap-south'],

alertOn: {

statusCode: '!= 200',

latency: '> 500ms',

contentMismatch: true

}

}, "qe-production-intelligence");

```

---

Chaos Engineering

```typescript

// Controlled failure injection

await Task("Chaos Experiment", {

hypothesis: 'System handles database latency gracefully',

steadyState: {

metric: 'error_rate',

expected: '< 0.1%'

},

experiment: {

type: 'network-latency',

target: 'database',

delay: '500ms',

duration: '5m'

},

rollback: {

automatic: true,

trigger: 'error_rate > 5%'

}

}, "qe-chaos-engineer");

```

---

Production β†’ Pre-Production Feedback Loop

```typescript

// Convert production incidents to regression tests

await Task("Incident Replay", {

incident: {

id: 'INC-2024-001',

type: 'performance-degradation',

conditions: { concurrent_users: 500, cart_items: 10 }

},

generateTests: true,

addToRegression: true

}, "qe-production-intelligence");

// Output: New test added to prevent recurrence

```

---

Agent Coordination Hints

Memory Namespace

```

aqe/shift-right/

β”œβ”€β”€ canary-results/* - Canary deployment metrics

β”œβ”€β”€ synthetic-tests/* - Monitoring configurations

β”œβ”€β”€ chaos-experiments/* - Experiment results

β”œβ”€β”€ production-insights/* - Issues β†’ test conversions

└── rum-analysis/* - Real user data patterns

```

Fleet Coordination

```typescript

const shiftRightFleet = await FleetManager.coordinate({

strategy: 'shift-right-testing',

agents: [

'qe-production-intelligence', // RUM, incident replay

'qe-chaos-engineer', // Resilience testing

'qe-performance-tester', // Synthetic monitoring

'qe-quality-analyzer' // Metrics analysis

],

topology: 'mesh'

});

```

---

Related Skills

  • [shift-left-testing](../shift-left-testing/) - Pre-production testing
  • [chaos-engineering-resilience](../chaos-engineering-resilience/) - Failure injection deep dive
  • [performance-testing](../performance-testing/) - Load testing
  • [agentic-quality-engineering](../agentic-quality-engineering/) - Agent coordination

---

Remember

Production is the ultimate test environment. Feature flags enable instant rollback. Canary catches issues before 100% rollout. Synthetic monitoring detects problems before users. Chaos engineering builds resilience. RUM shows real user experience.

With Agents: Agents monitor production, replay incidents as tests, run chaos experiments, and convert production insights to pre-production tests. Use agents to maintain continuous production quality.

More from this repository10

🎯
n8n-security-testing🎯Skill

Automates security vulnerability scanning and penetration testing for n8n workflows, identifying potential risks and misconfigurations.

🎯
database-testing🎯Skill

Validates database schemas, tests data integrity, verifies migrations, checks transaction isolation, and measures query performance.

🎯
brutal-honesty-review🎯Skill

Delivers unvarnished technical criticism with surgical precision, combining expert-level BS detection and zero-tolerance for low-quality work.

🎯
n8n-expression-testing🎯Skill

n8n-expression-testing skill from proffesor-for-testing/agentic-qe

🎯
n8n-trigger-testing-strategies🎯Skill

Validates n8n workflow triggers by comprehensively testing webhook, schedule, polling, and event-driven mechanisms with robust payload and authentication checks.

🎯
n8n-integration-testing-patterns🎯Skill

Validates n8n integration connectivity, authentication flows, and error handling across external service APIs through comprehensive testing patterns.

🎯
six-thinking-hats🎯Skill

Applies Six Thinking Hats methodology to systematically analyze software testing challenges from multiple perspectives, enhancing decision-making and test strategy development.

🎯
test-environment-management🎯Skill

Provisions and manages consistent, cost-effective test environments using Docker, Kubernetes, and infrastructure as code for reliable software testing.

🎯
risk-based-testing🎯Skill

Prioritizes testing efforts by systematically assessing and ranking risks based on probability and potential impact across software components.

🎯
context-driven-testing🎯Skill

context-driven-testing skill from proffesor-for-testing/agentic-qe