Product Execution 0 uses

Trustworthy Experiments

Helps design, run, and interpret controlled experiments correctly. Based on Ronny Kohavi's framework from "Trustworthy Online Controlled Experiments".

Name: trustworthy-experiments

Use when asked to "run an A/B test", "design an experiment", "check statistical significance", "trust our results", "avoid false positives", or "experiment guardrails". Helps design, run, and interpret controlled experiments correctly. Based on Ronny Kohavi's framework from "Trustworthy Online Controlled Experiments".

What It Is

Trustworthy Experiments is a framework for running controlled experiments (A/B tests) that produce reliable, actionable results. The core insight: most experiments fail, and many "successful" results are actually false positives.

The key shift: Move from "Did the experiment show a positive result?" to "Can I trust this result enough to act on it?"

Ronny Kohavi, who built experimentation platforms at Microsoft, Amazon, and Airbnb, found that:

66-92% of experiments fail to improve the target metric
8% of experiments have invalid results due to sample ratio mismatch alone
When the base success rate is 8%, a P-value of 0.05 still means 26% false positive risk

When to Use It

Use Trustworthy Experiments when you need to:

Design an A/B test that will produce valid, actionable results
Determine sample size and runtime for statistical power
Validate experiment results before making ship/no-ship decisions
Build an experimentation culture at your company
Choose metrics (OEC) that balance short-term gains with long-term value
Diagnose why results look suspicious (Twyman's Law)
Speed up experimentation without sacrificing validity

When Not to Use It

Don't use controlled experiments when:

You don't have enough users — Need tens of thousands minimum
The decision is one-time — Can't A/B test mergers or acquisitions
There's no real user choice — Employer-mandated software
You need immediate decisions — Experiments need time
The metric can't be measured — No experiment without observable outcomes

Resources

Book:

Trustworthy Online Controlled Experiments by Ronny Kohavi, Diane Tang, and Ya Xu

Install via CLI

Run one command in your terminal. Works with Claude Code and other AI assistants that support the Skills CLI.

Install just this skill

Terminal

npx skills add pmprompt/claude-plugin-product-management --skill trustworthy-experiments

Or install all 28 skills

Terminal

npx skills add pmprompt/claude-plugin-product-management

Manual Install (Advanced)

Create the skill file manually. Recommended for advanced users who want full control.

Create the skill file

Run this command to create the directory and SKILL.md file:

mkdir -p .claude/skills/trustworthy-experiments && touch .claude/skills/trustworthy-experiments/SKILL.md

This creates the directory and an empty SKILL.md file.

Open the skill file

Open the SKILL.md file in your favorite editor:

nano .claude/skills/trustworthy-experiments/SKILL.md

Or use code .claude/skills/trustworthy-experiments/SKILL.md for VS Code

Add the content

Copy the skill content and paste it into the SKILL.md file:

Then save the file. Now you can use the skill by typing /trustworthy-experiments in your AI assistant, or it will automatically use it when relevant.

Using a different AI assistant?

Claude Code: .claude/skills/

OpenCode: .opencode/skills/

Related Skills

Product Execution

Stakeholder Update Generator

Create compelling progress updates and release notes

View Skill

Product Execution

A/B Test Designer

Design robust A/B test experiments

View Skill

Product Execution

PMF Survey (Product-Market Fit Survey)

Helps quantify product-market fit and systematically improve it. The PMF Survey framework (created by Sean Ellis, popula...

View Skill

Go Beyond Copy-Paste

This skill is great for Claude. But with PMPrompt Pro, you can generate Product Execution documents instantly in your browser—no setup, no context switching.

Upgrade to Pro or try the dashboard free

Generate in one click Save to documents Export to PDF/Word