Skip to content

Rules

The warning layer sits on top of deterministic calculations. It does not replace the math; it adds interpretation and feasibility checks that the UI can highlight separately.

  • Severity: high
  • Trigger: estimated duration exceeds the recommended maximum duration
  • Meaning: test may be too slow to be practical
  • Severity: medium
  • Trigger: effective daily traffic is low
  • Meaning: duration and variance risk increase
  • Severity: high
  • Trigger: continuous metric without std_dev
  • Meaning: deterministic continuous sample size is not trustworthy
  • Severity: high
  • Trigger: more than two variants combined with weak effective traffic
  • Meaning: the design is too ambitious for available traffic
  • Severity: medium
  • Trigger: payload flags seasonality
  • Meaning: test should cover at least a full weekly cycle
  • Severity: medium
  • Trigger: active campaigns are present
  • Meaning: external acquisition effects can bias lift estimates
  • Severity: medium
  • Trigger: requested power is below 0.8
  • Meaning: higher risk of false negatives
  • Severity: medium
  • Trigger: more than two variants
  • Meaning: Bonferroni alpha correction is applied and can inflate required sample size
  • Severity: high
  • Trigger: estimated duration is long while the payload says a long test is not possible
  • Meaning: the plan is operationally inconsistent
  • Severity: medium
  • Trigger: more than one planned interim look (n_looks > 1) with a sequential inflation factor above 1.0
  • Meaning: group-sequential monitoring inflates the required sample size; the warning reports the percentage increase
  • Severity: high
  • Trigger: actual_counts are provided and the chi-square test against the planned traffic_split rejects at p < 0.001
  • Meaning: observed assignment counts deviate from the planned split (sample ratio mismatch); randomization or tracking is likely broken
  • catalog metadata: app/backend/app/rules/catalog.py
  • trigger logic: app/backend/app/rules/engine.py
  • warnings are shown in a dedicated accordion section
  • severity is rendered via red / yellow / green styling
  • multivariant designs also surface a dedicated Bonferroni note in the summary cards