Statistical Details
Both our Bayesian and our Frequentist engines begin with a similar foundation for estimating experiment effects. We estimate the experiment effect (either the relative lift or the absolute effect), and its standard error. We create decision making tools from those estimates (e.g., frequentist confidence intervals and p-values, or Bayesian credible intervals and risks) to help you make rollout/rollback decisions.
Our estimates compare an experimental variation (henceforth the treatment) to some baseline variation (henceforth control). Define as the population control mean, and define as the population treatment mean.
The absolute treatment effect is .
The relative treatment effect (or lift) is if and is undefined otherwise.
Throughout for any population parameter denote its sample counterpart as . For example, the sample absolute effect is .
Lift (Relative Effects)
By default, we estimate lift (i.e. relative effects or percentage changes) from the control to the treatment variations. No matter which engine we use, the statistics we leverage are