Topline and Projected Impact
The topline impact is the average daily effect that an experiment has on the overall metric value as compared between two groups. This is the real daily impact to a metric resulting from running the experiment, measured amongst the two groups being evaluated. The projected launch impact is an estimate of the daily impact we expect to see in the metric measured globally if a decision is made and the test group is launched to all users (beyond just those in the experiment). This impact is computed relative to the expected baseline value of the metric if the experiment wasn't running at all.
Topline Impact and Projected Launch Impact are shown in both absolute and relative units. Topline Impact and Projected Launch Impact currently do not use CUPED when measuring any impact induced by your experiment.
Take a simple example experiment with a Control group of 1000 users and a Test group of another 1000 users, which ran for 30 days. For an event_count metric, we observed an Experiment Delta of +1.0 events per user (abs). The Topline Impact for this metric would be +33.33 events per day (abs).
Computing Topline Impact
The topline impact is computed over the total duration of the experiment. This gives the most accurate estimate and tight confidence interval. The exact calculation depends on whether the metric represents an absolute quantity or a ratio:
Count and Sum Metrics (event_count, sum)
The absolute topline impact is derived directly from the experiment results. It depends on the difference in means between test and control, and the average number of users in the test group per day.
Knowing the absolute impact and the overall metric value (as seen in the metrics dashboard), we can compute the relative impact. This is the percentage change in the overall metric value over the rollup window that is attributed to the active experiment.
Ratio and Mean Metrics
To properly derive the topline impact on a ratio metric we must understand the impact on the numerator (X) and denominator (Y) separately. The topline impact is the current value of the ratio metric minus the baseline value we obtain by subtracting the numerator and denominator impacts:
Where the baseline value is the expected value of the topline metric if the experiment wasn't running:
The relative impact for ratio metrics is obtained by dividing the absolute impact by the baseline value:
Computing Projected Launch Impact
The layer allocation of the experiment and the size of the test group are used to estimate a scaling factor m, which represents the increase in absolute impact expected when a decision is made to launch the test group.
The launch factor over a rollup window is calculated as
to accommodate changes in allocation during the experiment.
Note that the targeting gate isn't factored in. The projected impact calculation assumes that the target gate remains the same after the experiment is launched.
Count and Sum Metrics (event_count, event_dau, sum)
For count and sum metrics, the projected absolute impact is simply the current topline impact scaled by a factor of m. For example: Consider an experiment running with 50% layers allocation and 50/50 test/control split, so that 25% of all users are in the test group. If the allocation has been changing during this experiment, we will use a weighted average based on historical allocations. If the topline impact is currently +10 events per day, then launching the experiment would lead to +40 events per day.
The relative projected impact is expected percentage change in the topline metric, relative to the baseline value of the metric without the experiment running.
Ratio and Mean Metrics
Similar to the topline impact calculation above, the projected impact of ratio metrics depends on the numerator and denominator impacts. We use the same scaling factor m to obtain the projected impact for each term:
Where the first term represents the projected metric value after launch.
Finally, the projected relative impact of a ratio metric is the projected absolute impact divided by the baseline value of the ratio:
Confidence intervals
The confidence intervals for topline and projected impact are computed in the same way as the confidence intervals for experiment deltas.
In the case of absolute impact of count and sum metrics, the variance calculation is simply a linear combination of the test and control variances:
And for projected launch impact we get:
For ratio metrics and relative impacts, the variance is calculated using the Delta method. This properly accounts for the correlation between the various numerator and denominator terms, leveraging Taylor expansion to linearize expressions containing non-linear combinations of experiment variables.
For example, the variance in the relative impact of a count metric is given by: