Metric Deltas
Computing Metric Deltas
A metric delta refers to the difference in metric values between two groups, by default the test and control groups. This is usually the impact we care about when looking at experiment results. To account for the different number of users (or units) that constitute each group, we compare the mean metric value per user, not the total.
We define all deltas here to be the difference between a "treatment" group as compared to a presumably unchanged "control" group. However, Statsig enables comparison between any two groups as desired.
Two different metric deltas are available in Pulse. The absolute delta is simply the difference between the two means:
It's often helpful to understand the impact relative to the baseline value of the metric. For example, an absolute delta of +1 clicks/user has different meanings with a baseline value of 1 (+100% increase) vs. a baseline value of 100 (+1% increase). The relative delta is computed using the mean of the control group as the baseline:
If you reverse the order of group comparison in Pulse to be "control" vs "treatment", then all deltas will be reversed and the direction of change will be inverted.
Computing Means
Properly computing the groups means is critical for obtained meaningful metric deltas. The exact methodology for calculating the metric means depends on the type of metric.
Event Count and Sum Metrics
These metrics represent totals: Number of times an event occurs, sum of time spent, total purchase amount, etc. The mean is the average user-level total during the time period of the analysis.
The mean value of the metric for a group is given by:
where:
- is the number of users in the group
- is the number of days during the analysis period that user was the experiment
- is the metric value for user on day
Note that only user metrics recorded after a user has been exposed to the experiment are included in the group mean.
User Accounting and Event User Metrics (and legacy Event DAU)
Event User metrics set to "Daily Participation Rate" capture the number of distinct users that have the event each day. In Pulse results, they are normalized by the number of days the user has been in the experiment. This represents the probability that a user is daily active for that event, i.e. the daily participation rate. In the terms defined above, the group mean is given by:
where:
- takes value 0 or 1 depending on if user has the event on a given day .
The following user accounting metrics are computed in the same may: DAU, WAU, MAU_28day, L7, L14, L28
For new user accounting (new_DAU, new_WAU, new_MAU_28day) we count users that are new xAU at some point during the analysis window. So the group mean is given by:
Where is the maximum value of the new xAU metric for user .
event_dau metrics are now in legacy support only and are no longer created for new events. Existing event_dau metrics will continue to be available for any of your new experiments and will continue to be computed daily. For all new events, you should create an event_user metric to measure daily active users.
Custom Ratios, Means, Retention and Stickiness Metrics
These are metrics such as click through rate, average purchase value, sessions per user, etc. They're obtained by diving a numerator value, , by a denominator value, . The mean value of a ratio metric for an experiment group is given by:
Where is the number of users in the experiment group that participate in the metric, i.e. have a non-zero denominator value. and are the and values for user on day .
Different approaches exist for dealing with ratio metrics in experiments. This implementation was selected because it's statistically sound as well as interpretable, given that:
- is the ratio of two means of independent observations: A set of user-level values and a set of user-level values. This means the central limit theorem can be used to separately obtain the summary statistics of and .
- The group means are computed in the same way as the topline metric value, making it easier to interpret the means and relate them to the topline metric.
Event User One-Time Event
For custom event_user metrics with "One-Time Event" selected, statsig computes how many users have the event at any time after the user has been in the experiment. This result is not normalized by the number of days a user is in the experiment. The group mean is given by:
where:
- is the number of users in the group
- takes value 0 or 1 depending on if user has the event at any point after entering the experiment