-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Experiments statistics: migration plan #26713
Comments
@jurajmajerik Could you share a bit more detail about what you're thinking w/r/t test cases for each? Also, should we have an "Update documentation" item? |
@danielbachhuber I was thinking of testing all the permutations of the test cases suggested in the list, so for example:
There are also tests for the existing methods, I just haven't had time to look closer at those.
|
@jurajmajerik Is there some documentation on why each methodology for each scenario? e.g. why does Trends continuous take the mean and then apply some log variance? |
@danielbachhuber some of this is covered in our main jupyter notebook: https://colab.research.google.com/drive/1hcWMsaS2GvMM0YeFCVctXfVWiNM5WDwq?usp=sharing At a high level, the goal is to choose a probability distribution that reflects the kind of values you'd expect in real life. For a continuous value like revenue, the distribution starts at zero and extends into positive values. This is why a logarithm is applied - it ensures the values start at zero. As for why the mean is used for a continuous value, I assume it's because you need a way to fairly compare the two groups. Comparing the sums wouldn’t work since the sample sizes might be different. Taking the mean per user gives a more accurate comparison. @andehen does the above make sense and can you provide more detail? |
https://posthoghelp.zendesk.com/agent/tickets/21955 is interested in trying this out when it's ready |
@jurajmajerik One thing worth noting that came up in conversation with @andehen today... Our current implementation of posthog/posthog/hogql_queries/experiments/funnels_statistics_v2.py Lines 160 to 172 in f4d7603
Higher probability and expected loss are inversely correlated, so a probability of >90% means that expected loss is probably less than 1%. Also worth noting that the current implementation of posthog/posthog/hogql_queries/experiments/trends_statistics.py Lines 110 to 115 in ecaed9a
However, it's only ever used in this condition, so probably not an immediate problem for us: posthog/frontend/src/scenes/experiments/experimentLogic.tsx Lines 1213 to 1226 in ecaed9a
|
Feature request
Debug info
The text was updated successfully, but these errors were encountered: