-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(config): extract gwas_significance parameter to step configuration #628
feat(config): extract gwas_significance parameter to step configuration #628
Conversation
5e6a181
to
323d0c3
Compare
@addramir just to clear things up here, The airflow layer that will run the data release will use the value of If you think we want to drop the original value, without maintaining the backwards compatibility for gentropy users, I will drop the warning and change the default value in |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice clean up! All good, 2 comments:
- I'd consider removing the warning.
- We apply a significance threshold also in the GWAS Catalog top hits ingestion. You might want to apply the same variable as well? Check here.
@ireneisdoomed and @addramir thank you both for the feedback! With regards to the issues, I would like to make a clear statments:
|
@project-defiant @ireneisdoomed Thank you for comments! "We therefore recommend to use a threshold of 1e-8 for GWAS with common variants, which might be slightly conservative for current datasets but should be appropriate for data from WGS or imputation-based studies in the future because the number of variants is expected to increase with the increase of sample size [14]" It is not really something to fight for really. The idea behind this data realise was to change the threshold only for GWAS Catalog for now. We can change it everywhere else (that is a correct thing) but we can do it later. |
@ireneisdoomed I have dropped the warning and added the default threshold to the code you mentioned here., as it seems logical to keep the clumping step and gwas annotation in sync. Also created a separate issue for updating other thresholds, so we do not forget about it. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👌
…on (#628) * feat(clumping): lower p-value significance threshold for clumping step * feat(config): extract gwas_significance to config * feat(config): synced p-value in association parsing --------- Co-authored-by: Szymon Szyszkowski <ss60@mib117351s.internal.sanger.ac.uk> Co-authored-by: Yakov <yt4@sanger.ac.uk>
✨ Context
opentargets/issues#3327
🛠 What does this PR implement
This is the first part of the #3327.
gwas_significance
(p-value) threshold parameter for the window-based-clumping step.gwas_significance
to the airflow config with the new value (1e-8
) overwriting default one (5e-8
)WindowBasedClumpingStep
config toWindowBasedClumpingStepConfig
The main reason for this change is that we want to use the value of
1e-8
and possibly even more stringent value of1e-9
in the future.🙈 Missing
🚦 Before submitting
dev
branch?make test
)?poetry run pre-commit run --all-files
)?