Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ground Truth #6

Open
williamdlees opened this issue Jul 23, 2020 · 0 comments
Open

Ground Truth #6

williamdlees opened this issue Jul 23, 2020 · 0 comments

Comments

@williamdlees
Copy link

williamdlees commented Jul 23, 2020

We would like to incorporate simulated datasets, where the ground truth of which alleles were combined is known. While this is not a pabacea, because the real-life recombination mechanism is not completely understood, this should provide some insight into the sensitivity and limitations of annotation tools.

While there is benefit in having simulated datasets that match real ones as closely as possible, there is also benefit in having very simple datasets that can be used to verify correct operation at the most fundamental level of the tool - for example a dataset that includes every allele, has known CDR3s, has non-functional sequences of various kinds. And maybe some more exotic sequences, e.g. heavy chain with no D gene present, duplicated D gene.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant