Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add option to output closest relative(s) per sample #218

Merged
merged 4 commits into from
Mar 7, 2022

Conversation

amkram
Copy link
Contributor

@amkram amkram commented Mar 5, 2022

Adds extract --closest-relatives which outputs a TSV line for each sample (selected with other options) with the following fields:

  1. Target sample
  2. Comma-separated list of the closest relatives (multiple entries if tied for closest)
  3. Distance in branch length to target sample

If --break-ties is specified, only the lexicographically smallest sample ID is chosen in field 2 (recommended if selecting many samples to save space)

E.g.
matUtils extract --closest-relatives output.tsv [--break-ties] -i input.pb.gz

@yatisht yatisht merged commit b7e5a17 into yatisht:master Mar 7, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants