-
Notifications
You must be signed in to change notification settings - Fork 55
Task: refquery
Lee Katz edited this page Jun 23, 2022
·
2 revisions
This looks up information about a sequence or a cluster, using the output from prepareref. It can:
-
Given a cluster name, return the names of all the sequences in that cluster.
-
Given a sequence name, return all the metadata for that sequence, plus the nucleotide sequence.
Suppose we have prepared the data from CARD using the following two commands:
ariba getref card getref
ariba prepareref -f getref.fa -m getref.tsv ref_dir
To find all the sequences in cluster "ErmD":
ariba refquery ref_dir cluster ErmD
which outputs:
Sequences belonging to cluster ErmD: ErmD.3000495.L08389.390-1254.240 ErmD.3000495.M29832.0-864.242 ErmD.3000495.M77505.771-1635.241
### Get sequence information
To get the information about the sequence "ErmD.3000495.L08389.390-1254.240":
ariba refquery ref_dir seq ErmD.3000495.L08389.390-1254.240
which outputs:
Name ErmD.3000495.L08389.390-1254.240 Is gene 1 Variant only 0 Cluster ErmD Description ErmD confers MLSb phenotype. Description ErmD Sequence ATGAAGAAAAAAAATCATAAGTACAGAGGAAAAAAGTTAAACCGCGGGGAATCTCCGAATTTTTCCGGACAGCATTTGATGCATAATAAAAAATTAATTGAAGAAATTGTGGATCGGGCAAATATTAGCATAGACGATACGGTTTTAGAGTTAGGAGCGGGAAAAGGGGCTTTGACAACTGTGCTAAGTCAAAAAGCCGGTAAGGTATTGGCAGTGGAAAACGATTCTAAATTCGTTGATATACTCACACGTAAAACAGCACAGCATTCAAATACGAAAATTATTCATCAAGATATCATGAAGATTCATTTACCAAAAGAAAAGTTTGTGGTGGTCTCTAATATTCCCTATGCCATCACAACTCCCATCATGAAAATGCTCTTGAACAATCCTGCAAGCGGATTTCAAAAAGGGATCATCGTAATGGAAAAAGGGGCTGCTAAACGTTTCACATCAAAATTCATTAAAAATTCCTATGTTTTAGCTTGGAGAATGTGGTTTGATATTGGCATTGTCAGAGAAATATCGAAAGAGCATTTTTCTCCCCCTCCAAAAGTGGACTCGGCAATGGTCAGAATAACACGAAAAAAAGACGCGCCTCTATCACATAAACATTATATTGCGTTTCGGGGACTTGCCGAATACGCGCTAAAGGAGCCGAATATCCCTCTCTGTGTTCGTTTACGCGGAATTTTTACCCCGCGTCAAATGAAACACTTAAGAAAAAGTCTAAAAATCAACAATGAAAAAACCGTTGGAACGCTCACCGAAAACCAATGGGCGGTTATTTTTAACACGATGACTCAATATGTAATGCATCACAAATGGCCAAGAGCAAATAAGCGAAAACCCGGAGAAATATAA