-
Notifications
You must be signed in to change notification settings - Fork 51
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question about ouput #141
Comments
Thank you for testing TRUST4.
A_102_0023_FR2_barcode_airr.tsv is kind of independent from the A_102_0023_FR2_airr.tsv file. The abundance in the *_report.tsv and *_airr.tsv (bulk setting) are the sum of the cells bearing the same VJC genes, and is not the UMI count. The CDR3s in the "barcode" output file should be present in the "bulk" output file, which is chain1+secondary_chain1. Hope this helps.
|
Thank you very much for this explanation! Now, I do some post-check to see spatial pattering of the BCR has located the plasma zone using A_102_0023_FR2_barcode_airr.tsv for example. Thanks again, |
I just uploaded a script "barcoderep-expand.py" to create barcode entries for secondary chains in the "scripts" folder. You can run it as: By default, this script will filter the lowly abundant chains (default is 0.1), and you can adjust that with option "--frac". Will this help? |
Just fixed a few bugs. I think the scripts should work fine now. |
Hello mourisl, option --fact is useful for me, I have used frac == 0 to get complete output.
Question:
I just get less 500 BCR with these efforts. Thanks for your help, |
Thank you for noticing this issue. I have removed the debugging print line from the github repo. One posssible reason that you did not get much more BCR is that B cells from the same location may bear the same BCR as the proliferation feature? |
@mourisl Now is 6:30 PM for me, have a good weekend. |
The primary chain fields in the barcode_report file only have the productive chains. So if none of the chains in a barcode is productive, this barcode will be skipped in the report file. |
Hello Trust4,
Thanks for the most flexible tools.
My team developed custom 5' spatial BCR. Now I'm in the stage of preprocessing.
Spot like mini-bulk(5-20 cells) with additional spatial barcode. in my fastq Read2: 279nt & Read1: 28nt.
Below is CMD using Trust4 (Output all the required fields for AIRR format #138 version)
output : [Wed Jul 6 21:26:41 2022] TRUST4 finishes.
Questions:
1/ In my understanding, *_barcode_airr.tsv came from *_barcode_report.tsv and *_airr.tsv came from *_report.tsv. *_barcode_airr.tsv is single cell output and *_airr.tsv is bulk output.
For each barcode, Trust4 picked one highest umi count chain in file *_barcode_airr.tsv which came from *_airr.tsv. Why some sequence_id is in file *_barcode_airr.tsv, but this sequence_id isn’t in the file *_airr.tsv?
A_102_0023_FR2_barcode_airr.tsv is independent on A_102_0023_FR2_airr.tsv? In other words, I’m waiting for A_102_0023_FR2_report.tsv is sum of chain1 + secondary_chain1 in
A_102_0023_FR2_barcode_report.tsv. I’ve misunderstood.
2/ In my case, Read2 has 279nt which start from IGH FR2. Stage 1 assembly is necessary? Is it possible using Trust4 just skip stage1 and start from 0 read extraction.
3/ I used option UMI, where is umi information in output ?
Umi == consensus_count == read_fragment_count == read_cnt ?
4/ In my situation, I start from "secondary" chains in the A_102_0023_FR2_barcode_report.tsv file or A_102_0023_FR2_airr.tsv ?
How I can extract top n chains and rank them using trust-airr.pl if I use A_102_0023_FR2_barcode_report.tsv as input?
Thanks for your reply,
Chuang
The text was updated successfully, but these errors were encountered: