-
Notifications
You must be signed in to change notification settings - Fork 173
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
lr-kallisto dorado unaligned bam files #450
Comments
You need to compile with -DUSE_BAM=ON in cmake. I haven’t throughly tested whether BAM works though, so FASTQ is the safer option in terms of avoiding bugs. Your other commands seem fine. I don’t think -x VISIUM works because that requires barcodes and UMIs to be at fixed positions in R1 with the sequence to be mapped being in R2 — ONT data doesn’t look like that. |
Thank you that's great I think I will be using pychopper fastq as I have already come into a bug. while building
It would have been good if VISIUM long reads would have been supported maybe something for the future Regards Mustafa |
You might be able to get it to work — I can compile htslib just fine but I think I needed to use the right C compiler or build in a docker. It’s a bit tricky. For VISIUM, you can probably get it to work — I’m just not familiar with the read structure. |
Hi! lr-kallisto is designed to run directly with fastq files. Please let me know if you run into any other issues! |
Apologies! I hadn't seen your other questions! I think you are missing the genemap for the -g flag which can be generated by kb ref. For the visium ont, the -x visium is for paired end reads, so it won't directly work. I'd be interested to hear more about your visium ont data though! I'd be interested in adding the processing steps for it to our pipelines! |
Hi! I'm wondering if the '-x 10xv2 or 10xv3' options are also for paired end reads. |
Hi
It is great to hear the support for long-reads coming to kallisto I would like to introduce it to our pipelines
I have few question regarding running lr-kallisto on ONT dorado basecalled reads.
kallisto bus --long -x bulk -i "$INDEX_PATH" -o "$OUTPUT_DIR/$SAMPLE_NAME" --bam "$BAM_FILE" -t $SLURM_NTASKS
this is using the dorado.bam
output file, is this correct? or should I usefastq
files from the likes ofpychopper
orientated full length files.I attempted to run --bam is through an error?
bustools sort -t $SLURM_NTASKS -o "$OUTPUT_DIR/$SAMPLE_NAME/sorted.bus" "$OUTPUT_DIR/$SAMPLE_NAME/output.bus" bustools count -o "$OUTPUT_DIR/$SAMPLE_NAME/count" -g "$GTF_PATH" -e "$OUTPUT_DIR/$SAMPLE_NAME/matrix.ec" -t "$OUTPUT_DIR/$SAMPLE_NAME/transcripts.txt" --cm "$OUTPUT_DIR$
is this correct ?
3.Third step being
kallisto quant-tcc -i "$INDEX_PATH" -o "$OUTPUT_DIR/$SAMPLE_NAME" --long -P ONT --gtf "$GTF_PATH" --matrix-to-files -t $SLURM_NTASKS "$OUTPUT_DIR/$SAMPLE_NAME/count.mtx"
Is this the correct approach ?
Additionally if I had 10x Genomics Visuim ONT reads can I process these using the -x Visium?
The text was updated successfully, but these errors were encountered: