No empty droplets with UMI counts over the lower cut off erorr #85

vkjain0006 · 2020-12-03T18:30:23Z

First time user of cellbender on snRNA seq data where cellranger reports presence of ambient RNA (using cellranger latest version 5). Trying to clean up the empty droplets using cellbender, bit it's giving following error:-
"AssertionError: There are no empty droplets with UMI counts over the lower cutoff of 1219. Some empty droplets are necessary for the analysis. Reduce the --low-count-threshold parameter"

Below is the command I used and also varied low count threshold parameter form 5 to 100 but still getting the same error everytime.
cellbender remove-background --inputs pathof cellranger matrix file/filtered_feature_bc_matrix --output cellbender_out --expected-cells 11203 --total-droplets-included 25000 --low-count-threshold 5 (tired 5,10,15,100)

Output when command is run
cellbender:remove-background: 2020-12-03 13:18:52
cellbender:remove-background: Running remove-background
cellbender:remove-background: Loading data from directory outs/filtered_feature_bc_matrix
cellbender:remove-background: CellRanger v3 format
cellbender:remove-background: Trimming dataset for inference.
cellbender:remove-background: Including 23891 genes that have nonzero counts.
cellbender:remove-background: Prior on counts in empty droplets is 2439
cellbender:remove-background: Prior on counts for cells is 6043
cellbender:remove-background: Excluding barcodes with counts below 1219
Traceback (most recent call last):
File "/data/Conda/anaconda3/envs/cellbender/bin/cellbender", line 33, in
sys.exit(load_entry_point('cellbender', 'console_scripts', 'cellbender')())
File "/data/Conda/anaconda3/envs/cellbender/CellBender/cellbender/base_cli.py", line 101, in main
cli_dict[args.tool].run(args)
File "/data/Conda/anaconda3/envs/cellbender/CellBender/cellbender/remove_background/cli.py", line 103, in run
main(args)
File "/data/Conda/anaconda3/envs/cellbender/CellBender/cellbender/remove_background/cli.py", line 196, in main
run_remove_background(args)
File "/data/Conda/anaconda3/envs/cellbender/CellBender/cellbender/remove_background/cli.py", line 153, in run_remove_background
fpr=args.fpr)
File "/data/Conda/anaconda3/envs/cellbender/CellBender/cellbender/remove_background/data/dataset.py", line 100, in init
gene_blacklist=gene_blacklist)
File "/data/Conda/anaconda3/envs/cellbender/CellBender/cellbender/remove_background/data/dataset.py", line 271, in _trim_dataset_for_analysis
f"There are no empty droplets with UMI counts over the lower "
AssertionError: There are no empty droplets with UMI counts over the lower cutoff of 1219. Some empty droplets are necessary for the analysis. Reduce the --low-count-threshold parameter.

Also please find rank plot below generated by cellranger for the sample

72d8700-356b-11eb-8455-1fe66a1d9ca8.png)

It will be great if someone can please share what parameters would be right for this dataset so that I can get to working.
I need to run cellbender on 2 more similar samples from this experiment.

Thanks!

sjfleming · 2020-12-18T14:53:43Z

Hi @vkjain0006 !
(Thanks for reaching out via email, I have neglected to keep up with issues for a few weeks...)

I will reply here as well so that other people might see the answer if they have the same question:

I believe the problem is that you're using the "filtered" feature_bc_matrix file. The algorithm depends on having empty droplets in order to properly learn the ambient RNA profile. So you need to use the "raw" feature_bc_matrix (including all droplets, even empties) as the --input.

vkjain0006 · 2021-01-20T17:14:53Z

Thanks @sjfleming for sharing the solution and it worked once I changed the input file.

But I am facing a new challenge now, as Seurat is giving error when I am trying to import filtered file generated by cellbender.

Here's what I am running and the error I am getting:-

d1 <- Read10X_h5(filepath, use.names = TRUE, unique.features = TRUE)
Error in [[.H5File(infile, paste0(genome, "/", feature_slot)) :
An object with name matrix/gene_names does not exist in this group

Can someone please share what I am doing wrong here ?

Thanks

sjfleming · 2021-02-18T18:28:30Z

Hm, I will need to check on the latest version of Seurat and make sure loading is still working with any new changes they've made.

Which Seurat version are you using?

In the meantime, I know that loading the data in scanpy works.

Is this a CellRanger v3 format file?

vkjain0006 · 2021-04-09T22:12:32Z

I am using latest seurat 3.9, files are generated from cellranger version 5.0.

I did find a way to get around the issue, mentioned by someone in the comments section of related issue that deleting PYTABLES attributes fixes the issue and it did work for me too.

sjfleming · 2021-05-03T18:11:15Z

@vkjain0006 okay great, I'm glad you got it to work. I'm surprised that this Seurat fix
satijalab/seurat#3653
did not solve the issue... or maybe that fix hasn't been included in the Seurat 3.9 distribution?

sjfleming · 2023-08-08T19:00:22Z

Closed by #238

sjfleming self-assigned this Dec 18, 2020

sjfleming mentioned this issue Mar 28, 2023

v0.3.0 #189

Closed

sjfleming mentioned this issue Aug 6, 2023

v0.3.0 #238

Merged

sjfleming closed this as completed Aug 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

No empty droplets with UMI counts over the lower cut off erorr #85

No empty droplets with UMI counts over the lower cut off erorr #85

vkjain0006 commented Dec 3, 2020

sjfleming commented Dec 18, 2020

vkjain0006 commented Jan 20, 2021

sjfleming commented Feb 18, 2021

vkjain0006 commented Apr 9, 2021

sjfleming commented May 3, 2021

sjfleming commented Aug 8, 2023

No empty droplets with UMI counts over the lower cut off erorr #85

No empty droplets with UMI counts over the lower cut off erorr #85

Comments

vkjain0006 commented Dec 3, 2020

sjfleming commented Dec 18, 2020

vkjain0006 commented Jan 20, 2021

sjfleming commented Feb 18, 2021

vkjain0006 commented Apr 9, 2021

sjfleming commented May 3, 2021

sjfleming commented Aug 8, 2023