-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
High MT genes ratio at some samples #179
Comments
Is this a plot where each point is adata = # cellranger or cellbender [cell, gene] AnnData, for all droplets including empty droplets
mt_genes = # your mitochondrial gene list
y_value = adata[:, mt_genes].X.sum() / adata.X.sum() ? I guess one of my questions would be: does the CellRanger calculation include the counts in empty droplets? |
Apologize for not being clear, actually the Anndata is the Here is how I get this plot: For CellBender anndata, I remove the "cells" with null counts. And then I use
then I calculate the median mt gene ratio for each sample to draw plot
Thanks for your reply~ |
Hi @dyinboisry4u , okay thanks for that explanation. I think I do have a hypothesis! But it would need to be tested :) My hypothesis is that the extra "non-empty" droplets that are being retained in the CellBender filtered output (as compared to CellRanger's cell calling algorithm) are actually droplets with high MT fraction. This would in turn make it look like the median MT fraction per sample had actually gone up after CellBender. You can see an example here where some of the low-UMI count "non-empty" droplets that CellBender finds are actually high MT-fraction droplets. In that experiment, they might represent dying cells. If you consider the same set of droplets in both the CellRanger data and the CellBender data, then you will see that the mitochondrial read fraction only goes down, since CellBender only subtracts counts from the count matrix, it never adds counts. |
Hi @sjfleming , your hypothesis is correct, here is my test: I also have some questions:
Thanks!~ |
Okay, yes, in that case, I think we have a satisfactory answer to the MT part. As for your other questions:
|
@sjfleming, thanks for your detailed explanation!
I just found this: https://hal.science/hal-03378505 , by the way, how do you calculate the entropy of gene expression?
Yes, for the The rank plot indeed shows an elbow, but if I use a larger |
Re: entropy, I use the python package called I actually think those runs with larger numbers of Even if some of those droplets end up being "cell fragments" that deserve to be eliminated during cell QC, it is not CellBender that should be eliminating them. From the perspective of CellBender, those are non-empty droplets. |
Hi, Thanks again for your reply!
# raw counts matrix
np.apply_along_axis(ndd.entropy, axis=1, arr=adata.X.todense(), k=adata.n_vars) Sorry for not being familiar with bayesian entropy estimation :( |
|
Train and test curve wobbly-ness has been addressed to a large extent in v0.3.0. Hopefully that holds true when people test it out. Closed by #238 |
Hi,
I run cellbender(v0.2.2) for 60 samples, and all of the parameters are set manually after I check the cellranger barcode rank plot. but at some samples, I find mt gene ratio is higher than cellranger matrix
what should I do?
thanks!
The text was updated successfully, but these errors were encountered: