-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
barcoded P2 data, dorado-basecalled: TypeError: Invalid type for the value of the key n50: <class 'NoneType'> #28
Comments
... same error when using BAM as input ...
|
Hi @sklages, Thank you for reporting this issue. To reproduce the error, could you please provide a summary or a sample of the 2.4 million records? I suspect the problem might be related to dependencies. can you try it using the Docker command and see if the issue persists?
Looking forward to your response. Best regards, |
thanks for your fast response, .. This is a pip install, I have no docker available here ... so maybe there i another way to check the dependencies? |
Could you provide the output of Best, |
This is the
|
Putting ToulligQC in a fresh python-3.11 venv results in the same error. |
I tried on a small private Linux box [1] with docker just as described with the same 2.4M file.
No idea where the docker run wants to write its data. But in general, the parsing and creating of stats/plots seems to work with docker. On the same linux box I pip-installed toulligqc (venv, Python 3.12.4) resulting in the same error as described initially. [1] Manjaro Linux, 16G (7G free), Intel i7-10700T, root access |
Thanks, @sklages, for all the details. Make sure you use I think this issue is related to the newer versions of Numpy and Pandas. I'm trying to reproduce the error with my data. In the meantime, try installing I hope that will solve the problem. Best regards, |
Indeed it works like a charm with
So for now the problem is somewhat solved by simply downgrading numpy/pandas. Thank you for the hint (and a very nice piece of software) :-) |
I have a few prom P2 flowcells, barcoded (
SQK-NBD114
, barcodes 1-4) basecalled with currentdorado 0.7.2
.As I see from #27 -- the barcode issue has been fixed in version 2.7.
Original
*_summary.tsv
has some 23M records. RunningToulligQC
like this:.. results in an error:
Using a fraction of this file works fine with 2.3M records .. and fails when using 2.4M or more records ...
The file itself looks "sane" .. I am not really sure where to look at.
Any idea that puts me in the right direction? I probably miss here something ver basic, but I do not see what ..
The text was updated successfully, but these errors were encountered: