-
Notifications
You must be signed in to change notification settings - Fork 107
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Memory Allocation Failed - Large Datasets #120
Comments
Thanks for posting the issue, @TonyX26. I think the problem is integer overflow. See how N*K gives a negative number here?
This is because N*K = 2.89E9 which is larger than the maximum integer 2.14E9. I think you need to change the definition of the function computeGaussianPerplexity() so that N and D are not declared as Then, N*D should not overflow, and calloc() will not be trying to allocate a "negative" amount of memory. If you have trouble making that change, I can also make it for you. If that fixes the problem, please make a pull request so we can update the repo. |
Thanks for the reply! That has fixed the negative memory problem... however it is still not working sadly. Thanks Error output:
Full Output
Script:
|
Same story. It is crashing here, and you can see the indices are integers, so they are overflowing. Try changing those to long int, particularly Although the algorithm should work, we did not test the actual code with a dataset of this size. There are likely other places too where we used int and it should be long int instead. It is a big help if you can go through and make those changes and then pull request. If you have difficulty or it's too much work, let me know and I can do it. |
Thanks for all the help! I've had a shot at doing it myself, but sadly haven't managed to get it to work still. If it's possible to get some help, that'd be much appreciated. I'll pull the request though, seeing that it is solved! Thank you |
Hey Toni, "pull request" does not mean closing the issue :-) I am reopening it, as it's clearly a bug. |
I'm so sorry. First time as you may have realised :D I've put in a pull request now. |
Hi All, I've implemented the above changes, and have been trying to find additional ways around it. The output below is the furthest I've managed to get sadly. I note the memory allocation is still negative, but I'm unsure of what else to change to avoid this problem. Any help would be very much appreciated!
This continues until:
Where the process then stops and ends.
|
Hi all,
I've been trying to run FIt-SNE on a FCS file 20 million events large. Unfortunately, despite allocating 1.5TB of memory, an error still arises (below). This does not occur when running the same file downsampled to 2 or 5 million cells. I have just been trying to run a small 20 iterations, just to identify the problem, however it never manages to get there...
Has anyone encountered this error before? I've attached the error file, the output, and my script.
Thanks!
Error file:
Script:
The text was updated successfully, but these errors were encountered: