-
-
Notifications
You must be signed in to change notification settings - Fork 304
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Interest in a zarr.sparse module? #424
Comments
Hi Dale, demo notebook is very cool, thanks a lot for posting. I'm on leave
for a couple of weeks but look forward to digging a bit deeper.
…On Mon, 1 Apr 2019, 14:21 Dale Tovar, ***@***.***> wrote:
Hey there,
For a project I've been working on I wanted a zarr-based sparse matrix
class so I recently made one: https://github.com/daletovar/zsparse
I've added a notebook with a few examples. After it gets to a more stable
and faster place I was planning on making it a stand alone package.
However, I've been thinking it might make sense to just add it to zarr. I
won't be offended if you guys aren't interested. At the very least, I
thought you guys might like to know about it, especially because it solves
#152 <#152>.
Right now there's support for csr and csc matrices and saving and loading
pydata/sparse arrays. A potential problem with making a COO class for
pydata/sparse is that doing a large number of binary searches on zarr
arrays takes much longer than it does for numpy arrays. The code would also
need to be written in cython instead of numba because numba doesn't support
zarr. I'd like to see how cython does on the csr and csc classes as it's
all currently written in pure python. All of this is to say, if you were
wondering why there isn't a COO class, these are some of the concerns I've
had.
Thanks for listening. I'm curious what you guys think about all of this.
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#424>, or mute the thread
<https://github.com/notifications/unsubscribe-auth/AAq8Qs4zFK9aVFEo7TEV4scZh2oumsT_ks5vcaVzgaJpZM4cU8Do>
.
|
Thanks, I appreciate that. |
As the only open issue I could find about storing sparse arrays in Zarr, I thought I'd comment here that the AnnData project's |
Now that we have the Perhaps now is the time to revisit this feature. cc @alxmrs |
cc @ivirshup (who has also expressed interest in some form of sparse support in Zarr) |
Hey there,
For a project I've been working on I wanted a zarr-based sparse matrix class so I recently made one: https://github.com/daletovar/zsparse
I've added a notebook with a few examples. After it gets to a more stable and faster place I was planning on making it a stand alone package. However, I've been thinking it might make sense to just add it to zarr. I won't be offended if you guys aren't interested. At the very least, I thought you guys might like to know about it, especially because it solves #152.
Right now there's support for csr and csc matrices and saving and loading pydata/sparse arrays. A potential problem with making a COO class for pydata/sparse is that doing a large number of binary searches on zarr arrays takes much longer than it does for numpy arrays. The code would also need to be written in cython instead of numba because numba doesn't support zarr. I'd like to see how cython does on the csr and csc classes as it's all currently written in pure python. All of this is to say, if you were wondering why there isn't a COO class, these are some of the concerns I've had.
Thanks for listening. I'm curious what you guys think about all of this.
The text was updated successfully, but these errors were encountered: