TODO Improve docstrings for the ctypes wrapper functions. Add more doctests. Replace computationally inefficient kernels in various functions with more efficient algorithms. Add bindings for more useful CUDA-based libraries (e.g., MAGMA, CUDAPP).