A full pipeline AutoML tool for tabular data
-
Updated
Apr 16, 2025 - Python
A full pipeline AutoML tool for tabular data
AGATHA: Automatic Graph-mining And Transformer based Hypothesis generation Approach
Unified Distributed Execution
Parallel Lammps Python interface - control a mpi4py parallel LAMMPS instance from a serial python process or a Jupyter notebook
Loop like a pro, make parameter studies fun.
Scalable Cytometry Image Processing (SCIP) is an open-source tool that implements an image processing pipeline on top of Dask, a distributed computing framework written in Python. SCIP performs projection, illumination correction, image segmentation and masking, and feature extraction.
Launch a Dask cluster from a Poetry environment
Procurement: Dask Cluster as a Process.
HPC cluster deployment and management for the Hetzner Cloud
Python library to query and transform genomic data from indexed files
Efficiently read climate/meteorology data into Xarray using Dask for parallelization. Transform the data for your modelling needs.
Magic commands to support running MPI python code as well as multi-node Dask workloads on Jupyter notebooks.
Preserve all necessary runtime data of a Dask client in order to "replay" and analyze the performance and behavior of the client after the fact
Python 3 tools for distributed analysis and visualisation of big climate data on HPC systems.
Wukong: a fast and efficient serverless DAG engine.
A custom dask remote jobqueue for HTCondor.
Testing PyCaret, Fugue, and Dask
Python library for implementing state-of-the-art Bayesian filtering techniques like Kalman Filters and Particle Filters.
Distributed solution for Traveling Salesman Problem using Dask.distributed and OR-Tools
Add a description, image, and links to the dask-distributed topic page so that developers can more easily learn about it.
To associate your repository with the dask-distributed topic, visit your repo's landing page and select "manage topics."