Skip to content

Multi-processing parallel PyTorch Geometric InMemoryDataset.

Notifications You must be signed in to change notification settings

PatRyg99/PyG-ParallelDataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

PyG-ParallelDataset

Multi-processing parallel PyTorch Geometric InMemoryDataset.

Features:

  • Run prefiltering and preprocessing on multiple jobs
  • Save preprocessed data in chunks of specified length (zipping large files may run into RAM issues even when the dataset itself can fit into RAM)

Dependencies:

  • torch
  • pyg
  • pqdm

About

Multi-processing parallel PyTorch Geometric InMemoryDataset.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages