Dataset setup

Human3.6M

We provide two ways to set up the Human3.6M dataset on our pipeline. You can either convert the original dataset (recommended) or use the dataset preprocessed by Martinez et al. (no longer available as of May 22nd, 2020). The two methods produce the same result. After this step, you should end up with two files in the data directory: data_3d_h36m.npz for the 3D poses, and data_2d_h36m_gt.npz for the ground-truth 2D poses.

Setup from original source (recommended)

Update: we have updated the instructions to simplify the procedure. MATLAB is no longer required for this step.

Register to the Human3.6m website website (or login if you already have an account) and download the dataset in its original format. You only need to download Poses -> D3 Positions for each subject (1, 5, 6, 7, 8, 9, 11)

Instructions without MATLAB (recommended)

You first need to install cdflib Python library via pip install cdflib.

Extract the archives named Poses_D3_Positions_S*.tgz (subjects 1, 5, 6, 7, 8, 9, 11) to a common directory. Your directory tree should look like this:

/path/to/dataset/S1/MyPoseFeatures/D3_Positions/Directions 1.cdf
/path/to/dataset/S1/MyPoseFeatures/D3_Positions/Directions.cdf
...

Then, run the preprocessing script:

cd data
python prepare_data_h36m.py --from-source-cdf /path/to/dataset
cd ..

If everything goes well, you are ready to go.

Instructions with MATLAB (old instructions)

First, we need to convert the 3D poses from .cdf to .mat, so they can be loaded from Python scripts. To this end, we have provided the MATLAB script convert_cdf_to_mat.m in the data directory. Extract the archives named Poses_D3_Positions_S*.tgz (subjects 1, 5, 6, 7, 8, 9, 11) to a directory named pose, and set up your directory tree so that it looks like this:

/path/to/dataset/convert_cdf_to_mat.m
/path/to/dataset/pose/S1/MyPoseFeatures/D3_Positions/Directions 1.cdf
/path/to/dataset/pose/S1/MyPoseFeatures/D3_Positions/Directions.cdf
...

Then run convert_cdf_to_mat.m from MATLAB.

Finally, run the Python conversion script specifying the dataset path:

cd data
python prepare_data_h36m.py --from-source /path/to/dataset/pose
cd ..

Setup from preprocessed dataset (old instructions)

Update: the link to the preprocessed dataset is no longer available; please use the procedure above. These instructions have been kept for backwards compatibility in case you already have a copy of this archive. All procedures produce the same result.

Download the ~~h36m.zip archive~~ (source: 3D pose baseline repository) to the data directory, and run the conversion script from the same directory. This step does not require any additional dependency.

cd data
wget https://www.dropbox.com/s/e35qv3n6zlkouki/h36m.zip
python prepare_data_h36m.py --from-archive h36m.zip
cd ..

2D detections for Human3.6M

We provide support for the following 2D detections:

gt: ground-truth 2D poses, extracted through the camera projection parameters.
sh_pt_mpii: Stacked Hourglass detections (model pretrained on MPII, no fine tuning).
sh_ft_h36m: Stacked Hourglass detections, fine-tuned on Human3.6M.
detectron_pt_h36m: Detectron (Mask R-CNN) detections (model pretrained on COCO, no fine tuning).
detectron_ft_h36m: Detectron (Mask R-CNN) detections, fine-tuned on Human3.6M.
cpn_ft_h36m_dbb: Cascaded Pyramid Network detections, fine-tuned on Human3.6M. Bounding boxes from detectron_ft_h36m.
User-supplied (see below).

The 2D detection source is specified through the --keypoints parameter, which loads the file data_2d_DATASET_DETECTION.npz from the data directory, where DATASET is the dataset name (e.g. h36m) and DETECTION is the 2D detection source (e.g. sh_pt_mpii). Since all the files are encoded according to the same format, it is trivial to create a custom set of 2D detections.

Ground-truth poses (gt) have already been extracted by the previous step. The other detections must be downloaded manually (see instructions below). You only need to download the detections you want to use. For reference, our best results on Human3.6M are achieved by cpn_ft_h36m_dbb.

Mask R-CNN and CPN detections

You can download these directly and put them in the data directory. We recommend starting with:

cd data
wget https://dl.fbaipublicfiles.com/video-pose-3d/data_2d_h36m_cpn_ft_h36m_dbb.npz
wget https://dl.fbaipublicfiles.com/video-pose-3d/data_2d_h36m_detectron_ft_h36m.npz
cd ..

These detections have been produced by models fine-tuned on Human3.6M. We adopted the usual protocol of fine-tuning on 5 subjects (S1, S5, S6, S7, and S8). We also included detections from the unlabeled subjects S2, S3, S4, which can be loaded by our framework for semi-supervised experimentation.

Optionally, you can download the Mask R-CNN detections without fine-tuning if you want to experiment with these:

cd data
wget https://dl.fbaipublicfiles.com/video-pose-3d/data_2d_h36m_detectron_pt_coco.npz
cd ..

Stacked Hourglass detections

These detections (both pretrained and fine-tuned) are provided by Martinez et al. in their repository on 3D human pose estimation. The 2D poses produced by the pretrained model are in the same archive as the dataset (h36m.zip). The fine-tuned poses can be downloaded here. Put the two archives in the data directory and run:

cd data
python prepare_data_2d_h36m_sh.py -pt h36m.zip
python prepare_data_2d_h36m_sh.py -ft stacked_hourglass_fined_tuned_240.tar.gz
cd ..

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Dataset setup

Human3.6M

Setup from original source (recommended)

Instructions without MATLAB (recommended)

Instructions with MATLAB (old instructions)

Setup from preprocessed dataset (old instructions)

2D detections for Human3.6M

Mask R-CNN and CPN detections

Stacked Hourglass detections

Files

README.md

Latest commit

History

README.md

File metadata and controls

Dataset setup

Human3.6M

Setup from original source (recommended)

Instructions without MATLAB (recommended)

Instructions with MATLAB (old instructions)

Setup from preprocessed dataset (old instructions)

2D detections for Human3.6M

Mask R-CNN and CPN detections

Stacked Hourglass detections