D2FP: Learning Implicit Prior for Human Parsing

This the official code for D2FP: Learning Implicit Prior for Human Parsing.

Paper

This codebase is implemented using Detectron2.

Overview

Setup

Requirements

Linux or macOS with Python ≥ 3.6
PyTorch ≥ 1.9 and torchvision that matches the PyTorch installation. Install them together at pytorch.org to make sure of this. Note, please check PyTorch version matches that is required by Detectron2.
Detectron2: follow Detectron2 installation instructions.
OpenCV is optional but needed by demo and visualization

CUDA kernel for MSDeformAttn

After preparing the required environment, run the following command to compile CUDA kernel for MSDeformAttn:

CUDA_HOME must be defined and points to the directory of the installed CUDA toolkit.

cd m2fp/modeling/pixel_decoder/ops
sh make.sh

Building on another system

To build on a system that does not have a GPU device but provide the drivers:

TORCH_CUDA_ARCH_LIST='8.0' FORCE_CUDA=1 python setup.py build install

Download pretrained weights

Download the R-103.pkl weights from the official website https://github.com/facebookresearch/MaskFormer/blob/main/MODEL_ZOO.md and place it in D2FP/weights/R-103.pkl

Datasets

Set the DETECTRON2_DATASETS environment variable and organize the downloaded datasets in the following directory structure within the specified path. You can download the dataset from the official website: https://sysu-hcp.net/lip/overview.php.

Expected dataset structure for LIP:

lip/
  Training/
    Images/             # all training images
    Category_ids/       # all training category labels
  Validation/
    Images/
    Category_ids/

Expected dataset structure for CIHP:

cihp/
  Training/
    Images/             # all training images
    Category_ids/       # all training category labels
    Instance_ids/       # all training part instance labels
    Human_ids/          # all training human instance labels
  Validation/
    Images/
    Category_ids/
    Instance_ids/
    Human_ids/

Training

Set the number of GPUs and customize the configuration as needed.

./scripts/train.sh

Evaluation

Set the number of GPUs and customize the configuration as needed.

./scripts/test.sh

Acknowledgement

Code is largely based on M2FP (https://github.com/soeaver/M2FP).

Citing

If you find our work useful, please consider citing:

@InProceedings{Hong_2025_WACV,
    author    = {Hong, Junyoung and Yang, Hyeri and Kim, Ye Ju and Kim, Haerim and Kim, Shinwoong and Shim, Euna and Lee, Kyungjae},
    title     = {D2FP: Learning Implicit Prior for Human Parsing},
    booktitle = {Proceedings of the Winter Conference on Applications of Computer Vision (WACV)},
    month     = {February},
    year      = {2025},
    pages     = {9096-9106}
}

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
configs		configs
d2fp		d2fp
datasets		datasets
scripts		scripts
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
train_net.py		train_net.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

D2FP: Learning Implicit Prior for Human Parsing

Overview

Setup

Requirements

CUDA kernel for MSDeformAttn

Building on another system

Download pretrained weights

Datasets

Expected dataset structure for LIP:

Expected dataset structure for CIHP:

Training

Evaluation

Acknowledgement

Citing

About

Releases 1

Packages

Contributors 2

Languages

License

cvlab-yongin/D2FP

Folders and files

Latest commit

History

Repository files navigation

D2FP: Learning Implicit Prior for Human Parsing

Overview

Setup

Requirements

CUDA kernel for MSDeformAttn

Building on another system

Download pretrained weights

Datasets

Expected dataset structure for LIP:

Expected dataset structure for CIHP:

Training

Evaluation

Acknowledgement

Citing

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages