GitHub - MCG-NJU/TPM: [WACV 2025 Oral] Transferring Foundation Models for Generalizable Robotic Manipulation

[WACV 2025 Oral] Transferring Foundation Models for Generalizable Robotic Manipulation

[Arxiv 23.06] Pave the Way to Grasp Anything: Transferring Foundation Models for Universal Pick-Place Robots

Getting started

Preparation:

Install ROS Noetic and Movelt
Clone GroundingDINO for open-vocab object detection git clone https://github.com/IDEA-Research/GroundingDINO.git
Clone MixFormer for object tracking: git clone https://github.com/MCG-NJU/MixFormer.git
Clone Segment-Anything for segmentation: https://github.com/facebookresearch/segment-anything.git
Prepare and process your own data

Train the model:

Prepare the the environment conda env create -f environment.yaml
Train our two-stream policy python train.py

Inference our two-stream policy on a Franka robot:

Run inference python inference/inference.py

Citation

Please cite the following paper if you feel this repository useful for your research.

@InProceedings{Yang_2025_WACV,
    author    = {Yang, Jiange and Tan, Wenhui and Jin, Chuhao and Yao, Keling and Liu, Bei and Fu, Jianlong and Song, Ruihua and Wu, Gangshan and Wang, Limin},
    title     = {Transferring Foundation Models for Generalizable Robotic Manipulation},
    booktitle = {Proceedings of the Winter Conference on Applications of Computer Vision (WACV)},
    month     = {February},
    year      = {2025},
    pages     = {1999-2010}
}

@article{yang2023transferring,
  title={Transferring foundation models for generalizable robotic manipulation},
  author={Yang, Jiange and Tan, Wenhui and Jin, Chuhao and Yao, Keling and Liu, Bei and Fu, Jianlong and Song, Ruihua and Wu, Gangshan and Wang, Limin},
  journal={arXiv preprint arXiv:2306.05716},
  year={2023}
}

@article{yang2023pave,
  title={Pave the way to grasp anything: Transferring foundation models for universal pick-place robots},
  author={Yang, Jiange and Tan, Wenhui and Jin, Chuhao and Liu, Bei and Fu, Jianlong and Song, Ruihua and Wang, Limin},
  journal={arXiv preprint arXiv:2306.05716},
  volume={1},
  number={2},
  year={2023}
}

Acknowledges

Thanks to the open source of the following projects:

Movelt

GroundingDINO

Mixfromer MixformerV2

SAM MobileSAM

MiniGPT-4 DetGPT

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
inference		inference
tpm		tpm
train_configs		train_configs
README.md		README.md
environment.yml		environment.yml
frame_work.png		frame_work.png
mask_model.py		mask_model.py
res_model.py		res_model.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[WACV 2025 Oral] Transferring Foundation Models for Generalizable Robotic Manipulation

[Arxiv 23.06] Pave the Way to Grasp Anything: Transferring Foundation Models for Universal Pick-Place Robots

Getting started

Preparation:

Train the model:

Inference our two-stream policy on a Franka robot:

Citation

Acknowledges

About

Releases

Packages

Languages

MCG-NJU/TPM

Folders and files

Latest commit

History

Repository files navigation

[WACV 2025 Oral] Transferring Foundation Models for Generalizable Robotic Manipulation

[Arxiv 23.06] Pave the Way to Grasp Anything: Transferring Foundation Models for Universal Pick-Place Robots

Getting started

Preparation:

Train the model:

Inference our two-stream policy on a Franka robot:

Citation

Acknowledges

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages