Deep Physics-aware Inference of Cloth Deformation for Monocular Human Performance Capture

Download Video: HD (MP4, 97 MB)

Abstract

Recent monocular human performance capture approaches have shown compelling dense tracking results of the full body from a single RGB camera. However, existing methods either do not estimate clothing at all or model cloth deformation with simple geometric priors instead of taking into account the underlying physical principles. This leads to noticeable artifacts in their reconstructions, \eg baked-in wrinkles, implausible deformations that seemingly defy gravity, and intersections between cloth and body. To address these problems, we propose a person-specific, learning-based method that integrates a simulation layer into the training process to provide for the first time physics supervision in the context of weakly supervised deep monocular human performance capture. We show how integrating physics into the training process improves the learned cloth deformations, allows modeling clothing as a separate piece of geometry, and largely reduces cloth-body intersections. Relying only on weak 2D multi-view supervision during training, our approach leads to a significant improvement over current state-of-the-art methods and is thus a clear step towards realistic monocular capture of the entire deforming surface of a clothed human.

Downloads


Citation

BibTeX, 1 KB

@INPROCEEDINGS {9665859,
	author = {Y. Li and M. Habermann and B. Thomaszewski and S. Coros and T. Beeler and C. Theobalt},
	booktitle = {2021 International Conference on 3D Vision (3DV)},
	title = {Deep Physics-aware Inference of Cloth Deformation for Monocular Human Performance Capture},
	year = {2021},
	volume = {},
	issn = {},
	pages = {373-384},
	abstract = {Recent monocular human performance capture approaches have shown compelling dense tracking results of the full body from a single RGB camera. However, existing methods either do not estimate clothing at all or model cloth deformation with simple geometric priors instead of taking into account the underlying physical principles. This leads to noticeable artifacts in their reconstructions, e.g. baked-in wrinkles, implausible deformations that seemingly defy gravity, and intersections between cloth and body. To address these problems, we propose a person-specific, learning-based method that integrates a simulation layer into the training process to provide for the first time physics supervision in the context of weakly supervised deep monocular human performance capture. We show how integrating physics into the training process improves the learned cloth deformations, allows modeling clothing as a separate piece of geometry, and largely reduces cloth-body intersections. Relying only on weak 2D multi-view supervision during training, our approach leads to a significant improvement over current state-of-the-art methods and is thus a clear step towards realistic monocular capture of the entire deforming surface of a clothed human.},
	keywords = {training;learning systems;deformable models;three-dimensional displays;tracking;dynamics;clothing},
	doi = {10.1109/3DV53792.2021.00047},
	url = {https://doi.ieeecomputersociety.org/10.1109/3DV53792.2021.00047
	},
	publisher = {IEEE Computer Society},
	address = {Los Alamitos, CA, USA},
	month = {dec}
	}
				

Acknowledgments

The authors would like to thank the anonymous reviewers for their valuable feedback, and Gereon Fox for the video narration. The authors from MPII were supported by the ERC Consolidator Grant 4DRepLy (770784).

Contact

For questions, clarifications, please get in touch with:
Yue Li
yue.li@inf.ethz.ch

This page is Zotero translator friendly. Page last updated Imprint. Data Protection.