Multi-view Neural Human Rendering (NHR)

Minye Wu1,3,4     Yuehao Wang1     Qiang Hu1     Jingyi Yu1,2    

1 ShanghaiTech University     2 DGene Inc.     3 University of Chinese Academy of Sciences    
4 Shanghai Institute of Microsystem and Information Technology


Our neural human renderer (NHR) produces photorealistic free-view-video (FVV) from multi-view dynamic human captures. NHR trains on multi-view videos and is composed of three modules: feature extraction (FE), projection and rasterization (PR), and rendering (RE). FE adopts PointNet++ to extract features from the reconstructed models over time even under strong topology/reconstruction inconsistencies based on structure and semantics.



We present an end-to-end Neural Human Renderer (NHR) for dynamic human captures under the multi-view setting. NHR adopts PointNet++ for feature extraction (FE) to enable robust 3D correspondence matching on low quality, dynamic 3D reconstructions. To render new views, we map 3D features onto the target camera as a 2D feature map and employ an anti-aliased CNN to handle holes and noises. Newly synthesized views from NHR can be further used to construct visual hulls to handle textureless and/or dark regions such as black clothing. Comprehensive experiments show NHR significantly outperforms the state-ofthe-art neural and image-based rendering techniques, especially on hands, hair, nose, foot, etc.





All experiments are conducted on 3D dynamic human data collected by a multi-camera dome system with up to 80 cameras arranged on a cylinder. All cameras are synchronized and capture at 25 frames per second. In this paper, we use 5 sets of datasets where the performers are in different clothing and perform different actions. [ More details]



Video below shows our results on the 5 datasets.



   title={Multi-View Neural Human Rendering},
   author={Wu, Minye and Wang, Yuehao and Hu, Qiang and Yu, Jingyi},
   booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},



Snapshot for paper " Multi-view Neural Human Rendering "
Minye Wu, Yuehao Wang, Qiang Hu, Jingyi Yu.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020

  [Paper (pdf, 2.1MB)]     [Poster Slides (pdf, 1.3MB)]

  [Code (github)]

  [Datasets sport_1(7z, 2.4GB)  sport_2(7z, 2.7GB)  sport_3(7z, 2.7GB)  basketball(7z, 4.5GB)  dance(7z, 8.6GB)]

Last update: Sep. 26, 2020