Welcome to Journal of University of Chinese Academy of Sciences,Today is

Journal of University of Chinese Academy of Sciences ›› 2023, Vol. 40 ›› Issue (2): 268-279.DOI: 10.7523/j.ucas.2021.0058

• Research Articles • Previous Articles     Next Articles

Recognition-oriented facial 3D information estimation

CHEN Hanqin1, QIN Jin2, ZHAO Tong1,3, YAN Yao1   

  1. 1. School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China;
    2. School of Computer and Control, University of Chinese Academy of Sciences, Beijing 101408, China;
    3. Key Laboratory of Big Data Mining and Knowledge Management, Chinese Academy of Sciences, Beijing 100049, China
  • Received:2021-01-16 Revised:2021-04-09 Online:2023-03-15

Abstract: 3D face recognition has the advantages of recognition accuracy and anti-counterfeiting strength over the popular 2D face recognition, and represents the development direction of face recognition. Due to the high cost of 3D facial acquisition, 3D facial recognition can not establish and optimize face recognition algorithms directly rely on massive face data as 2D facial recognition methods. How to obtain augmented training data for 3D faces accurately and efficiently is the most pressing problem in driving the development of 3D face recognition applications. A large amount of related research focuses on how to get better 3D face reconstruction visualization,but does not give much consideration to the subsequent recognition task,so that the recognition accuracy of 3D face recognition algorithms trained with these reconstructed images is much lower than expected. To address this problem, a recognition-oriented method for estimating facial 3D information is proposed. Different from the general method, this method directly builds an interactive bridge between information estimation and subsequent recognition:during the training process of 3D face information estimation, it directly bases on the corresponding recognition network to supervise and improve the estimation effect of 3D face information. For this purpose, we first construct a 3D face information representation, the depth-surface normal vector map (DN map), and then train a facial CycleGAN model with a real 3D dataset to learn a mapping from 2D face to DN map with preserved identity information and represent it in the form of U-Net network. Experiments are conducted on five datasets to compare with other methods, and the improvement is particularly significant in the ND-2006 dataset, with an improvement of 31.8%. In addition, experiments on performance improvement under data augmentation are conducted, here the performance improvement of the augmentation method based on the facial CycleGAN is more obvious under the same conditions of data augmentation, with a maximum improvement of 14.9% on the CASIA 3D dataset.

Key words: facial 3D information estimation, cycle generative adversarial networks, facial DN map, identity preserving loss, parameter pre-train

CLC Number: