欢迎访问中国科学院大学学报,今天是

中国科学院大学学报 ›› 2020, Vol. 37 ›› Issue (6): 820-827.DOI: 10.7523/j.issn.2095-6134.2020.06.014

• 计算机科学 • 上一篇    下一篇

基于层次化标签的人体解析

胡莉娜1,2,3, 高盛华2   

  1. 1. 中国科学院上海微系统与信息技术研究所, 上海 200050;
    2. 上海科技大学信息学院, 上海 201210;
    3. 中国科学院大学, 北京 100049
  • 收稿日期:2019-03-15 修回日期:2019-05-29 发布日期:2020-11-15
  • 通讯作者: 胡莉娜
  • 基金资助:
    国家自然科学青年基金(61502304)资助

Hierarchical label-guided human parsing

HU Lina1,2,3, GAO Shenghua2   

  1. 1. Shanghai Institute of Microsyst&Information Technology, Chinese Academy of Sciences, Shanghai 200050, China;
    2. School of Information Science&Technology, ShanghaiTech University, Shanghai 201210, China;
    3. University of Chinese Academy of Sciences, Beijing 100049, China
  • Received:2019-03-15 Revised:2019-05-29 Published:2020-11-15

摘要: 人体解析针对图像中人体不同部位进行语义分割,是近年来计算机视觉领域中的一个重要研究课题。不同于场景中的一般物体,人具有高度的结构化特征,并存在复杂的姿态变化和衣物遮挡情况。针对这一任务,提出一种基于卷积神经网络的层次化标签结构的人体解析方法。首先对精细的标签按照类别进行不同程度的合并,获得多个层级的解析图;然后改进具有金字塔特征抽取结构的卷积神经网络,使用解析图对金字塔不同层级的特征进行监督;最后将所有层级特征进行融合得到解析结果。在人体解析数据集LIP上的实验验证,与当前通用的语义分割算法相比,该算法可获得更高的人体解析准确性并改善了图像的分割效果。

关键词: 层次化标签, 卷积神经网络, 人体解析, 语义分割

Abstract: Human parsing is a type of semantic segmentation of different human body parts in an image. It is an emerging task in the field of computer vision. Compared with general objects, human body is much more structured but with wide variations in pose and occlusions caused by wearing. In this paper we present a hierarchical label network (HLNet). Firstly, fine categories are merged into body parts with different granularities to obtain multiple parsing maps for each image. Next, a convolutional neural network with a pyramid feature extraction structure is trained under supervision of these maps. Finally, the hierarchical features are fused together to predict the final parsing results. Experimental results on the LIP dataset show that the proposed algorithm achieves higher parsing accuracy and better segmentation performance, compared with common semantic segmentation algorithms.

Key words: hierarchical labeling, convolutional neural networks(CNN), human parsing, semantic segmentation

中图分类号: