Welcome to Journal of University of Chinese Academy of Sciences,Today is

Journal of University of Chinese Academy of Sciences ›› 2022, Vol. 39 ›› Issue (4): 512-523.DOI: 10.7523/j.ucas.2020.0040

• Research Articles • Previous Articles     Next Articles

Building extraction based on UNet++ network with different backbones

GU Yumin1,2, YAN Fuli1   

  1. 1. Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China;
    2. College of Resources and Environment, University Academy of Sciences, Beijing 100190, China
  • Received:2020-05-12 Revised:2020-08-14 Online:2022-07-15

Abstract: Automatic building extraction methods based on deep learning theory have the technical characteristics of high accuracy and speed,and are of great significance in industrial applications, such as urban planning,disaster prevention and mitigation. This paper introduces the deep learning modules and the traditional remote sensing validation section in the proposed building extraction method in high-resolution remote sensing imageries, forming an operational deep-learning-theory based building extraction technical system that integrates different backbone modules, UNet + + networks,and remote sensing authenticity verification modules. The basic network is transformed through the traditional convolutional network model backbones,such as VGG,ResNet, and Inception to improve the model operational efficiency,strengthen the model feature learning capabilities,verify the effectiveness and applicability of the algorithm through authenticity validation. Taking the Massachusetts building dataset published by Mnih as the data source,a comparative analysis was carried out with the traditional non-full convolutional network model and full convolutional network model. The results show that an increasing in the depth and width of the model can substantially improve the building extraction results. The InceptionV3-UNet + + backbone model has the best performance in recall rate,accuracy,CSI,F1 score,Kappa coefficients, and total accuracy,reaching 85. 14%,90. 50%,0. 781 6,0. 877 4,0. 850 4, and 95. 57%,respectively,and its robustness is also verified on the WHU datasets. This method has significantly improved the extraction accuracy and the details of the buildings extracted, especially on complex and irregular buildings, which will facilitate the building extraction applications in real, complex, and large scene of high-resolution remote sensing imageries.

Key words: deep learning, high-resolution remote sensing image, convolutional neural network, building extraction, image segmentation

CLC Number: