欢迎访问中国科学院大学学报,今天是

中国科学院大学学报 ›› 2024, Vol. 41 ›› Issue (5): 687-694.DOI: 10.7523/j.ucas.2023.077

• 电子信息与计算机科学 • 上一篇    

基于多尺度特征和注意力机制的深度学习点云压缩

黄玉林1,2, 梁磊3, 李卫军1,2, 习晓环3   

  1. 1. 中国能源建设集团广东省电力设计研究院有限公司, 广州 510663;
    2. 广东科诺勘测工程有限公司, 广州 51066;
    3. 中国科学院空天信息创新研究院 数字地球重点实验室, 北京 100094
  • 收稿日期:2023-03-06 修回日期:2023-09-13 发布日期:2023-10-25
  • 通讯作者: 梁磊,E-mail:202231051040@mail.bnu.edu.cn
  • 基金资助:
    国家自然科学基金(41871264)和广西自然科学基金创新项目(2019GXNSFGA245001)资助

Point cloud compression of deep learning based on multi-scale feature and attention mechanism

HUANG Yulin1,2, LIANG Lei3, LI Weijun1,2, XI Xiaohuan3   

  1. 1. Guangdong Electric Power Design Institute of China Energy Engineering Group Co. Ltd, Guangzhou 510663, China;
    2. Guangdong Kenuo Surveying Engineering Co. Ltd, Guangzhou 51066;
    3. Key Laboratory of Digital Earth Science, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China
  • Received:2023-03-06 Revised:2023-09-13 Published:2023-10-25

摘要: 三维点云广泛应用于无人驾驶、实景三维等领域,然而复杂场景的海量点云对存储、处理和传输等带来极大挑战。提出一种基于多尺度特征和注意力机制的深度变分自编码点云几何信息压缩算法MSA-GPCC,通过加入多尺度模型提取特征、变分自编码器构建熵模型,继而结合尺度注意力模块和多尺度特征,实现基于熵编码的点云几何信息高码率、低失真压缩。在MPEG数据集上进行的实验表明,相比G-PCC算法和基于深度学习的D-PCC算法,MSA-GPCC算法在点间等比特率下平均质量增益分别提升7.72和4.91 dB,点到面等比特率下平均质量增益分别提升5.56和3.09 dB。

关键词: 点云压缩, 深度学习, 注意力机制, 变分自编码器, 多尺度特征

Abstract: 3D point clouds have extensive applications in the auto-drive, 3D real scene, and other fields. But complex scene requires massive point clouds to represent which brings great challenges to storage space, data processing and transmission bandwidth. A multi-scale attention point cloud geometry compression (MSA-GPCC) is proposed to compress point cloud data based on multi-scale features, attention mechanism, and variational auto-encoders (VAE). Experiments and analysis are carried out based on MPEG data sets. The results show that MSA-GPCC performs better than those of the traditional G-PCC and deep-learning-based D-PCC algorithms, D1 BD-PSNR is improved by 7.72 and 4.91 dB respectively, and D2 BD-PSNR is improved by 5.56 and 3.09 dB respectively.

Key words: point cloud compression, deep learning, attention mechanism, variational auto-encoders, multi-scale feature

中图分类号: