欢迎访问中国科学院大学学报,今天是

中国科学院大学学报 ›› 2026, Vol. 43 ›› Issue (1): 136-143.DOI: 10.7523/j.ucas.2024.021

• 电子信息与计算机科学 • 上一篇    

面向21CMA的分布式存储运维管理系统设计与实现

杨嘉宁1,2(), 韩军1,3, 崔辰州1,2,3   

  1. 1.中国科学院国家天文台,北京 100101
    2.中国科学院大学,北京 100049
    3.国家天文科学数据中心,北京 100101
  • 收稿日期:2023-12-21 修回日期:2024-04-07 发布日期:2024-05-22
  • 通讯作者: 杨嘉宁
  • 基金资助:
    SKA专项(2020SKA0120201);国家重点研发计划(2022YFF0711500);国家自然科学基金(12273077);国家自然科学基金(12103070);中国科学院网信专项2022年度应用示范项目(CAS-WX2021SF-0204)

Distributed storage operation and management system for 21CMA

Jianing YANG1,2(), Jun HAN1,3, Chenzhou CUI1,2,3   

  1. 1.National Astronomical Observatories,Chinese Academy of Sciences,Beijing 100101,China
    2.University of Chinese Academy of Sciences,Beijing 100049,China
    3.National Astronomical Data Center,Beijing 100101,China
  • Received:2023-12-21 Revised:2024-04-07 Published:2024-05-22
  • Contact: Jianing YANG

摘要:

21 cm阵列望远镜(21CMA)是我国在平方公里阵列(SKA)低频波段的先导设备,为使其具备脉冲星观测能力,SKA专项启动了21CMA的升级计划。为解决升级后海量观测数据的接收与存储问题,项目设计并引入了基于高级精简指令集计算机(ARM)的分布式文件系统,但现有系统在便捷性和标准化上还存在不足。为此,针对21CMA存储设备定制了部署模块、故障恢复模块和系统监控界面,并对系统进行全面测试。测试内容涵盖部署时间、故障恢复时间以及监控系统数据库的稳定性。测试结果表明,本系统不仅解决了21CMA存储设备的运维管理问题,还提高了其可靠性和效率,满足21CMA多集群监控的需求。

关键词: 运维管理, 监控可视化, 分布式存储, 21 cm低频射电阵列

Abstract:

21 centimeter array (21CMA) is China’s pilot equipment in the square kilometre array (SKA) low-frequency band, therefore, the National SKA Program of China launches a special upgrade plan for 21CMA to enable pulsar observation capability. To solve the problem of receiving and storing massive observation data after upgrading, the project designed and introduced a distributed file storage system based on advanced RISC machine (ARM), but there are still shortcomings in terms of usability and standardization. This study customizes deployment, failure recovery, and system monitoring interface module for 21CMA storage device, and carries out system tests. The test covers the deployment time, failure recovery time, and database stability of the monitoring system. The test results show that the system solves the operation and maintenance management problems of 21CMA storage devices, also improves their reliability and efficiency, meets the needs of 21CMA’s multi-cluster monitoring, and is of great significance for similar projects in the future.

Key words: operations and maintenance management, monitoring and visualization, distributed storage system, 21CMA

中图分类号: