欢迎访问中国科学院大学学报,今天是

中国科学院大学学报

• • 上一篇    下一篇

一种基于双控节点的Ceph写性能优化方法*

黄遵祥, 朱磊基, 熊勇   

  1. 中国科学院上海微系统与信息技术研究所,上海 201800;
    中国科学院大学,北京 100049;
    中国科学院无线传感网与通信重点实验室,上海 201800
  • 收稿日期:2020-07-13 修回日期:2020-11-04 发布日期:2021-05-31
  • 通讯作者: E-mail:yong.xiong@mail.sim.ac.cn
  • 基金资助:
    *全军共用信息系统装备预研专用技术项目(31511030302)资助

A Ceph write performance optimization method based on double-control nodes

HUANG Zunxiang, ZHU Leiji, XIONG Yong   

  1. Shanghai Institute of Microsystem and Information Technology Chinese Academy of Sciences,Shanghai 201800,China;
    University of Chinese Academy of Sciences,Beijing 100049,China;
    Key Lab of Wireless Sensor Network and Communication Chinese Academy of Sciences,Shanghai 201800,China
  • Received:2020-07-13 Revised:2020-11-04 Published:2021-05-31

摘要: 分布式存储系统Ceph由于采用多副本强一致性写入机制,造成集群写性能不理想。针对该问题,提出一种基于双控节点的Ceph写性能优化方法,首先利用双控双存储阵列节点,当一个控制器出现故障时,该节点中的另一个伙伴控制器创建新的OSD进程并快速接管故障控制器的存储阵列,从而保证数据存储的安全性和高可靠性,同时将写入机制优化为主副本OSD在本地写入日志盘(journal)后,就向客户端返回写完成,之后写入数据盘和其余从副本的完成情况则由主副本OSD继续收集并完成后续各类回调操作,从而降低非必要写操作对集群写性能的影响。最后对数据可用性和集群写性能进行实验测试,其中写性能测试分别从写延迟、吞吐量和IOPS三个角度,对优化后的方法和Ceph原生写入机制在顺序写和随机写两方面进行比较,进一步验证了优化方法在维护数据高可用的同时,对写性能提升的效果。

关键词: 分布式存储, Ceph, 双控节点, 副本一致性

Abstract: Because the distributed storage system Ceph uses a multi-copy strong consistency write mechanism, the cluster write performance is not ideal. To solve this problem, this paper proposes a Ceph write optimization method based on double-control nodes. With double-control double-RAID nodes, when one controller fails, another partner controller in the node creates a new OSD process and quickly takes over the RAID of the failed controller, thereby ensuring the safety and high reliability of data storage. At the same time, the write mechanism is optimized as follows: after the primary OSD is written to the journal, the write completion is returned to the client. After that, the primary OSD continues to collect the completion status of the write data disk and other slave copies, and then completes callback operations. Thereby reducing the impact of unnecessary write operations on the write performance of the cluster. Finally, the data availability and cluster write performance are tested experimentally. The write performance test compares the optimized method and Ceph's native write mechanism in terms of sequential write and random write from three perspectives of write latency, throughput and IOPS. It further verifies the effect of the optimization method on improving write performance while maintaining high data availability.

Key words: distributed storage, Ceph, double-control node, replica consistency

中图分类号: