欢迎访问中国科学院大学学报,今天是

中国科学院大学学报 ›› 2011, Vol. 28 ›› Issue (6): 776-785.DOI: 10.7523/j.issn.2095-6134.2011.6.011

• 论文 • 上一篇    下一篇

基于GPGPU和CUDA的高速AES算法的实现和优化

顾青1, 高能2, 包珍珍3, 向继2   

  1. 1. 国家863计划信息安全基础设施研究中心, 上海 200336;
    2. 中国科学院研究生院信息安全国家重点实验室, 北京 100049;
    3. 中国科学技术大学, 合肥 230026
  • 收稿日期:2010-07-23 修回日期:2010-11-01 发布日期:2011-11-15
  • 基金资助:

    中国科学院知识创新工程(YYYJ-1013)和国家科技支撑课题(2008BAH32B04)资助 

Implementation and optimization of high speed AES algorithm based on GPGPU and CUDA

GU Qing1, GAO Neng2, BAO Zhen-Zhen3, XIANG Ji2   

  1. 1. National 863 Program Research Center for Information Security Infrastructure, Shanghai 200336, China;
    2. State Key Laboratory of Information Security, Graduate University, Chinese Academy of Sciences, Beijing 100049, China;
    3. University of Science and Technology of China, Hefei 230026, China
  • Received:2010-07-23 Revised:2010-11-01 Published:2011-11-15

摘要:

随着高性能计算需求的不断增长,人们开始将目光投向具有强大计算能力及高存储带宽的GPU设备.与擅长处理复杂性逻辑事务的CPU相比,GPGPU(general purpose graphic processing unit,通用图形处理器)更适合于大规模数据并行处理.CUDA(compute unified device architecture,统一计算架构)的出现更加速了GPGPU应用面的扩张.基于GPGPU和CUDA技术对AES算法的实现进行加速,得到整体吞吐量6~7Gbit/s的速度.如果不考虑数据加载时间,对于1MB以上的输入规模,吞吐量可以达到20Gbit/s.

关键词: 通用图像处理器, 统一计算架构, AES算法, 并行计算

Abstract:

Compared with the CPU which is good at handling logic complexity service, GPGPU (general purpose graphic processing unit) is suitable for large-scale parallel processing computing. The emergence of CUDA (compute unified device architecture) accelerates the expansion of application of GPGPU. We accelerate the implementation of AES algorithm based on GPGPU and CUDA and achieve a total throughput of 6~7Gbit/s. Regardless of the time of data loading and storing, a throughput of 20Gbit/s towards an input size over 1MB can be achieved.

Key words: GPGPU(general purpose graphic processing unit), CUDA(compute unified device architecture), AES algorithm, parallel computing

中图分类号: