欢迎访问中国科学院大学学报,今天是

中国科学院大学学报 ›› 2017, Vol. 34 ›› Issue (3): 395-400.DOI: 10.7523/j.issn.2095-6134.2017.03.014

• 简报 • 上一篇    

基于Hadoop的邮政寄递大数据分析系统设计与实现

王卫锋, 杨林   

  1. 中国科学院大学计算机与控制学院信息动态学与工程应用实验室, 北京 100049
  • 收稿日期:2016-09-14 修回日期:2016-11-18 发布日期:2017-05-15
  • 通讯作者: 王卫锋,E-mail:ry_009@126.com

Design and implementation of postal delivery big data analytic system based on Hadoop

WANG Weifeng, YANG Lin   

  1. Information Dynamics and Engineering Applications Laboratory, School of Computer and Control Engineering, University of Chinese Academy of Sciences, Beijing 100049, China
  • Received:2016-09-14 Revised:2016-11-18 Published:2017-05-15

摘要: 面对海量邮政寄递数据,现有的构建于关系数据库上的数据仓库系统在做数据分析时具有建设成本高、分析能力会遇到瓶颈等缺点。Hadoop具有高可扩展、高性能和低成本等优点,被广泛应用于大数据的存储和分析。基于对Hadoop开源框架的研究,设计邮政寄递大数据分析系统,并对该系统进行部分实现。结合邮政安监系统工程需求展开实验,得出大数据分析系统的性能参数,为后续工程建设提供依据。

关键词: 邮政寄递数据, Hadoop, 大数据存储, 大数据分析

Abstract: Facing massive postal delivery data, the existing data warehouse system based on the traditional relational database has problems of high construction cost and analysis capacity bottleneck. Nowadays, Hadoop is widely used in large data storage and analysis, and it has the advantages of high scalability, high performance, and so on. On the basis of studies of the open source framework of Hadoop, combining with practical engineering project, we proposed a delivery data analysis system based on Hadoop. we implemented some parts of the system. We obtained the performance parameters of this system. The parameters can be widely used in future building of the project.

Key words: postal delivery data, Hadoop, big data storage, big data analysis

中图分类号: