黄震江
基于Hadoop平台的大数据分析关键技术标准化探讨
Research on Standardization of Key Techniques of Big Data
Analysis Based on Hadoop Platform
摘 要 分析基于Hadoop平台的大数据分析关键技术面临的标准化问题,从数据采集、并行计算框架、分析结果输出、并行数据分析算法四个方面进行标准化的分析和调研,提出包含架构模型等四个方面的标准化方向以及相关API等方面的标准化建议。
关键词 大数据分析 计算框架 并行分析算法 Hadoop
Abstract: In this paper, we investigate the standardization issues and challenges of the
big data analysis techniques based on Hadoop platform. By analyzing and researching
standards from the four aspects including data collection, parallel computing framework
the results of the analysis output, parallel data analysis algorithms, we conclude four
aspects of standardization direction including the schema model as well as the API
standardization recommendations.
Keywords: big data analysis; computing framework; parallel analysis algorithms; Hadoop