面向大数据的非结构化数据管理平台关键技术
Key Techniques for Building Big-Data-Oriented Unstructured Data
Management Platform
摘 要 大数据的有效管理是实现大数据处理的前提和基础。非结构化数据是当今大数据的主体。针对非结构化数据管理平台构造中的数据表示、数据操作与数据处理效率等关键问题,提出并论述了四面体数据模型、非结构化数据查询语言(UQL)及数据分布式存储与并行处理架构,并将上述技术在非结构化数据管理系统(AUDR)中进行了应用。
关键词 非结构化数据管理 四面体数据模型 非结构化数据查询语言 并行处理
Abstract:The effective management of big data is the premise and foundation of big data
processing. Today, unstructured data becomes the principal part of big data. Regarding
the key issues of building big-data-oriented unstructured data management platform,
such as data representation, data manipulation, data processing efficiency,a tetrahedral
data model,an Unstructured Data Query Language (UQL),a distributed data storage and
parallel processing framework are presented and discussed.The key techniques describe
are applied in the unstructured data management platform named AUDR (Advanced
Unstructured Data Repository).
Keywords: unstructured data management; tetrahedral data model; unstructured data
query language; parallel processing