企業(yè)數(shù)據(jù)中心數(shù)據(jù)采集與建模
本文關(guān)鍵詞: 數(shù)據(jù) 數(shù)據(jù)中心 HADOOP 出處:《山東大學(xué)》2017年碩士論文 論文類型:學(xué)位論文
【摘要】:本論文包含某企業(yè)數(shù)據(jù)中心數(shù)據(jù)采集與建模部分設(shè)計與實現(xiàn),數(shù)據(jù)中心包含采集層、數(shù)據(jù)服務(wù)層、應(yīng)用層、訪問層等,本文論述采集層與服務(wù)層的建設(shè)。數(shù)據(jù)在本系統(tǒng)中分為結(jié)構(gòu)化與非結(jié)構(gòu)化,其中結(jié)構(gòu)化的數(shù)據(jù)存儲在GBASE庫中,非結(jié)構(gòu)化的數(shù)據(jù)存儲在HADOOP環(huán)境中,數(shù)據(jù)規(guī)模在PB級別,搭建成本比較低的分布式存儲環(huán)境來存儲與計算。開發(fā)使用OCDP大數(shù)據(jù)開發(fā)平臺,數(shù)據(jù)存儲采用HADOOP與GBASE數(shù)據(jù)庫,流程控制采用BDPE工具進(jìn)行配置管理。數(shù)據(jù)中心接入移動公司原有的三大業(yè)務(wù)支撐系統(tǒng),將分散于各業(yè)務(wù)系統(tǒng)不同數(shù)據(jù)庫、不同格式、不同類別的數(shù)據(jù)按照業(yè)務(wù)類型劃分為7大主題域數(shù)據(jù),通過統(tǒng)一數(shù)據(jù)接口采集到數(shù)據(jù)中心,依照移動公司現(xiàn)有的業(yè)務(wù)需求,將數(shù)據(jù)抽象為貼近業(yè)務(wù)需求的數(shù)據(jù)模型,實現(xiàn)將底層數(shù)據(jù)與上層的依賴于各業(yè)務(wù)的具體應(yīng)用之間的松耦合,簡化上層程序開發(fā)人員的開發(fā)難度。本文介紹數(shù)據(jù)采集層與數(shù)據(jù)服務(wù)層的開發(fā)工作,采集層將分布于不同系統(tǒng)的數(shù)據(jù)接入到數(shù)據(jù)中心,數(shù)據(jù)服務(wù)層將數(shù)據(jù)抽象為與業(yè)務(wù)相關(guān)的模型,為具體應(yīng)用解決底層數(shù)據(jù)問題。
[Abstract]:This paper includes the design and implementation of data acquisition and modeling in a certain enterprise data center. The data center includes acquisition layer, data service layer, application layer, access layer and so on. This paper discusses the construction of collection layer and service layer. Data is divided into structured and unstructured in this system, in which structured data is stored in GBASE library. The unstructured data is stored in the HADOOP environment and the data scale is in the PB level. The distributed storage environment with relatively low cost is built to store and compute. The development uses the OCDP big data development platform. Data storage uses HADOOP and GBASE database, process control uses BDPE tools for configuration management. The data center is connected to the original three business support systems of mobile company. The data scattered in different databases, different formats and different types of data are divided into 7 subject domain data according to the business types, and the data center is collected through the unified data interface. According to the existing business requirements of mobile companies, the data is abstracted as a data model close to the business requirements to realize the loose coupling between the underlying data and the specific applications that depend on each business. This paper introduces the development of data acquisition layer and data service layer, which connects the data distributed in different systems to the data center. The data service layer abstracts the data as a business-related model to solve the underlying data problems for specific applications.
【學(xué)位授予單位】:山東大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2017
【分類號】:TP308;TP274.2
【相似文獻(xiàn)】
相關(guān)期刊論文 前10條
1 楊書靜;河北中行實現(xiàn)全省一個數(shù)據(jù)中心[J];中國金融電腦;2001年06期
2 江南;數(shù)據(jù)中心如何應(yīng)付管理挑戰(zhàn)[J];互聯(lián)網(wǎng)周刊;2001年40期
3 ;簡化管理挑戰(zhàn)——惠普推實用數(shù)據(jù)中心解決方案[J];每周電腦報;2001年67期
4 李慶莉;去數(shù)據(jù)中心看一看——中國銀行華北信息中心計劃處處長云恩善談數(shù)據(jù)中心運行、管理[J];中國金融電腦;2002年12期
5 馬天蔚;;數(shù)據(jù)中心按需造[J];每周電腦報;2002年25期
6 戚麗,蔣東興,武海平,馮珂;校園數(shù)據(jù)中心建設(shè)與管理方法的探索[J];教育信息化;2002年S1期
7 何俊山;您企業(yè)的數(shù)據(jù)中心2003了嗎?[J];微電腦世界;2003年17期
8 ;挖潛數(shù)據(jù)中心[J];金融電子化;2004年07期
9 王琨月;;數(shù)據(jù)中心業(yè)務(wù)就緒[J];每周電腦報;2004年21期
10 包東智;新熱點:創(chuàng)建下一代數(shù)據(jù)中心[J];上海信息化;2005年10期
相關(guān)會議論文 前10條
1 姚,
本文編號:1457419
本文鏈接:http://sikaile.net/kejilunwen/jisuanjikexuelunwen/1457419.html