手機閱讀平臺數(shù)據(jù)倉庫管理模塊的設(shè)計與實現(xiàn)
[Abstract]:With the continuous development of information technology, enterprises are facing more and more abundant and complicated data, and more data are generated by its computer system. the continuous maturity of data warehouse technology provides an effective solution for enterprise data management. In the construction, use and construction of data warehouse, with the collection, storage, processing and calculation of massive data, the organization and management of data assets and the complexity of daily access, people have become the focus of attention. For users, they face multiple heterogeneous data sources, indicators and interpretations are different, statistical caliber is inconsistent, and the understanding of business personnel is out of sync with the specific implementation of developers. For business and technical personnel, they have to deal with multiple systems, the definition of business terms is not in line with the development of business, system development, lack of standard information bearing platform. How to construct a sound data warehouse management system is a key problem that enterprises need to solve in the face of complex data assets. At present, the Hadoop Hive data warehouse platform of China Mobile phone Reading Base has been stable for nearly a year, and the development tasks are basically only on the application level of the warehouse to meet the business application needs. However, the warehouse itself is managed. Maintenance does not form a complete feature set, users can only rely on manual reading a large number of text to obtain the required information. In order to facilitate the scientific management and maintenance of the whole data warehouse platform in the later stage, a data warehouse management module in accordance with its own characteristics is designed and implemented in this paper. The design of a good warehouse management module is not only convenient for IT personnel, technicians and maintenance personnel to better manage and use data warehouse resources, but also can help ordinary business personnel to make flexible use of the massive data provided by the warehouse to a great extent. This paper designs and develops a warehouse management system which accords with the characteristics of Hadoop Hive data warehouse in mobile phone reading base, which provides the functions of metadata management, task scheduling monitoring and data consanguinity analysis. Metadata management enables users to find out the data they care about efficiently, and it is also the basis of task scheduling monitoring and data consanguinity analysis. Task scheduling monitoring enables users to obtain the running status of Hive in real time, and to understand the information and running status of upstream and downstream nodes. Data consanguinity analysis provides users with data maps, which can easily understand the source and direction of data, and provide reliable data support for subsequent warehouse structure optimization. The organizational structure of the paper is as follows: the first chapter is the introduction, which introduces the research background, research content, research significance, and the specific application in mobile phone reading project. The second chapter briefly introduces the related background and technology of this topic, including mobile phone reading BI (Business Intelligence) platform, data warehouse, metadata, Oozie and data warehouse application status. The third chapter carries on the requirement analysis and the overall design to the mobile phone reading platform data warehouse management module. The fourth chapter introduces the detailed design and implementation of warehouse management module, including the implementation of each key module. The fifth chapter is the system test and analysis, carries on the test to each function, and has carried on the comparative analysis to the system before and after the application effect. The sixth chapter summarizes the work of this topic and looks forward to the next research work.
【學(xué)位授予單位】:北京郵電大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2016
【分類號】:TP311.52
【參考文獻(xiàn)】
相關(guān)期刊論文 前3條
1 張明治;;基于CWM規(guī)范設(shè)計的元數(shù)據(jù)管理系統(tǒng)[J];電腦知識與技術(shù);2014年02期
2 楊鴻賓;宋明;;元數(shù)據(jù)管理平臺總體架構(gòu)設(shè)計研究[J];計算機系統(tǒng)應(yīng)用;2007年11期
3 王磊;李一凡;趙懷慈;;銀聯(lián)數(shù)據(jù)倉庫系統(tǒng)中ETL的設(shè)計和實現(xiàn)[J];微電子學(xué)與計算機;2007年05期
相關(guān)博士學(xué)位論文 前1條
1 魏建生;高性能重復(fù)數(shù)據(jù)檢測與刪除技術(shù)研究[D];華中科技大學(xué);2012年
相關(guān)碩士學(xué)位論文 前4條
1 任桂禾;大數(shù)據(jù)處理支撐平臺調(diào)度子系統(tǒng)的設(shè)計與實現(xiàn)[D];北京郵電大學(xué);2015年
2 朱斌;基于Hadoop的日志統(tǒng)計分析系統(tǒng)的設(shè)計與實現(xiàn)[D];哈爾濱工業(yè)大學(xué);2013年
3 毛瑞雪;基于數(shù)據(jù)血緣的審計證據(jù)追蹤技術(shù)研究及應(yīng)用[D];哈爾濱工程大學(xué);2012年
4 賈文娟;基于hive分布式計算與數(shù)據(jù)挖掘的關(guān)聯(lián)性營銷的設(shè)計與實現(xiàn)[D];北京交通大學(xué);2011年
,本文編號:2485614
本文鏈接:http://sikaile.net/kejilunwen/ruanjiangongchenglunwen/2485614.html