基于云平臺的高速公路交通數(shù)據(jù)倉庫設(shè)計與查詢優(yōu)化研究與實現(xiàn)
[Abstract]:With the development of Internet of things technology and the increase of intelligent sensors, the data collected by transportation industry is increasing rapidly. Especially in the freeway toll collection system, a large amount of highway toll collection station data are generated every day. By analyzing these structured data, we can get very valuable information such as freeway traffic flow, space-time distribution of carrying capacity, expressway transportation boom index, toll report forms, and so on. Provide data support for highway managers to make correct decisions. Currently, most management systems used by transportation departments are Oracle-driven databases. Faced with the increasingly large data volume of highway toll station data, these management systems have problems such as complex data integration process, long time, dependence on professionals, slow data query speed and so on. Therefore, this paper studies the highway traffic data warehouse design and query optimization technology based on cloud platform. Firstly, according to the characteristics of highway toll station data, this paper designs a data warehouse for mass highway toll station data. The construction process includes three core operation stages: data extraction, data preprocessing and data processing. Secondly, by comparing the query characteristics of Hive and Impala, this paper analyzes the partition granularity of data warehouse and the business characteristics of highway management, and puts forward three query optimization methods of data warehouse. Then, based on the distributed file storage system HDFS, data warehouse tool Hive and the data query engine Impala, this paper designs and implements the data visualization platform for highway management. Provides data query and project analysis functions. Finally, the function and performance of the data warehouse are verified by the actual toll station data in this paper. The results show that the data query optimization method proposed in this paper can effectively improve the efficiency of data query and shorten the query time.
【學位授予單位】:北京郵電大學
【學位級別】:碩士
【學位授予年份】:2017
【分類號】:TP311.13;TP393.09
【參考文獻】
相關(guān)期刊論文 前7條
1 吳黎兵;邱鑫;葉璐瑤;王曉棟;聶雷;;基于Hadoop的SQL查詢引擎性能研究[J];華中師范大學學報(自然科學版);2016年02期
2 趙文英;;當前大數(shù)據(jù)管理技術(shù)探究[J];信息與電腦(理論版);2015年22期
3 曾萍;韋杰;;數(shù)據(jù)倉庫技術(shù)在高校信息化建設(shè)中的應(yīng)用研究[J];軟件;2014年05期
4 李小強;何珊;何金明;;通過對比數(shù)據(jù)庫來理解數(shù)據(jù)倉庫[J];考試周刊;2013年91期
5 邱衛(wèi)云;;智能交通大數(shù)據(jù)分析云平臺技術(shù)[J];中國交通信息化;2013年10期
6 黃文依;王勁松;林勝;;HDFS可視化操作研究與實現(xiàn)[J];天津理工大學學報;2012年01期
7 許春玲;張廣泉;;分布式文件系統(tǒng)Hadoop HDFS與傳統(tǒng)文件系統(tǒng)Linux FS的比較與分析[J];蘇州大學學報(工科版);2010年04期
相關(guān)碩士學位論文 前5條
1 張鵬;多數(shù)據(jù)庫環(huán)境數(shù)據(jù)集成與轉(zhuǎn)換技術(shù)研究[D];北方工業(yè)大學;2016年
2 費仕憶;Hadoop大數(shù)據(jù)平臺與傳統(tǒng)數(shù)據(jù)倉庫的協(xié)作研究[D];東華大學;2014年
3 王遠志;基于Hadoop的全網(wǎng)絡(luò)流量異常監(jiān)測算法研究[D];鄭州大學;2014年
4 韓歡;基于大數(shù)據(jù)的智能交通運輸平臺的研究[D];成都理工大學;2014年
5 常濤;改進型MapReduce框架的研究與設(shè)計[D];北京郵電大學;2011年
,本文編號:2397894
本文鏈接:http://sikaile.net/guanlilunwen/ydhl/2397894.html