當(dāng)前位置：主頁(yè) > 科技論文 > 數(shù)學(xué)論文 >

基于并行處理大數(shù)據(jù)圖查詢研究

發(fā)布時(shí)間：2018-08-02 12:39

【摘要】：隨著互聯(lián)網(wǎng)的飛速發(fā)展,我們逐漸進(jìn)入一個(gè)數(shù)據(jù)為王的時(shí)代,不僅數(shù)據(jù)量變得十分巨大而且數(shù)據(jù)變得日益復(fù)雜,如何從這些多而雜的數(shù)據(jù)中查找出有用的數(shù)據(jù)已經(jīng)成為一個(gè)非常迫在眉睫需要優(yōu)化的問(wèn)題。與此同時(shí),在數(shù)據(jù)存儲(chǔ)方式上分布式云存儲(chǔ)已經(jīng)成為一種常用的解決方案,于是問(wèn)題就轉(zhuǎn)變?yōu)榛诜植际酱鎯?chǔ)的數(shù)據(jù)查詢。對(duì)于大規(guī)模分布式存儲(chǔ)的數(shù)據(jù)進(jìn)行按需查詢,一種常用的有力的工具是圖,圖數(shù)據(jù)結(jié)構(gòu)在具有引用關(guān)系的數(shù)據(jù)上具有很強(qiáng)的優(yōu)勢(shì),因此針對(duì)大數(shù)據(jù)的查詢就可以轉(zhuǎn)化為圖查詢算法問(wèn)題。在圖查詢算法中,有一大類問(wèn)題就是在數(shù)據(jù)圖中查詢給定兩個(gè)節(jié)點(diǎn),回答這兩個(gè)節(jié)點(diǎn)是不是可達(dá)的,也就是圖的可達(dá)查詢問(wèn)題。在實(shí)際應(yīng)用中,圖的可達(dá)查詢問(wèn)題應(yīng)用范圍廣泛,有很重要的研究意義。傳統(tǒng)的針對(duì)圖的可達(dá)查詢問(wèn)題的解決方法,要么限定在基于樹(shù)的圖查詢,要么有的是針對(duì)特定的圖數(shù)據(jù)庫(kù)系統(tǒng),這些算法大多數(shù)普遍采用索引的方法,但是在處理分布式大數(shù)據(jù)圖的時(shí)候在準(zhǔn)確性和性能上有很大的缺陷。針對(duì)這些問(wèn)題,本文提出了基于Hadoop分布式計(jì)算平臺(tái)下的MapReduce編程模型的并行可達(dá)圖查詢算法,并提出了一個(gè)基于六度可達(dá)查詢的索引用來(lái)解決局部查詢上的可達(dá)查詢問(wèn)題。通過(guò)這些算法,致力于優(yōu)化分布式大圖的可達(dá)查詢問(wèn)題,并采用多個(gè)實(shí)際應(yīng)用中的數(shù)據(jù)集,從多個(gè)指標(biāo)和角度,進(jìn)行了多次實(shí)驗(yàn)評(píng)估,驗(yàn)證了算法的準(zhǔn)確性和高效性。
[Abstract]:With the rapid development of the Internet, we have gradually entered an era of data king, not only the amount of data has become very large, but the data has become increasingly complex. How to find useful data from these data has become a very urgent problem to be optimized. At the same time, distributed cloud storage has become a common solution in data storage, so the problem is transformed into data query based on distributed storage. For large-scale distributed data on demand query, one of the commonly used powerful tool is graph, graph data structure has a strong advantage in referencing data. Therefore, the query for big data can be transformed into graph query algorithm. In the graph query algorithm, there is a kind of problem, which is to query the given two nodes in the data graph and answer whether the two nodes are reachable or not, that is, the reachable query problem of the graph. In practical application, the problem of reachability query of graph has a wide range of applications, which is of great significance. Traditional solutions to the problem of reachable query for graphs are either limited to tree based graph queries or specific graph database systems. Most of these algorithms generally use index methods. However, there are many shortcomings in the accuracy and performance of distributed big data diagrams. In order to solve these problems, a parallel Datuk query algorithm based on MapReduce programming model based on Hadoop distributed computing platform is proposed, and an index based on six-degree reachable query is proposed to solve the problem of local reachable query. Through these algorithms, we make great efforts to optimize the reachable query problem of distributed large graph, and use the data sets in many practical applications to carry out many experiments from many indexes and angles to verify the accuracy and efficiency of the algorithm.
【學(xué)位授予單位】：華北電力大學(xué)(北京)
【學(xué)位級(jí)別】：碩士
【學(xué)位授予年份】：2017
【分類號(hào)】：TP311.13;O157.5

【參考文獻(xiàn)】

相關(guān)期刊論文前2條

1 樊文飛;懷進(jìn)鵬;;Querying Big Data: Bridging Theory and Practice[J];Journal of Computer Science & Technology;2014年05期

2 吳廣君;王樹(shù)鵬;陳明;李超;;海量結(jié)構(gòu)化數(shù)據(jù)存儲(chǔ)檢索系統(tǒng)[J];計(jì)算機(jī)研究與發(fā)展;2012年S1期

，

本文編號(hào)：2159464

資料下載

論文發(fā)表

支付寶下載

Download by Alipay
微信下載

Download by Wechat
會(huì)員下載

Download by Member

本文鏈接：http://sikaile.net/kejilunwen/yysx/2159464.html

上一篇：幾類分?jǐn)?shù)階模糊微分方程初邊值問(wèn)題及其應(yīng)用
下一篇：系統(tǒng)半變分不等式問(wèn)題的適定性研究

論文發(fā)表

·知網(wǎng)|萬(wàn)方|維普|龍?jiān)磡省級(jí)|國(guó)家級(jí)|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

基于并行處理大數(shù)據(jù)圖查詢研究