天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當前位置:主頁 > 科技論文 > 計算機論文 >

MapReduce模型的數(shù)據(jù)分配策略研究

發(fā)布時間:2018-05-22 13:28

  本文選題:云計算 + Hadoop。 參考:《華中科技大學》2013年碩士論文


【摘要】:自2007年云計算誕生至今,它已經(jīng)逐漸成為國內(nèi)外IT界熱門的概念,得到了廣泛的關(guān)注。在當今互聯(lián)網(wǎng)高速發(fā)達的環(huán)境中,面對數(shù)據(jù)量的急劇增長,如何快速有效的對海量數(shù)據(jù)進行存儲和計算成為亟待解決的問題,這也是云計算誕生的原動力。但是對于云計算而言,它本身只是一種思維方式,雖然有硬件設(shè)施提供必要的環(huán)境,但是能夠支撐云計算思想的編程模型更加重要。由Google提出的MapReduce并行編程模型,為云計算海量數(shù)據(jù)的處理提供了軟件支持。 Hadoop以一種可靠、高效、可伸縮的方式工作,在短短幾年里成為了主流的開源云計算平臺,,但是Hadoop仍然是一個比較年輕的平臺,在很多地方有不夠完善之處,對其進行改進是十分必要的。通過對Hadoop平臺下的MapReduce并行編程模型進行深入研究,主要針對MapReduce并行編程模型在Map端輸出的中間數(shù)據(jù)分布不均衡現(xiàn)象提出解決方案,該方案的設(shè)計思路是用兩個階段MapReduce作業(yè)對上述問題進行處理,第一個MapReduce階段用于對源數(shù)據(jù)集進行并行抽樣,根據(jù)抽樣的結(jié)果估計數(shù)據(jù)信息,提出一種稱為LAB的分配策略,該分配策略對中間數(shù)據(jù)進行均衡分配;第二MapReduce階段按照上述數(shù)據(jù)分配策略執(zhí)行MapReduce作業(yè)。 通過實驗表明,該方案減少了作業(yè)運行時間,Reduce端輸入數(shù)據(jù)達到負載均衡,從而證明改進方案的可行性和其優(yōu)勢所在。該方案能夠充分利用計算資源,避免資源的浪費,提高了程序運行效率。
[Abstract]:Since the birth of cloud computing in 2007, it has gradually become a hot concept in IT field at home and abroad. With the rapid development of the Internet, how to store and compute the massive data quickly and effectively becomes an urgent problem in the face of the rapid growth of data, which is also the driving force of cloud computing. But for cloud computing, it is only a way of thinking. Although there are hardware facilities to provide the necessary environment, the programming model that can support cloud computing is more important. The MapReduce parallel programming model proposed by Google provides software support for cloud computing massive data processing. Hadoop, which works in a reliable, efficient and scalable way, has become the mainstream open source cloud computing platform in just a few years, but Hadoop is still a relatively young platform that is imperfect in many places. It is necessary to improve it. Through the in-depth study of MapReduce parallel programming model based on Hadoop platform, a solution is proposed to solve the problem of uneven distribution of intermediate data output from MapReduce parallel programming model in Map terminal. The design idea of the scheme is to deal with the above problems with two stage MapReduce jobs. The first stage of MapReduce is used to sample the source data set in parallel. According to the result of sampling, the data information is estimated, and an allocation strategy called LAB is proposed. The allocation strategy distributes the intermediate data evenly, and the second MapReduce stage executes the MapReduce job according to the above data allocation strategy. The experimental results show that this scheme can reduce the operation time and reduce the input data to achieve load balance, which proves the feasibility of the improved scheme and its advantages. The program can make full use of computing resources, avoid the waste of resources, and improve the efficiency of program operation.
【學位授予單位】:華中科技大學
【學位級別】:碩士
【學位授予年份】:2013
【分類號】:TP333;TP311.1

【參考文獻】

相關(guān)期刊論文 前3條

1 陳康;鄭緯民;;云計算:系統(tǒng)實例與研究現(xiàn)狀[J];軟件學報;2009年05期

2 李玉林;董晶;;基于Hadoop的MapReduce模型的研究與改進[J];計算機工程與設(shè)計;2012年08期

3 孫廣中;肖鋒;熊曦;;MapReduce模型的調(diào)度及容錯機制研究[J];微電子學與計算機;2007年09期



本文編號:1922251

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/kejilunwen/jisuanjikexuelunwen/1922251.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶8dfab***提供,本站僅收錄摘要或目錄,作者需要刪除請E-mail郵箱bigeng88@qq.com
国产免费人成视频尤物| 国产午夜精品美女露脸视频| 99国产一区在线播放| 欧美又黑又粗大又硬又爽| 办公室丝袜高跟秘书国产| 欧美一区二区黑人在线| 日韩精品一区二区三区含羞含羞草| 老司机亚洲精品一区二区| 日本精品理论在线观看| 夜色福利久久精品福利| 亚洲精品一区三区三区| 成年男女午夜久久久精品| 国产丝袜美女诱惑一区二区| 国产成人精品一区二三区在线观看| 午夜精品久久久99热连载| 色哟哟精品一区二区三区| 高清一区二区三区不卡免费| 国产精品白丝一区二区| 少妇福利视频一区二区| 日韩夫妻午夜性生活视频| 91老熟妇嗷嗷叫太91| 日韩欧美一区二区久久婷婷 | 草草视频精品在线观看| 欧美日韩乱一区二区三区| 国产又粗又硬又长又爽的剧情| 亚洲欧美日本国产不卡 | 老富婆找帅哥按摩抠逼视频| 日韩精品视频一二三区| 亚洲丁香婷婷久久一区| 日本不卡一本二本三区| 久草热视频这里只有精品| 免费人妻精品一区二区三区久久久| 国产级别精品一区二区视频| 亚洲中文在线男人的天堂| 中文字幕人妻日本一区二区 | 国产色第一区不卡高清| 亚洲另类女同一二三区| 欧美精品女同一区二区| 久久精品色妇熟妇丰满人妻91| 欧美精品专区一区二区| 中文字幕人妻日本一区二区|