云環(huán)境下MapReduce容錯(cuò)技術(shù)的研究
[Abstract]:Cloud computing (Cloud Computing) has become one of the most important technologies in the computer industry today. With the rapid development of cloud technology, the form of data has gradually changed from traditional structured data (structured data) to semi-structured data (semi-structured data) and unstructured data (unstructureddata). At the same time, the scale of data has expanded in a large scale. Traditional database technology has been unable to cope with massive data, so how to deal with these big data (Big Data) has become a problem to be solved. So in 2004 Google put forward its solution, MapReduce, to meet the challenges posed by big data in the cloud age. Simply put, MapReduce is a programming model for batch parallelization of mass data. It not only solves the performance problem of processing massive data, but also simplifies the way for programmers to develop distributed parallel programs. More importantly, MapReduce solves the problems of extensibility (Scalability) and reliability (Reliability), which is the biggest advantage of MapReduce compared with traditional database. A variety of researches have been carried out around MapReduce as a new programming framework, among which the fault-tolerant ability of MapReduce has been one of the hotspots. The domestic and foreign research programs for fault tolerance can be summed up into the following two methods: reexecution and backup. The purpose of these schemes is to carry out the corresponding recovery operations after the failure is discovered, but if the failure situation is not perceived in time, the above schemes will not be able to play a full role. Therefore, this paper will study the fault-tolerant ability of MapReduce from a new point of view, that is, how to perceive the failure nodes in MapReduce more quickly and accurately. In order to solve this problem, this paper tries to put forward two kinds of ideas: adaptive overdue time and credit-based detection model. Adaptive overruns are designed to change the rigid and fixed outages in MapReduce clusters. In order to do this, the execution time of each job is estimated first, and then the overdue time is adaptive to the estimated execution time. At run time, if the JobTracker does not receive heartbeat information from a node within an adaptive timeframe, that node is considered invalid. The credit-based detection model assigns a credit value to each node and makes use of the reduce task to remotely obtain the action of map data failure and evaluate the reputation of the node in real time. The node is considered to be invalid if the creditworthiness value of the node attenuates to the preset lower limit due to too many failed actions. A large number of experimental data show that the two schemes proposed in this paper are obviously superior to the original Hadoop cluster. When there are node failures in the cluster, compared with the original scheme, the time of finding the failure can be greatly reduced by the scheme in this paper. In addition, it can be seen from the comparative experiments of the two schemes that the adaptive extended time will be more inclined to the execution of short jobs, while the credit-based detection model is more suitable for the execution of large jobs. By using these two schemes, the existing fault-tolerant techniques can be better coordinated, and the Hadoop cluster has a better fault-tolerant capability not only to locate failures quickly, but also to recover quickly from failures. The main contribution of this paper is to propose two kinds of mechanisms: adaptive delay time and credit-based detection model, and at the same time broaden the research ideas of Hadoop fault tolerance.
【學(xué)位授予單位】:上海交通大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2012
【分類號(hào)】:TP302.8
【相似文獻(xiàn)】
相關(guān)期刊論文 前10條
1 王義強(qiáng);袁修華;馬明陽(yáng);胡艷娟;;基于神經(jīng)網(wǎng)絡(luò)的數(shù)控插補(bǔ)容錯(cuò)技術(shù)[J];農(nóng)業(yè)機(jī)械學(xué)報(bào);2011年07期
2 歐陽(yáng)城添;王曦;鄭劍;;自適應(yīng)一致表決算法[J];計(jì)算機(jī)科學(xué);2011年07期
3 柳燕煌;黃立勤;;云計(jì)算環(huán)境的并行支持向量機(jī)[J];南陽(yáng)理工學(xué)院學(xué)報(bào);2011年02期
4 鄭啟龍;汪睿;王向前;;HPMR內(nèi)存管理模塊優(yōu)化設(shè)計(jì)[J];計(jì)算機(jī)系統(tǒng)應(yīng)用;2011年08期
5 寧新建;;航空火控計(jì)算機(jī)的容錯(cuò)技術(shù)分析[J];計(jì)算機(jī)與網(wǎng)絡(luò);2010年17期
6 李虎;鄒鵬;賈焰;周斌;;一種基于MapReduce的分布式文本數(shù)據(jù)過(guò)濾模型研究[J];信息網(wǎng)絡(luò)安全;2011年09期
7 李遠(yuǎn)方;鄧世昆;聞?dòng)癖?韓月陽(yáng);;Hadoop-MapReduce下的PageRank矩陣分塊算法[J];計(jì)算機(jī)技術(shù)與發(fā)展;2011年08期
8 李s,
本文編號(hào):2202608
本文鏈接:http://sikaile.net/kejilunwen/jisuanjikexuelunwen/2202608.html