分布式存儲系統(tǒng)上的RS糾刪碼研究與應(yīng)用
[Abstract]:With the popularity of computers, intelligent devices, and the rapid development of Internet technology. All kinds of data are in geometric order of magnitude, which brings more challenges to storage system. Data storage security has become an urgent problem to be solved in current storage system. At present, distributed storage is an effective way for people to deal with large amount of data storage. There are two ways to ensure data security in distributed storage system: multiple replica technology and erasure code technology. Multi-replica technology is simple and easy to implement. It only needs to backup one data several times and store it separately. The three-copy strategy is the most common. In order to obtain more stable data security, multi-replica technology can only be achieved by increasing the number of replicas, and the problem is that the storage cost is increasing exponentially. In order to solve the problem of high storage cost of multi-copy technology, erasure code technology, which is used to solve the problem of data loss in communication system, is introduced to the storage system. Erasure code technology can solve the problem of high storage cost and guarantee the same or higher data security ability as many copies. While erasure code technology solves the problem that the storage cost of multi-copy technology is too high, a new problem arises: the consumption of system resources and the number of I / O in recovering data after data loss are increased greatly. For this purpose, this paper begins with RS erasure codes, analyzes the coding equations and characteristics of RS erasure codes, and combines the advantages of array codes and LDPC codes, and proposes an improved LRC. code based on RS erasure codes. The definition of LRC and the fault tolerance analysis of LRC are given, the reliability of LRC is analyzed by Markov model, and the construction matrix of coding equation and the variation of coding parameters are analyzed comparatively. In order to apply the coding idea of LRC, based on the open source distributed storage system (HDFS), the system architecture and data placement strategy of HDFS storage system are analyzed and understood. Data reconfiguration process and communication verification mechanism are discussed in this paper. The design idea of implementing LRC on HDFS is put forward. Finally, through three groups of comparative experiments, it is concluded that under the condition of similar storage cost, LRC saves nearly half the time cost of RS coding in decoding time; When the coding parameters are changed, the encoding and decoding performance of LRC will not change significantly, and it can provide more choice of parameters. In the coding matrix, the coding equation based on Cauchy matrix and the coding equation based on Van der Mon matrix have similar performance.
【學位授予單位】:成都理工大學
【學位級別】:碩士
【學位授予年份】:2017
【分類號】:TP333
【參考文獻】
相關(guān)期刊論文 前1條
1 羅象宏;舒繼武;;存儲系統(tǒng)中的糾刪碼研究綜述[J];計算機研究與發(fā)展;2012年01期
相關(guān)博士學位論文 前3條
1 謝平;RAID-6編碼布局及重構(gòu)優(yōu)化研究[D];華中科技大學;2015年
2 劉衛(wèi)平;網(wǎng)絡(luò)存儲中的數(shù)據(jù)容錯與容災(zāi)技術(shù)研究[D];西北工業(yè)大學;2006年
3 萬武南;分布式安全存儲系統(tǒng)糾刪碼技術(shù)的研究[D];中國科學院研究生院(成都計算機應(yīng)用研究所);2006年
相關(guān)碩士學位論文 前6條
1 梁先海;糾刪碼存儲集群的數(shù)據(jù)重構(gòu)優(yōu)化技術(shù)研究[D];華中科技大學;2015年
2 王敬軒;分布式文件系統(tǒng)存儲效率優(yōu)化研究[D];華中科技大學;2013年
3 楊明;基于LDPC碼的分布式容災(zāi)系統(tǒng)及其性能研究[D];哈爾濱工程大學;2012年
4 張世樂;面向大數(shù)據(jù)塊的快速多容錯編碼研究[D];復旦大學;2010年
5 金奎;基于分布式存儲系統(tǒng)的數(shù)據(jù)安全傳輸?shù)脑O(shè)計與實現(xiàn)[D];哈爾濱工業(yè)大學;2009年
6 姜英豪;基于RS和Chord的分布式存儲系統(tǒng)的設(shè)計[D];哈爾濱工業(yè)大學;2008年
,本文編號:2324572
本文鏈接:http://sikaile.net/kejilunwen/jisuanjikexuelunwen/2324572.html