云存儲(chǔ)中數(shù)據(jù)壓縮技術(shù)的研究
本文選題:云存儲(chǔ) + 數(shù)據(jù)壓縮。 參考:《云南大學(xué)》2013年碩士論文
【摘要】:近些年,云存儲(chǔ)的出現(xiàn)對(duì)傳統(tǒng)存儲(chǔ)領(lǐng)域產(chǎn)生了深刻影響,是目前企業(yè)界和學(xué)術(shù)界共同關(guān)注的熱點(diǎn)。但是目前學(xué)術(shù)界對(duì)存儲(chǔ)的研究大都關(guān)注于帶寬、安全及基礎(chǔ)設(shè)施方面,很少放在云存儲(chǔ)這個(gè)大環(huán)境下來(lái)考慮,尤其是對(duì)在云存儲(chǔ)環(huán)境中面臨海量數(shù)據(jù)時(shí)數(shù)據(jù)壓縮技術(shù)對(duì)負(fù)載均衡的影響。另外,在對(duì)數(shù)據(jù)壓縮技術(shù)進(jìn)行研究時(shí),往往通過(guò)犧牲空間來(lái)?yè)Q取低時(shí)間復(fù)雜度,造成了數(shù)據(jù)在解壓縮時(shí)有損。因此無(wú)法保障數(shù)據(jù)壓縮前后的一致性。最后,對(duì)私有云存儲(chǔ)系統(tǒng)的研究,存在著眾多的開源版本,如何選取適合私有的、靈活的及可定制的云存儲(chǔ)解決方案是我們面臨的又一重大問(wèn)題。 針對(duì)上述問(wèn)題,本文將對(duì)數(shù)據(jù)壓縮技術(shù)進(jìn)行分析和研究,尤其是以字典編碼及其延伸出來(lái)的LZZ77算法、LZ78算法及LZW算法作為研究對(duì)象,分析了各自的優(yōu)缺點(diǎn),在此基礎(chǔ)上本文提出了一種改進(jìn)型LZW算法,它在負(fù)載均衡,系統(tǒng)擴(kuò)展方面有著天然的優(yōu)勢(shì)。改進(jìn)型LZW算法在提高算法效率和保護(hù)用戶服務(wù)質(zhì)量的前提下,可以將數(shù)據(jù)冗余壓縮到最小,以保證盡可能少的無(wú)用信息。最后,本文在此基礎(chǔ)上,實(shí)現(xiàn)了一種私有云存儲(chǔ)平臺(tái)。論文的主要內(nèi)容和工作包括以下幾個(gè)方面: ·分析了云存儲(chǔ)及其數(shù)據(jù)壓縮技術(shù)在國(guó)內(nèi)外研究現(xiàn)狀,結(jié)合目前存在的問(wèn)題,我們得出了把數(shù)據(jù)壓縮技術(shù)作為研究重點(diǎn)的合理性和必要性。 ·針對(duì)數(shù)據(jù)壓縮技術(shù)領(lǐng)域研究狀況,我們提出了一種改進(jìn)型LZW算法且詳細(xì)分析了其性能。通過(guò)實(shí)驗(yàn)驗(yàn)證了該算法在數(shù)據(jù)壓縮比、壓縮時(shí)間及壓縮效果方面的優(yōu)越性,為以后在該領(lǐng)域的研究提供借鑒價(jià)值。 ·基于上述研究,本文針對(duì)搭建私有云存儲(chǔ)的特點(diǎn),我們實(shí)現(xiàn)了一個(gè)私有云存儲(chǔ)系統(tǒng)MongoDB,給想要搭建私有云存儲(chǔ)環(huán)境的用戶和科研團(tuán)體打下了基礎(chǔ)。
[Abstract]:In recent years, the emergence of cloud storage has a profound impact on the traditional storage field.However, at present, most of the research on storage in academic circles is focused on bandwidth, security and infrastructure, and is rarely considered in the general environment of cloud storage.Especially, the influence of data compression technology on load balancing in cloud storage environment is discussed.In addition, in the research of data compression technology, we often sacrifice space for low time complexity, which results in the loss of data decompression.Therefore, the consistency of data compression can not be guaranteed.Finally, there are many open source versions in the research of private cloud storage system. How to select suitable private, flexible and customizable cloud storage solutions is another important problem we face.In view of the above problems, this paper will analyze and study the data compression technology, especially take the dictionary coding and its extended LZZ77 algorithm, LZ78 algorithm and LZW algorithm as the research objects, and analyze their advantages and disadvantages.On this basis, an improved LZW algorithm is proposed, which has natural advantages in load balancing and system expansion.The improved LZW algorithm can reduce the data redundancy to the minimum while improving the efficiency of the algorithm and protecting the user's quality of service so as to ensure as little useless information as possible.Finally, this paper implements a private cloud storage platform.The main contents and work of the thesis include the following aspects:This paper analyzes the current research situation of cloud storage and its data compression technology at home and abroad. Combined with the existing problems, we get the rationality and necessity of taking data compression technology as the research focus.In view of the research status in the field of data compression, we propose an improved LZW algorithm and analyze its performance in detail.The superiority of the algorithm in data compression ratio, compression time and compression effect is verified by experiments, which provides reference value for future research in this field.Based on the above research, we implement a private cloud storage system, MongoDB-based, aiming at the characteristics of building private cloud storage, which lays the foundation for users and research groups who want to build private cloud storage environment.
【學(xué)位授予單位】:云南大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2013
【分類號(hào)】:TP333
【參考文獻(xiàn)】
相關(guān)期刊論文 前4條
1 唐箭;;云存儲(chǔ)系統(tǒng)的分析與應(yīng)用研究[J];電腦知識(shí)與技術(shù);2009年20期
2 謝華成;范黎林;;云環(huán)境下海量非結(jié)構(gòu)化信息存儲(chǔ)技術(shù)探究[J];制造業(yè)自動(dòng)化;2012年16期
3 付印金;肖儂;劉芳;鮑先強(qiáng);;基于重復(fù)數(shù)據(jù)刪除的虛擬桌面存儲(chǔ)優(yōu)化技術(shù)[J];計(jì)算機(jī)研究與發(fā)展;2012年S1期
4 許霞;馬光思;魚濤;;LZW無(wú)損壓縮算法的研究與改進(jìn)[J];計(jì)算機(jī)技術(shù)與發(fā)展;2009年04期
相關(guān)碩士學(xué)位論文 前3條
1 蔡柳青;基于MongoDB的云監(jiān)控設(shè)計(jì)與應(yīng)用[D];北京交通大學(xué);2011年
2 郅斌;一種私有云存儲(chǔ)系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn)[D];北京郵電大學(xué);2011年
3 劉一夢(mèng);基于 MongoDB的云數(shù)據(jù)管理技術(shù)的研究與應(yīng)用[D];北京交通大學(xué);2012年
,本文編號(hào):1748884
本文鏈接:http://sikaile.net/kejilunwen/jisuanjikexuelunwen/1748884.html