面向海量用戶(hù)的云存儲(chǔ)系統(tǒng)的設(shè)計(jì)與優(yōu)化
[Abstract]:With the development of information technology, the popularity of mobile networks, the rapid increase of the data volume of individual users, the demand for cloud storage services has increased rapidly, and many well-known enterprises have input into the R & D and operation of individual user cloud storage services, such as Google, Microsoft, Drupbox, Lenovo, Kingsoft and Huawei, Baidu, telecom operator, etc. In these excellent products, most of the policies are based on Hadoop HDFS as a base file system for secondary customization development. The open source Hadoop HDFS is a hot research area with its excellent architecture design and high scalability, availability, reliability, fault tolerance, economy, and excellent performance. However, there are many problems still to be solved by the HDFS, such as the single-point bottleneck of the NameNode, the insufficient processing capacity of the small file, the lack of reference of the redundant files, the lack of the load balance of the user, the failure of the file, the weak security of the system, the lack of data encryption and the sharing of the authorization mechanism, etc. The existing shortage of HDFS can be made up of two ways: first, the source code of the HDFS is modified, that is, it is improved from inside; secondly, a layer of service layer is added on the HDFS, the part of the function is peeled off, and the HDFS is simplified Function. In the first way, it is necessary to make major changes to the HDFS. The quantity is large, the difficulty is high, the HDFS version can not be backward compatible, and the single-point bottleneck, the breakpoint continuous transmission, the file encryption authorization and the like cannot be effectively solved. The second way is that the binding condition is small, the difficulty is low, the engineering quantity is small, can be compatible with various HDFS versions, and more importantly, the cloud storage system constructed in this way has a great improvement space, can solve the problems, and has strong expandability, so that the method adopts the following In this paper, the existing HDFS architecture is analyzed, and a set of cloud storage system architecture for mass users is built on this basis. The architecture provides an optimized solution for many problems existing in the HDFS, and can ensure the data security and use. The main part of this paper is to protect the privacy of the household. The innovation point is as follows: 1. A mass user cloud storage system architecture based on HDFS is proposed and the advantages of this architecture are analyzed: the problem of single-point bottleneck is effectively relieved, the security and the expandability of the system are enhanced, a plurality of access protocols are supported, HDFS version and so on. 2. A complete system safety protection mechanism is proposed: first, a client login verification method which can resist the Trojan environment is proposed to enhance the security of the user account; secondly, a file encryption and classification authorization management is proposed The method can ensure the data safety of the user and can facilitate the distribution and recovery of the file authorization; and thirdly, the access control strategy applicable to the cloud storage service is provided, The analysis shows that the system not only can improve the security of the system but also the users Privacy protection and secure access control. Re-use SSL/ TLS for encrypted communications, systems The overall security is greatly enhanced. 3. The scheduling mechanism of the load balance among the application servers is given. the load balancing of the access request of the user layer is realized, and the application server can join or the method comprises the following steps of: quitting the cluster, and avoiding the problem of single point failure. The balancing scheduling of the access request and the cache management of the application server are mutually matched, and the method can improve system performance and load capacity effectively. 4. for HDFS and a corresponding optimization scheme is proposed for the characteristics of the mass user, for example, the breakpoint continuous transmission function is increased, the small file is packaged and stored, the large file is subjected to redundant reference processing, the cache is added to the application server, The container structure of the file and the HDFS structure mapping, etc. Compared with the original HDFS system, the method proposed in this paper is increasing the system
【學(xué)位授予單位】:華東師范大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2013
【分類(lèi)號(hào)】:TP333;TP309
【參考文獻(xiàn)】
相關(guān)期刊論文 前10條
1 唐箭;;云存儲(chǔ)系統(tǒng)的分析與應(yīng)用研究[J];電腦知識(shí)與技術(shù);2009年20期
2 許春聰;黃小猛;徐鵬志;吳諾;劉松彬;楊廣文;;CarrierFS:基于虛擬內(nèi)存的分布式文件系統(tǒng)[J];華中科技大學(xué)學(xué)報(bào)(自然科學(xué)版);2010年S1期
3 付印金;肖儂;劉芳;;重復(fù)數(shù)據(jù)刪除關(guān)鍵技術(shù)研究進(jìn)展[J];計(jì)算機(jī)研究與發(fā)展;2012年01期
4 方世昌;;國(guó)際標(biāo)準(zhǔn)ISO 7498-2第一版簡(jiǎn)介和讀后感[J];計(jì)算機(jī)工程與應(yīng)用;1990年07期
5 張前進(jìn);齊美彬;李莉;;基于應(yīng)用層負(fù)載均衡策略的分析與研究[J];計(jì)算機(jī)工程與應(yīng)用;2007年32期
6 楊德志;許魯;張建剛;;藍(lán)鯨分布式文件系統(tǒng)元數(shù)據(jù)服務(wù)[J];計(jì)算機(jī)工程;2008年07期
7 黎哲,郭成城,陳亮;一個(gè)基于TCP遷移機(jī)制的第七層負(fù)載均衡系統(tǒng)[J];計(jì)算機(jī)應(yīng)用研究;2005年04期
8 羅擁軍;李曉樂(lè);孫如祥;;負(fù)載均衡算法綜述[J];科技情報(bào)開(kāi)發(fā)與經(jīng)濟(jì);2008年23期
9 謝鯤;文吉?jiǎng)?張大方;謝高崗;;布魯姆過(guò)濾器查詢(xún)算法[J];軟件學(xué)報(bào);2009年01期
10 譚生龍;;存儲(chǔ)虛擬化技術(shù)的研究[J];微計(jì)算機(jī)應(yīng)用;2010年01期
相關(guān)碩士學(xué)位論文 前1條
1 陳虎;基于HDFS的云存儲(chǔ)平臺(tái)的優(yōu)化與實(shí)現(xiàn)[D];華南理工大學(xué);2012年
,本文編號(hào):2319262
本文鏈接:http://sikaile.net/kejilunwen/jisuanjikexuelunwen/2319262.html