基于Hadoop的文件同步存儲(chǔ)系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn)
[Abstract]:In the age of cloud computing, with the widespread use of network terminal devices and the further popularization of Internet technology, data storage and backup technology has become closely related to individual life and the operation of organizations. Enterprises and individuals are faced with the management problems of massive data. The development of cloud storage and related technologies has brought innovation to the field of data storage. Cloud-based online storage system can provide users with permanent, scalable, convenient and inexpensive data storage and backup services. At present, the more mature domestic storage service products are Jinshan Express, Huawei Netdisk and so on. They all provide stable data storage and file synchronization, but there are also some problems. Firstly, the monitoring function of file system provided by client is not perfect, secondly, the efficiency of file data synchronization is low in some cases. In addition, some products do not provide secure transmission of data or classified data transfer for multiple synchronization events; finally, the existing products do not provide encrypted storage of client and server data. The optimization of cloud storage platform that supports data storage is also a problem that vendors should strive to solve to provide data synchronization storage services based on cloud storage. From the point of view of online synchronous storage service consumer, this paper summarizes the main functions and existing problems of current synchronous storage service products, and starts from the requirements and problems. The key technology of file synchronization storage system based on cloud storage is studied deeply. A cloud storage background based on hadoop is designed and implemented. A file synchronization storage system based on Rsync synchronization algorithm is designed and implemented. The main work of this paper includes: analyzing the advantages and disadvantages of the same kind of products at home and abroad, clarifying the needs of the system users; Using the open source jpathwatch class library to monitor the change of virtual disk in the client, the real-time trigger and notification function of different kinds of synchronous events is realized, and the monitoring of file movement and file renaming is added. Through the classification of synchronous events, the classification of different events, especially the update of file contents and the continuation of events, is realized. A synchronization protocol based on Rsync algorithm is designed to reduce the amount of data transmission between communication parties and improve the synchronization efficiency. According to different synchronization tasks, the optimal data transmission mode is designed, the encrypted data transmission is realized by using HTTPS, and the Hadoop-based cloud storage is used to store the data in the background. In this paper, the hierarchical modularization method is used to design and implement the system, and in the last two chapters of the paper, the functional modules of the system are tested and analyzed, and the research results and the extensible functions of the system are summarized. Finally, the future work is prospected.
【學(xué)位授予單位】:電子科技大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2012
【分類號(hào)】:TP333
【參考文獻(xiàn)】
相關(guān)期刊論文 前9條
1 楊亞平,李偉琴;基于SSL的數(shù)據(jù)安全傳輸系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn)[J];北京航空航天大學(xué)學(xué)報(bào);2001年04期
2 鄧波濤;;基于Java的系統(tǒng)網(wǎng)絡(luò)編程研究[J];電腦知識(shí)與技術(shù);2011年15期
3 林雪云;利用SSL實(shí)現(xiàn)數(shù)據(jù)傳輸安全[J];福建電腦;2005年10期
4 魏興國(guó);;HTTP和HTTPS協(xié)議安全性分析[J];程序員;2007年07期
5 趙斌,劉長(zhǎng)起,戴英俠;Windows操作系統(tǒng)的文件操作監(jiān)控技術(shù)[J];計(jì)算機(jī)工程與應(yīng)用;2004年31期
6 劉貝;湯斌;;云存儲(chǔ)原理及發(fā)展趨勢(shì)[J];科技信息;2011年05期
7 孟彥;侯整風(fēng);;基于SSL/TLS的安全文件傳輸系統(tǒng)[J];計(jì)算機(jī)技術(shù)與發(fā)展;2006年05期
8 谷慶華;李成貴;;Java多線程技術(shù)在網(wǎng)絡(luò)通信系統(tǒng)中的應(yīng)用[J];西安外事學(xué)院學(xué)報(bào);2007年04期
9 周可;王樺;李春花;;云存儲(chǔ)技術(shù)及其應(yīng)用[J];中興通訊技術(shù);2010年04期
相關(guān)碩士學(xué)位論文 前1條
1 李貞;基于Rsync算法的遠(yuǎn)程文件同步系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn)[D];北京郵電大學(xué);2010年
本文編號(hào):2435552
本文鏈接:http://sikaile.net/kejilunwen/jisuanjikexuelunwen/2435552.html