面向海量存儲(chǔ)系統(tǒng)的分層存儲(chǔ)技術(shù)研究
發(fā)布時(shí)間:2018-12-21 09:35
【摘要】:隨著數(shù)據(jù)信息量的快速增長(zhǎng),一方面迫切需要有大容量的海量存儲(chǔ)系統(tǒng)來(lái)存儲(chǔ)這些海量數(shù)據(jù)信息,但是另一方面,由于在海量存儲(chǔ)系統(tǒng)中往往會(huì)出現(xiàn)只有小部分活躍的數(shù)據(jù)文件經(jīng)常被訪(fǎng)問(wèn),而有相當(dāng)大比例的數(shù)據(jù)文件被訪(fǎng)問(wèn)的次數(shù)比較少的情況,因此必須對(duì)海量存儲(chǔ)系統(tǒng)中的數(shù)據(jù)文件進(jìn)行分層存儲(chǔ)管理。 分層存儲(chǔ)的主要技術(shù)思想是:將活躍數(shù)據(jù)文件存放在高性能且價(jià)格較昂貴的存儲(chǔ)設(shè)備上,而將不活躍的文件存放在容量大但價(jià)格低廉的存儲(chǔ)設(shè)備中。但是目前的分層存儲(chǔ)技術(shù)還存在一些問(wèn)題,如:典型的文件價(jià)值判定方法僅僅考慮數(shù)據(jù)本身的固有屬性以及歷史訪(fǎng)問(wèn)頻率,很少考慮文件的訪(fǎng)問(wèn)模式,也沒(méi)有考慮將文件存放在存儲(chǔ)系統(tǒng)的哪個(gè)層次來(lái)加快存儲(chǔ)系統(tǒng)的訪(fǎng)問(wèn)速度;另外由于文件的活躍度也經(jīng)常是隨著業(yè)務(wù)的變化而變化,所以也亟待提出更好的動(dòng)態(tài)遷移策略來(lái)優(yōu)化存儲(chǔ)系統(tǒng)的整體性能。 針對(duì)以上問(wèn)題,本文對(duì)海量存儲(chǔ)系統(tǒng)中數(shù)據(jù)文件的分層管理技術(shù)進(jìn)行了較深入的分析和研究,所做的主要研究工作和取得的研究成果如下: 1、分析和研究了海量存儲(chǔ)系統(tǒng)中文件訪(fǎng)問(wèn)模式和不同存儲(chǔ)設(shè)備性能差異,提出了一種自動(dòng)分層存儲(chǔ)系統(tǒng)(ATSS:Automated Tiered Storage System)的結(jié)構(gòu)。 2、提出了一種基于訪(fǎng)問(wèn)模式與存儲(chǔ)性能相匹配的文件價(jià)值判定方法(FVPSP:FileValuation based on access Pattern and Storage Performance)。該方法的思想是將訪(fǎng)問(wèn)頻率較高或者隨機(jī)讀寫(xiě)比例較高的數(shù)據(jù)文件放在固態(tài)硬盤(pán)SSD上,而將訪(fǎng)問(wèn)頻率較低或者順序讀寫(xiě)比例較高的數(shù)據(jù)文件存放在機(jī)械硬盤(pán)上。 3、針對(duì)文件活躍程度不斷變化的存儲(chǔ)系統(tǒng),,提出了一種基于文件活躍度自適應(yīng)調(diào)整時(shí)間間隔的動(dòng)態(tài)遷移策略,該策略減少了遷移過(guò)程中遷移模塊對(duì)正常業(yè)務(wù)的影響。 本文對(duì)提出的文件價(jià)值判定方法和動(dòng)態(tài)遷移策略進(jìn)行了性能測(cè)試。實(shí)驗(yàn)結(jié)果表明在順序讀寫(xiě)比例較高的工作負(fù)載中,F(xiàn)VPSP文件價(jià)值判定方法能夠取得較好的優(yōu)化效果;另外,自適應(yīng)調(diào)整時(shí)間間隔的動(dòng)態(tài)遷移策略與固定時(shí)間間隔的遷移策略相比,能夠在不影響性能的條件下有效地減少文件的遷移次數(shù)。因此本文的研究結(jié)果對(duì)海量存儲(chǔ)系統(tǒng)中數(shù)據(jù)文件的分層管理技術(shù)研究有一定的參考價(jià)值。
[Abstract]:With the rapid growth of data information, on the one hand, there is an urgent need for massive storage system to store these massive data information, but on the other hand, Because in mass storage systems, only a small number of active data files are frequently accessed, while a large proportion of data files are accessed less frequently. Therefore, data files in mass storage system must be managed by hierarchical storage. The main technical idea of hierarchical storage is to store active data files on high performance and expensive storage devices while inactive files are stored in large but inexpensive storage devices. However, there are still some problems in the current hierarchical storage technology, such as: the typical method for determining the value of a file only considers the inherent attributes of the data and the historical access frequency, and rarely considers the access mode of the file. It also does not consider which level of storage system to store files in order to speed up the access speed of storage system; In addition, because the activity of files often changes with the change of business, it is urgent to propose a better dynamic migration strategy to optimize the overall performance of storage system. Aiming at the above problems, this paper makes a deep analysis and research on the hierarchical management technology of data files in mass storage system. The main research work and research results are as follows: 1. This paper analyzes and studies the file access mode and the performance differences of different storage devices in mass storage system, and proposes an automatic hierarchical storage system (ATSS:Automated Tiered Storage System) structure. 2. A file value determination method (FVPSP:FileValuation based on access Pattern and Storage Performance).) based on the matching of access mode and storage performance is proposed. The idea of this method is that the data files with high access frequency or high random reading / writing ratio are placed on the solid state hard disk (SSD), while the data files with low access frequency or high proportion of sequential reading and writing are stored on the mechanical hard disk. 3. For the storage system with changing file activity, a dynamic migration strategy based on adaptive adjusting time interval of file activity is proposed, which reduces the influence of migration module on normal business during migration. This paper tests the performance of the proposed document value determination method and dynamic migration strategy. The experimental results show that the value determination method of FVPSP file can achieve better optimization effect in the workload with high proportion of sequential reading and writing. In addition, the dynamic migration strategy with adaptive time interval can effectively reduce the number of file migrations without affecting the performance compared with that of the fixed time interval migration strategy. Therefore, the results of this paper have a certain reference value for the research of hierarchical management of data files in mass storage system.
【學(xué)位授予單位】:華南理工大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2013
【分類(lèi)號(hào)】:TP333
本文編號(hào):2388732
[Abstract]:With the rapid growth of data information, on the one hand, there is an urgent need for massive storage system to store these massive data information, but on the other hand, Because in mass storage systems, only a small number of active data files are frequently accessed, while a large proportion of data files are accessed less frequently. Therefore, data files in mass storage system must be managed by hierarchical storage. The main technical idea of hierarchical storage is to store active data files on high performance and expensive storage devices while inactive files are stored in large but inexpensive storage devices. However, there are still some problems in the current hierarchical storage technology, such as: the typical method for determining the value of a file only considers the inherent attributes of the data and the historical access frequency, and rarely considers the access mode of the file. It also does not consider which level of storage system to store files in order to speed up the access speed of storage system; In addition, because the activity of files often changes with the change of business, it is urgent to propose a better dynamic migration strategy to optimize the overall performance of storage system. Aiming at the above problems, this paper makes a deep analysis and research on the hierarchical management technology of data files in mass storage system. The main research work and research results are as follows: 1. This paper analyzes and studies the file access mode and the performance differences of different storage devices in mass storage system, and proposes an automatic hierarchical storage system (ATSS:Automated Tiered Storage System) structure. 2. A file value determination method (FVPSP:FileValuation based on access Pattern and Storage Performance).) based on the matching of access mode and storage performance is proposed. The idea of this method is that the data files with high access frequency or high random reading / writing ratio are placed on the solid state hard disk (SSD), while the data files with low access frequency or high proportion of sequential reading and writing are stored on the mechanical hard disk. 3. For the storage system with changing file activity, a dynamic migration strategy based on adaptive adjusting time interval of file activity is proposed, which reduces the influence of migration module on normal business during migration. This paper tests the performance of the proposed document value determination method and dynamic migration strategy. The experimental results show that the value determination method of FVPSP file can achieve better optimization effect in the workload with high proportion of sequential reading and writing. In addition, the dynamic migration strategy with adaptive time interval can effectively reduce the number of file migrations without affecting the performance compared with that of the fixed time interval migration strategy. Therefore, the results of this paper have a certain reference value for the research of hierarchical management of data files in mass storage system.
【學(xué)位授予單位】:華南理工大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2013
【分類(lèi)號(hào)】:TP333
【參考文獻(xiàn)】
相關(guān)期刊論文 前1條
1 王印紅;郭相坤;;信息生命周期理論在企業(yè)數(shù)據(jù)中心的應(yīng)用研究[J];中國(guó)管理信息化(綜合版);2007年03期
相關(guān)碩士學(xué)位論文 前1條
1 宋麗娜;海量存儲(chǔ)系統(tǒng)中數(shù)據(jù)分級(jí)存儲(chǔ)的關(guān)鍵技術(shù)研究[D];國(guó)防科學(xué)技術(shù)大學(xué);2011年
本文編號(hào):2388732
本文鏈接:http://sikaile.net/kejilunwen/jisuanjikexuelunwen/2388732.html
最近更新
教材專(zhuān)著