DOA下的數(shù)據(jù)注冊(cè)中心元數(shù)據(jù)存儲(chǔ)策略研究
本文選題:DOA + 元數(shù)據(jù)存儲(chǔ) ; 參考:《成都理工大學(xué)》2013年碩士論文
【摘要】:伴隨國(guó)家信息化建設(shè)的快速發(fā)展,各種各類數(shù)據(jù)信息不斷涌現(xiàn),時(shí)時(shí)刻刻為人們的日常生活生產(chǎn)提供信息服務(wù)。Internet的全世界普及,使數(shù)據(jù)每一時(shí)刻都在網(wǎng)絡(luò)中流動(dòng),使數(shù)據(jù)散落在不同的數(shù)據(jù)存儲(chǔ)系統(tǒng)與設(shè)備中。為實(shí)現(xiàn)數(shù)據(jù)在客戶端匯聚,G/S模式應(yīng)運(yùn)而生。它采用“請(qǐng)求(request)-聚合(Polymerization)-服務(wù)(Service)"機(jī)制對(duì)所有的數(shù)據(jù)信息服務(wù)過(guò)程進(jìn)行組織與管理,在客戶端實(shí)現(xiàn)數(shù)據(jù)信息匯聚和服務(wù)聚合。G/S模式解決了數(shù)據(jù)的匯聚問(wèn)題,但是如何管理各種數(shù)據(jù),G/S模式并沒(méi)有提出有效的解決方法。為了能夠更快更好的獲取數(shù)據(jù),為G/S模式服務(wù),成都理工大學(xué)教授苗放老師提出了以數(shù)據(jù)為中心,以標(biāo)識(shí)為主線的面向數(shù)據(jù)的體系結(jié)構(gòu)——DOA,運(yùn)用數(shù)據(jù)的元數(shù)據(jù)實(shí)現(xiàn)對(duì)數(shù)據(jù)的統(tǒng)一管理。要管理各種類型數(shù)據(jù),首先要有相關(guān)數(shù)據(jù)的元數(shù)據(jù),所以數(shù)據(jù)的元數(shù)據(jù)存儲(chǔ)問(wèn)題,成為了DOA首要考慮的問(wèn)題。DOA要求數(shù)據(jù)注冊(cè)中心能夠快速提供數(shù)據(jù)的元數(shù)據(jù)信息,如何存儲(chǔ)才能快速提供元數(shù)據(jù),成為DOA下元數(shù)據(jù)管理數(shù)據(jù)的瓶頸,也是本文研究的問(wèn)題,解決DOA下的數(shù)據(jù)注冊(cè)中心元數(shù)據(jù)快速獲取的存儲(chǔ)問(wèn)題,成為當(dāng)務(wù)之急。本文首先對(duì)G/S模式和DOA進(jìn)行了全面概述,分析了它們的層次結(jié)構(gòu)與對(duì)數(shù)據(jù)匯聚和管理的作用,促使了對(duì)元數(shù)據(jù)存儲(chǔ)策略的研究。通過(guò)對(duì)元數(shù)據(jù)標(biāo)準(zhǔn)和開發(fā)設(shè)計(jì)框架基礎(chǔ)知識(shí)的概述,為設(shè)計(jì)了元數(shù)據(jù)存儲(chǔ)規(guī)范準(zhǔn)備了基礎(chǔ)知識(shí)。本文的工作如下:(1)研究數(shù)據(jù)注冊(cè)中心元數(shù)據(jù)存儲(chǔ)策略由于DOA使用元數(shù)據(jù)統(tǒng)一管理各種數(shù)據(jù),所以元數(shù)據(jù)的獲取速度決定了DOA的性能。因而,如何存儲(chǔ)元數(shù)據(jù)成為元數(shù)據(jù)快速獲取的關(guān)鍵。在分析研究文件系統(tǒng)方式、LDAP目錄存儲(chǔ)方式、各種關(guān)系數(shù)據(jù)庫(kù)存儲(chǔ)方式和內(nèi)存數(shù)據(jù)存儲(chǔ)方式的存儲(chǔ)優(yōu)缺點(diǎn)的基礎(chǔ)上,對(duì)使用何種存儲(chǔ)方式才能使數(shù)據(jù)注冊(cè)中心元數(shù)據(jù)被快速獲取進(jìn)行了分析研究。并依據(jù)研究成果,設(shè)計(jì)了一種能夠?qū)崿F(xiàn)元數(shù)據(jù)快速獲取的元數(shù)據(jù)存儲(chǔ)結(jié)構(gòu)模型。(2)研究元數(shù)據(jù)存儲(chǔ)的相關(guān)算法基于內(nèi)存容量的大小和元數(shù)據(jù)被訪問(wèn)頻率的問(wèn)題,研究為內(nèi)存提供存儲(chǔ)元數(shù)據(jù)的相關(guān)算法。研究了一種基于代價(jià)函數(shù)的緩存替換算法,為數(shù)據(jù)注冊(cè)中心在緩存中動(dòng)態(tài)存儲(chǔ)最近有可能被經(jīng)常訪問(wèn)的元數(shù)據(jù)。同時(shí)進(jìn)一步研究了在緩存中選取高訪問(wèn)頻率元數(shù)據(jù)表的算法,以實(shí)現(xiàn)通常訪問(wèn)的元數(shù)據(jù)信息存入內(nèi)存數(shù)據(jù)庫(kù)。通過(guò)這兩種算法,把訪問(wèn)頻率高的元數(shù)據(jù)存入內(nèi)存,提高了數(shù)據(jù)注冊(cè)中心對(duì)元數(shù)據(jù)的獲取速度。本文的創(chuàng)新點(diǎn)與研究成果有:(1)提出了一種DOA下的數(shù)據(jù)注冊(cè)中心元數(shù)據(jù)快速獲取存儲(chǔ)策略。本元數(shù)據(jù)存儲(chǔ)策略,依據(jù)DOA下的元數(shù)據(jù)統(tǒng)一管理數(shù)據(jù)的要求,利用關(guān)系數(shù)據(jù)庫(kù)的存儲(chǔ)容量、內(nèi)存數(shù)據(jù)庫(kù)的訪問(wèn)效率和緩存動(dòng)態(tài)存儲(chǔ)作用,通過(guò)基于代價(jià)的緩存替換算法實(shí)現(xiàn)在緩存中動(dòng)態(tài)存儲(chǔ)最近有可能被經(jīng)常訪問(wèn)的元數(shù)據(jù),以提高元數(shù)據(jù)的獲取速度與積累元數(shù)據(jù)的被訪問(wèn)頻率;再通過(guò)基于代價(jià)函數(shù)的緩存選取算法,選取緩存中訪問(wèn)頻率最高的元數(shù)據(jù)表存入內(nèi)存數(shù)據(jù)庫(kù),完成元數(shù)據(jù)的內(nèi)存數(shù)據(jù)庫(kù)存儲(chǔ)。在此基礎(chǔ)上,設(shè)計(jì)了DOA下的數(shù)據(jù)注冊(cè)中心的元數(shù)據(jù)存儲(chǔ)結(jié)構(gòu)模型,并依據(jù)此模型設(shè)計(jì)了DOA下的數(shù)據(jù)注冊(cè)中心元數(shù)據(jù)存儲(chǔ)系統(tǒng)。(2)提出了一種數(shù)據(jù)注冊(cè)中心的替換緩存中訪問(wèn)權(quán)值最低元數(shù)據(jù)表的算法。通過(guò)對(duì)現(xiàn)存的幾種常用緩存替換算法分析研究,從函數(shù)的角度,利用緩存數(shù)據(jù)表大小、訪問(wèn)次數(shù)、訪問(wèn)時(shí)間和獲取緩存數(shù)據(jù)表的CPU時(shí)間,設(shè)計(jì)了一種基于代價(jià)函數(shù)的緩存替換算法,實(shí)現(xiàn)最近有可能被經(jīng)常訪問(wèn)的數(shù)據(jù)表在緩存中的存儲(chǔ)。(3)提出了一種數(shù)據(jù)注冊(cè)中心的選取高訪問(wèn)頻率元數(shù)據(jù)表存入內(nèi)存數(shù)據(jù)庫(kù)的算法。在對(duì)現(xiàn)存緩存替換算法研究的基礎(chǔ)上,從函數(shù)的角度,利用緩存數(shù)據(jù)表大小、訪問(wèn)次數(shù)、訪問(wèn)時(shí)間和訪問(wèn)密度,設(shè)計(jì)了一種基于代價(jià)函數(shù)的緩存數(shù)據(jù)表選取算法,實(shí)現(xiàn)高訪問(wèn)頻率元數(shù)據(jù)表從緩存到內(nèi)存數(shù)據(jù)庫(kù)的存儲(chǔ)。
[Abstract]:With the rapid development of national information construction, various kinds of data and information are constantly emerging. The information service of.Internet is popularized all the time for people's daily life. The data will flow in the network at every moment, so that data are scattered in different data storage and storage systems and equipment. G/S mode comes into being. It uses "request" - aggregation (Polymerization) - Service (Service) "mechanism to organize and manage all the data information service processes. The data aggregation problem is solved by data aggregation and service aggregation.G/S mode in the client, but how to manage all kinds of data, G/S mode is not In order to get the data faster and better, to serve the G/S mode faster and better, Professor Miao Fang of Chengdu University of Technology put forward a data centric, identification oriented data oriented architecture, DOA, using data metadata to realize the unified management of the logarithm data. First, to manage various types of data, first In order to have metadata of relevant data, the problem of metadata storage of data has become the primary concern of DOA..DOA requires that data registration center can provide metadata information quickly, how to store metadata quickly, become the bottleneck of metadata management data under DOA, and the problem of this paper, to solve DOA The fast acquisition of data registry metadata has become a top priority. This paper first gives a comprehensive overview of the G/S model and DOA, analyzes their hierarchical structure and the function of data convergence and management, and promotes the research of metadata storage strategy. The basic knowledge is prepared for the design of metadata storage specification. The work of this paper is as follows: (1) study the metadata storage strategy of data registration center because DOA uses metadata to manage all kinds of data, so the speed of metadata acquisition determines the performance of DOA. Therefore, how to store metadata is the key to the rapid acquisition of metadata. On the basis of analyzing the methods of file system, the storage mode of LDAP directory, the storage mode of all kinds of relational databases and the memory of memory data storage methods, this paper analyzes and studies what kind of storage methods used to make the metadata of the data registration center be obtained quickly. And a kind of realization is designed based on the research results. The metadata storage structure model is quickly acquired. (2) the related algorithms of metadata storage are based on the size of memory capacity and the frequency of the metadata being accessed, and the related algorithms for storing metadata for memory are studied. A caching replacement algorithm based on cost function is studied, which is used to move the data registry in the cache. State storage of metadata that may be frequently accessed recently. At the same time, the algorithm of selecting high access frequency metadata in the cache is further studied to save the commonly accessed metadata information into the memory database. Through these two algorithms, the metadata with high frequency of access is stored in the memory, and the data registry is improved to the metadata. The innovation points and research results of this paper are as follows: (1) a fast acquisition strategy for data registration center metadata under DOA is proposed. This metadata storage strategy is based on the requirements of unified management of data under the metadata under DOA, using the storage capacity of the relational database, the access efficiency of the memory database and the dynamic storage of the cache. Use the cost based cache replacement algorithm to dynamically store the most frequently accessed metadata that may be frequently accessed in the cache to improve the speed of metadata acquisition and the access frequency of the accumulated metadata; then, the cache selection algorithm based on the cost function is used to select the metadata with the highest access frequency in the cache and save the memory. On the basis of this, the metadata storage structure model of the data registration center under DOA is designed, and the data registration center metadata storage system under DOA is designed based on this model. (2) a calculation of the minimum metadata table of access weight in the replacement caching of the data registration center is proposed. By analyzing several existing common cache replacement algorithms, from the point of view of function, using the size of the cached data table, the number of access, access time and the CPU time of obtaining the cached data table, a cache replacement algorithm based on the cost function is designed to realize the storage of the data table that may be frequently accessed recently in the cache. (3) an algorithm for selecting the high access frequency metadata table to be stored in the memory database is proposed. On the basis of the existing cache replacement algorithm, a cost function based caching data table is designed from the function point of view, using the size of the cached data table, access times, access time and access density. The algorithm implements the storage of high access frequency metadata tables from cache to memory database.
【學(xué)位授予單位】:成都理工大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2013
【分類號(hào)】:TP333
【相似文獻(xiàn)】
相關(guān)期刊論文 前10條
1 于海燕,周順平;空間元數(shù)據(jù)系統(tǒng)的研究與設(shè)計(jì)[J];現(xiàn)代計(jì)算機(jī)(專業(yè)版);2002年09期
2 呂秋培,解素芳,李新利,盧曉慧,任鳳仙,于慧敏,劉彩霞,陳曉;關(guān)于元數(shù)據(jù)及其應(yīng)用[J];檔案學(xué)通訊;2003年03期
3 王繼周,李成名,林宗堅(jiān);面向共享的空間元數(shù)據(jù)管理框架研究與設(shè)計(jì)[J];國(guó)土資源遙感;2003年03期
4 李雙文;;論元數(shù)據(jù)的概念、層次和作用[J];云南檔案;2009年09期
5 楊茹;張愛文;;黑龍江省交通數(shù)字化中元數(shù)據(jù)創(chuàng)建問(wèn)題研究[J];黑龍江工程學(xué)院學(xué)報(bào)(自然科學(xué)版);2010年03期
6 曹樹金,馬利霞;描述教育資源的元數(shù)據(jù)標(biāo)準(zhǔn)[J];大學(xué)圖書館學(xué)報(bào);2004年02期
7 熊志剛,徐梅林;基于元數(shù)據(jù)及語(yǔ)意網(wǎng)的學(xué)習(xí)技術(shù)探討[J];開放教育研究;2004年05期
8 袁平,韓景潤(rùn),黨海飛;空間元數(shù)據(jù)自動(dòng)生成技術(shù)研究[J];地理信息世界;2005年01期
9 李曉霞;;元數(shù)據(jù)在空間信息網(wǎng)上發(fā)布中的實(shí)現(xiàn)方法與應(yīng)用[J];計(jì)算機(jī)時(shí)代;2006年04期
10 傅海洋;黃俊;劉曉晨;;物流元數(shù)據(jù)及其管理研究[J];物流科技;2008年10期
相關(guān)會(huì)議論文 前10條
1 趙培翔;張銘;楊冬青;唐世渭;;數(shù)字化文檔元數(shù)據(jù)的自動(dòng)提取[A];第二十屆全國(guó)數(shù)據(jù)庫(kù)學(xué)術(shù)會(huì)議論文集(研究報(bào)告篇)[C];2003年
2 陳勝鵬;邱文;;基于元數(shù)據(jù)的空間地理信息管理系統(tǒng)整合的研究與實(shí)現(xiàn)[A];中國(guó)測(cè)繪學(xué)會(huì)第九次全國(guó)會(huì)員代表大會(huì)暨學(xué)會(huì)成立50周年紀(jì)念大會(huì)論文集[C];2009年
3 萬(wàn)甜甜;劉強(qiáng);董海良;;西南資源環(huán)境信息元數(shù)據(jù)研究[A];第四屆海峽兩岸GIS發(fā)展研討會(huì)暨中國(guó)GIS協(xié)會(huì)第十屆年會(huì)論文集[C];2006年
4 白云;高國(guó)勇;;基于Web的城市地下管線元數(shù)據(jù)共享機(jī)制的研究[A];吉林省測(cè)繪學(xué)會(huì)2008年學(xué)術(shù)年會(huì)論文集(上)[C];2008年
5 殷寧欣;張衛(wèi)平;張濤;;空間信息系統(tǒng)中的模型方法元數(shù)據(jù)標(biāo)準(zhǔn)研究[A];吉林省測(cè)繪學(xué)會(huì)2008年學(xué)術(shù)年會(huì)論文集(上)[C];2008年
6 韓先培;趙軍;;基于Wikipedia的語(yǔ)義元數(shù)據(jù)生成[A];第四屆全國(guó)信息檢索與內(nèi)容安全學(xué)術(shù)會(huì)議論文集(上)[C];2008年
7 王蜀安;張銘;王愛華;楊冬青;;構(gòu)建基于OAI協(xié)議的可互操作數(shù)字圖書館元數(shù)據(jù)服務(wù)[A];第十九屆全國(guó)數(shù)據(jù)庫(kù)學(xué)術(shù)會(huì)議論文集(技術(shù)報(bào)告篇)[C];2002年
8 廖軼;黃磊;袁芳;;元數(shù)據(jù)在高校數(shù)據(jù)中心的應(yīng)用探索[A];中國(guó)高等教育學(xué)會(huì)教育信息化分會(huì)第十二次學(xué)術(shù)年會(huì)論文集[C];2014年
9 徐佳沅;;規(guī)劃信息數(shù)據(jù)庫(kù)的元數(shù)據(jù)集成管理研究[A];廣東省測(cè)繪學(xué)會(huì)第九次會(huì)員代表大會(huì)暨學(xué)術(shù)交流會(huì)論文集[C];2010年
10 陳X;李心科;;基于可擴(kuò)展數(shù)據(jù)清理框架的元數(shù)據(jù)的研究[A];計(jì)算機(jī)技術(shù)與應(yīng)用進(jìn)展·2007——全國(guó)第18屆計(jì)算機(jī)技術(shù)與應(yīng)用(CACIS)學(xué)術(shù)會(huì)議論文集[C];2007年
相關(guān)博士學(xué)位論文 前5條
1 劉海學(xué);基于語(yǔ)義標(biāo)注的元數(shù)據(jù)自動(dòng)構(gòu)建及其相關(guān)技術(shù)研究[D];華東師范大學(xué);2010年
2 符海芳;網(wǎng)絡(luò)地理信息組織和提取的研究及其在農(nóng)業(yè)信息中的應(yīng)用[D];中南大學(xué);2003年
3 辜寄蓉;基于元數(shù)據(jù)的綜合數(shù)據(jù)管理與信息共享[D];成都理工大學(xué);2003年
4 徐昊;科技知識(shí)對(duì)象的語(yǔ)義模式研究[D];吉林大學(xué);2013年
5 王霞;衛(wèi)生統(tǒng)計(jì)調(diào)查元數(shù)據(jù)概念模型的研究[D];第四軍醫(yī)大學(xué);2006年
相關(guān)碩士學(xué)位論文 前10條
1 安靜;基于元數(shù)據(jù)動(dòng)態(tài)獲取的數(shù)據(jù)質(zhì)量檢查技術(shù)[D];中國(guó)地質(zhì)大學(xué)(北京);2015年
2 李欣;交通物流信息平臺(tái)元數(shù)據(jù)研究與應(yīng)用[D];長(zhǎng)安大學(xué);2015年
3 施偉;高性能分布式文件系統(tǒng)元數(shù)據(jù)服務(wù)設(shè)計(jì)與實(shí)現(xiàn)[D];電子科技大學(xué);2014年
4 呂鵬程;一個(gè)分布式文件系統(tǒng)元數(shù)據(jù)服務(wù)器設(shè)計(jì)與實(shí)現(xiàn)[D];電子科技大學(xué);2015年
5 趙卓;禽肉質(zhì)量安全追溯元數(shù)據(jù)研究[D];河北科技師范學(xué)院;2014年
6 秦超;本體元數(shù)據(jù)設(shè)計(jì)、提取及應(yīng)用[D];南京大學(xué);2014年
7 白雷;云存儲(chǔ)中獨(dú)立元數(shù)據(jù)的設(shè)計(jì)與實(shí)現(xiàn)[D];西安電子科技大學(xué);2014年
8 袁進(jìn)俊;DOA下分布式DRC的元數(shù)據(jù)分級(jí)存儲(chǔ)模型研究[D];成都理工大學(xué);2015年
9 張雷;DOA下的數(shù)據(jù)注冊(cè)中心元數(shù)據(jù)存儲(chǔ)策略研究[D];成都理工大學(xué);2013年
10 高小明;國(guó)家基礎(chǔ)地理信息動(dòng)態(tài)元數(shù)據(jù)管理系統(tǒng)的研究與實(shí)現(xiàn)[D];武漢大學(xué);2005年
,本文編號(hào):1912781
本文鏈接:http://sikaile.net/kejilunwen/jisuanjikexuelunwen/1912781.html