基于HBase的醫(yī)療衛(wèi)生數(shù)據(jù)中心構(gòu)建與異構(gòu)數(shù)據(jù)庫同步研究
發(fā)布時間:2018-05-19 23:02
本文選題:醫(yī)療衛(wèi)生 + HBase。 參考:《電子科技大學(xué)》2013年碩士論文
【摘要】:隨著社會信息化的飛速發(fā)展,,國家衛(wèi)生部就推進(jìn)醫(yī)療衛(wèi)生行業(yè)信息化建設(shè)作出專門的強(qiáng)調(diào)。本文則以省衛(wèi)生廳的“區(qū)域醫(yī)療監(jiān)管平臺”課題為研究基礎(chǔ),針對醫(yī)療衛(wèi)生行業(yè)的信息孤島、信息協(xié)同共享及監(jiān)管困難等難題,提出構(gòu)建醫(yī)療衛(wèi)生信息數(shù)據(jù)中心,并把所有醫(yī)療行業(yè)的數(shù)據(jù)同步到數(shù)據(jù)中心的設(shè)計思想來解決這些難題。在此過程中就需要解決醫(yī)療衛(wèi)生數(shù)據(jù)中心的構(gòu)建、異構(gòu)關(guān)系數(shù)據(jù)庫到數(shù)據(jù)中心的同步和數(shù)據(jù)中心效率問題就構(gòu)成本文研究內(nèi)容。 通過深入的研究分析醫(yī)療衛(wèi)生信息數(shù)據(jù)中心規(guī)模及相關(guān)需求問題。在對比和分析關(guān)系數(shù)據(jù)庫與新型的NoSQL數(shù)據(jù)庫的差異后提出運用NoSQL數(shù)據(jù)庫來構(gòu)建,通過分析幾種NoSQL數(shù)據(jù)庫特點確定以HBase構(gòu)建醫(yī)療衛(wèi)生信息數(shù)據(jù)中心。為確定數(shù)據(jù)模型,提出以關(guān)系數(shù)據(jù)庫的E-R建模基礎(chǔ)提出HBase的C-O-R建模思想,然后綜合分析衛(wèi)生部頒布醫(yī)療衛(wèi)生元數(shù)據(jù)標(biāo)準(zhǔn)和部分醫(yī)療衛(wèi)生機(jī)構(gòu)的實際情況,實現(xiàn)醫(yī)療衛(wèi)生信息的HBase數(shù)據(jù)中心構(gòu)建。 為實現(xiàn)把各醫(yī)療機(jī)構(gòu)的異構(gòu)關(guān)系數(shù)據(jù)庫中的數(shù)據(jù)透明無差異的同步到數(shù)據(jù)中心,提出以下方案。首先,在數(shù)據(jù)格式上提出異構(gòu)數(shù)據(jù)庫產(chǎn)生的異構(gòu)數(shù)據(jù)進(jìn)行標(biāo)準(zhǔn)化的XML和JSON通用數(shù)據(jù)格式封裝實現(xiàn)屏蔽異構(gòu)數(shù)據(jù)庫的數(shù)據(jù)差異;其次是傳輸協(xié)議簡單及通用,采用SOA架構(gòu)設(shè)計思想,提出運用Web Service方法實現(xiàn)數(shù)據(jù)同步傳輸;在異構(gòu)數(shù)據(jù)庫的增量數(shù)據(jù)獲取方面揉合時間戳、觸發(fā)器及日志法為一體的方法;最后,提出通用前置機(jī)設(shè)計實現(xiàn)對所有異構(gòu)數(shù)據(jù)庫的讀取,實現(xiàn)采用XML文件對異構(gòu)數(shù)據(jù)庫的差異配置。 在構(gòu)建好HBase中心并擁有數(shù)據(jù)后,由于HBase對于數(shù)據(jù)訪問只有Row Key的定位及全表掃描兩種,為了提高復(fù)雜查找效率而提出HBase的列索引構(gòu)建。本文提出兩種索引設(shè)計方式:運用Row Key優(yōu)勢和構(gòu)建索引表。本文索引表的構(gòu)建是采用MySQL數(shù)據(jù)庫與HBase數(shù)據(jù)庫相結(jié)合的雙索引體系結(jié)構(gòu)。 最后經(jīng)過模擬測試分析,并同MySQL的相關(guān)測試性能進(jìn)行對比,驗證出本文的設(shè)計方法可以較好的完成醫(yī)療衛(wèi)生信息中心的構(gòu)建和異構(gòu)數(shù)據(jù)庫到數(shù)據(jù)中心的同步,實現(xiàn)醫(yī)療衛(wèi)生數(shù)據(jù)共享、監(jiān)管及業(yè)務(wù)協(xié)同。
[Abstract]:With the rapid development of social informatization, the Ministry of Health has made a special emphasis on promoting the construction of medical and health industry informatization. On the basis of the research of "Regional Medical Supervision platform" of the provincial health department, aiming at the problems of information isolated island, information sharing and supervision difficulties in the medical and health industry, this paper puts forward the construction of the medical and health information data center. And all medical industry data synchronization to the data center design ideas to solve these problems. In this process, it is necessary to solve the construction of medical and health data center. The synchronization between heterogeneous relational database and data center and the efficiency of data center constitute the content of this paper. Through in-depth research and analysis of medical and health information data center size and related needs. After comparing and analyzing the difference between the relational database and the new NoSQL database, the author put forward to use the NoSQL database to construct the medical and health information data center by analyzing the characteristics of several NoSQL databases. In order to determine the data model, this paper puts forward the C-O-R modeling idea of HBase based on E-R modeling of relational database, and then synthetically analyzes the actual situation of medical and health metadata standard promulgated by the Ministry of Health and some medical and health institutions. Construction of HBase data center for medical and health information. In order to synchronize the data in heterogeneous relational database of medical institutions to the data center transparently, the following scheme is put forward. First of all, in the data format, the paper puts forward the standard XML and JSON universal data format encapsulation of heterogeneous data generated by heterogeneous database to mask the data difference of heterogeneous database, and secondly, the transmission protocol is simple and universal. Adopting the idea of SOA architecture design, this paper puts forward the method of synchronizing data transmission using Web Service method, combining time stamp, trigger and log method in the incremental data acquisition of heterogeneous database. Finally, In this paper, a universal front computer is designed to read all heterogeneous databases, and the differential configuration of heterogeneous databases is realized by using XML file. After constructing the HBase center and having the data, the column index construction of HBase is proposed in order to improve the efficiency of complex lookup because HBase has only Row Key positioning and full table scanning for data access. This paper presents two methods of index design: using the advantage of Row Key and constructing index table. In this paper, the construction of index table is based on the combination of MySQL database and HBase database. Finally, through the simulation test analysis, and compared with the related test performance of MySQL, it is verified that the design method of this paper can better complete the construction of medical and health information center and the synchronization of heterogeneous database to data center. Achieve medical and health data sharing, supervision and business collaboration.
【學(xué)位授予單位】:電子科技大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2013
【分類號】:TP311.13;TP308
【參考文獻(xiàn)】
相關(guān)期刊論文 前6條
1 李建江;崔健;王聃;嚴(yán)林;黃義雙;;MapReduce并行編程模型研究綜述[J];電子學(xué)報;2011年11期
2 甄玉鋼;劉璐瑩;康建初;;基于XML的異構(gòu)數(shù)據(jù)庫集成系統(tǒng)構(gòu)架與開發(fā)[J];計算機(jī)工程;2006年02期
3 王玉標(biāo);饒錫如;何盼;;異構(gòu)環(huán)境下數(shù)據(jù)庫增量同步更新機(jī)制[J];計算機(jī)工程與設(shè)計;2011年03期
4 谷方舟;沈波;;JSON數(shù)據(jù)交換格式在異構(gòu)系統(tǒng)集成中的應(yīng)用研究[J];鐵路計算機(jī)應(yīng)用;2012年02期
5 彭想;陳敏;;基于區(qū)域的醫(yī)療衛(wèi)生數(shù)據(jù)共享與交換平臺[J];中國醫(yī)院院長;2008年01期
6 宗文紅;張濤;蔡佳慧;周洲;孔斌;葉強(qiáng);;基于區(qū)域衛(wèi)生信息平臺的探索與實踐[J];中國衛(wèi)生信息管理雜志;2012年04期
相關(guān)碩士學(xué)位論文 前1條
1 李寬;基于HDFS的分布式Namenode節(jié)點模型的研究[D];華南理工大學(xué);2011年
本文編號:1912064
本文鏈接:http://sikaile.net/kejilunwen/jisuanjikexuelunwen/1912064.html
最近更新
教材專著