天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當(dāng)前位置:主頁(yè) > 科技論文 > 搜索引擎論文 >

基于都柏林核心(DC)的中醫(yī)文獻(xiàn)元數(shù)據(jù)標(biāo)準(zhǔn)研究

發(fā)布時(shí)間:2018-04-21 05:03

  本文選題:中醫(yī)文獻(xiàn)元數(shù)據(jù) + 中醫(yī)古籍; 參考:《中國(guó)中醫(yī)科學(xué)院》2013年碩士論文


【摘要】:中醫(yī)在數(shù)千年的發(fā)展長(zhǎng)河中,形成了異彩紛呈的醫(yī)學(xué)流派,留下了浩如煙海的中醫(yī)古今文獻(xiàn)。中醫(yī)作為珍貴的文化遺產(chǎn),中醫(yī)文獻(xiàn)起到了文化傳承的紐帶作用,記載著幾千年的醫(yī)家智慧、醫(yī)學(xué)經(jīng)驗(yàn)。面對(duì)龐大文獻(xiàn)資源,使用者需要高效率的檢索利用和知識(shí)發(fā)掘;文獻(xiàn)管理者要對(duì)其進(jìn)行分類整理、權(quán)利管理、資源評(píng)鑒、妥善保存。中醫(yī)文獻(xiàn)從其產(chǎn)生、傳遞、儲(chǔ)存、到最終消失的過(guò)程中有著收集、加工、利用等方面的諸多環(huán)節(jié),其生命周期中蘊(yùn)含著大量信息,這些在文獻(xiàn)形成和利用中經(jīng)歷的過(guò)去事實(shí)和文獻(xiàn)資源本身的內(nèi)容一樣都具有一定的記錄價(jià)值。人們致力于研究如何在海量中醫(yī)文獻(xiàn)中按特定需求進(jìn)行查詢檢索,并要盡量返回更為精確的結(jié)果。不該僅僅依靠傳統(tǒng)搜索引擎那樣只用關(guān)鍵詞機(jī)械匹配,而不考慮語(yǔ)義,機(jī)械式的查詢往往會(huì)返回空白、錯(cuò)誤或是不符合意圖的結(jié)果。元數(shù)據(jù)的應(yīng)用,對(duì)這個(gè)問題的解決起到了很大的推動(dòng)作用。 2008年10月,中國(guó)中醫(yī)科學(xué)院中醫(yī)藥信息研究所開始參與ISO/TC215傳統(tǒng)醫(yī)學(xué)信息標(biāo)準(zhǔn)化工作,對(duì)ISO/TC215傳統(tǒng)醫(yī)學(xué)信息標(biāo)準(zhǔn)化的動(dòng)態(tài)進(jìn)行了密切關(guān)注,并且在ISO/TC215中提交中醫(yī)藥信息國(guó)際標(biāo)準(zhǔn)提案“中醫(yī)數(shù)據(jù)集元數(shù)據(jù)標(biāo)準(zhǔn)”,10月份又將“中醫(yī)數(shù)據(jù)集元數(shù)據(jù)標(biāo)準(zhǔn)”改為“中醫(yī)文獻(xiàn)元數(shù)據(jù)標(biāo)準(zhǔn)”。ISO/TC215已于2011年11月對(duì)“中醫(yī)文獻(xiàn)元數(shù)據(jù)標(biāo)準(zhǔn)”提案啟動(dòng)投票程序。2012年5月ISO對(duì)“中醫(yī)藥文獻(xiàn)元數(shù)據(jù)”這項(xiàng)標(biāo)準(zhǔn)正式立項(xiàng)。這是我國(guó)中醫(yī)藥信息標(biāo)準(zhǔn)在ISO中首次立項(xiàng)。 本課題是針對(duì)已立項(xiàng)的中醫(yī)文獻(xiàn)元數(shù)據(jù)標(biāo)準(zhǔn)著重從的設(shè)計(jì)原則、制定方法切入,依據(jù)原則與方法建立一套完整的中醫(yī)文獻(xiàn)元數(shù)據(jù)標(biāo)準(zhǔn)體系。 首先完成了中醫(yī)藥標(biāo)準(zhǔn)發(fā)展的研究背景調(diào)查,調(diào)研國(guó)外已有的醫(yī)學(xué)元數(shù)據(jù),對(duì)國(guó)內(nèi)醫(yī)學(xué)元數(shù)據(jù)研究現(xiàn)狀在圖書情報(bào)和互聯(lián)網(wǎng)的范圍內(nèi)進(jìn)行調(diào)研。 描述信息資源的元數(shù)據(jù)有描述傳統(tǒng)印刷型文獻(xiàn)的MARC格式,也有描述網(wǎng)絡(luò)信息資源的DC元數(shù)據(jù),還有一種介于MARC和DC之間的第三種元數(shù)據(jù)——MODS;描述醫(yī)學(xué)信息的元數(shù)據(jù)有:ISO的健康信息學(xué)技術(shù)委員會(huì)研制的"ISO13119Health informatics-Clinical knowledge resources-Metadata(健康信息學(xué)-臨床知識(shí)資源-元數(shù)據(jù))”標(biāo)準(zhǔn)、Ohio LINK醫(yī)學(xué)元數(shù)據(jù)、美國(guó)Oregon Health Sciences University制定的醫(yī)學(xué)核心元數(shù)據(jù)MCM、法國(guó)Rouen University Hospital(RUH)1995年發(fā)起的基于質(zhì)量控制的主題網(wǎng)關(guān)項(xiàng)目:CISMeF等。已發(fā)布的這些醫(yī)學(xué)元數(shù)據(jù)標(biāo)準(zhǔn)都是很大程度的參考或復(fù)用了DC。 用學(xué)術(shù)聯(lián)機(jī)數(shù)據(jù)庫(kù)檢索和互聯(lián)網(wǎng)檢索相結(jié)合的方式,發(fā)現(xiàn)國(guó)內(nèi)在元數(shù)據(jù)領(lǐng)域已有較多研究,但涉及醫(yī)學(xué)領(lǐng)域的元數(shù)據(jù)研究非常少,而中醫(yī)領(lǐng)域更是鮮有問津。國(guó)內(nèi)缺乏權(quán)威部門牽頭并起草、正式發(fā)布的醫(yī)學(xué)元數(shù)據(jù)標(biāo)準(zhǔn),使中醫(yī)文獻(xiàn)共享缺乏有力支撐,因此本課題有一定研究與開發(fā)的必要性。 第二,從資源利用、保護(hù)等方面分析中醫(yī)文獻(xiàn)元數(shù)據(jù)標(biāo)準(zhǔn)的研究目的與意義,總結(jié)元數(shù)據(jù)的功能,分析基于DC設(shè)計(jì)新的元數(shù)據(jù)的原因。 中醫(yī)文獻(xiàn)收藏地點(diǎn)分散,現(xiàn)存1949年以前的12000多種中醫(yī)文獻(xiàn)目前分散保存在全國(guó)各專業(yè)圖書館,仍作為各館的鎮(zhèn)館之寶而束之高閣。學(xué)者們?cè)谖墨I(xiàn)整理研究各個(gè)工作環(huán)節(jié)上依然沿用手工作業(yè)的方式。隨著計(jì)算機(jī)技術(shù)應(yīng)用在文獻(xiàn)管理領(lǐng)域的延伸以及掃描技術(shù)的發(fā)展,文獻(xiàn)的電子化處理給讀者帶來(lái)極大的閱讀便利;古老的文獻(xiàn)在重建天日的同時(shí)能夠獲得很好的保護(hù)。國(guó)內(nèi)已有或規(guī)劃中的很多中醫(yī)文獻(xiàn)檢索平臺(tái)和數(shù)據(jù)庫(kù)。與文獻(xiàn)利用的信息技術(shù)的發(fā)展形成對(duì)比的是文獻(xiàn)利用理論支撐的相對(duì)滯后。元數(shù)據(jù)的標(biāo)準(zhǔn)化是文獻(xiàn)利用理論建設(shè)的重要環(huán)節(jié),中醫(yī)文獻(xiàn)元數(shù)據(jù)為中醫(yī)藥文獻(xiàn)資源的規(guī)范化描述奠定了基礎(chǔ),它有助于構(gòu)建明晰、周全、簡(jiǎn)單、易懂的文獻(xiàn)描述性記錄,能有效支持中醫(yī)藥文獻(xiàn)的收集、保管和利用,改善中醫(yī)藥文獻(xiàn)檢索的效果,對(duì)于中醫(yī)藥文獻(xiàn)資源的系統(tǒng)保護(hù)和深度利用具有重要意義。元數(shù)據(jù)基于DC設(shè)計(jì)可以避免MARC格式中大量繁瑣的定長(zhǎng)字段,使得編目界面變得簡(jiǎn)潔而直觀,無(wú)論是專業(yè)編目員還是非專業(yè)編目員,都可以參與文獻(xiàn)編目工作,這使編目工作更能適應(yīng)對(duì)龐大的網(wǎng)絡(luò)化信息資源的組織。 第三,設(shè)計(jì)元數(shù)據(jù)框架,分析元數(shù)據(jù)方案設(shè)計(jì)的通用原則和具體原則,規(guī)劃中醫(yī)文獻(xiàn)元數(shù)據(jù)的設(shè)計(jì)流程。 根據(jù)中醫(yī)文獻(xiàn)生命周期的各項(xiàng)活動(dòng)和描述角度的不同,將中醫(yī)文獻(xiàn)元數(shù)據(jù)劃分為7個(gè)元數(shù)據(jù)子集: (1)標(biāo)識(shí)信息子集:外部特征的基本信息,包括名稱,標(biāo)識(shí)符,創(chuàng)建者和出版者,等等。 (2)內(nèi)容信息子集:關(guān)于中醫(yī)文獻(xiàn)內(nèi)部特征的描述信息,包括描述,主題,等等。 (3)分發(fā)信息子集:關(guān)于用戶獲取和收藏文獻(xiàn)資源的信息。 (4)質(zhì)量信息子集:關(guān)于文獻(xiàn)資源保存狀態(tài)的質(zhì)量信息。 (5)限制信息子集:對(duì)資源和元數(shù)據(jù)獲取和使用的限制信息。 (6)維護(hù)信息子集:關(guān)于維護(hù)保養(yǎng)文獻(xiàn)資源的信息。 (7)關(guān)聯(lián)信息子集:提供了資源之間關(guān)聯(lián)關(guān)系的參考信息。 總結(jié)了設(shè)計(jì)元數(shù)據(jù)標(biāo)準(zhǔn)6條通用原則:(1)簡(jiǎn)單性與適用性原則;(2)專指度與通用性原則;(3)互操作性與易轉(zhuǎn)換性原則;(4)靈活性與可擴(kuò)展性原則;(5)用戶需求原則;(6)遵循現(xiàn)有標(biāo)準(zhǔn)原則。 除了通用原則,針對(duì)具體領(lǐng)域元數(shù)據(jù)的制定歸納了條具體原則:(1)資源分析原則(2)擴(kuò)展原則(元素?cái)U(kuò)展原則和修飾限定原則)(3).元素定義原則(4)置標(biāo)原則 第四,進(jìn)行本文中醫(yī)文獻(xiàn)元數(shù)據(jù)的相關(guān)資源分析,對(duì)著錄對(duì)象和著錄單位等提出了細(xì)節(jié)性的界定。 “文獻(xiàn)”采用廣義的定義;除中醫(yī)外,傳統(tǒng)醫(yī)學(xué)文獻(xiàn)也可適用于此元數(shù)據(jù);當(dāng)實(shí)體文獻(xiàn)資源數(shù)字化后,需對(duì)數(shù)字化文本或影像等格式的文獻(xiàn)資源以及實(shí)體本身屬性進(jìn)行著錄,二者結(jié)合不可分離;具體著錄單位要按實(shí)際需要確定。 第五,完成了中醫(yī)文獻(xiàn)元數(shù)據(jù)的元素集、元素定義及著錄規(guī)則的具體描述,并用摘要和字典兩種形式進(jìn)行呈現(xiàn)。 元素集及其限定詞的摘要展示于下表: 中醫(yī)文獻(xiàn)元數(shù)據(jù)保留了DC的元數(shù)據(jù)元素集,又包括中醫(yī)藥領(lǐng)域的特征元素。 重用DC元數(shù)據(jù)元素,如題名(Title)、類型(Type)、創(chuàng)建者(Creator)、主題(Subject)、描述(Description)、日期(Date)、標(biāo)識(shí)符(Identifier)、語(yǔ)種(Language)、關(guān)聯(lián)(Relation)等; 根據(jù)中醫(yī)藥領(lǐng)域特性,對(duì)DC元數(shù)據(jù)元素進(jìn)行細(xì)化,例如將DC中的題名(Title)進(jìn)一步細(xì)化為版心題名(Title on the Fore-edge)、內(nèi)封題名(Title on the Inside Cover)、書衣題名(Title on the Book Cover)、卷端題名(Title on the First Page of Text)等; 添加具有中醫(yī)藥特色的元數(shù)據(jù)元素,例如歷代醫(yī)家、醫(yī)學(xué)流派等等。 第六,選擇合適的網(wǎng)絡(luò)描述語(yǔ)言作為本元數(shù)據(jù)的置標(biāo)語(yǔ)言,實(shí)現(xiàn)元數(shù)據(jù)的網(wǎng)絡(luò)應(yīng)用功能。 RDF (Resource Description Framework),即資源描述框架,是一種用于描述Web資源的標(biāo)記語(yǔ)言。RDF使用XML語(yǔ)法和RDF Schema (RDFS)來(lái)將元數(shù)據(jù)描述成為數(shù)據(jù)模型。RDF三元組數(shù)據(jù)模型包括的三種對(duì)象類型: ●資源(Resource)。RDF編碼中描述的所有事物都稱為資源。 ●屬性(Property)。屬性是用來(lái)描述資源的外部特征、內(nèi)容說(shuō)明或資源間相互關(guān)系。 ●陳述(Statement)。陳述是用特定模式的語(yǔ)句將資源的屬性及其值表達(dá)出來(lái)。陳述語(yǔ)句可以和自然語(yǔ)言語(yǔ)句相對(duì)應(yīng),資源(Resource)對(duì)應(yīng)于自然語(yǔ)言中的主語(yǔ)(Subject),屬性(Property)對(duì)應(yīng)于謂語(yǔ)(Predicate),屬性值(Value)對(duì)應(yīng)于賓語(yǔ)(Object)。 第七,通過(guò)對(duì)比本元數(shù)據(jù)和國(guó)際權(quán)威元數(shù)據(jù)臨床知識(shí)資源元數(shù)據(jù)標(biāo)準(zhǔn)HICKR,討論本元數(shù)據(jù)的唯一性和不可替代性。 最后,總結(jié)本研究的主要工作,對(duì)中醫(yī)文獻(xiàn)元數(shù)據(jù)的應(yīng)用前景做出展望。
[Abstract]:TCM is a precious cultural heritage . Traditional Chinese medicine literature plays a role of cultural heritage , which records thousands of thousand years of wisdom and medical experience . Facing the huge literature resources , users need high - efficiency search and utilization and knowledge discovery ;
There are many links in the process of collection , processing and utilization in the process of the formation , transmission , storage and eventual disappearance of Chinese medical literature .

In October 2008 , the Chinese Medical Information Institute of Chinese Academy of Traditional Chinese Medicine began to participate in the standardization of traditional medical information of ISO / TC215 . It has paid close attention to the standardization of ISO / TC215 traditional medical information . In October 2011 , the International Standard of Chinese Medicine Data Collection Metadata Standard was changed to " Chinese Medical Document Metadata Standard . " ISO / TC215 started the voting procedure on the proposal of " Chinese Medical Document Metadata Standard " in November 2011 . This is the first entry of the standard of Chinese medicine information in ISO in May 2012 .

The subject is to set up a complete standard system of TCM literature metadata according to the principles and methods , aiming at the design principle and the method of establishing the standard of TCM literature metadata .

Firstly , the research background of the standard development of Chinese medicine is completed , and the existing medical metadata is researched , and the present situation of domestic medical metadata research is investigated in the range of book information and Internet .

The metadata of descriptive information resources describes the MARC format of traditional printed documents , DC metadata describing network information resources , and a third metadata _ MODS between MARC and DC . The metadata of medical information is : " ISO13119Health Science - Clinical Knowledge resources - Metadata " developed by ISO ' s health informatics technology committee , Ohio LINK medical metadata , quality - controlled theme gateway project initiated in 1995 by Rouen University Hospital ( RUH ) , etc . These medical metadata standards have been published to a large extent with reference to or multiplexed DC .

With the combination of academic online database retrieval and Internet search , it has been found that there are many researches in the field of metadata , but the research on metadata in medical field is very few , and the field of Chinese medicine is more and more intensive . There is no authoritative department in the country and drafting and officially releasing medical metadata standard , which makes the sharing of TCM literature lack of strong support . Therefore , the subject has some research and development necessity .

Second , from the aspects of resource utilization , protection and so on , the research purpose and significance of the metadata standard of TCM literature are analyzed , the function of metadata is summarized , and the reason of the new metadata based on DC design is analyzed .

There are more than 12,000 traditional Chinese medicine documents which were dispersed in various professional libraries throughout the country in 1949 , and still serve as the treasure of the library of the various museums . The scholars still use manual operation in the research of the literature . With the development of the computer technology in the field of document management and the development of scanning technology , the electronic processing of the documents brings great convenience to the readers ;
The standardization of metadata is the important link of literature utilization theory . The standardization of metadata is the important link of literature utilization theory .

Thirdly , the metadata framework is designed , the general principles and specific principles of metadata design are analyzed , and the design flow of the metadata of TCM literature is planned .

According to the various activities and description angles of the life cycle of TCM literature , the metadata of TCM literature is divided into 7 metadata subsets :

( 1 ) Identification information subset : basic information of external features , including name , identifier , creator and publisher , etc .

( 2 ) subset of content information : description information about the internal characteristics of the traditional Chinese medicine literature , including description , subject , and so on .

( 3 ) Distribution information subset : information about user acquisition and collection of document resources .

( 4 ) subset of quality information : quality information about the preservation status of document resources .

( 5 ) Restriction information subset : restriction information about the acquisition and use of resources and metadata .

( 6 ) Maintenance information subset : information about maintenance document resources .

( 7 ) Correlation information subset : provides reference information of the relationship between resources .

The general principles of design metadata standard are summarized as follows : ( 1 ) the principle of simplicity and applicability ;
( 2 ) the principle of specificity and universality ; ( 3 ) the principle of interoperability and accessibility ;
( 4 ) Principle of flexibility and scalability ;
( 5 ) User requirement principle ;
( 6 ) Compliance with existing standard principles .

In addition to general principles , specific principles are summarized for the formulation of metadata in specific areas : ( 1 ) the principle of resource analysis ( 2 ) ( 2 ) the principle of expansion ( element extension principle and modification definition principle ) ( 3 ) . Element definition principle ( 4 ) setting principle

Fourthly , the related resource analysis of the literature metadata of the traditional Chinese medicine is carried out , and the detailed definition is put forward for the description object and the description unit , etc .

" Literature " is defined in a broad sense ;
Besides traditional Chinese medicine , the traditional medical literature can also be applied to this metadata ;
When the entity document resources are digitized , the document resources in the format such as digital text or image and the attribute of the entity itself need to be recorded , and the combination of the two entities is not separable ;
The specific directory units shall be determined according to the actual needs .

Fifth , the element set , the element definition and the description of the well - known rules of the TCM literature metadata are completed , and presented in two forms of abstract and dictionary .

The summary of the element set and its qualifier is shown in the following table :

TCM literature metadata preserves the metadata element sets of DC , and also includes the feature elements in the field of traditional Chinese medicine .

Reuse of DC metadata elements , such as Title , Type , Creator , Subject , Description , Date , Identifier , Language , Relation , and so on ;


According to the characteristics of the traditional Chinese medicine field , the DC metadata elements are thinned , such as title on the title - edge , Title on the Inside Cover , Title on the Book Cover , Title on the First Page of Text , and the like ;


Add metadata elements with traditional Chinese medicine characteristics , such as medical experts , medical schools and so on .

and sixthly , selecting an appropriate network description language as a markup language of the metadata , and realizing the network application function of the metadata .

RDF ( Resource Description Framework ) , a resource description framework , is a markup language for describing Web resources . RDF uses XML syntax and RDF Schema ( RDFS ) to describe metadata as a data model . The RDF triple data model includes three object types :

Resource . All things described in RDF coding are called resources .

Property . An attribute is used to describe the external features , content descriptions , or inter - resource relationships of a resource .

A statement is a statement that expresses the property of a resource and its value with a specific schema statement . A statement statement can correspond to a natural language statement . A resource corresponds to a subject in a natural language , and the Property corresponds to a predicate , and the attribute value corresponds to object .

Seventh , the uniqueness and non - substitutability of this meta - data are discussed by comparing the metadata of this metadata and the metadata standard of the international authoritative metadata clinical knowledge resource metadata .

Finally , the main work of this study is summarized , and the prospect of the application of TCM literature metadata is forecasted .

【學(xué)位授予單位】:中國(guó)中醫(yī)科學(xué)院
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2013
【分類號(hào)】:R-05

【參考文獻(xiàn)】

相關(guān)期刊論文 前10條

1 任磊;譚躍生;;基于RDF元數(shù)據(jù)的網(wǎng)格資源統(tǒng)一描述方法[J];內(nèi)蒙古科技大學(xué)學(xué)報(bào);2009年02期

2 肖瓏,陳凌,馮項(xiàng)云,馮英;中文元數(shù)據(jù)標(biāo)準(zhǔn)框架及其應(yīng)用[J];大學(xué)圖書館學(xué)報(bào);2001年05期

3 金毅,王紹平;元數(shù)據(jù)在電子化學(xué)位論文中的應(yīng)用探討[J];大學(xué)圖書館學(xué)報(bào);2002年02期

4 李鵬云,陳奕;試論MARC元數(shù)據(jù)向DC都柏林核心元數(shù)據(jù)的轉(zhuǎn)換[J];新世紀(jì)圖書館;2005年02期

5 呂精巧;宋智忠;郭兆紅;;網(wǎng)絡(luò)環(huán)境下數(shù)字圖書館的安全問題研究[J];科技情報(bào)開發(fā)與經(jīng)濟(jì);2009年22期

6 王偉;;近年來(lái)我國(guó)DC元數(shù)據(jù)研究文獻(xiàn)綜述[J];圖書館理論與實(shí)踐;2007年05期

7 馬珉;元數(shù)據(jù)——組織網(wǎng)上信息資源的基本格式[J];情報(bào)科學(xué);2002年04期

8 李慧;元數(shù)據(jù)在數(shù)字圖書館中的應(yīng)用[J];情報(bào)理論與實(shí)踐;2001年01期

9 王漢元;置標(biāo)語(yǔ)言以及SGML、HTML和XML的關(guān)系[J];情報(bào)雜志;2005年03期

10 陶蘭,楊睿,陳沖,孫曉明;面向語(yǔ)義Web的RDF數(shù)據(jù)處理和應(yīng)用[J];深圳大學(xué)學(xué)報(bào);2005年04期

相關(guān)碩士學(xué)位論文 前1條

1 劉振華;視頻文件元數(shù)據(jù)的設(shè)計(jì)與開發(fā)[D];山東大學(xué);2009年



本文編號(hào):1781021

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/kejilunwen/sousuoyinqinglunwen/1781021.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶9cb15***提供,本站僅收錄摘要或目錄,作者需要?jiǎng)h除請(qǐng)E-mail郵箱bigeng88@qq.com