天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當(dāng)前位置:主頁 > 科技論文 > 軟件論文 >

民族節(jié)日領(lǐng)域本體的構(gòu)建及語義檢索模型研究

發(fā)布時(shí)間:2018-09-01 09:24
【摘要】:少數(shù)民族文化是民族發(fā)展過程中創(chuàng)造并傳承下來的精神財(cái)富,有著獨(dú)特的價(jià)值、豐富的內(nèi)涵和鮮明的民族特色,其中少數(shù)民族的傳統(tǒng)節(jié)日更是民族文化不可缺少的一部分。但由于現(xiàn)代社會的發(fā)展和主流文化的強(qiáng)勢沖擊、少數(shù)民族信息資源開發(fā)利用不足,傳統(tǒng)的信息傳播方式使民族風(fēng)俗、文化、宗教等具有民族特色的資源面臨著傳承中斷的困境。本文以潑水節(jié)為例,實(shí)現(xiàn)了對少數(shù)民族節(jié)日領(lǐng)域本體的半自動構(gòu)建,并對民族節(jié)日領(lǐng)域語義檢索模型進(jìn)行研究,利用本體技術(shù)的優(yōu)勢為后續(xù)的少數(shù)民族文化傳承、保護(hù)和宣傳提供了技術(shù)基礎(chǔ)。本文主要工作如下:1.本文總結(jié)國內(nèi)外本體半自動構(gòu)建的現(xiàn)狀,探討領(lǐng)域本體的半自動構(gòu)建的模式方法,提出一種民族節(jié)日領(lǐng)域本體的半自動構(gòu)建方法,并構(gòu)建了領(lǐng)域初始本體,通過專家指導(dǎo)和查詢大量文獻(xiàn)資料構(gòu)造了民族節(jié)日領(lǐng)域詞典用于文本分詞。2.結(jié)合領(lǐng)域需求,通過網(wǎng)絡(luò)爬蟲技術(shù)獲取文本,利用SVM文本分類技術(shù)獲取民族節(jié)日領(lǐng)域中潑水節(jié)相關(guān)文本。在文本預(yù)處理的分詞階段使用本文構(gòu)造的領(lǐng)域詞典并對特征選擇的卡方檢驗(yàn)方法和權(quán)重計(jì)算的TF-IDF方法做出改進(jìn),提高了分類的準(zhǔn)確性。3.對文本分類后獲取到的領(lǐng)域相關(guān)文本進(jìn)行概念和關(guān)系的提取,在概念提取階段使用基于統(tǒng)計(jì)的方法獲取領(lǐng)域概念集,在關(guān)系提取階段以詞法特征為基礎(chǔ),結(jié)合依存句法分析技術(shù),利用基于樹結(jié)構(gòu)思想的SVM對概念關(guān)系進(jìn)行提取,用Jena API和Protégé共同完成本體擴(kuò)展、修正和概念關(guān)系細(xì)化工作,形成最終領(lǐng)域本體。4.為驗(yàn)證領(lǐng)域本體的實(shí)用性,對民族節(jié)日領(lǐng)域本體的檢索應(yīng)用提出了詳細(xì)的設(shè)計(jì)和構(gòu)思,并構(gòu)建了初始的語義檢索試驗(yàn)?zāi)P?為本體的后續(xù)應(yīng)用提供理論基礎(chǔ)和可行性研究。
[Abstract]:Minority culture is the spiritual wealth created and passed down in the process of national development, which has unique value, rich connotation and distinct national characteristics, among which the traditional festival of ethnic minorities is an indispensable part of national culture. However, due to the development of modern society and the strong impact of mainstream culture, the exploitation and utilization of the information resources of ethnic minorities are insufficient, and the traditional ways of information dissemination make the national customs, culture, religion and other resources with national characteristics faced with the dilemma of inheritance and interruption. This paper takes the water splashing festival as an example, realizes the semi-automatic construction of the domain ontology of the minority festival, studies the semantic retrieval model of the national festival field, and makes use of the advantages of ontology technology to carry on the subsequent minority culture inheritance. Protection and publicity provide the technical basis. The main work of this paper is as follows: 1. This paper summarizes the present situation of ontology semi-automatic construction at home and abroad, discusses the mode method of domain ontology semi-automatic construction, puts forward a semi-automatic construction method of national festival domain ontology, and constructs domain initial ontology. Through expert guidance and inquiry a large number of literature materials to construct a national festival field dictionary for text participle. 2. According to the requirements of the field, the text is obtained by the web crawler technology and the text classification technology of SVM is used to obtain the relevant text of the water splashing festival in the field of national festivals. In the word segmentation stage of text preprocessing, the domain dictionary constructed in this paper is used and the chi-square test method of feature selection and the TF-IDF method of weight calculation are improved to improve the accuracy of classification. 3. The concepts and relationships of domain related texts are extracted after text classification. In the phase of concept extraction, a statistical method is used to obtain domain concept sets, and in the phase of relation extraction, lexical features are used as the basis. Combined with dependency syntax analysis technology, SVM based on tree structure is used to extract concept relation, and Jena API and Prot 茅 g 茅 are used to complete ontology extension, revise and refine concept relation, and form final domain ontology. 4. In order to verify the practicability of domain ontology, this paper presents a detailed design and conception for the retrieval application of national festival domain ontology, and constructs an initial semantic retrieval experimental model, which provides a theoretical basis and feasibility study for the subsequent application of ontology.
【學(xué)位授予單位】:云南師范大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2017
【分類號】:TP391.3

【參考文獻(xiàn)】

相關(guān)期刊論文 前10條

1 王超;李書琴;肖紅;;基于文獻(xiàn)的農(nóng)業(yè)領(lǐng)域本體自動構(gòu)建方法研究[J];計(jì)算機(jī)應(yīng)用與軟件;2014年08期

2 鐘偉金;;基于共現(xiàn)詞網(wǎng)改造的領(lǐng)域本體自動構(gòu)建模型研究[J];情報(bào)理論與實(shí)踐;2014年01期

3 田維;郭劍毅;余正濤;線巖團(tuán);王炎冰;;結(jié)合FCA與Jena的領(lǐng)域本體半自動構(gòu)建方法研究[J];計(jì)算機(jī)工程與科學(xué);2013年03期

4 路永和;李焰鋒;;改進(jìn)TF-IDF算法的文本特征項(xiàng)權(quán)值計(jì)算方法[J];圖書情報(bào)工作;2013年03期

5 谷俊;;中文專利本體半自動構(gòu)建系統(tǒng)設(shè)計(jì)[J];圖書情報(bào)工作;2013年03期

6 支麗平;王恒山;;基于多Agent的大規(guī)模領(lǐng)域本體的自動化構(gòu)建方法[J];情報(bào)學(xué)報(bào);2012年08期

7 金鑫;;面向Web信息資源的領(lǐng)域本體模型自動構(gòu)建機(jī)制的研究[J];計(jì)算機(jī)科學(xué);2012年06期

8 尚新麗;;國外本體構(gòu)建方法比較分析[J];圖書情報(bào)工作;2012年04期

9 劉寧;李冠宇;邵彬;;Jena2推理機(jī)制的研究[J];微計(jì)算機(jī)信息;2010年33期

10 陽小蘭;錢程;趙海廷;;Web文本預(yù)處理技術(shù)探析[J];電腦知識與技術(shù);2010年29期

相關(guān)博士學(xué)位論文 前2條

1 王進(jìn);基于本體的語義信息檢索研究[D];中國科學(xué)技術(shù)大學(xué);2006年

2 程勇;基于本體的不確定性知識管理研究[D];中國科學(xué)院研究生院(計(jì)算技術(shù)研究所);2005年

相關(guān)碩士學(xué)位論文 前10條

1 劉Z,

本文編號:2216762


資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/kejilunwen/ruanjiangongchenglunwen/2216762.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶205e1***提供,本站僅收錄摘要或目錄,作者需要?jiǎng)h除請E-mail郵箱bigeng88@qq.com