當(dāng)前位置：主頁(yè) > 碩博論文 > 信息類(lèi)碩士論文 >

面向中文知識(shí)圖譜的數(shù)據(jù)重組與應(yīng)用

發(fā)布時(shí)間：2018-09-10 18:30

【摘要】：隨著語(yǔ)義網(wǎng)的快速發(fā)展,越來(lái)越多基于RDF的圖譜數(shù)據(jù)被發(fā)布到萬(wàn)維網(wǎng)上,組成了開(kāi)放鏈接數(shù)據(jù)(Linking Open Data)。一般來(lái)說(shuō),這些開(kāi)放數(shù)據(jù)提供SPARQL查詢(xún)服務(wù)和關(guān)鍵詞訪問(wèn)服務(wù)。實(shí)際上,相當(dāng)一部分用戶(hù)在訪問(wèn)的時(shí)候會(huì)選擇關(guān)鍵詞訪問(wèn),這些訪問(wèn)行為同時(shí)也被記錄在服務(wù)器的日志中。盡管用戶(hù)期望進(jìn)行表達(dá)能力更強(qiáng)的查詢(xún),SPARQL的復(fù)雜性和對(duì)所要查詢(xún)圖譜的不了解,往往會(huì)使得用戶(hù)很難獲得理想的查詢(xún)結(jié)果。除了RDF這種語(yǔ)義網(wǎng)數(shù)據(jù)交換的標(biāo)準(zhǔn)格式之外,隨著No SQL的興起和發(fā)展,基于屬性圖數(shù)據(jù)的查詢(xún)和存儲(chǔ)也得到越來(lái)越多的關(guān)注和研究。雖然有一部分基于屬性圖的評(píng)測(cè)標(biāo)準(zhǔn)已經(jīng)發(fā)布并且應(yīng)用到實(shí)際的場(chǎng)景中,但仍然缺乏被廣泛認(rèn)可的衡量綜合性能的評(píng)測(cè)基準(zhǔn)。因此,如何更好地組織和使用迄今為止積累下來(lái)的海量基于RDF的語(yǔ)義數(shù)據(jù),已經(jīng)成為語(yǔ)義網(wǎng)領(lǐng)域一個(gè)開(kāi)放性的問(wèn)題。畢業(yè)設(shè)計(jì)正是從這個(gè)背景出發(fā),提出了一個(gè)針對(duì)RDF的圖譜上SPARQL查詢(xún)推薦的框架和一個(gè)利用RDF數(shù)據(jù)對(duì)屬性圖進(jìn)行基準(zhǔn)評(píng)測(cè)的方法。具體來(lái)說(shuō),首先是提出來(lái)一個(gè)針對(duì)SPARQL查詢(xún)進(jìn)行推薦的框架。該框架是通過(guò)分析知識(shí)圖譜的訪問(wèn)日志,挖掘得到用戶(hù)查詢(xún)的偏好情況,并結(jié)合用戶(hù)的原始SPARQL查詢(xún)語(yǔ)句,推薦合適查詢(xún)語(yǔ)句。Zhishi.me上的實(shí)驗(yàn)結(jié)果表明推薦后的查詢(xún)語(yǔ)句能返回具有更好可讀性的查詢(xún)結(jié)果,能幫助用戶(hù)更好地使用SPARQL語(yǔ)句來(lái)遍歷知識(shí)圖譜。除此之外,本文還提出了一個(gè)利用已有RDF數(shù)據(jù)集來(lái)生成評(píng)測(cè)屬性圖的存儲(chǔ)的評(píng)測(cè)基準(zhǔn)。該方法先將RDF的數(shù)據(jù)模型轉(zhuǎn)換為屬性圖的數(shù)據(jù)模型,并通過(guò)分析訪問(wèn)日志來(lái)生成相應(yīng)查詢(xún)語(yǔ)句集。基于Zhishi.me的數(shù)據(jù)集實(shí)現(xiàn)了該評(píng)測(cè)基準(zhǔn),并對(duì)Neo4j和Titan這兩個(gè)目前最流行的支持屬性圖存儲(chǔ)的數(shù)據(jù)庫(kù)進(jìn)行了充分的評(píng)測(cè),為用戶(hù)選擇使用合適的數(shù)據(jù)庫(kù)提供了可靠的參考依據(jù)。
[Abstract]:With the rapid development of semantic web, more and more map data based on RDF have been published on the world wide web, forming open link data (Linking Open Data). In general, this open data provides SPARQL query services and keyword access services. In fact, quite a number of users choose keyword access when accessing, and these access behaviors are also recorded in the server log. Although the complexity of Sparql and the lack of understanding of the query graph, it is difficult for users to obtain ideal query results. In addition to RDF, the standard format of semantic web data exchange, with the rise and development of No SQL, query and storage based on attribute graph data have been paid more and more attention and research. Although some attribute graph-based metrics have been published and applied to actual scenarios, there is still a lack of widely accepted benchmarks to measure comprehensive performance. Therefore, how to better organize and use the accumulated mass of semantic data based on RDF has become an open problem in the field of semantic Web. It is against this background that the graduation project proposes a framework for SPARQL query and recommendation on the RDF graph and a method for benchmarking attribute diagrams using RDF data. Specifically, the first is to propose a framework for the recommendation of SPARQL queries. By analyzing the access log of the knowledge map, the framework can mine the preference of the user query, and combine with the original SPARQL query statement of the user. The experimental results on the recommended query statement. Zhishi.me show that the recommended query statement can return the query results with better readability and can help users to traverse the knowledge map better by using SPARQL sentences. In addition, this paper proposes a benchmark to generate the store of the attribute graph by using the existing RDF data set. Firstly, the data model of RDF is transformed into the data model of attribute graph, and the corresponding query statement set is generated by analyzing the access log. The data set based on Zhishi.me implements the benchmark, and gives a full evaluation of the two most popular databases, Neo4j and Titan, which support the storage of attribute diagrams, which provides a reliable reference for users to choose the appropriate database.
【學(xué)位授予單位】：上海交通大學(xué)
【學(xué)位級(jí)別】：碩士
【學(xué)位授予年份】：2015
【分類(lèi)號(hào)】：TP391.3

【相似文獻(xiàn)】

相關(guān)期刊論文前10條

1 曹起武;;用“對(duì)號(hào)入座”法開(kāi)展SQL查詢(xún)語(yǔ)句的教學(xué)[J];邢臺(tái)職業(yè)技術(shù)學(xué)院學(xué)報(bào);2010年05期

2 曾慶森,楊武;數(shù)據(jù)庫(kù)SQL查詢(xún)語(yǔ)句優(yōu)化方法的研究[J];電腦開(kāi)發(fā)與應(yīng)用;2001年02期

3 陳立明;SQL查詢(xún)語(yǔ)句優(yōu)化方法的研究[J];山西電子技術(shù);2002年04期

4 王振輝;吳廣茂;;SQL查詢(xún)語(yǔ)句優(yōu)化研究[J];計(jì)算機(jī)應(yīng)用;2005年S1期

5 楊波;薛錦云;;開(kāi)發(fā)等式比較SQL查詢(xún)語(yǔ)句的一種模型推理方法[J];計(jì)算機(jī)工程與應(yīng)用;2007年22期

6 王菲菲;;淺談SQL查詢(xún)語(yǔ)句的優(yōu)化方法[J];吉林華橋外國(guó)語(yǔ)學(xué)院學(xué)報(bào);2009年02期

7 蔡柳萍;;SQL查詢(xún)語(yǔ)句的優(yōu)化[J];經(jīng)營(yíng)管理者;2011年01期

8 楊姝;路遙;馬紅霞;;SQL查詢(xún)語(yǔ)句的優(yōu)化方法研究[J];硅谷;2011年02期

9 方瑞英;;淺析派生表在SQL查詢(xún)語(yǔ)句中的應(yīng)用[J];辦公自動(dòng)化;2013年04期

10 甄真;陳虎;張林亞;;列數(shù)據(jù)庫(kù)的SQL查詢(xún)語(yǔ)句編譯與優(yōu)化[J];計(jì)算機(jī)工程;2013年06期

相關(guān)會(huì)議論文前2條

1 陳新宇;楊冬青;唐世渭;陶艷瑰;崔宗軍;;基于受限漢語(yǔ)的數(shù)據(jù)庫(kù)查詢(xún)語(yǔ)句分析[A];第十六屆全國(guó)數(shù)據(jù)庫(kù)學(xué)術(shù)會(huì)議論文集[C];1999年

2 熊文新;宋柔;;信息檢索查詢(xún)語(yǔ)句的表述分析[A];第四屆全國(guó)語(yǔ)言文字應(yīng)用學(xué)術(shù)研討會(huì)論文集[C];2005年

相關(guān)重要報(bào)紙文章前2條

1 特約作者熾天使;不花錢(qián)拿IT認(rèn)證[N];電腦報(bào);2004年

2 河南張華貴;數(shù)據(jù)庫(kù)中參數(shù)化查詢(xún)的實(shí)現(xiàn)[N];電腦報(bào);2001年

相關(guān)碩士學(xué)位論文前7條

1 陳柏良;面向中文知識(shí)圖譜的數(shù)據(jù)重組與應(yīng)用[D];上海交通大學(xué);2015年

2 劉強(qiáng);面向查詢(xún)語(yǔ)句的擴(kuò)展過(guò)濾及權(quán)重計(jì)算研究[D];華中師范大學(xué);2013年

3 畢妲妮;查詢(xún)語(yǔ)句的概念分析及其在檢索中的應(yīng)用[D];上海交通大學(xué);2013年

4 張占英;關(guān)于數(shù)據(jù)庫(kù)漢語(yǔ)查詢(xún)語(yǔ)句中查詢(xún)信息的研究[D];河南大學(xué);2004年

5 王晶;非結(jié)構(gòu)化數(shù)據(jù)結(jié)構(gòu)化存儲(chǔ)中的查詢(xún)語(yǔ)句重寫(xiě)技術(shù)研究[D];華中科技大學(xué);2013年

6 李敏銘;基于JavaEE的數(shù)據(jù)庫(kù)輔導(dǎo)教學(xué)系統(tǒng)的設(shè)計(jì)和實(shí)現(xiàn)[D];電子科技大學(xué);2013年

7 朱素英;基于語(yǔ)音的圖書(shū)資料查詢(xún)漢語(yǔ)接口研究[D];國(guó)防科學(xué)技術(shù)大學(xué);2005年

，

本文編號(hào)：2235270

資料下載

論文發(fā)表

支付寶下載

Download by Alipay
微信下載

Download by Wechat
會(huì)員下載

Download by Member

本文鏈接：http://sikaile.net/shoufeilunwen/xixikjs/2235270.html

上一篇：原位生長(zhǎng)寬禁帶異質(zhì)材料紫外探測(cè)器的研制
下一篇：基于借閱記錄的圖書(shū)個(gè)性化推薦方法研究與應(yīng)用

論文發(fā)表

·知網(wǎng)|萬(wàn)方|維普|龍?jiān)磡省級(jí)|國(guó)家級(jí)|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

面向中文知識(shí)圖譜的數(shù)據(jù)重組與應(yīng)用