天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當(dāng)前位置:主頁(yè) > 科技論文 > 軟件論文 >

數(shù)據(jù)空間中基于數(shù)據(jù)世系的關(guān)聯(lián)關(guān)系獲取方法研究

發(fā)布時(shí)間:2018-09-05 07:08
【摘要】:隨著信息技術(shù)的不斷發(fā)展,數(shù)據(jù)信息逐步呈現(xiàn)海量、多樣、非結(jié)構(gòu)化的特點(diǎn)。而傳統(tǒng)的數(shù)據(jù)庫(kù)技術(shù)已不能夠?qū)@些復(fù)雜數(shù)據(jù)進(jìn)行有效的管理,新的數(shù)據(jù)管理模式——數(shù)據(jù)空間應(yīng)運(yùn)而生,其不但可以支持文檔、Web等多種不同的異構(gòu)數(shù)據(jù)源,而且具有集成演化的特性,強(qiáng)調(diào)數(shù)據(jù)之間的關(guān)聯(lián)性及演化性。而專(zhuān)利文獻(xiàn)中含有豐富的結(jié)構(gòu)化信息及非結(jié)構(gòu)化信息,本文選取海量專(zhuān)利數(shù)據(jù)進(jìn)行分析,挖掘?qū)@g潛在的技術(shù)關(guān)聯(lián)關(guān)系并以此發(fā)現(xiàn)新穎專(zhuān)利。由于專(zhuān)利文獻(xiàn)中引文的缺失以及作者引用動(dòng)機(jī)難以判斷,因此,不能直接使用引用關(guān)系作為專(zhuān)利技術(shù)關(guān)聯(lián)的評(píng)價(jià)指標(biāo)。針對(duì)這一問(wèn)題,本文構(gòu)建了專(zhuān)利間綜合語(yǔ)義相似度模型,用以評(píng)估專(zhuān)利間的技術(shù)關(guān)聯(lián)。首先,根據(jù)專(zhuān)利文獻(xiàn)中包含的專(zhuān)利作者、IPC專(zhuān)利分類(lèi)號(hào)等結(jié)構(gòu)化信息分別構(gòu)建了專(zhuān)利作者相同關(guān)系矩陣WA和基于IPC專(zhuān)利分類(lèi)號(hào)共類(lèi)關(guān)系矩陣WC;然后,針對(duì)專(zhuān)利標(biāo)題、摘要、權(quán)利說(shuō)明書(shū)等文本信息構(gòu)建專(zhuān)利文本相似度矩陣Ws,最后,進(jìn)行多維融合構(gòu)建綜合語(yǔ)義相似度模型。接下來(lái),引入時(shí)序因素并結(jié)合專(zhuān)利間綜合語(yǔ)義相似度模型構(gòu)建專(zhuān)利世系關(guān)聯(lián)網(wǎng)絡(luò),根據(jù)專(zhuān)利數(shù)據(jù)世系分析相關(guān)技術(shù)的演化路徑,以此來(lái)對(duì)專(zhuān)利價(jià)值進(jìn)行評(píng)估,并挖掘新穎專(zhuān)利。首先利用專(zhuān)利世系關(guān)聯(lián)網(wǎng)絡(luò)中專(zhuān)利間潛在的直接或間接被引關(guān)系,綜合考量專(zhuān)利價(jià)值隨時(shí)間指數(shù)衰減因素及潛在的直接或間接被引的專(zhuān)利對(duì)專(zhuān)利價(jià)值的貢獻(xiàn)度,提出專(zhuān)利價(jià)值評(píng)估算法;由于新加入的專(zhuān)利對(duì)原有專(zhuān)利世系關(guān)聯(lián)網(wǎng)絡(luò)中的專(zhuān)利的價(jià)值影響,為節(jié)省大量重復(fù)計(jì)算的時(shí)間,最后提出專(zhuān)利價(jià)值動(dòng)態(tài)更新算法,當(dāng)在T+1時(shí)刻新加入的專(zhuān)利與原有T時(shí)刻的專(zhuān)利存在潛在技術(shù)關(guān)聯(lián)時(shí),其價(jià)值為所有的鄰接點(diǎn)的價(jià)值傳遞度之和,從而提高算法的計(jì)算效率。最后,使用專(zhuān)利數(shù)據(jù)集進(jìn)行相關(guān)實(shí)驗(yàn),經(jīng)實(shí)驗(yàn)結(jié)果對(duì)比分析驗(yàn)證了專(zhuān)利綜合語(yǔ)義相似度模型的準(zhǔn)確性以及專(zhuān)利價(jià)值動(dòng)態(tài)更新算法的高效性。
[Abstract]:With the continuous development of information technology, data information gradually presents the characteristics of mass, diversity, unstructured. However, the traditional database technology can not manage these complex data effectively, and a new data management model, data space, emerges as the times require, which can not only support many different heterogeneous data sources, such as document and Web, etc. Moreover, it has the characteristics of integration and evolution, emphasizing the relevance and evolution of data. The patent literature contains abundant structured information and unstructured information. This paper selects massive patent data to analyze the potential technological relationships between patents and find new patents. Due to the lack of citation in patent literature and the difficulty in judging the author's citation motivation, the citation relation cannot be directly used as the evaluation index of patent technology relevance. To solve this problem, a comprehensive semantic similarity model between patents is constructed to evaluate the technical association between patents. First of all, according to the structured information of patent author WA and WC; based on IPC patent classification number, the same relationship matrix WA and WC; are constructed respectively. The patent text similarity matrix (Ws,) is constructed with the text information such as the specification. Finally, the comprehensive semantic similarity model is constructed by multi-dimensional fusion. Then, the temporal factors are introduced and combined with the comprehensive semantic similarity model among patents to construct the patent lineage correlation network. According to the patent data lineage, the evolution path of the related technology is analyzed to evaluate the patent value and explore novel patents. Firstly, by using the potential direct or indirect citation relationship between patents in the related network of patent lineages, the factors of exponential decay of patent value over time and the contribution of potential direct or indirect cited patents to patent value are considered synthetically. Due to the influence of the new patent on the value of the patent in the original patent-related network, in order to save a lot of time of repeated calculation, a dynamic updating algorithm of patent value is put forward. When there is a potential technical correlation between the newly added patent and the original patent at T1, the value of the patent is the sum of the value transfer degrees of all adjacent points, thus improving the computational efficiency of the algorithm. Finally, the patent data set is used to carry on the related experiments, and the accuracy of the patent synthesis semantic similarity model and the efficiency of the patent value dynamic updating algorithm are verified by the comparison and analysis of the experimental results.
【學(xué)位授予單位】:哈爾濱工程大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2016
【分類(lèi)號(hào)】:TP391.1

【參考文獻(xiàn)】

相關(guān)期刊論文 前10條

1 馮嶺;彭智勇;劉斌;車(chē)敦仁;;一種基于潛在引用網(wǎng)絡(luò)的專(zhuān)利價(jià)值評(píng)估方法[J];計(jì)算機(jī)研究與發(fā)展;2015年03期

2 黃斌;黃魯成;吳菲菲;苗紅;;基于專(zhuān)利共類(lèi)的技術(shù)間關(guān)聯(lián)性評(píng)估[J];情報(bào)雜志;2015年02期

3 王鑫;趙蘊(yùn)華;高芳;;基于分類(lèi)號(hào)和引文的專(zhuān)利相似度測(cè)量方法研究[J];數(shù)字圖書(shū)館論壇;2015年01期

4 劉峰;吳瑞紅;徐川;呂學(xué)強(qiáng);;專(zhuān)利文獻(xiàn)中關(guān)鍵詞抽取方法的改進(jìn)[J];情報(bào)雜志;2014年12期

5 胡阿沛;張靜;張曉宇;;基于專(zhuān)利文獻(xiàn)的技術(shù)演化分析方法評(píng)述[J];現(xiàn)代情報(bào);2013年10期

6 張杰;劉美佳;翟東升;;基于專(zhuān)利共詞分析的RFID領(lǐng)域技術(shù)主題研究[J];科技管理研究;2013年10期

7 汪雪鋒;趙晨曉;衡曉帆;王有國(guó);張琪;;基于時(shí)間序列的關(guān)聯(lián)分析在技術(shù)監(jiān)測(cè)中的應(yīng)用研究[J];情報(bào)雜志;2013年04期

8 陳立新;梁立明;;技術(shù)領(lǐng)域的集成與整合研究——基于美國(guó)專(zhuān)利IPC的關(guān)聯(lián)分析[J];情報(bào)雜志;2013年01期

9 鐘華;鄧輝;;基于技術(shù)生命周期的專(zhuān)利組合判別研究[J];圖書(shū)情報(bào)工作;2012年18期

10 曾淑琴;吳揚(yáng)揚(yáng);;基于數(shù)據(jù)空間的數(shù)據(jù)源內(nèi)容關(guān)系發(fā)現(xiàn)機(jī)制[J];微型機(jī)與應(yīng)用;2012年14期

相關(guān)會(huì)議論文 前1條

1 張樹(shù)良;王金平;趙亞娟;;國(guó)際半導(dǎo)體照明材料專(zhuān)利技術(shù)發(fā)展態(tài)勢(shì)分析[A];第七屆中國(guó)功能材料及其應(yīng)用學(xué)術(shù)會(huì)議論文集(第4分冊(cè))[C];2010年

相關(guān)碩士學(xué)位論文 前4條

1 謝壽峰;基于專(zhuān)利分析的技術(shù)演變與預(yù)測(cè)研究[D];南京理工大學(xué);2014年

2 劉倩楠;基于專(zhuān)利引文網(wǎng)絡(luò)的技術(shù)演進(jìn)路徑識(shí)別研究[D];大連理工大學(xué);2010年

3 曹菲菲;基于內(nèi)容分析的專(zhuān)利挖掘技術(shù)研究[D];東北大學(xué);2008年

4 侯筱蓉;基于引文路徑分析的專(zhuān)利技術(shù)演進(jìn)圖研究[D];重慶大學(xué);2008年

,

本文編號(hào):2223553

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/kejilunwen/ruanjiangongchenglunwen/2223553.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶9902d***提供,本站僅收錄摘要或目錄,作者需要?jiǎng)h除請(qǐng)E-mail郵箱bigeng88@qq.com