基于領(lǐng)域特征和用戶查詢?nèi)拥腄eep Web數(shù)據(jù)源描述方法
[Abstract]:[Objective / meaning] data source description (also known as data source Digest) is one of the key problems in Deep Web integrated retrieval field. The quality of data source description directly affects the retrieval efficiency and effect of integrated retrieval system. This paper presents a data source description method based on domain features and user query sampling, with a view to being a non cooperative environment. It provides reference and reference for the application and research of resource integration. [method / process] this method is an off-line sampling method for heterogeneous and non cooperative data sources. By analyzing the data source and the subject attributes used in the query, the domain feature words set, the initial feature word set and the high frequency characteristic word set are constructed in turn, and the high frequency feature word query is finally obtained. Sample data source description information. Combined with popular CORI algorithm, this paper analyzes the correlation calculation method of user query and data source description based on inference network, and designs an integrated retrieval system based on Lemur tool set based on this method. The effectiveness of the above method is verified. [results / Conclusion] methods are in the aspect of recall and precision. Compared with other methods, this method has obvious cost advantages and practical value in automatic updating and operation management of sample data.
【作者單位】: 中國(guó)科學(xué)院文獻(xiàn)情報(bào)中心;中國(guó)科學(xué)院大學(xué);
【基金】:國(guó)家社會(huì)科學(xué)基金項(xiàng)目“基于開放獲取學(xué)術(shù)期刊的資源深度整合與揭示研究”(項(xiàng)目編號(hào):16BTQ025)研究成果之一
【分類號(hào)】:TP391.3
【相似文獻(xiàn)】
相關(guān)期刊論文 前10條
1 萬(wàn)春,劉麗莉;數(shù)據(jù)源的自動(dòng)生成[J];計(jì)算機(jī)時(shí)代;2001年09期
2 唐懿芳 ,牛力 ,張師超;多數(shù)據(jù)源挖掘中的模式合成技術(shù)[J];菏澤師專學(xué)報(bào);2002年02期
3 蔡璇;田忠和;;多數(shù)據(jù)源查詢的幾種優(yōu)化方法[J];計(jì)算機(jī)與數(shù)字工程;2006年07期
4 王穎;;分布式空間數(shù)據(jù)源的聯(lián)合查詢[J];計(jì)算機(jī)工程與設(shè)計(jì);2007年04期
5 胡鵬昱;趙朋朋;方巍;崔志明;;深網(wǎng)數(shù)據(jù)源質(zhì)量估計(jì)模型[J];計(jì)算機(jī)工程;2009年09期
6 孫宏旭;邢薇;馬立和;;動(dòng)態(tài)多數(shù)據(jù)源的研究與實(shí)現(xiàn)[J];電腦學(xué)習(xí);2010年03期
7 鄧松;萬(wàn)常選;劉喜平;廖國(guó)瓊;;基于用戶反饋的深網(wǎng)數(shù)據(jù)源選擇[J];小型微型計(jì)算機(jī)系統(tǒng);2012年11期
8 鄧松;萬(wàn)常選;吁亮;劉德喜;雷剛;王映龍;;非合作結(jié)構(gòu)化深網(wǎng)數(shù)據(jù)源摘要的動(dòng)態(tài)更新[J];微電子學(xué)與計(jì)算機(jī);2014年04期
9 黃克穎;高s,
本文編號(hào):2134606
本文鏈接:http://sikaile.net/kejilunwen/ruanjiangongchenglunwen/2134606.html