分布式知識搜索系統(tǒng)的研究與實現(xiàn)
[Abstract]:The Internet contains a lot of valuable information, and search engine is an important tool for people to retrieve information from the Internet. Traditional search engines only rely on keyword matching to find relevant pages for users, and rank them to users according to certain algorithms, without referring to the semantic information of web pages. With the development of Internet technology and the increase of people's demand for accurate search, the traditional search engine can not adapt to the change of this demand. In order to solve the shortcomings of traditional search engines, knowledge search emerged as the times require. Knowledge search will analyze the user's query intention and return the relevant knowledge to the user, which greatly improves the accuracy and correlation of the search results. Due to the high time consuming of natural language processing and the storage problems and security caused by the growth of knowledge base, this paper combines knowledge search with distributed framework to implement a workflow framework. Distributed crawler and distributed knowledge extraction module can flexibly configure the process of distributed knowledge search system, and the efficiency of single computer system and distributed system are compared. A comparative experiment on an experimental distributed system composed of three machines shows that the efficiency of the distributed knowledge extraction system is nearly twice as high as that of the single machine system, and can be further improved with the expansion of the distributed cluster. At the same time, the distributed system can also provide better security.
【學(xué)位授予單位】:北京郵電大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2013
【分類號】:TP391.3
【參考文獻(xiàn)】
相關(guān)期刊論文 前10條
1 李穎新,劉全金,阮曉鋼;多發(fā)性骨髓瘤基因表達(dá)譜分析[J];北京工業(yè)大學(xué)學(xué)報;2004年03期
2 胡光民;周亮;柯立新;;基于Hadoop的網(wǎng)絡(luò)日志分析系統(tǒng)研究[J];電腦知識與技術(shù);2010年22期
3 許勇,宋柔;基于HMM的百科辭典文本中句子的知識點分類[J];計算機(jī)工程與應(yīng)用;2005年04期
4 陳莉;吳潔;馬靜;薛浩;;基于本體的領(lǐng)域知識搜索研究[J];計算機(jī)工程;2008年24期
5 安強(qiáng)強(qiáng);張蕾;;基于依存樹的中文語義角色標(biāo)注[J];計算機(jī)工程;2010年04期
6 毛文吉,陸汝鈐;基于SELD描述語言的英文科技文本知識自動獲取[J];計算機(jī)學(xué)報;1998年S1期
7 陳克健;電子詞典與詞匯知識表達(dá)[J];中文信息學(xué)報;2002年04期
8 劉懷軍;車萬翔;劉挺;;中文語義角色標(biāo)注的特征工程[J];中文信息學(xué)報;2007年01期
9 李軍輝;王紅玲;周國棟;朱巧明;錢培德;;語義角色標(biāo)注中句法特征的研究[J];中文信息學(xué)報;2009年06期
10 劉挺;車萬翔;李生;;基于最大熵分類器的語義角色標(biāo)注[J];軟件學(xué)報;2007年03期
相關(guān)碩士學(xué)位論文 前1條
1 鄧昱;中文問答系統(tǒng)中的答案抽取算法研究[D];北京郵電大學(xué);2009年
,本文編號:2213846
本文鏈接:http://sikaile.net/kejilunwen/sousuoyinqinglunwen/2213846.html