天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當(dāng)前位置:主頁 > 科技論文 > 軟件論文 >

面向精準(zhǔn)問答的數(shù)據(jù)處理的設(shè)計與實現(xiàn)

發(fā)布時間:2018-10-11 08:21
【摘要】:隨著網(wǎng)絡(luò)的迅速發(fā)展,使得互聯(lián)網(wǎng)上的信息越來越多,人們越來越不能從海量的信息中獲取對自己有用的信息。當(dāng)用戶使用搜索引擎進行搜索時,給出的結(jié)果往往是非常多的,用戶還需要去自己甄別,這就使得簡單的排列搜索結(jié)果并不能滿足用戶的需求。精準(zhǔn)問答的出現(xiàn)可以為用戶的搜索提供準(zhǔn)確的答案,省去了用戶自己去甄別的過程,為用戶提供更好的搜索體驗。精準(zhǔn)問答中最重要的就是展現(xiàn)給用戶的答案數(shù)據(jù),對數(shù)據(jù)的準(zhǔn)確性和實效性有很高的要求。所以提供的答案需要準(zhǔn)確的數(shù)據(jù)來做支撐,這就需要建立一個非常完善的數(shù)據(jù)處理流程來獲取精準(zhǔn)問答答案數(shù)據(jù)。本文首先論述對精準(zhǔn)問答數(shù)據(jù)處理的研究背景和意義,根據(jù)調(diào)研結(jié)果得出本文的研究內(nèi)容。接下來根據(jù)研究內(nèi)容對整個精準(zhǔn)問答數(shù)據(jù)處理中涉及到的關(guān)鍵技術(shù)和原理進行了詳細闡述,包括Web信息抽取技術(shù)、XML技術(shù)、流式計算平臺、搜索引擎建立索引等。然后對精準(zhǔn)問答數(shù)據(jù)處理進行了總體需求分析,并針對本文設(shè)計與實現(xiàn)的Web信息抽取、文檔拆分、樂隊成員生成三個部分做出了功能需求分析。接著對數(shù)據(jù)處理進行總體設(shè)計,給出了 Web信息抽取模塊、文檔拆分模塊、樂隊成員生成模塊的概要設(shè)計和各模塊詳細設(shè)計與實現(xiàn)細節(jié)。最后說明了測試環(huán)境,分別對三個模塊進行功能測試和性能測試,并對測試結(jié)果進行總結(jié)。
[Abstract]:With the rapid development of the Internet, more and more information on the Internet, people can not get useful information from the mass of information. When users use search engines to search, the results are often very many, and users still need to identify themselves, which makes the simple arrangement of search results can not meet the needs of users. The appearance of precise question and answer can provide the accurate answer for the user's search, obviate the process of the user's own discriminating, and provide the user with better search experience. The most important thing in the accurate question answering is to show the answer data to the user, which requires the accuracy and effectiveness of the data. Therefore, the answers are supported by accurate data, which requires the establishment of a very complete data processing process to obtain accurate question and answer data. This paper first discusses the research background and significance of accurate question and answer data processing. Then according to the research content, the key technologies and principles involved in the whole precise question and answer data processing are described in detail, including Web information extraction technology, XML technology, flow computing platform, search engine index and so on. Then, the general requirement analysis of precise question and answer data processing is carried out, and the functional requirements analysis is made for the three parts of Web information extraction, document splitting and band member generation, which are designed and implemented in this paper. Then the general design of the data processing is given, including the Web information extraction module, the document splitting module, the summary design of the band member generation module, and the detailed design and implementation details of each module. Finally, the test environment is described. The function and performance of the three modules are tested, and the test results are summarized.
【學(xué)位授予單位】:北京郵電大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2016
【分類號】:TP391.1;TP393.09

【參考文獻】

相關(guān)碩士學(xué)位論文 前1條

1 李猛;基于DOM的Web信息抽取技術(shù)的研究與實現(xiàn)[D];大連理工大學(xué);2008年

,

本文編號:2263495

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/kejilunwen/ruanjiangongchenglunwen/2263495.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶30268***提供,本站僅收錄摘要或目錄,作者需要刪除請E-mail郵箱bigeng88@qq.com