受限領(lǐng)域問答系統(tǒng)的研究與設(shè)計(jì)
發(fā)布時(shí)間:2018-02-23 02:22
本文關(guān)鍵詞: 問答系統(tǒng) FAQ 問題理解 相似度計(jì)算 出處:《內(nèi)蒙古大學(xué)》2012年碩士論文 論文類型:學(xué)位論文
【摘要】:隨著互聯(lián)網(wǎng)的發(fā)展和應(yīng)用,網(wǎng)上的信息迅速增長。人們希望能從海量的網(wǎng)絡(luò)內(nèi)容獲取自己所需要的信息。搜索引擎的出現(xiàn)從很大程度上解決了這個(gè)問題。人們只需輸入一些關(guān)鍵字,搜索引擎就會(huì)返回相關(guān)的網(wǎng)頁。但是面對(duì)繁多的網(wǎng)頁信息,用戶很難迅速找到自己所需的內(nèi)容。因此,為了滿足人們能夠更快速、準(zhǔn)確地獲取信息的愿望,自動(dòng)問答系統(tǒng)(automatic Question Answering System, QA)逐漸發(fā)展起來。 自動(dòng)問答系統(tǒng)允許用戶使用自然語言進(jìn)行提問,并針對(duì)問題返回一個(gè)簡潔準(zhǔn)確的答案。它綜合運(yùn)用多種自然語言處理技術(shù),是計(jì)算機(jī)應(yīng)用領(lǐng)域研究的熱點(diǎn)之一。目前,英文問答系統(tǒng)的研究已比較成熟,由于中文自然語言的復(fù)雜性,因此中文問答系統(tǒng)的研究還處于初步階段。本文研究的是受限領(lǐng)域內(nèi)的中文自動(dòng)問答系統(tǒng)。 本文根據(jù)計(jì)算機(jī)領(lǐng)域知識(shí)的特點(diǎn),研究設(shè)計(jì)了一個(gè)針對(duì)計(jì)算機(jī)網(wǎng)絡(luò)課程基于常問問題庫(FAQ)的中文問答系統(tǒng)。本系統(tǒng)主要研究了領(lǐng)域知識(shí)庫的構(gòu)建,問題理解,計(jì)算句子相似度算法等方面的內(nèi)容。在構(gòu)建領(lǐng)域知識(shí)庫部分,研究設(shè)計(jì)了課程知識(shí)點(diǎn)表結(jié)構(gòu)、FAQ存儲(chǔ)方式、對(duì)FAQ進(jìn)行預(yù)處理;問題理解部分,主要研究了中文分詞、關(guān)鍵詞提取和擴(kuò)展、問題分類方法等;句子相似度計(jì)算部分,采用了基于語義的相似度計(jì)算方法。并建立了相應(yīng)的問題測(cè)試集進(jìn)行試驗(yàn),文章最后介紹了整個(gè)自動(dòng)問答系統(tǒng)的實(shí)驗(yàn)結(jié)果及其評(píng)價(jià)。
[Abstract]:With the development and application of the Internet, Information on the Internet is growing rapidly. People want to get the information they need from a huge amount of online content. The emergence of search engines solves this problem to a large extent. People just need to enter some keywords. The search engine will return the relevant web pages. However, in the face of so many web pages, it is difficult for users to find the content they need quickly. Therefore, in order to satisfy people's desire to obtain information more quickly and accurately, Automatic Question Answering system (QA) is developing gradually. The automatic question answering system allows users to use natural language to ask questions and return a concise and accurate answer to the question. It synthetically uses a variety of natural language processing techniques and is one of the hotspots in the field of computer application. Due to the complexity of Chinese natural language, the study of Chinese question and answer system is still in its preliminary stage. According to the characteristics of computer domain knowledge, this paper studies and designs a Chinese question answering system for computer network courses based on FAQ. This system mainly studies the construction of domain knowledge base and the understanding of problems. In the part of constructing domain knowledge base, we design the structure of course knowledge point table and pre-process FAQ. In the part of problem understanding, we mainly study the Chinese word segmentation, and the main contents of this paper are as follows: (1) in the part of constructing domain knowledge base, we design the structure of course knowledge point table and preprocess FAQ. In the part of sentence similarity calculation, the semantic similarity calculation method is used, and the corresponding question test set is established for the experiment. Finally, the paper introduces the experimental results and evaluation of the whole automatic question and answer system.
【學(xué)位授予單位】:內(nèi)蒙古大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2012
【分類號(hào)】:TP391.1
【引證文獻(xiàn)】
相關(guān)碩士學(xué)位論文 前1條
1 陳琛;基于領(lǐng)域本體的腎病專家咨詢系統(tǒng)[D];南京郵電大學(xué);2013年
,本文編號(hào):1525893
本文鏈接:http://sikaile.net/kejilunwen/sousuoyinqinglunwen/1525893.html
最近更新
教材專著