可視輔助科學(xué)文獻(xiàn)閱讀的研究及應(yīng)用

發(fā)布時(shí)間：2018-03-30 18:46

本文選題：文檔可視化　切入點(diǎn)：文本摘要技術(shù)　出處：《天津大學(xué)》2016年碩士論文

【摘要】：近年,科技論文發(fā)表數(shù)與日俱增,研究學(xué)者需要閱讀越來(lái)越多的文獻(xiàn)。怎樣快速有效地閱讀一篇科技論文逐漸成為一個(gè)重要的研究問(wèn)題。而一篇科技論文通常是一項(xiàng)學(xué)術(shù)研究的結(jié)晶,其中涵蓋了許多了論點(diǎn)和發(fā)現(xiàn)。這使讀者很難在很短的時(shí)間內(nèi)獲取到一篇文章的核心論點(diǎn)。另一方面,在閱讀科技論文時(shí),理解與其相關(guān)的重要參考文獻(xiàn)對(duì)于更好的理解文章內(nèi)容有很大的幫助。然而,一篇文章的引用網(wǎng)絡(luò)是一個(gè)復(fù)雜的結(jié)構(gòu)。在引用網(wǎng)絡(luò)中尋找相關(guān)文獻(xiàn)很容易會(huì)讓人迷失在文獻(xiàn)的復(fù)雜網(wǎng)絡(luò)中。由此,怎樣從眾多的參考文獻(xiàn)中快速找到最重要最相關(guān)的幾篇,怎樣能在閱讀過(guò)程中不迷失在文檔的多維空間中也成為值得研究的問(wèn)題。本文利用文本可視分析的技術(shù)進(jìn)行科學(xué)文獻(xiàn)閱讀的研究。為了解決上述問(wèn)題,本文通過(guò)研究科學(xué)文獻(xiàn)在引用網(wǎng)絡(luò)中的關(guān)系和特性,在文檔分析相關(guān)研究和技術(shù)的基礎(chǔ)上,提出利用一種基于閱讀目的的文本摘要技術(shù)來(lái)進(jìn)行文章中關(guān)鍵句子的抽取,同時(shí)利用LDA(Latent Dirichlet Allocation)話題模型對(duì)科學(xué)文獻(xiàn)的內(nèi)容進(jìn)行話題分析。此外,本文還提出了一個(gè)基于文本摘要和引用關(guān)系的可視輔助文檔閱讀系統(tǒng)。系統(tǒng)通過(guò)文本摘要技術(shù)提取論文中重要的句子,并利用多尺度的可視化方式展示出來(lái),方便讀者在閱讀時(shí)定位到論文的核心內(nèi)容;利用話題模型抽取出參考文獻(xiàn)的核心話題,并設(shè)計(jì)多種可視化方案,包括詞云,樹(shù)圖,徑向圖等,來(lái)展現(xiàn)參考文獻(xiàn)的核心話題及其與該篇文章之間的關(guān)聯(lián)關(guān)系;記錄用戶在整個(gè)閱讀過(guò)程中的行為從而使用戶關(guān)注在自己的閱讀目的上,防止迷失。同時(shí),我們?cè)谝粋€(gè)具體的使用場(chǎng)景下詳細(xì)介紹了系統(tǒng)的使用方法以及交互方式,并進(jìn)行了用戶研究來(lái)驗(yàn)證系統(tǒng)的可用性,結(jié)果證明本文提出的系統(tǒng)具有可擴(kuò)展性以及良好的用戶體驗(yàn)。最后,在案例研究中,我們通過(guò)分析不同用戶的閱讀路徑得出了許多不同的閱讀模式,在未來(lái)的工作中將對(duì)用戶閱讀行為進(jìn)行建模分析,并利用用戶閱讀數(shù)據(jù)做閱讀推薦。
[Abstract]:In recent years, the number of scientific and technological papers has been increasing. Researchers need to read more and more literature. How to read a scientific paper quickly and effectively has gradually become an important research question. And a scientific paper is usually the crystallization of an academic research. It covers a lot of arguments and discoveries. This makes it difficult for readers to get the core arguments of an article in a very short time. On the other hand, while reading scientific papers, Understanding the important references that are relevant to them can be of great help in better understanding the content of the article. However, The citation network of an article is a complex structure. It is easy to get lost in the complex network of references to find the most important and relevant articles from the numerous references. How to not be lost in the multidimensional space of the document in the process of reading is also a problem worth studying. This paper makes use of the technology of text visual analysis to study the reading of scientific literature. In order to solve the above problems, Based on the study of the relationship and characteristics of scientific literature in the citation network and on the basis of the relevant research and techniques of document analysis, this paper proposes a text summarization technique based on reading purpose to extract the key sentences in the article. At the same time, we use the LDA(Latent Dirichlet allocation) topic model to analyze the content of scientific literature. This paper also proposes a visual assistant document reading system based on text summary and reference relationship. The system extracts important sentences from the paper by text summarization technology and displays them in a multi-scale visual way. It is convenient for readers to locate the core content of the paper while reading, extract the core topics of the reference document by using topic model, and design a variety of visualization schemes, including word cloud, tree map, radial map, etc. To show the core topic of the reference and its relationship with the article, to record the user's behavior throughout the reading process, so that the user can focus on his reading purpose and prevent him from getting lost. At the same time, In a specific usage scenario, we introduce the usage method and interaction method of the system in detail, and carry out user research to verify the usability of the system. The results show that the proposed system is scalable and has a good user experience. Finally, in the case study, we get a lot of different reading patterns by analyzing different users' reading paths. In the future work, the user's reading behavior will be modeled and analyzed, and the user's reading data will be used for reading recommendation.
【學(xué)位授予單位】：天津大學(xué)
【學(xué)位級(jí)別】：碩士
【學(xué)位授予年份】：2016
【分類號(hào)】：TP391.1

【相似文獻(xiàn)】

相關(guān)期刊論文前10條

1 江開(kāi)忠;李子成;顧君忠;;自動(dòng)文本摘要方法[J];計(jì)算機(jī)工程;2008年01期

2 馬漢華;邵志清;過(guò)弋;;基于認(rèn)知心理學(xué)模型的自動(dòng)文本摘要生成技術(shù)[J];華東理工大學(xué)學(xué)報(bào)(自然科學(xué)版);2009年06期

3 孫春葵,李蕾,楊曉蘭,鐘義信;基于知識(shí)的文本摘要系統(tǒng)研究與實(shí)現(xiàn)[J];計(jì)算機(jī)研究與發(fā)展;2000年07期

4 程倩倩;田大鋼;;基于基本要素方法的中文自動(dòng)文本摘要模型[J];現(xiàn)代圖書(shū)情報(bào)技術(shù);2010年02期

5 胡俠;林曄;王燦;林立;;自動(dòng)文本摘要技術(shù)綜述[J];情報(bào)雜志;2010年08期

6 劉冬平;李振坤;熊建斌;;基于統(tǒng)計(jì)的音樂(lè)摘要研究[J];現(xiàn)代計(jì)算機(jī)(專業(yè)版);2010年02期

7 王知津;基于句子選擇的自動(dòng)文本摘要方法及其評(píng)價(jià)[J];現(xiàn)代圖書(shū)情報(bào)技術(shù);1998年01期

8 鄒劍章;周經(jīng)野;陳益強(qiáng);胡明清;;基于事件框架的移動(dòng)摘要方法研究[J];微計(jì)算機(jī)信息;2010年12期

9 廖濤;劉宗田;王利;;多主題文本摘要抽取的研究與實(shí)現(xiàn)[J];計(jì)算機(jī)工程;2011年06期

10 龍瓏;鄧偉;;綠網(wǎng)摘要提取系統(tǒng)算法研究[J];微型機(jī)與應(yīng)用;2013年12期

相關(guān)會(huì)議論文前9條

1 王慧芳;張勇;邢春曉;張文珂;楊吉江;;文本摘要算法集成與實(shí)現(xiàn)[A];第二十五屆中國(guó)數(shù)據(jù)庫(kù)學(xué)術(shù)會(huì)議論文集（二）[C];2008年

2 伊力亞爾·加爾木哈買提;尼亞子別克·阿不都加勒力;;哈薩克文自動(dòng)文本摘要方法淺談[A];少數(shù)民族青年自然語(yǔ)言處理技術(shù)研究與進(jìn)展——第三屆全國(guó)少數(shù)民族青年自然語(yǔ)言信息處理、第二屆全國(guó)多語(yǔ)言知識(shí)庫(kù)建設(shè)聯(lián)合學(xué)術(shù)研討會(huì)論文集[C];2010年

3 張龍凱;王厚峰;;文本摘要中的句子抽取方法研究[A];中國(guó)計(jì)算語(yǔ)言學(xué)研究前沿進(jìn)展（2009-2011）[C];2011年

4 苗英豪;韓艷;;利用文獻(xiàn)閱讀與討論提高研究生專業(yè)課教學(xué)質(zhì)量的實(shí)踐[A];土木建筑教育改革理論與實(shí)踐[C];2009年

5 李頌;;將文獻(xiàn)閱讀討論會(huì)方式引入研究生專業(yè)理論課的教學(xué)[A];第八屆全國(guó)口腔醫(yī)學(xué)教育學(xué)術(shù)研討會(huì)論文集[C];2012年

6 鄢呈s，

本文編號(hào)：1687170

資料下載

論文發(fā)表

支付寶下載

Download by Alipay
微信下載

Download by Wechat
會(huì)員下載

Download by Member

本文鏈接：http://sikaile.net/kejilunwen/ruanjiangongchenglunwen/1687170.html

上一篇：坡度理論分布與分辨率的關(guān)系
下一篇：基于快速遞推模糊2-劃分熵圖割的紅外圖像分割

論文發(fā)表

·知網(wǎng)|萬(wàn)方|維普|龍?jiān)磡省級(jí)|國(guó)家級(jí)|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

可視輔助科學(xué)文獻(xiàn)閱讀的研究及應(yīng)用