天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當前位置:主頁 > 論文百科 > 論文查重 >

基于篇章結構的抄襲論文識別系統(tǒng)的研究與實現(xiàn)

發(fā)布時間:2018-08-08 13:49
【摘要】: 目前,剽竊已經(jīng)是一個日益嚴重的問題。隨著數(shù)字化圖書館、互聯(lián)網(wǎng)的普及和迅速發(fā)展,大量的以數(shù)字形式存在的資源使剽竊變得更加容易,尤其是學生和學術研究人員,他們通過網(wǎng)絡搜索工具很容易就可以找到與課題研究相關的內容。特別是近幾年來,抄襲、一稿多投等一系列的剽竊事件屢見報端,其問題的嚴重性越來越引起人們的重視。要杜絕此類現(xiàn)象、凈化學術氛圍,除了要加強對學生的教育、制定相應的法律法規(guī)外,建立有效的抄襲識別系統(tǒng)已經(jīng)刻不容緩。 總結現(xiàn)有的抄襲論文檢測技術和系統(tǒng),存在幾個缺陷:第一,現(xiàn)有的原型系統(tǒng)對于目前較普遍的一篇論文剽竊多篇論文的剽竊方式?jīng)]有做出分析研究。第二,現(xiàn)有的原型系統(tǒng)在檢測過程中大都沒有加入篇章結構相似度的計算,即便考慮了篇章結構的特征也并不全面或者存在著不合理因素。第三,對于已經(jīng)發(fā)生剽竊行為的文檔,現(xiàn)有的原型系統(tǒng)沒有給出相應抄襲類型的判別,對于十分明顯的抄襲類型,不能快速、準確地捕獲。因此,本文研究了現(xiàn)有的復制檢測技術,同時分析了抄襲論文具備的特征,最后采用類似COPS的數(shù)字指紋方法識別學術論文中的完全抄襲和部分抄襲;采用基于篇章信息的詞頻統(tǒng)計方法識別隱式抄襲,并對改進前后的方法利用P-R和MAP指標進行了實驗對比。
[Abstract]:Plagiarism is now an increasingly serious problem. With the popularity and rapid development of digital libraries and the Internet, plagiarism has become easier to plagiarism, especially for students and academic researchers, who can easily find content related to research through web search tools. In particular, in recent years, a series of plagiarism, plagiarism, multiple plagiarism and other plagiarism events have been repeated, and the seriousness of the problems has attracted more and more attention. In order to eliminate such phenomena and purify the academic atmosphere, it is very urgent to establish an effective plagiarism identification system in addition to strengthening the education of students and formulating relevant laws and regulations.
There are several defects in the existing plagiarism detection technology and system. First, the existing prototype system has not made an analysis on the plagiarism method of plagiarizing a number of papers. Second, the existing prototype system has not included the calculation of the similarity of the text structure in the detection process, even if it is considered. The characteristics of the text structure are not comprehensive or unreasonable. Third, for the documents that have been plagiarized, the existing prototype system does not give the identification of the corresponding plagiarism types. For the very obvious type of plagiarism, it can not be quickly and accurately captured. The characteristics of the plagiarism are analyzed. Finally, the complete plagiarism and partial plagiarism in the academic papers are identified by the digital fingerprint method similar to COPS. The word frequency statistics based on text information is used to identify the hidden plagiarism, and the experimental comparison of the improved methods using the P-R and MAP indexes is carried out.
【學位授予單位】:東北師范大學
【學位級別】:碩士
【學位授予年份】:2009
【分類號】:TP311.52

【引證文獻】

相關碩士學位論文 前1條

1 王森;基于主題樹的自上而下文本復制檢測研究[D];大連理工大學;2010年

,

本文編號:2171978

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/wenshubaike/gzzj/2171978.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權申明:資料由用戶56f40***提供,本站僅收錄摘要或目錄,作者需要刪除請E-mail郵箱bigeng88@qq.com