基于參考譯文圖的機器翻譯質(zhì)量評價方法研究

發(fā)布時間：2018-03-25 08:06

本文選題：機器翻譯　切入點：自動評價　出處：《南京大學》2017年碩士論文

【摘要】：隨著科技日新月異的發(fā)展,機器翻譯技術(shù)得到了越來越多人的關(guān)注,其理論研究以及實際應(yīng)用都已成為機器學習領(lǐng)域的一大熱點。與人工翻譯相比,機器翻譯在翻譯效率上有較明顯的提高,且代價較低,所以自動化翻譯技術(shù)(機器翻譯技術(shù))的研究已經(jīng)成為一個非常重要的研究方向。隨著機器翻譯技術(shù)的飛速發(fā)展,機器翻譯質(zhì)量評價這一研究課題應(yīng)運而生,對于機器翻譯系統(tǒng)產(chǎn)生的翻譯結(jié)果,我們需要衡量其翻譯質(zhì)量如何。完成機器翻譯本身并不是最終目的,我們希望知道機器譯文能夠在多大程度上幫助到人們,同時,對機器譯文質(zhì)量有大致的評估,也有助于學習得到一個更好的機器翻譯系統(tǒng),這兩者相輔相成。對于機器譯文的評測,傳統(tǒng)的方式是進行人工評價,但是這種方式由于其耗時、代價昂貴且結(jié)果不可重現(xiàn),現(xiàn)已被自動評測方法部分取代。當前主流的自動評測方法,采用的都是相同的策略:將機器譯文與一個或是多個人工譯文(參考譯文)進行比較,并且認為一個與參考譯文非常相似的機器譯文肯定比那些與參考譯文有較大差異的譯文更準確。但是,由于語義和表達方式的多樣性,源語言中的一個句子可能有很多不同的正確翻譯結(jié)果(即翻譯多樣性),而參考譯文是有限的,這就會導致在一些情況下,自動評測方法不能準確的對譯文質(zhì)量有一個評估,同時這一問題也限制了機器翻譯系統(tǒng)的翻譯能力。在本文中,我們針對目前的自動評測方法在翻譯多樣性方面的的缺陷做出了改進。由于人工翻譯的昂貴代價,我們能夠獲取到的參考譯文是有限的,因此,本文提出了一種從有限的資源中獲取到更多信息的自動方法。首先,我們探索了不同的評價方法以及選擇不同的參考譯文對機器翻譯系統(tǒng)的影響;其次,對已有的參考譯文進行拓展,對于部分存在多個參考譯文的數(shù)據(jù)集,通過構(gòu)建一個參考譯文圖以獲取更多的信息,對于大部分只存在一個參考譯文的數(shù)據(jù)集,利用外部單語資源英文復述表,對當前的參考譯文進行拓展,得到一個含有更加豐富信息的參考譯文圖;最后,通過從參考譯文圖中尋找最優(yōu)路徑以提高效率,并將該自動方法應(yīng)用到已有的自動評測方法中,幫助提升機器翻譯系統(tǒng)本身的翻譯能力。實驗結(jié)果表明,本文提出的從有限的資源中獲取到更多信息的自動方法能夠更快地獲取到更加豐富且多樣的參考譯文信息。對于多個參考譯文的拓展,能夠?qū)C器譯文作出更加準確的評價;對于單個參考譯文的拓展,不僅能夠更準確地評價系統(tǒng)譯文,同時還減小了使用不同譯文帶來的系統(tǒng)性能的差異;將該方法應(yīng)用于翻譯系統(tǒng)的訓練過程,有效的提高了翻譯系統(tǒng)能力。
[Abstract]:With the rapid development of science and technology, machine translation technology has been paid more and more attention. Its theoretical research and practical application have become a hot spot in the field of machine learning. The efficiency of machine translation has been improved obviously and the cost is low. Therefore, the research of automatic translation technology (machine translation technology) has become a very important research direction. With the rapid development of machine translation technology, The quality evaluation of machine translation has come into being. We need to measure the translation quality of the machine translation system, and it is not the ultimate goal to complete the machine translation. We want to know how much machine translation can help people, and at the same time, to have a general assessment of the quality of machine translation, and to learn about a better machine translation system. The traditional method of evaluating machine translations is manual evaluation, but because of its time-consuming, expensive and unrepeatable results, Has been partially replaced by automated evaluation methods. The current mainstream automated evaluation methods use the same strategy: compare machine translations to one or more manual translations (reference translations), And think that a machine translation that is very similar to the reference translation is certainly more accurate than those that differ greatly from the reference translation. However, due to the diversity of semantics and expressions, A sentence in the source language may have many different correct translation results (i.e., translation diversity, and the reference translation is limited, which can lead in some cases to an accurate assessment of the quality of the translation by automatic evaluation methods. At the same time, this problem also limits the translation ability of machine translation systems. In this paper, we make improvements to the shortcomings of the current automated evaluation methods in terms of translation diversity. The number of reference translations we can obtain is limited, so this paper proposes an automatic method for obtaining more information from limited resources. We explore the influence of different evaluation methods and the selection of different reference translations on the machine translation system. Secondly, we extend the existing reference translations to some data sets with multiple reference translations. By constructing a reference translation map to obtain more information, for most of the data sets where there is only one reference translation, the current reference translation can be extended by using the external monolingual resource English retelling table. A reference translation map with more information is obtained. Finally, by searching for the optimal path from the reference translation map to improve efficiency, the automatic method is applied to the existing automatic evaluation methods. Help to improve the translation capabilities of the machine translation system itself. The automatic method of obtaining more information from limited resources in this paper can obtain more abundant and diverse reference translation information more quickly. For the expansion of multiple reference translations, the machine translation can be evaluated more accurately. For the expansion of a single reference translation, it can not only evaluate the system translation more accurately, but also reduce the system performance difference caused by using different translation, and apply this method to the training process of translation system. Effectively improve the translation system ability.
【學位授予單位】：南京大學
【學位級別】：碩士
【學位授予年份】：2017
【分類號】：H085

【相似文獻】

相關(guān)期刊論文前10條

1 吳倩;謝旭升;;譯文質(zhì)量與譯者傳統(tǒng)角色的顛覆[J];伊犁師范學院學報(社會科學版);2011年02期

2 肖安法;;論譯文質(zhì)量的檢驗[J];常州工學院學報(社科版);2013年04期

3 徐盛桓;譯文質(zhì)量評估的數(shù)學模型[J];華南師范大學學報(社會科學版);1987年04期

4 熱外都拉,李培環(huán);對一些譯文的意見[J];語言與翻譯;1989年01期

5 李立達;談“雅”及譯文的“雅”處理[J];內(nèi)江師范學院學報;1990年01期

6 曾謝巖;談?wù)剣H會議通知的譯文質(zhì)量[J];中國科技翻譯;1993年02期

7 劉憲軍;地方外宣資料譯文質(zhì)量亟待提高[J];中國科技翻譯;1998年02期

8 姚雪梅;;文學譯文的翻譯策略分析:如何關(guān)照譯文的讀者[J];芒種;2012年13期

9 劉精香;一篇珠圓玉潤的譯文——淺評How to Mark a Book的漢譯[J];大學英語;1993年04期

10 韓子滿;過猶不及——淺論譯文的歸化問題[J];外語教學;2000年02期

相關(guān)會議論文前9條

1 葉立林;;譯文質(zhì)量的主觀評價[A];貴州省翻譯工作者協(xié)會第6屆會員代表大會暨2007年翻譯學術(shù)年會論文集[C];2007年

2 侯敏;侯敞;;漢英機譯系統(tǒng)譯文質(zhì)量的評測[A];世紀之交的中國應(yīng)用語言學研究——第二屆全國語言文字應(yīng)用學術(shù)研討會論文集[C];1998年

3 朱憲超;;十年翻譯路(2000-2010)——談?wù)勎覍Ψg工作的幾點認識[A];全國首屆翻譯碩士(MTI)教育與翻譯產(chǎn)業(yè)研討會論文集[C];2009年

4 周玲;;理解對譯者意味著什么[A];國際交流學院科研論文集（第四期）[C];1997年

5 董振東;董強;;機譯研究中的一些誤區(qū)——從譯文質(zhì)量分析看[A];自然語言理解與機器翻譯——全國第六屆計算語言學聯(lián)合學術(shù)會議論文集[C];2001年

6 陳絳;;評《中國的現(xiàn)代化》的名詞翻譯——兼及若干譯本的譯名問題[A];近代中國（第二輯）[C];1991年

7 陳可;;論高語速語境下中英同傳的簡約處理[A];中國翻譯學學科建設(shè)高層論壇摘要[C];2013年

8 張霄軍;楊云峰;;英語否定句的四種機譯質(zhì)量評析[A];機器翻譯研究進展——2002年全國機器翻譯研討會論文集[C];2002年

9 靳寧;賈德江;;再論關(guān)聯(lián)理論對翻譯的解釋力[A];中國英漢語比較研究會第七次全國學術(shù)研討會論文集[C];2006年

相關(guān)重要報紙文章前9條

1 馮威;譯著質(zhì)量：一個死角？[N];中國新聞出版報;2005年

2 林曉琴福建師范大學外國語學院;生態(tài)翻譯學:“三位一體”，譯者主導[N];中國社會科學報;2012年

3 蔣一帆;郵集翻譯的質(zhì)量標準[N];中國集郵報;2009年

4 張衛(wèi)族;《杰克·韋爾奇自傳》翻譯硬傷多得令人憂慮[N];中國新聞出版報;2003年

5 王鳴陽;談科技翻譯質(zhì)量[N];中華讀書報;2002年

6 本報記者田小滿;學者解析當下學術(shù)翻譯癥結(jié)[N];中國社會科學報;2011年

7 葛崇岳;也說文學翻譯信、達、雅[N];安徽日報;2004年

8 勿罔;翻譯大獎外行勝內(nèi)行[N];中華讀書報;2001年

9 林會敏（書評人）;譯海里的夜行燈塔[N];中國圖書商報;2006年

相關(guān)碩士學位論文前10條

1 徐娟;“南京市電子地圖地名翻譯”項目審校報告[D];南京理工大學;2015年

2 王淼;傳播學視角下的《易經(jīng)》翻譯研究[D];華北電力大學;2015年

3 梁瑾璇;漢語博物館文本英譯問題及其對策研究[D];山西大學;2015年

4 奎地;論計算機輔助譯文質(zhì)量控制[D];青島科技大學;2016年

5 丁國慶;視點轉(zhuǎn)換手法在《鹿王》視譯實踐中的運用[D];吉林大學;2016年

6 林新靖;英語節(jié)奏視角下的漢英筆譯效果研究[D];北京外國語大學;2016年

7 徐夢云;漢英同傳中脫離源語語言外殼對譯文質(zhì)量的影響[D];北京外國語大學;2016年

8 馬瑞雪;不同耳機佩戴方式對英漢同傳質(zhì)量影響的比較研究[D];北京外國語大學;2016年

9 李瑩;論發(fā)言人部分脫稿對英中帶稿和無稿同傳的影響[D];北京外國語大學;2016年

10 王威;融合檢索技術(shù)的譯文推薦系統(tǒng)的研究與實現(xiàn)[D];北京工業(yè)大學;2016年

，

本文編號：1662246

資料下載

論文發(fā)表

支付寶下載

Download by Alipay
微信下載

Download by Wechat
會員下載

Download by Member

本文鏈接：http://sikaile.net/wenyilunwen/yuyanyishu/1662246.html

上一篇：基于語料庫的現(xiàn)代漢語連詞“而且”與“并且”研究
下一篇：關(guān)于計算機輔助普通話水平測試偏差復審問題

論文發(fā)表

·知網(wǎng)|萬方|維普|龍源|省級|國家級|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

基于參考譯文圖的機器翻譯質(zhì)量評價方法研究