構建集成系統(tǒng)：基于規(guī)則與統(tǒng)計數(shù)據(jù)的機器翻譯

發(fā)布時間：2018-07-08 07:43

本文選題：機器翻譯 + 人工智能�。� 參考：《廣東商學院》2012年碩士論文

【摘要】：機器翻譯的發(fā)展從最早提出理論設想到今天已經(jīng)經(jīng)歷了六十多年的歷史了。如今主流的機器翻譯算法主要分成兩大陣營：基于規(guī)則以及基于統(tǒng)計數(shù)據(jù)的機器翻譯�；谝�(guī)則的機器翻譯核心是依賴于預先人工設置的語法規(guī)則模塊作為語法分析的憑據(jù)；而對于基于統(tǒng)計數(shù)據(jù)的機器翻譯來說，翻譯的核心就是網(wǎng)絡爬蟲的文件掃描歸類機制，以及該機制所創(chuàng)建的動態(tài)參考數(shù)據(jù)庫。也就是說，，基于規(guī)則的機器翻譯是模塊性系統(tǒng)，而基于統(tǒng)計數(shù)據(jù)的機器翻譯是基于過程類系統(tǒng)。本論文從喬姆斯基語法的視角下闡述基于規(guī)則的機器翻譯系統(tǒng)獨特的模塊化處理優(yōu)勢以及在具體自然語言處理上的不足，并從奈達對翻譯過程的理論的視角下分析基于統(tǒng)計數(shù)據(jù)的機器翻譯系統(tǒng)的過程優(yōu)勢以及語法分析不穩(wěn)定的劣勢。本論文通過結合基于規(guī)則的翻譯系統(tǒng)的“圖書館”和“語法分析器”以及基于統(tǒng)計數(shù)據(jù)的翻譯系統(tǒng)的“爬蟲”（也稱漫游）機制來建立一個集成模塊優(yōu)勢和過程優(yōu)勢的系統(tǒng)，通過將圖書館的語法機制融入奈達的翻譯步驟來解決基于統(tǒng)計數(shù)據(jù)翻譯系統(tǒng)中的語法分析不確定性，彌補前者在自然語言處理上的不足以及后者在語法分析上的薄弱。本文最后勾勒了機器發(fā)展將來以圖書館和語法分析器為借鑒，以爬蟲建立后備資料數(shù)據(jù)的趨勢，并在系統(tǒng)和接口硬件上集成的趨勢以及展望
[Abstract]:The development of machine translation has gone through more than 60 years since it was first proposed. Nowadays, the mainstream machine translation algorithms are divided into two main camps: rule-based and statistical-based machine translation. The core of rule-based machine translation is to rely on the pre-set grammar rules module as the credentials of syntax analysis. For machine translation based on statistics, the core of translation is the file scanning and categorization mechanism of the web crawler. And the dynamic reference database created by this mechanism. That is to say, rule-based machine translation is a modular system, while statistical data based machine translation is a process-based system. From the perspective of Chomsky's grammar, this thesis expounds the unique modular processing advantages of rule-based machine translation system and its shortcomings in specific natural language processing. From the perspective of Nida's theory of translation process, this paper analyzes the process advantages of machine translation system based on statistical data and the disadvantage of unstable grammar analysis. In this paper, we combine the "library" and "parser" of the rule-based translation system and the "crawler" mechanism of the statistical data based translation system to establish a system that integrates the advantages of modules and processes. By incorporating the grammatical mechanism of library into Nida's translation steps, the uncertainty of grammatical analysis in the statistical data translation system is solved, and the deficiency of the former in natural language processing and the weakness in grammatical analysis of the latter are remedied. Finally, this paper outlines the trend of machine development in the future, using library and parser as reference, using reptile to build backup data, and integrating system and interface hardware.
【學位授予單位】：廣東商學院
【學位級別】：碩士
【學位授予年份】：2012
【分類號】：H085

【相似文獻】

相關期刊論文前10條

1 彭述初;;機器翻譯學科發(fā)展綜述[J];華中科技大學學報(社會科學版);2006年02期

2 魏任;;布拉格國際數(shù)理語言學和機器翻譯座談會[J];當代語言學;1965年06期

3 王宗炎;;日本機器翻譯研究近況[J];當代語言學;1984年01期

4 黃寧夏;英漢機譯系統(tǒng)語言比較功能處理水平分析[J];漳州師范學院學報(哲學社會科學版);2004年01期

5 魯孝賢;;機器翻譯語義排歧的方法[J];中國科技翻譯;2007年04期

6 潘娣;;機器翻譯的利弊以及發(fā)展前景[J];安徽文學(下半月);2008年09期

7 梊善;“語言學譯S狻痹又

本文編號：2106716

資料下載

論文發(fā)表

支付寶下載

Download by Alipay
微信下載

Download by Wechat
會員下載

Download by Member

本文鏈接：http://sikaile.net/wenyilunwen/hanyulw/2106716.html

上一篇：內蒙古俄羅斯語概況
下一篇：中韓被動句比較研究

論文發(fā)表

·知網(wǎng)|萬方|維普|龍源|省級|國家級|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

構建集成系統(tǒng)：基于規(guī)則與統(tǒng)計數(shù)據(jù)的機器翻譯