構(gòu)建集成系統(tǒng):基于規(guī)則與統(tǒng)計(jì)數(shù)據(jù)的機(jī)器翻譯
發(fā)布時(shí)間:2018-07-08 07:43
本文選題:機(jī)器翻譯 + 人工智能��; 參考:《廣東商學(xué)院》2012年碩士論文
【摘要】:機(jī)器翻譯的發(fā)展從最早提出理論設(shè)想到今天已經(jīng)經(jīng)歷了六十多年的歷史了。如今主流的機(jī)器翻譯算法主要分成兩大陣營(yíng):基于規(guī)則以及基于統(tǒng)計(jì)數(shù)據(jù)的機(jī)器翻譯�;谝�(guī)則的機(jī)器翻譯核心是依賴于預(yù)先人工設(shè)置的語(yǔ)法規(guī)則模塊作為語(yǔ)法分析的憑據(jù);而對(duì)于基于統(tǒng)計(jì)數(shù)據(jù)的機(jī)器翻譯來(lái)說(shuō),翻譯的核心就是網(wǎng)絡(luò)爬蟲(chóng)的文件掃描歸類機(jī)制,以及該機(jī)制所創(chuàng)建的動(dòng)態(tài)參考數(shù)據(jù)庫(kù)。也就是說(shuō),,基于規(guī)則的機(jī)器翻譯是模塊性系統(tǒng),而基于統(tǒng)計(jì)數(shù)據(jù)的機(jī)器翻譯是基于過(guò)程類系統(tǒng)。 本論文從喬姆斯基語(yǔ)法的視角下闡述基于規(guī)則的機(jī)器翻譯系統(tǒng)獨(dú)特的模塊化處理優(yōu)勢(shì)以及在具體自然語(yǔ)言處理上的不足,并從奈達(dá)對(duì)翻譯過(guò)程的理論的視角下分析基于統(tǒng)計(jì)數(shù)據(jù)的機(jī)器翻譯系統(tǒng)的過(guò)程優(yōu)勢(shì)以及語(yǔ)法分析不穩(wěn)定的劣勢(shì)。本論文通過(guò)結(jié)合基于規(guī)則的翻譯系統(tǒng)的“圖書(shū)館”和“語(yǔ)法分析器”以及基于統(tǒng)計(jì)數(shù)據(jù)的翻譯系統(tǒng)的“爬蟲(chóng)”(也稱漫游)機(jī)制來(lái)建立一個(gè)集成模塊優(yōu)勢(shì)和過(guò)程優(yōu)勢(shì)的系統(tǒng),通過(guò)將圖書(shū)館的語(yǔ)法機(jī)制融入奈達(dá)的翻譯步驟來(lái)解決基于統(tǒng)計(jì)數(shù)據(jù)翻譯系統(tǒng)中的語(yǔ)法分析不確定性,彌補(bǔ)前者在自然語(yǔ)言處理上的不足以及后者在語(yǔ)法分析上的薄弱。本文最后勾勒了機(jī)器發(fā)展將來(lái)以圖書(shū)館和語(yǔ)法分析器為借鑒,以爬蟲(chóng)建立后備資料數(shù)據(jù)的趨勢(shì),并在系統(tǒng)和接口硬件上集成的趨勢(shì)以及展望
[Abstract]:The development of machine translation has gone through more than 60 years since it was first proposed. Nowadays, the mainstream machine translation algorithms are divided into two main camps: rule-based and statistical-based machine translation. The core of rule-based machine translation is to rely on the pre-set grammar rules module as the credentials of syntax analysis. For machine translation based on statistics, the core of translation is the file scanning and categorization mechanism of the web crawler. And the dynamic reference database created by this mechanism. That is to say, rule-based machine translation is a modular system, while statistical data based machine translation is a process-based system. From the perspective of Chomsky's grammar, this thesis expounds the unique modular processing advantages of rule-based machine translation system and its shortcomings in specific natural language processing. From the perspective of Nida's theory of translation process, this paper analyzes the process advantages of machine translation system based on statistical data and the disadvantage of unstable grammar analysis. In this paper, we combine the "library" and "parser" of the rule-based translation system and the "crawler" mechanism of the statistical data based translation system to establish a system that integrates the advantages of modules and processes. By incorporating the grammatical mechanism of library into Nida's translation steps, the uncertainty of grammatical analysis in the statistical data translation system is solved, and the deficiency of the former in natural language processing and the weakness in grammatical analysis of the latter are remedied. Finally, this paper outlines the trend of machine development in the future, using library and parser as reference, using reptile to build backup data, and integrating system and interface hardware.
【學(xué)位授予單位】:廣東商學(xué)院
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2012
【分類號(hào)】:H085
【相似文獻(xiàn)】
相關(guān)期刊論文 前10條
1 彭述初;;機(jī)器翻譯學(xué)科發(fā)展綜述[J];華中科技大學(xué)學(xué)報(bào)(社會(huì)科學(xué)版);2006年02期
2 魏任;;布拉格國(guó)際數(shù)理語(yǔ)言學(xué)和機(jī)器翻譯座談會(huì)[J];當(dāng)代語(yǔ)言學(xué);1965年06期
3 王宗炎;;日本機(jī)器翻譯研究近況[J];當(dāng)代語(yǔ)言學(xué);1984年01期
4 黃寧夏;英漢機(jī)譯系統(tǒng)語(yǔ)言比較功能處理水平分析[J];漳州師范學(xué)院學(xué)報(bào)(哲學(xué)社會(huì)科學(xué)版);2004年01期
5 魯孝賢;;機(jī)器翻譯語(yǔ)義排歧的方法[J];中國(guó)科技翻譯;2007年04期
6 潘娣;;機(jī)器翻譯的利弊以及發(fā)展前景[J];安徽文學(xué)(下半月);2008年09期
7 梊善;“語(yǔ)言學(xué)譯S狻痹又
本文編號(hào):2106716
本文鏈接:http://sikaile.net/wenyilunwen/hanyulw/2106716.html
最近更新
教材專著