一種降低迭代運算誤差的FMA結(jié)構(gòu)研究

發(fā)布時間：2018-05-04 18:20

本文選題：浮點融合乘加 + 迭代運算��；參考：《國防科學(xué)技術(shù)大學(xué)》2012年碩士論文

【摘要】：在新一代面向應(yīng)用的多核微處理器的設(shè)計中，設(shè)計者引入可以顯著提高浮點運算性能的融合乘加部件（Fused Multiply-Add，F(xiàn)MA）。與浮點乘法、浮點加法獨立執(zhí)行所不同，融合乘加部件在執(zhí)行過程中只有一次舍入操作，降低了運算結(jié)果的誤差。但是，迭代運算的誤差隨著運算次數(shù)的增加而顯著增大，從而嚴(yán)重影響運算結(jié)果的可靠程度。本文在分析傳統(tǒng)融合乘加結(jié)構(gòu)（T-FMA）的基礎(chǔ)上研究改進(jìn)的降低迭代運算誤差的FMA結(jié)構(gòu)（IM-FMA）。本文所取得的主要研究成果包括以下幾個方面： 1、研究了當(dāng)前FMA的常見算法和結(jié)構(gòu)以及在大規(guī)模數(shù)值計算中運算誤差所造成的影響，分析了浮點數(shù)據(jù)系統(tǒng)的分布特點——規(guī)格化數(shù)據(jù)的唯一性和有限區(qū)間內(nèi)的均勻性、計算推導(dǎo)出了不同數(shù)據(jù)格式在對數(shù)分布下的MRRE和ARRE，綜述了IEEE754-2008標(biāo)準(zhǔn)所規(guī)定的二進(jìn)制數(shù)據(jù)格式、異常處理以及誤差分析的基礎(chǔ)知識。 2、建立了適用于浮點迭代運算誤差分析的數(shù)學(xué)模型，分析推導(dǎo)了FMA部件在執(zhí)行迭代運算過程中產(chǎn)生誤差的決定因素——運算次數(shù)n和舍入單元u。在運算次數(shù)相同的情況下，舍入單元越小迭代運算的誤差越小，所以降低迭代運算誤差的根本方法在于降低舍入單元。 3、提出了一種降低迭代運算誤差的FMA結(jié)構(gòu)（IM-FMA）。本文摒棄了以往在執(zhí)行單次運算和迭代運算過程中都只保留有限位寬（53位）尾數(shù)的做法，增加了從規(guī)格化移位操作之后、舍入操作之前的中間結(jié)果（106位）到加數(shù)的反饋通路，并通過特定的選擇模塊來區(qū)別參與單次運算模式和迭代運算模式的加數(shù)來源。同時，研究了IM-FMA結(jié)構(gòu)中關(guān)鍵子模塊——乘法模塊、加數(shù)對階移位和前導(dǎo)零預(yù)測所采用的算法、結(jié)構(gòu)和邏輯電路實現(xiàn)方法。 4、驗證測試了所實現(xiàn)的T-FMA和IM-FMA，然后執(zhí)行兩個迭代測試序列分別得到T-FMA和IM-FMA的運算結(jié)果。在測試序列1下T-FMA運算結(jié)果的絕對誤差為252100、相對誤差為1.1102E-14，，IM-FMA運算結(jié)果的絕對誤差為0、相對誤差也為0；在測試序列2下T-FMA的絕對誤差為(2521)100、相對誤差為1.1102E-14，IM-FMA的絕對誤差為100、相對誤差為2.4652E-30，所以IM-FMA與T-FMA相比在降低迭代運算誤差方面是有效的。
[Abstract]:In the design of a new generation of application-oriented multi-core microprocessors, the designer introduces the fused multiplicative addition component (Fused Multiply-Add-FMAA), which can significantly improve the performance of floating-point computing. Unlike floating-point multiplication, floating-point addition performs independently, the fused multiplicative unit rounding only once in the execution process, thus reducing the error of the result. However, the error of iterative operation increases significantly with the increase of the number of operations, which seriously affects the reliability of the results. In this paper, based on the analysis of the traditional fusion multiplication and addition structure (T-FMA), an improved FMA structure to reduce the error of iterative operation is studied. The main research results of this paper include the following aspects: 1. The common algorithms and structures of FMA and the influence of operational errors in large-scale numerical computation are studied, and the distribution characteristics of floating point data system are analyzed, such as the uniqueness of normalized data and the uniformity in finite interval. The MRRE and ARREs of different data formats under logarithmic distribution are derived, and the basic knowledge of binary data format, exception handling and error analysis in IEEE754-2008 standard is summarized. 2. A mathematical model for the error analysis of floating-point iterative operation is established, and the determinants of errors in the iterative operation of FMA are analyzed and deduced, which are the number of operations n and the rounding unit u. In the case of the same number of operations, the smaller the rounding unit, the smaller the error of iterative operation, so the fundamental method to reduce the error of iterative operation is to reduce the rounding unit. 3. A FMA structure to reduce the error of iterative operation is proposed. In this paper, we abandon the practice of retaining only the finite bit width of 53 bits) Mantissa in the process of single operation and iterative operation, and increase the feedback path from the intermediate result of 106 bits rounding before the operation after the normalized shift operation to the addition. The addition sources of single operation mode and iterative operation mode are distinguished by specific selection modules. At the same time, the algorithm, structure and realization method of logic circuit are studied for the key sub-module of IM-FMA, multiplication module, addition pair shift and leading zero prediction. 4. The implemented T-FMA and IM-FMA are verified and tested, and then two iterative test sequences are executed to get the results of T-FMA and IM-FMA. Under test sequence 1, the absolute error of T-FMA operation is 252100, the relative error is 1.1102E-14 IM-FMA, the relative error is 0, and the absolute error of T-FMA under test sequence 2 is 2521 100, the relative error is 1.1102E-14IM-FMA, the absolute error is 1.1102E-14IM-FMA. The relative error is 2.4652E-30, so IM-FMA is effective in reducing iterative error compared with T-FMA.
【學(xué)位授予單位】：國防科學(xué)技術(shù)大學(xué)
【學(xué)位級別】：碩士
【學(xué)位授予年份】：2012
【分類號】：TP332

【共引文獻(xiàn)】

相關(guān)期刊論文前10條

1 許盛;高明倫;李偉;婁孝祥;;一種可重用MCU核測試平臺的設(shè)計[J];電子測量技術(shù);2008年01期

2 許盛;易浩勇;;異步FIFO在USB2.0控制器中的應(yīng)用[J];電子測量技術(shù);2009年08期

3 魏芳;劉志軍;馬克杰;;基于Verilog HDL的異步FIFO設(shè)計與實現(xiàn)[J];電子技術(shù)應(yīng)用;2006年07期

4 張騁;蔡惠智;何國建;;基于CPCI總線多DSP系統(tǒng)的高速主機接口設(shè)計[J];電子技術(shù)應(yīng)用;2007年12期

5 畢衛(wèi)紅;楊焰文;張燕君;;分布式光纖溫度傳感系統(tǒng)中數(shù)據(jù)處理的FPGA設(shè)計[J];電子技術(shù)應(yīng)用;2008年09期

6 周匯;俞軍;程君俠;華霞;;浮點乘法驗證中的一種邊界點測試方法[J];復(fù)旦學(xué)報(自然科學(xué)版);2008年01期

7 何清平;劉佐濂;江建鈞;;Verilog語言綜合問題研究[J];廣州大學(xué)學(xué)報(自然科學(xué)版);2006年05期

8 王洋;劉衛(wèi)東;于崗;;基于AMBA APB總線NandFlash控制器的設(shè)計研究[J];電子設(shè)計工程;2011年20期

9 丁世勇;譚文文;李桂英;;基于FPGA的以太網(wǎng)MAC控制器的設(shè)計與實現(xiàn)[J];電子設(shè)計工程;2011年21期

10 鄒連英;余峰;;Xilinx DDR2 IP核控制器設(shè)計[J];艦船電子工程;2012年06期

相關(guān)會議論文前1條

1 張均軍;葉星寧;;一種充電管理芯片的可測性設(shè)計[A];四川省電子學(xué)會半導(dǎo)體與集成技術(shù)專委會2006年度學(xué)術(shù)年會論文集[C];2006年

相關(guān)碩士學(xué)位論文前10條

1 劉仍穩(wěn);ASIP的流水線以及參量化設(shè)計[D];西安電子科技大學(xué);2011年

2 張欣舒;基于FPGA的實時影像監(jiān)控系統(tǒng)的研究與實現(xiàn)[D];杭州電子科技大學(xué);2011年

3 張建龍;安全SD卡SoC芯片的SPI接口設(shè)計與實現(xiàn)[D];遼寧大學(xué);2011年

4 邱曉光;AVS視頻解碼芯片功能驗證平臺的研究與實現(xiàn)[D];山東大學(xué);2011年

5 馬成海;兼容AMBA4的DDRn SDRAM控制器IP核的前端設(shè)計[D];山東大學(xué);2011年

6 李風(fēng)志;基于OpenRISC 1200的AVS解碼器軟件控制器的實現(xiàn)[D];山東大學(xué);2011年

7 趙靜;基于FPGA的數(shù)字頻譜分析儀的設(shè)計與實現(xiàn)[D];山東大學(xué);2011年

8 王曉暉;SDH網(wǎng)絡(luò)分析儀發(fā)送端電路的設(shè)計與實現(xiàn)[D];杭州電子科技大學(xué);2009年

9 李萌;邊界掃描測試與故障診斷系統(tǒng)開發(fā)[D];哈爾濱工業(yè)大學(xué);2010年

10 高健博;以太網(wǎng)二層交換芯片的MAC地址交換器的設(shè)計與實現(xiàn)[D];哈爾濱工業(yè)大學(xué);2011年

本文編號：1844079

資料下載

論文發(fā)表

支付寶下載

Download by Alipay
微信下載

Download by Wechat
會員下載

Download by Member

本文鏈接：http://sikaile.net/kejilunwen/jisuanjikexuelunwen/1844079.html

上一篇：基于改進(jìn)粒子群優(yōu)化算法的虛擬機遷移選擇策略研究
下一篇：一種基于連續(xù)頁面歸并回收的旁路轉(zhuǎn)換緩沖器

論文發(fā)表

·知網(wǎng)|萬方|維普|龍源|省級|國家級|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

一種降低迭代運算誤差的FMA結(jié)構(gòu)研究