天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

YHFT-DX寄存器文件的設(shè)計(jì)與實(shí)現(xiàn)

發(fā)布時(shí)間:2018-06-15 08:01

  本文選題:寄存器文件 + 全定制設(shè)計(jì); 參考:《國(guó)防科學(xué)技術(shù)大學(xué)》2012年碩士論文


【摘要】:YHFT-DX是采用65納米工藝自主研發(fā)的高頻、高性能32位定點(diǎn)超長(zhǎng)指令字?jǐn)?shù)字信號(hào)處理器,其中寄存器文件是該處理器的性能瓶頸和設(shè)計(jì)難點(diǎn)之一。本文根據(jù)YHFT-DX的總體結(jié)構(gòu)和性能要求,確定了對(duì)該多端口寄存器文件進(jìn)行全定制設(shè)計(jì)的技術(shù)路線,設(shè)計(jì)實(shí)現(xiàn)了一款13讀9寫、支持64位長(zhǎng)型數(shù)據(jù)的32×32位寄存器文件,并對(duì)寄存器文件進(jìn)行了可測(cè)性設(shè)計(jì)和低功耗設(shè)計(jì)。所設(shè)計(jì)的寄存器文件版圖面積為266×302μm2,在YHFT-DX數(shù)字信號(hào)處理器芯片中得到了應(yīng)用,流片后的芯片測(cè)試結(jié)果表明:典型條件下隨機(jī)讀寫的平均功耗為8mW,,最差條件下工作頻率可以達(dá)到800MHz,達(dá)到了設(shè)計(jì)目標(biāo)。 本文的主要貢獻(xiàn)和創(chuàng)新點(diǎn)集中體現(xiàn)在以下幾個(gè)方面: 1.對(duì)YHFT-DX寄存器文件進(jìn)行了功能設(shè)計(jì)、時(shí)序設(shè)計(jì)以及結(jié)構(gòu)設(shè)計(jì),確定了定向通路機(jī)制,避免了寫后讀數(shù)據(jù)相關(guān)。根據(jù)長(zhǎng)型數(shù)據(jù)訪問特點(diǎn),采用端口復(fù)用、分體布局技術(shù)在寄存器內(nèi)部把端口數(shù)目從13讀9寫減少為10讀6寫,將存儲(chǔ)陣列中端口數(shù)目和譯碼器數(shù)目減少了6個(gè),使版圖面積減少了22%。 2.采用全掃描設(shè)計(jì)方法來(lái)增加寄存器文件的可測(cè)試性,從而實(shí)現(xiàn)寄存器文件可觀察性、可控制性等可測(cè)試性設(shè)計(jì)目標(biāo),并從以下方面體現(xiàn)其可測(cè)性:從任一寫端口向任一寄存器寫入數(shù)據(jù)可觀察;從任一讀端口向任一寄存器文件讀數(shù)據(jù),或者任一讀端口通過定向通路向任一寫端口讀數(shù)據(jù)可觀察;輸入端口的控制信號(hào)實(shí)現(xiàn)對(duì)寫地址、寫使能、寫數(shù)據(jù)和讀數(shù)據(jù)的輸入和輸出工作狀態(tài)可控制。 3.采用邏輯優(yōu)化、操作數(shù)隔離、門控時(shí)鐘、混合閾值、多級(jí)譯碼、電路轉(zhuǎn)換等多種低功耗設(shè)計(jì)技術(shù),降低了動(dòng)態(tài)功耗和漏流功耗。典型條件下隨機(jī)讀寫平均功耗為8mW。 4.采用結(jié)構(gòu)化版圖設(shè)計(jì)減少了版圖面積。同時(shí)加入可測(cè)試設(shè)計(jì)后,通過改變電路結(jié)構(gòu)并運(yùn)用結(jié)構(gòu)化版圖設(shè)計(jì)方法,使得寄存器文件版圖面積比傳統(tǒng)版圖設(shè)計(jì)方法減少了15%。通過更優(yōu)的電路結(jié)構(gòu),提高了寄存器的性能,在譯碼、存儲(chǔ)和定向通路中使用了低閾值技術(shù)降低了延時(shí),頻率在最差條件下可以達(dá)到800MHz。
[Abstract]:YHFT-DX is a high frequency, high performance 32 bit fixed point ultra long instruction word digital signal processor developed by 65 nm process. Register file is one of the performance bottlenecks and design difficulties of the processor. According to the overall structure and performance requirements of YHFT-DX, the technical route of fully customizing the multi-port register file is determined in this paper. A 32 脳 32 bit register file with 13 read and 9 writes and supporting 64 bit long data is designed and implemented. The register file is designed for testability and low power consumption. The designed register file has a layout area of 266 脳 302 渭 m ~ 2, which has been applied in YHFT-DX digital signal processor chip. The chip test results after streaming show that the average power consumption of random reading and writing is 8 MW under typical conditions, and the working frequency can reach 800 MHz under the worst condition, which achieves the design goal. The main contributions and innovations of this paper are embodied in the following aspects: 1. The function design, timing design and structure design of the YHFT-DX register file are carried out, and the directional path mechanism is determined to avoid the post-write data correlation. According to the characteristics of long data access, port multiplexing is adopted. The split layout technique reduces the number of ports from 13 read 9 write to 10 read 6 write in the register, and reduces the number of ports and decoders in the memory array by 6. Reduce the layout area by 22.2. The full scan design method is used to increase the testability of register files, so as to realize the testability design goals of register files, such as observability, controllability, etc. The testability can be observed by writing data from any write port to any register, reading data from any read port to any register file, or reading data from any read port to any write port through a directional path. Input port control signal to write address, write enable, write data and read data input and output working state can be controlled. 3. Low power design techniques such as logic optimization, Operand isolation, gated clock, hybrid threshold, multistage decoding and circuit conversion are used to reduce dynamic power consumption and leakage power consumption. Under typical conditions, the average power consumption of random reading and writing is 8 MW. 4. Structural layout design reduces layout area. After adding testability design, by changing the circuit structure and using the structural layout design method, the register file layout area is reduced by 15% compared with the traditional layout design method. The performance of the register is improved by better circuit structure, and the delay is reduced by using low threshold technique in decoding, storage and orientation paths, and the frequency can reach 800MHz under the worst conditions.
【學(xué)位授予單位】:國(guó)防科學(xué)技術(shù)大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2012
【分類號(hào)】:TP332

【參考文獻(xiàn)】

相關(guān)期刊論文 前3條

1 琚小明;姚慶棟;史冊(cè);洪享;周莉;;一種新的減少媒體處理器中寄存器文件復(fù)雜度的方法[J];電路與系統(tǒng)學(xué)報(bào);2006年01期

2 簡(jiǎn)貴胄,葛寧,馮重熙;靜態(tài)時(shí)序分析方法的基本原理和應(yīng)用[J];計(jì)算機(jī)工程與應(yīng)用;2002年14期

3 溫璞;楊學(xué)軍;;V-PIM中低功耗分體多端口向量寄存器文件設(shè)計(jì)[J];計(jì)算機(jī)工程與應(yīng)用;2006年04期



本文編號(hào):2021315

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/kejilunwen/jisuanjikexuelunwen/2021315.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶b9747***提供,本站僅收錄摘要或目錄,作者需要?jiǎng)h除請(qǐng)E-mail郵箱bigeng88@qq.com
最近中文字幕高清中文字幕无| 精品人妻一区二区三区四在线| 国产精品国三级国产专不卡| 色偷偷亚洲女人天堂观看| 日韩无套内射免费精品| 台湾综合熟女一区二区| 夫妻激情视频一区二区三区| 东北女人的逼操的舒服吗| 欧美大粗爽一区二区三区| 日韩和欧美的一区二区三区| 女人精品内射国产99| 国产精品日韩欧美第一页| 国产丝袜极品黑色高跟鞋| 日系韩系还是欧美久久| 加勒比日本欧美在线观看| 风间中文字幕亚洲一区| 日本高清中文精品在线不卡| 在线日韩中文字幕一区 | 日本精品理论在线观看| 五月婷婷欧美中文字幕| 日本三区不卡高清更新二区| 亚洲精品国产精品日韩| 久七久精品视频黄色的| 欧洲一级片一区二区三区| 好吊妞视频这里有精品| 国产不卡的视频在线观看| 国产精品欧美一区二区三区不卡| 微拍一区二区三区福利| 人妻少妇av中文字幕乱码高清| 色婷婷日本视频在线观看| 熟女体下毛荫荫黑森林自拍| 日韩精品视频免费观看| 伊人天堂午夜精品草草网| 免费福利午夜在线观看| 亚洲日本中文字幕视频在线观看 | 国产性色精品福利在线观看| a久久天堂国产毛片精品| 欧美高潮喷吹一区二区| 国产高清一区二区白浆| 日本精品最新字幕视频播放| 内用黄老外示儒术出处|