高性價(jià)比DSP中指令Cache的研究與設(shè)計(jì)
[Abstract]:System throughput is an important measure of processor performance, which is often limited by the capabilities of CPU. Studies have shown that the price difference between the fast memory chip and the next fastest speed level can be as high as 50% or 100%, but the memory can only get an additional 20% speed. Unfortunately, the speed of the CPU cannot be increased accordingly. So in order to develop a kind of DSP processor with high performance-price ratio, the design of Cache has become a key factor to improve the performance of processor system. The research of Cache has become a hot topic in the development of DSP processor. In this paper, the design of "two-stage buffer" architecture is presented for 32-bit fixed-point DSP processor AXP32 (the main frequency of AXP32 can be up to 150 MHz, peripheral frequency up to 40 MHz),). In order to improve the speed of CPU to refer to external memory, an instruction Cache, is inserted between CPU and external memory, which is called "first stage cache". The design capacity of the instruction Cache is 512B, the block size is 16B, and the image rule is the direct image. According to the basic flow of digital IC design, the instruction Cache is divided into two modules: data part and control part. The data part mainly realizes the function of searching and comparing, while the control part realizes the next operation of instruction Cache according to the lookup result. The design of the "second level cache" is mainly based on the external SPI FLASH,. In order to speed up the communication between the DSP kernel and the external FLASH, an asynchronous FIFO. is inserted between the SPI peripheral and the DSP kernel. The asynchronous FIFO uses Graycode count to synchronize the read / write pointer, and gives a detailed circuit design and description for the logic of the read-write address and the judgment logic of the empty full flag. In the design of this subject, we use Verilog programming language to realize the design of this subject at RTL level, and use the NC-Verilog simulation tool of Cadence to simulate the function of instruction Cache and asynchronous FIFO. After the simulation is passed, Synopsys's Design Compiler synthesis tool is used to optimize the code design. The results are as follows: the total area of instruction Cache is 0.351 mm2, the power consumption is 38.85 mW, the maximum clock frequency is 150 MHz; asynchronous FIFO, the total area is 0.011 mm2, the power consumption is 452.95 渭 W, and the read clock frequency is up to 100 MHz..
【學(xué)位授予單位】:湘潭大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2017
【分類號(hào)】:TP332
【參考文獻(xiàn)】
相關(guān)期刊論文 前5條
1 程光偉;劉大偉;;基于FPGA的異步FIFO的研究和設(shè)計(jì)[J];工業(yè)儀表與自動(dòng)化裝置;2013年02期
2 王曉燕;楊先文;陳海民;;密碼嵌入式處理器中高速緩存的研究與設(shè)計(jì)[J];計(jì)算機(jī)工程與設(shè)計(jì);2012年08期
3 鄔春學(xué);華樂;;異步FIFO控制器的設(shè)計(jì)[J];微型機(jī)與應(yīng)用;2012年04期
4 楊曉剛;屈凌翔;張樹丹;;DSP中指令Cache的低功耗設(shè)計(jì)[J];計(jì)算機(jī)工程與應(yīng)用;2011年32期
5 李紅橋;肖建青;張洵穎;龔龍慶;;流水線處理器中Cache模塊的設(shè)計(jì)[J];科學(xué)技術(shù)與工程;2010年32期
相關(guān)碩士學(xué)位論文 前10條
1 朱志強(qiáng);基于28nm工藝的低功耗觸發(fā)器設(shè)計(jì)及優(yōu)化[D];安徽大學(xué);2015年
2 張波;基于SOC異步FIFO的設(shè)計(jì)與形式驗(yàn)證[D];西安電子科技大學(xué);2015年
3 歐陽邦見;多核X-DSPX共享存儲(chǔ)部件的設(shè)計(jì)與實(shí)現(xiàn)[D];國防科學(xué)技術(shù)大學(xué);2013年
4 袁瀟;DSP處理器中數(shù)據(jù)Cache的設(shè)計(jì)和驗(yàn)證[D];西安電子科技大學(xué);2013年
5 江滔;一種COFDM無線視頻傳輸系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn)[D];華中科技大學(xué);2013年
6 付波;基于dsp的兩級(jí)cache低功耗研究與實(shí)現(xiàn)[D];南昌大學(xué);2012年
7 李樺林;多核SoC中高帶寬片外數(shù)據(jù)交換與處理技術(shù)研究[D];合肥工業(yè)大學(xué);2012年
8 徐文昌;SoC調(diào)試跟蹤系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn)[D];西安電子科技大學(xué);2012年
9 李帥;基于Vera語言的GPS/北斗Ⅱ雙;鶐酒尿(yàn)證方法及實(shí)現(xiàn)[D];暨南大學(xué);2011年
10 劉彬;異步FIFO的設(shè)計(jì)與形式化驗(yàn)證[D];國防科學(xué)技術(shù)大學(xué);2011年
,本文編號(hào):2358411
本文鏈接:http://sikaile.net/kejilunwen/jisuanjikexuelunwen/2358411.html