片上多核同步單元的研究實現(xiàn)及片間擴展
[Abstract]:With the development of application requirements and chip manufacturing technology, more processor and memory resources can be integrated on a single chip, and the on-chip system is gradually developed from a single-core structure to a multi-core structure. The emergence of multi-core architecture not only improves performance, but also puts forward higher requirements for inter-core synchronization mechanism. To give full play to the processing power of each processor core in the multi-core chip, we need the support of efficient synchronization mechanism. X-DSP is a self-designed architecture and instruction set structure for high-performance multi-core DSP, developed by our university. It is mainly used in the field of signal and image processing. Multiple DSP cores and global cache, are integrated into the chip to communicate with high speed out of chip via PCIE interface. The multi-core architecture supports the parallel execution of multiple tasks, and the data communication among the tasks requires an efficient synchronization mechanism to ensure the correctness and efficiency of the execution. Based on the system structure of X-DSP, this paper uses distributed hardware synchronization unit to realize multi-core synchronization. At the same time, in order to make the off-chip processor core participate in the synchronization effectively, the interface extension based on PCIE is completed, and the PCIE-NI bridge is designed and implemented. The main contents and contributions of this paper are as follows: (1) the hardware synchronization scheme and the software synchronization scheme are analyzed and compared, and the hardware synchronization mechanism based on lock and fence is determined. By reducing the influence of synchronous operation on the normal memory access behavior, the synchronization efficiency is improved. (2) considering the architecture characteristics of X-DSP, a distributed hardware synchronization unit including hardware lock and fence is designed. Among them, the hardware lock has two working modes: the rotation lock and the queue rotation lock, which can effectively reduce the number of requests for lock acquisition. The hardware fence is released by broadcast, thus reducing the network hot issues caused by the serial release of the traditional fence. (3) the PCIE-NI transfer bridge is designed, and the AXI standard interface is realized. The protocol transfer between PBUS and DBI interface and NI interface designed by X-DSP makes the core of off-chip processor participate in synchronization effectively and realize data sharing between chip and chip. (4) Module level verification is completed based on hierarchical verification methodology. The system level verification and the joint test between the hardware synchronization unit and the PCIE-NI bridge are completed in the full chip system environment. The results of logic synthesis show that the design of this paper can meet the performance requirements.
【學(xué)位授予單位】:國防科學(xué)技術(shù)大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2015
【分類號】:TP332
【參考文獻】
相關(guān)期刊論文 前6條
1 陳書明;萬江華;魯建壯;劉仲;孫海燕;孫永節(jié);劉衡竹;劉祥遠;李振濤;徐毅;陳小文;;YHFT-QDSP:High-Performance Heterogeneous Multi-Core DSP[J];Journal of Computer Science & Technology;2010年02期
2 顏建峰;吳寧;;基于PCI總線的DMA高速數(shù)據(jù)傳輸系統(tǒng)[J];電子科技大學(xué)學(xué)報;2007年05期
3 Mick Posner;;快速實現(xiàn)基于AMBA 3 AXI協(xié)議的設(shè)計[J];電子設(shè)計應(yīng)用;2007年01期
4 蔣周良;權(quán)進國;林孝康;;AMBA總線新一代標準AXI分析和應(yīng)用[J];微計算機信息;2006年29期
5 汪東,馬劍武,陳書明;基于Gray碼的異步FIFO接口技術(shù)及其應(yīng)用[J];計算機工程與科學(xué);2005年01期
6 胡偉武,,夏培肅;順序一致共享存儲系統(tǒng)中的亂序執(zhí)行技術(shù)──基本理論[J];計算機學(xué)報;1997年06期
相關(guān)博士學(xué)位論文 前1條
1 賈小敏;多核處理器片上Cache訪問行為分析與優(yōu)化機制研究[D];國防科學(xué)技術(shù)大學(xué);2011年
相關(guān)碩士學(xué)位論文 前4條
1 梁天永;IP集成方案研究與DFI-AXI總線橋的設(shè)計[D];華南理工大學(xué);2010年
2 黃穎然;基于覆蓋率驗證方法的IP核測試平臺設(shè)計[D];西安電子科技大學(xué);2009年
3 黃冕;X處理器存儲一致性模型的研究與實現(xiàn)[D];國防科學(xué)技術(shù)大學(xué);2008年
4 陳石坤;多核處理器中CACHE一致性協(xié)議研究和實現(xiàn)[D];國防科學(xué)技術(shù)大學(xué);2005年
本文編號:2416136
本文鏈接:http://sikaile.net/kejilunwen/jisuanjikexuelunwen/2416136.html