高性能硬件加速器的實(shí)現(xiàn)
[Abstract]:In modern complex digital signal processing, with the increasing complexity of the algorithm and the amount of data to be processed, the general-purpose processor has been unable to meet the requirements of high-speed real-time data processing in some specific applications. Heterogeneous multi-core systems can assign different computing tasks to different processor cores for parallel processing, accelerate task execution, provide a more efficient and flexible processing mechanism, and meet the needs of various applications. Hardware accelerators can improve the speed of scientific computation for specific applications. Therefore, the architecture of heterogeneous multicore system with hardware accelerator has emerged as the times require. Some multicore processors perform accelerated operations by integrating specialized accelerators for certain applications, but their flexibility is not high. With the advent of reconfigurable technology, the application of reconfigurable technology to hardware accelerators can make up for the shortcomings of performance and flexibility in general computing and software computing, and provide a better solution for complex high-speed signal processing. According to the development trend, this paper studies reconfigurable computing technology, hardware accelerator and how to integrate hardware accelerator in heterogeneous multi-core system. In this paper, the following three aspects are studied: (1) according to the requirements of the application target, the application characteristics of high-density computing are analyzed, and the characteristics of matrix algorithm are analyzed, and the high degree of parallelism is analyzed. The matrix calculation method which can improve the system performance effectively, and the algorithm of matrix operation type is optimized according to the application target and application platform. A hybrid granularity parallel matrix inversion method with in-situ substitution is obtained. (2) based on the optimization algorithm and structure, the hardware architecture of the optimized matrix inversion algorithm is proposed. A reconfigurable high performance hardware accelerator for heterogeneous multi-core systems is designed. The hardware accelerator is mainly used for matrix operations in the field of high density computing. Especially, the inverse operation of matrix can efficiently perform the inversion of real matrix with single precision of 2n within order 128. (3) based on Xilinx V6 FPGA, the experimental verification and performance analysis of the designed hardware accelerator are carried out. The integration of the hardware accelerator in heterogeneous multicore system is introduced, and the high performance of the designed accelerator is verified.
【學(xué)位授予單位】:合肥工業(yè)大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2017
【分類號】:TP332
【參考文獻(xiàn)】
相關(guān)期刊論文 前7條
1 劉仲;田希;陳磊;;支持原位計(jì)算的高效三角矩陣乘法向量化方法[J];國防科技大學(xué)學(xué)報(bào);2014年06期
2 張啟英;劉亞剛;張淑艷;朱娟;;基于FPGA的硬件加速器設(shè)計(jì)的研究與應(yīng)用[J];計(jì)算機(jī)光盤軟件與應(yīng)用;2013年17期
3 許芳;席毅;陳虹;靳偉偉;;基于FPGA/Nios-Ⅱ的矩陣運(yùn)算硬件加速器設(shè)計(jì)[J];電子測量與儀器學(xué)報(bào);2011年04期
4 周杰;陳嘯洋;趙建勛;竇勇;;大矩陣QR分解的FPGA設(shè)計(jì)與實(shí)現(xiàn)[J];計(jì)算機(jī)工程與科學(xué);2010年10期
5 蘇濤,莊德靖,吳順君;一種SAR成像快速算法及其并行實(shí)現(xiàn)[J];西安電子科技大學(xué)學(xué)報(bào);2005年01期
6 譚道盛,溫啟愚;矩陣的任意分塊求逆及其應(yīng)用[J];四川大學(xué)學(xué)報(bào)(自然科學(xué)版);1999年01期
7 徐蘭;;復(fù)數(shù)矩陣的快速Givens變換[J];華東師范大學(xué)學(xué)報(bào)(自然科學(xué)版);1988年03期
相關(guān)博士學(xué)位論文 前5條
1 李東生;基于高密度計(jì)算的多核芯片設(shè)計(jì)關(guān)鍵技術(shù)研究[D];合肥工業(yè)大學(xué);2012年
2 王超;異構(gòu)多核可重構(gòu)片上系統(tǒng)關(guān)鍵技術(shù)研究[D];中國科學(xué)技術(shù)大學(xué);2011年
3 鄔貴明;FPGA矩陣計(jì)算并行算法與結(jié)構(gòu)[D];國防科學(xué)技術(shù)大學(xué);2011年
4 谷曉忱;并行蒙特卡羅計(jì)算硬件加速器的關(guān)鍵技術(shù)研究[D];國防科學(xué)技術(shù)大學(xué);2010年
5 宋宇鯤;動態(tài)可重構(gòu)協(xié)處理器研究[D];合肥工業(yè)大學(xué);2006年
相關(guān)碩士學(xué)位論文 前6條
1 郭磊;矩陣運(yùn)算的硬件加速技術(shù)研究[D];國防科學(xué)技術(shù)大學(xué);2010年
2 邵儀;基于FPGA的矩陣運(yùn)算固化實(shí)現(xiàn)技術(shù)研究[D];解放軍信息工程大學(xué);2010年
3 李本齋;PowerPC下H.264運(yùn)動估計(jì)硬件加速器研究[D];合肥工業(yè)大學(xué);2010年
4 陳迎春;DReNoC:基于片上網(wǎng)絡(luò)的動態(tài)可重構(gòu)計(jì)算系統(tǒng)研究與實(shí)現(xiàn)[D];合肥工業(yè)大學(xué);2010年
5 何瑩瑩;基于二維網(wǎng)格NoC的矩陣求逆加速實(shí)現(xiàn)[D];合肥工業(yè)大學(xué);2010年
6 林皓;基于FPGA的矩陣運(yùn)算實(shí)現(xiàn)[D];南京理工大學(xué);2007年
,本文編號:2280291
本文鏈接:http://sikaile.net/kejilunwen/jisuanjikexuelunwen/2280291.html