三維處理器中計算資源動態(tài)共享技術(shù)研究

發(fā)布時間：2018-07-05 06:07

本文選題：多核 + 硅通孔　；參考：《國防科學技術(shù)大學》2012年碩士論文

【摘要】：隨著半導(dǎo)體工藝尺寸步入深亞微米和納米級別，尺寸的縮減逐漸接近物理極限。通過提升主頻來提高處理器性能的發(fā)展方向已經(jīng)停滯，微處理器體系結(jié)構(gòu)研究和實現(xiàn)轉(zhuǎn)移到采用多核與眾核結(jié)構(gòu)提高性能的道路上。但多核與眾核結(jié)構(gòu)并未解決功耗墻、存儲墻、片上互連延遲增加等問題，并使之進一步惡化，阻礙了微處理器性能的進一步提高。三維處理器通過采用三維集成電路技術(shù)使硅片與硅片上下直接堆疊，從而以較低成本獲得更多硅片資源，進而可以集成更多緩存來解決存儲墻問題。連接不同堆疊層次硅片的硅通孔具有低延遲、高通信帶寬等特性，可以用來解決納米工藝全局連線的問題。傳統(tǒng)多處理器和多計算機上的負載不均衡問題在三維多核處理器中也存在。由于三維處理器資源眾多，，部分核由于負載重導(dǎo)致部分資源成為性能瓶頸，而同一時刻其他核計算資源處于閑置狀態(tài)的場景更加常見。針對傳統(tǒng)二維多處理器中核間共享資源面臨的共享資源粒度粗、互連延遲長、可擴展性差等問題，本文提出了一種新的資源交叉的三維處理器結(jié)構(gòu)3DDRS，該結(jié)構(gòu)考慮了功耗與散熱平衡，同時支持計算資源動態(tài)共享。針對未來三維處理器更多層次堆疊和多種數(shù)量核心共享資源的需求，提出了3DDRS結(jié)構(gòu)針對多核三維處理器的擴展原則，同時提出了計算資源共享的關(guān)鍵技術(shù)。由于多任務(wù)執(zhí)行模式的一致性，本文在同時多線程模擬器SMTSIM的基礎(chǔ)上擴展3DDRS的性能模擬工具3DDRS-SIM。在3DDRS-SIM中重點實現(xiàn)了3DDRS結(jié)構(gòu)中動態(tài)共享計算資源的原理，并支持對多堆疊層次的資源共享的性能模擬。使用3DDRS-SIM對3DDRS結(jié)構(gòu)面臨計算密集型單任務(wù)和多任務(wù)應(yīng)用的性能進行了評估。實驗結(jié)果表明，基于多種計算資源共享的3DDRS結(jié)構(gòu)，使三維處理器的單線程最高性能平均提升23%，整體性能平均提升12%。三維處理器結(jié)構(gòu)3DDRS中多個堆疊層次計算資源的共享技術(shù)，能夠動態(tài)適應(yīng)負載的資源需求，提升簡單眾核設(shè)計下單線程應(yīng)用計算的性能和處理器的整體性能，是未來三維處理器設(shè)計的良好備選。
[Abstract]:As semiconductor process size steps into deep sub-micron and nano-scale, the size reduction is gradually approaching the physical limit. The development direction of improving processor performance by raising the main frequency has been stalled. The research and implementation of microprocessor architecture has been shifted to the path of improving performance by adopting multi-core and multi-core architecture. However, the multi-core and multi-core architecture has not solved the problems of power wall, storage wall, on-chip interconnect delay and so on, which further worsen and hinder the further improvement of microprocessor performance. By using 3D integrated circuit technology to stack up and down the silicon chip directly, the 3D processor can acquire more silicon chip resources at lower cost, and then integrate more buffers to solve the storage wall problem. The silicon through holes connected to different stacked layers of silicon have the characteristics of low delay and high communication bandwidth, which can be used to solve the problem of global connection in nanotechnology. Traditional multi-processor and multi-computer load imbalance problems also exist in three-dimensional multi-core processors. Due to the large number of 3D processor resources, part of the core due to heavy load leads to part of the performance bottleneck, while other computing resources at the same time is more common in the idle state scenario. Aiming at the problems of coarse-grained shared resources, long interconnect delay, poor scalability and so on, the traditional two-dimensional multi-processor system is faced with shared resources among cores. In this paper, a new 3D processor architecture, 3DDRSs, is proposed, which takes into account the balance between power consumption and heat dissipation, and supports dynamic sharing of computing resources. In order to meet the need of multilevel stacking and multiple core sharing resources in future 3D processors, the expansion principle of 3DDRS architecture for multi-core 3D processors is proposed, and the key technology of computing resource sharing is also presented. Because of the consistency of multitask execution mode, this paper extends the 3DDRS performance simulation tool 3DDRS-SIMs based on the simultaneous multithreading simulator SMTSIM. The principle of dynamically sharing computing resources in 3DDRS architecture is implemented in 3DDRS-SIM, and the performance simulation of multi-stack level resource sharing is supported. The performance of 3D DDRS architecture facing computationally intensive single-task and multi-task applications is evaluated using 3DDRS-SIM. The experimental results show that based on the 3DDRS architecture of multi-computing resource sharing, the single thread maximum performance of 3D processor is increased by 23 per thread on average, and the overall performance is increased by an average of 12. The sharing technology of multi-layer computing resources in 3D processor architecture 3DDRS can dynamically adapt to the resource requirements of the load and improve the performance of the simple multi-core design send order thread application computing performance and the overall performance of the processor. It is a good candidate for future 3D processor design.
【學位授予單位】：國防科學技術(shù)大學
【學位級別】：碩士
【學位授予年份】：2012
【分類號】：TP332

【相似文獻】

相關(guān)期刊論文前10條

1 C.A.(Al)Dennis ,陳瑞源 ,力康;公用信號處理器的應(yīng)用和設(shè)計[J];系統(tǒng)工程與電子技術(shù);1987年06期

2 Robert Cravotta;;可配置處理器應(yīng)用日趨紅火[J];電子設(shè)計技術(shù);2003年11期

3 劉磊;鄒候文;唐屹;;一種可編程安全處理器體系結(jié)構(gòu)的研究與實現(xiàn)[J];廣州大學學報(自然科學版);2006年04期

4 張錚;趙榮彩;顏峻;邰銘;陳科;;網(wǎng)絡(luò)處理器體系結(jié)構(gòu)和應(yīng)用綜述[J];信息工程大學學報;2006年04期

5 張怡,孫志剛;基于IPSec的下一代高性能安全處理器的體系結(jié)構(gòu)[J];國防科技大學學報;2003年02期

6 岳虹;戴葵;王志英;;一種面向數(shù)字信號處理的嵌入式處理器體系結(jié)構(gòu)設(shè)計[J];計算機工程與科學;2006年10期

7 許珊琳;;適合嵌入應(yīng)用的嵌入式處理器[J];中國集成電路;2009年02期

8 張磊;王穎;陳云霽;徐志偉;張立新;;可重塑處理器:用戶可定義的加速器中處理器架構(gòu)[J];網(wǎng)絡(luò)新媒體技術(shù);2012年06期

9 Robert Cravotta;;一個處理器能兼顧控制與信號處理嗎?[J];電子設(shè)計技術(shù);2002年07期

10 朱丹;李暾;郭陽;李思昆;;微處理器體系結(jié)構(gòu)級測試程序自動生成技術(shù)[J];軟件學報;2005年12期

相關(guān)會議論文前3條

1 宋緋;劉曉寧;;DSP/MCU結(jié)構(gòu)的新型處理器[A];第九屆全國青年通信學術(shù)會議論文集[C];2004年

2 趙秋平;楊燦群;王鋒;;LBM算法在Cell處理器上的實現(xiàn)和優(yōu)化[A];2008'中國信息技術(shù)與應(yīng)用學術(shù)論壇論文集（二）[C];2008年

3 周巍;孫冰;戰(zhàn)立明;呂建華;王國仁;于戈;;基于DOM模型的XML查詢處理器的設(shè)計與實現(xiàn)[A];第十八屆全國數(shù)據(jù)庫學術(shù)會議論文集（研究報告篇）[C];2001年

相關(guān)重要報紙文章前10條

1 ;處理器上演多核大戲[N];計算機世界;2005年

2 心元;PC“心臟”的搏擊[N];計算機世界;2004年

3 清華大學微處理器與SoC技術(shù)研究中心王海霞汪東升;顛覆傳統(tǒng)理念[N];計算機世界;2005年

4 清華大學微處理器與SoC技術(shù)研究中心汪東升王海霞張悠慧李兆麟;CMP 開啟處理器效能時代[N];計算機世界;2005年

5 江蘇 netfan;體現(xiàn)速度與性能[N];電腦報;2004年

6 四川王毅;變革進行時[N];電腦報;2004年

7 清華大學微處理器與SoC技術(shù)研究中心汪東升;多核技術(shù)天地廣闊[N];計算機世界;2006年

8 本報記者李獻王皓;2002年服務(wù)器四大景觀[N];計算機世界;2003年

9 ;MontaVista Linux 2．1跨平臺[N];中國計算機報;2002年

10 ;CPU技術(shù)進步牛氣沖天[N];計算機世界;2004年

相關(guān)博士學位論文前10條

1 魏繼增;可配置可擴展處理器關(guān)鍵問題研究[D];天津大學;2010年

2 霍文捷;嵌入式處理器安全運行機制的研究與設(shè)計[D];華中科技大學;2010年

3 從明;類數(shù)據(jù)流驅(qū)動的分片式處理器體系結(jié)構(gòu)[D];中國科學技術(shù)大學;2009年

4 徐光;分片式流處理器體系結(jié)構(gòu)[D];中國科學技術(shù)大學;2010年

5 李勇;異步數(shù)據(jù)觸發(fā)微處理器體系結(jié)構(gòu)關(guān)鍵技術(shù)研究與實現(xiàn)[D];國防科學技術(shù)大學;2007年

6 任永青;邏輯核動態(tài)可重構(gòu)的眾核處理器體系結(jié)構(gòu)[D];中國科學技術(shù)大學;2010年

7 黎鐵軍;嵌入式流媒體處理器體系結(jié)構(gòu)技術(shù)研究[D];國防科學技術(shù)大學;2005年

8 黃海林;高可靠處理器體系結(jié)構(gòu)研究[D];中國科學院研究生院（計算技術(shù)研究所）;2006年

9 劉光輝;高效處理器容錯技術(shù)研究與實現(xiàn)[D];國防科學技術(shù)大學;2013年

10 溫璞;面向科學計算的PIM體系結(jié)構(gòu)技術(shù)研究[D];國防科學技術(shù)大學;2007年

相關(guān)碩士學位論文前10條

1 曾斌;分片式處理器體系結(jié)構(gòu)上的超塊優(yōu)化技術(shù)[D];中國科學技術(shù)大學;2009年

2 黃冕;X處理器存儲一致性模型的研究與實現(xiàn)[D];國防科學技術(shù)大學;2008年

3 趙燦明;分片式處理器上激進執(zhí)行模型分析[D];中國科學技術(shù)大學;2009年

4 劉晉汾;處理器描述語言的研究與應(yīng)用[D];解放軍信息工程大學;2011年

5 劉子揚;基于虛擬計算群的眾核處理器動態(tài)在線任務(wù)調(diào)度算法研究[D];上海交通大學;2013年

6 邸志雄;多核包處理器數(shù)據(jù)控制總線技術(shù)研究[D];西安電子科技大學;2010年

7 方紅霞;基于指令的處理器時延測試產(chǎn)生方法[D];中國科學院研究生院（計算技術(shù)研究所）;2005年

8 黎寶峰;嵌入式DSP處理器的設(shè)計與驗證[D];湖南大學;2003年

9 鐘松延;可配置可擴展處理器編譯器設(shè)計[D];天津大學;2012年

10 董亞卓;循環(huán)陣列處理器體系結(jié)構(gòu)的關(guān)鍵技術(shù)研究與實現(xiàn)[D];國防科學技術(shù)大學;2004年

本文編號：2099232

資料下載

論文發(fā)表

支付寶下載

Download by Alipay
微信下載

Download by Wechat
會員下載

Download by Member

本文鏈接：http://sikaile.net/kejilunwen/jisuanjikexuelunwen/2099232.html

上一篇：一種新型高速1553B總線控制器的應(yīng)用驗證
下一篇：UNIX服務(wù)器集中監(jiān)控系統(tǒng)的設(shè)計與實現(xiàn)

論文發(fā)表

·知網(wǎng)|萬方|維普|龍源|省級|國家級|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

三維處理器中計算資源動態(tài)共享技術(shù)研究