基于Hadoop-GPU的RBM云計算實現(xiàn)
發(fā)布時間:2018-07-23 12:07
【摘要】:針對受限波爾茲曼機處理大數(shù)據(jù)時存在的訓練緩慢問題,在Hadoop云計算平臺和GPU并行加速的基礎上設計了基于Hadoop-GPU框架的的RBM加速計算實現(xiàn)方法.通過對MapReduce機制和RBM訓練過程的分析,將RBM訓練分割為采用Map端實現(xiàn)吉布斯采樣,Reduce端實現(xiàn)參數(shù)更新,并通過GPU實現(xiàn)運算并行加速的方法組合.最后通過MNIST手寫數(shù)字識別集實驗證明,在大規(guī)模數(shù)據(jù)下,Hadoop-GPU平臺對RBM的訓練具有良好的可行性,加速比達到20以上,并且隨著數(shù)據(jù)規(guī)模的增加,加速比呈現(xiàn)更為顯著的增長.
[Abstract]:Aiming at the problem of slow training of constrained Boltzmann machine in big data processing, a RBM accelerated computing implementation method based on Hadoop-GPU framework is designed on the basis of Hadoop cloud computing platform and GPU parallel acceleration. Based on the analysis of MapReduce mechanism and RBM training process, the RBM training is divided into two parts: the Gibbs sampling and reduce end is used to realize the parameter updating and the GPU is used to realize the method combination of parallel computation acceleration. Finally, the experiment of MNIST handwritten numeral recognition set shows that the Hadoop-GPU platform is feasible for RBM training under large scale data, and the speedup ratio is more than 20, and the speedup increases more significantly with the increase of data scale.
【作者單位】: 海軍航空工程學院信息融合研究所;山西美佳礦業(yè)裝備有限公司;
【基金】:國家自然科學基金(61032001)
【分類號】:TP332
[Abstract]:Aiming at the problem of slow training of constrained Boltzmann machine in big data processing, a RBM accelerated computing implementation method based on Hadoop-GPU framework is designed on the basis of Hadoop cloud computing platform and GPU parallel acceleration. Based on the analysis of MapReduce mechanism and RBM training process, the RBM training is divided into two parts: the Gibbs sampling and reduce end is used to realize the parameter updating and the GPU is used to realize the method combination of parallel computation acceleration. Finally, the experiment of MNIST handwritten numeral recognition set shows that the Hadoop-GPU platform is feasible for RBM training under large scale data, and the speedup ratio is more than 20, and the speedup increases more significantly with the increase of data scale.
【作者單位】: 海軍航空工程學院信息融合研究所;山西美佳礦業(yè)裝備有限公司;
【基金】:國家自然科學基金(61032001)
【分類號】:TP332
【參考文獻】
相關期刊論文 前2條
1 江務學;張t,
本文編號:2139363
本文鏈接:http://sikaile.net/kejilunwen/jisuanjikexuelunwen/2139363.html
最近更新
教材專著