基于并行計算的交互式數(shù)據(jù)挖掘和可視化系統(tǒng)
[Abstract]:With the development of information technology, the amount of data increases explosively. The traditional data mining technology based on CPU can not deal with such a huge amount of data efficiently. In addition, the human brain is easier to recognize color and geometry for boring numbers. Using data visualization technology, data mining results can be more naturally and intuitively presented in the operation interface, which can better meet the needs of users. But at present, the traditional data visualization tools used in data mining can only draw 2D or 3D graphics, and lack of interactivity. This paper presents an interactive data mining and visualization system based on parallel computing. In this paper, the classical data stream mining algorithm is optimized by using GPU (Graphics Processing Unit) programming method. The traditional data mining technology based on CPU adopts serial data processing method, which can not meet the needs of multiple computer resources running at the same time. When the amount of data is large, the number of iterations will be many, the memory requirement will be large, the processing speed will be very slow and the efficiency will be low. The GPU programming method uses the parallel way to process the data. The multiple threads run independently and simultaneously, so the operation efficiency is very high, so it is more suitable to deal with a large amount of data. Aiming at the data independence and data dependence in big data, this paper optimizes the clustering algorithm K-Means and the connected area marking algorithm (Connected Component Labeling,CCL by using GPU programming technology, and completes the mining analysis of big data. In this paper, an interactive method of data visualization is proposed. In order to realize the visualization of data, we use the software development kit of DirectX to transform the original data set or data mining result into vertex, line, surface, color and other graphics. The multi-dimensional model is built by using various clear graphic functions provided in the software development toolkit, and the final visualization results are rendered. In addition, we also create a graphical user interface (GUI),) which can change the clustering parameters according to their different requirements and get the visualization results that meet their needs. Based on the above algorithm, the energy consumption data generated by air conditioning operation are experimented in this paper, and the traditional algorithm is optimized by using GPU programming method, which not only realizes the clustering analysis of the data, The experimental data show that the speed of the system is greatly improved and the operation efficiency is higher when the system is used to deal with the huge amount of data. In addition, we use the software development kit of DirectX to represent the abstract data mining results as concrete four-dimensional three-dimensional graphics and images, and users can change the visual view of the visual results and the K value of clustering through keyboard operation. Get the results you want to meet the real needs of users.
【學位授予單位】:北方工業(yè)大學
【學位級別】:碩士
【學位授予年份】:2017
【分類號】:TP311.13
【參考文獻】
相關期刊論文 前10條
1 權國龍;馮園園;馮仰存;顧小清;;面向知識的可視化技術分析與觀察[J];遠程教育雜志;2016年01期
2 鄧仲華;劉偉偉;陸穎雋;;基于云計算的大數(shù)據(jù)挖掘內(nèi)涵及解決方案研究[J];情報理論與實踐;2015年07期
3 Yang Ju;Heping Xie;Zemin Zheng;Jinbo Lu;Lingtao Mao;Feng Gao;Ruidong Peng;;Visualization of the complex structure and stress field inside rock by means of 3D printing technology[J];Chinese Science Bulletin;2014年36期
4 Yufeng Zhao;Qi Xie;Liyun He;Baoyan Liu;Kun Li;Xiang Zhang;Wenjing Bai;Lin Luo;Xianghong Jing;Ruili Huo;;Comparsion analysis of data mining models applied to clinical research in Traditional Chinese Medicine[J];Journal of Traditional Chinese Medicine;2014年05期
5 潘巍;李戰(zhàn)懷;;大數(shù)據(jù)環(huán)境下并行計算模型的研究進展[J];華東師范大學學報(自然科學版);2014年05期
6 Amani Tahat;Jordi Marti;Ali Khwaldeh;Kaher Tahat;;Pattern recognition and data mining software based on artificial neural networks applied to proton transfer in aqueous environments[J];Chinese Physics B;2014年04期
7 Chenyang Ge;Zuoxun Hou;Huimin Yao;Nanning Zheng;Wenzhe Zhao;;A new implementation of image-processing engine for 3D visualization and stereo video stream display[J];Chinese Science Bulletin;2014年Z1期
8 Zhen Chen;Fuye Han;Junwei Cao;Xin Jiang;Shuo Chen;;Cloud Computing-Based Forensic Analysis for Collaborative Network Security Management System[J];Tsinghua Science and Technology;2013年01期
9 孫大為;常桂然;高尚;靳立忠;王興偉;;Modeling a Dynamic Data Replication Strategy to Increase System Availability in Cloud Computing Environments[J];Journal of Computer Science & Technology;2012年02期
10 牛東曉;王永利;馬小勇;;Optimization of support vector machine power load forecasting model based on data mining and Lyapunov exponents[J];Journal of Central South University of Technology;2010年02期
相關博士學位論文 前4條
1 李秋虹;基于MapReduce的大規(guī)模數(shù)據(jù)挖掘技術研究[D];復旦大學;2013年
2 周勇;基于并行計算的數(shù)據(jù)流處理方法研究[D];大連理工大學;2013年
3 張小慶;基于云計算環(huán)境的資源提供優(yōu)化方法研究[D];武漢理工大學;2013年
4 任永功;面向聚類的數(shù)據(jù)可視化方法及相關技術研究[D];東北大學;2006年
相關碩士學位論文 前1條
1 王莉;基于Hadoop的大數(shù)據(jù)平臺數(shù)據(jù)挖掘云服務研究[D];長江大學;2016年
,本文編號:2228283
本文鏈接:http://sikaile.net/shoufeilunwen/xixikjs/2228283.html