大規(guī)模社會網(wǎng)絡(luò)中影響最大化問題高效處理技術(shù)研究
[Abstract]:In recent years, with the rapid development of the Internet and the Web2.0 technology, the social network, as a bridge for communication with the real human world, has become an important medium and platform for interactive communication, knowledge sharing and information dissemination. Among them, the purpose of maximizing the problem is to find the most influential set of nodes in the social network, which is the key problem in the field of social network analysis, and has wide application in many important scenarios, such as marketing, advertisement release, public opinion warning, water quality monitoring, epidemic monitoring and the like. And therefore has high research value and application value. The development and deployment of many impact-maximizing application strategies are very sensitive to the algorithm's solution time, so the efficient solution algorithm is the core of the current research and influence maximizing algorithm in the academic and industry. The existing research results are mainly focused on some greedy algorithms and heuristic algorithms. On the other hand, the data of the present social network is massive, the data coupling degree is high, the network structure changes dynamically, In the face of the large-scale social network, the existing algorithms have exposed many problems which are difficult to overcome: the parallelizable question of the calculation of the influence value of the nodes in the first and the social networks The existing work is focused on reducing the complexity of the algorithm, and the existing parallel computing architecture is not fully utilized to speed up the problem. A large number of node-affected values in the actual social network can be executed by the parallel computing architecture. So, in the aspect of the parallelism of the mining algorithm, the execution speed of the influence maximizing algorithm is still greatly improved. the second, the existing effect maximization algorithm does not take full account of the social network node distribution The distribution of the nodes in the social network obeys the typical power law. However, most of the existing greedy algorithms are used to calculate the influence value of all the nodes in a precise way, which leads to the high computational complexity of the large-degree nodes and becomes a bottle to be executed by the algorithm. The Dynamic Change of the Structure of the Third and the Social Network Problem: The existing research on maximizing the problem is focused on the static network; when the network changes dynamically, it is necessary to re-compute the influence value of the node for the whole network, resulting in a large amount of redundant calculation, which can lead to the performance of the large-scale social network In view of the above-mentioned technical bottleneck, this paper systematically studies the high-efficiency processing technology of the problem of maximizing the influence of social network, from the following aspects: Open study: The algorithm complexity is high for the existing method, resulting in the running time In this paper, based on the heterogeneous parallel computing framework of CPU and GPU, this paper designs and implements a high-degree-of-parallelism impact-maximizing algorithm, BUTA, and advances in to the GPU architecture. In this paper, the hierarchical dependency of nodes in the social network is analyzed, and the calculation of the influence of the nodes is found. On the basis of this, a self-base layer-by-layer scanning method, BUTA. BUTA, is designed. On the one hand, the algorithm complexity can be greatly reduced while the accuracy of the algorithm is guaranteed. On the other hand, BUTA makes full use of the hierarchical distribution of the nodes and calculates the nodes with high degree of parallelism. In order to make the BUTA algorithm more suitable for the heterogeneous parallel computing framework of the CPU + GPU, the paper designs three optimization methods: K-layer combination, data recombination and combined access, which are used to reduce the running-time branch, reduce the number of visits and the increase. This paper presents a sampling estimation algorithm based on Monte-Carlo theory, which is based on the Monte-Carlo theory, which has not fully utilized the distribution characteristics of the social network nodes. The calculation efficiency is improved. The distribution characteristics of the nodes in the social network are modeled and mined, and the problem of the long-time calculation time of the large-degree nodes is solved. In this paper, the Monte-Carlo theory is introduced, and a kind of node influence value estimation is designed. Method ESMCE. In the process of sampling, the ESMCE algorithm designs a sampling section that is guided by the power law index. In this paper, based on the difference between the estimation error and the precision requirement, a method for predicting the number of subsequent sampling nodes based on the gray-scale prediction model is presented in this paper to improve the accuracy of the algorithm until the sampling error is satisfied by multi-iterative sampling. In order to solve the problem of the low efficiency of the existing algorithm caused by the dynamic change of the social network topology, this paper designs an incremental influence to the maximum. In this paper, the evolution of the social network topology is deeply analyzed, and the topology change of the social network is found to meet the principle of priority and the degree of most influential nodes It is obviously larger than the common node. Based on the above-mentioned findings, this paper designs a kind of effect based on the local theory the invention designs a pruning strategy based on the influence change amount of the node and the most influential node information corresponding to the original network, the pre-ordering node set is sorted, so that the complexity of the dynamic social network influence maximization solution is greatly reduced, The time of program execution is reduced. In view of the fact that the current content distribution method ignores the social information such as the user-related relation and the geographical position in the social network, the problem that the user's access delay is high is caused, and a kind of problem based on the influence is designed in this paper. The content distribution method SCORE is different from the existing content distribution method. The SCORE method makes full use of the user information in the social network, and proposes a cache content selection strategy based on the influence maximization algorithm to quickly and accurately locate the future. In order to minimize the access delay, the SCORE method designs a K-MEANS clustering algorithm and a weighted spherical average computing party based on the association relationship and the geographic location information between the users in order to minimize the access delay the edge server selection policy of the method thereby pre-distributing the key content to the edge server closest to the potential access user, so as to respond to the user's request nearby. The experiment results show that the SCORE method can greatly reduce the user access To sum up, this paper puts forward a valid solution for the efficient processing technology of the problem of maximizing the influence of social network, and it is made on the real data set In this paper, the effectiveness of the proposed algorithm is verified, and the research and practical tools for maximizing the influence of the social network
【學(xué)位授予單位】:國防科學(xué)技術(shù)大學(xué)
【學(xué)位級別】:博士
【學(xué)位授予年份】:2013
【分類號】:TP393.02;TP301.6
【參考文獻(xiàn)】
相關(guān)期刊論文 前4條
1 郎君;秦兵;宋巍;劉龍;劉挺;李生;;基于社會網(wǎng)絡(luò)的人名檢索結(jié)果重名消解[J];計算機(jī)學(xué)報;2009年07期
2 田家堂;王軼彤;馮小軍;;一種新型的社會網(wǎng)絡(luò)影響最大化算法[J];計算機(jī)學(xué)報;2011年10期
3 劉思峰;灰色系統(tǒng)理論的產(chǎn)生與發(fā)展[J];南京航空航天大學(xué)學(xué)報;2004年02期
4 何清華;張世琦;;社會網(wǎng)絡(luò)分析發(fā)展與工程應(yīng)用研究綜述[J];建設(shè)監(jiān)理;2012年02期
相關(guān)博士學(xué)位論文 前5條
1 韓毅;社會網(wǎng)絡(luò)分析與挖掘的若干關(guān)鍵問題研究[D];國防科學(xué)技術(shù)大學(xué);2011年
2 謝乃明;灰色系統(tǒng)建模技術(shù)研究[D];南京航空航天大學(xué);2008年
3 高霖;社會網(wǎng)絡(luò)動態(tài)性及網(wǎng)絡(luò)環(huán)境中的分布式搜索策略研究[D];中國科學(xué)技術(shù)大學(xué);2009年
4 萬懷宇;社會網(wǎng)絡(luò)中基于鏈接的分類問題研究[D];北京交通大學(xué);2012年
5 曾波;灰色預(yù)測建模技術(shù)研究[D];南京航空航天大學(xué);2012年
本文編號:2504761
本文鏈接:http://sikaile.net/wenyilunwen/guanggaoshejilunwen/2504761.html