云環(huán)境下基于多目標(biāo)優(yōu)化算法的微博意見領(lǐng)袖挖掘
[Abstract]:In recent years, with the popularity of the Internet, people's way of communication has also undergone earth-shaking changes. As a new kind of network information communication media, Weibo has been pursued by many netizens, its timeliness and fragmentation can make information widely spread in a very short time, so the effective control of information becomes very important. As soon as someone deliberately spreads false information or endangers social security, the consequences will be very serious. So, it is of great significance to excavate and monitor the opinion leaders who play a dominant role in the network information dissemination. At present, there are statistical method, cluster analysis method and social network analysis method based on SNA for opinion leader mining. These methods have their own characteristics. However, in the face of the massive Weibo data generated by 33.1 billion Weibo netizens, They don't show good handling power. In this paper, taking Weibo user attribute as the foothold, combining its multi-attribute features with multi-objective optimization problem, this paper proposes to introduce Skyline query into the mining of Weibo opinion leader. Skyline query is a kind of method to solve multi-objective optimization problem. Facing the huge amount of Weibo data, this paper introduces MapReduce framework, the key technology of Skyline, and implements the BNL block nested loop algorithm and SFS sorting filtering algorithm in Skyline computing under this programming framework, so that it has better performance when dealing with massive data. Then the evaluation model of opinion leader is established, and two indexes of user influence and user participation are put forward to evaluate the opinion leader of Weibo. The weight of each attribute is determined by AHP, and the formula of opinion leader is given at last. In the mining experiment stage, the Hadoop cluster environment is built, the Weibo crawler is designed to obtain Weibo data, the data is processed by parallel SFS algorithm, and the processing results are applied to the opinion leader model for calculation. Finally, the author compares the opinion leaders excavated in this paper with the official popular users of Sina Weibo, and the results show that the opinion leaders are distributed in a relatively wide range of areas by the method of this paper. To some extent avoided Sina Weibo official user ranking entertainment figures in the big phenomenon. Therefore, the idea of this paper can be used as a solution to deal with massive, high-dimensional data, which provides a possible choice for the mining of Weibo opinion leaders.
【學(xué)位授予單位】:北京郵電大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2014
【分類號】:TP393.092
【參考文獻(xiàn)】
相關(guān)期刊論文 前10條
1 李名亮;;現(xiàn)實與隱憂:微博意見領(lǐng)袖的話語權(quán)力[J];今傳媒;2012年05期
2 桑亮;許正林;;微博意見領(lǐng)袖的形成機(jī)制及其影響[J];當(dāng)代傳播;2011年03期
3 肖宇;許煒;夏霖;;一種基于情感傾向分析的網(wǎng)絡(luò)團(tuán)體意見領(lǐng)袖識別算法[J];計算機(jī)科學(xué);2012年02期
4 王玨;曾劍平;周葆華;吳承榮;;基于聚類分析的網(wǎng)絡(luò)論壇意見領(lǐng)袖發(fā)現(xiàn)方法[J];計算機(jī)工程;2011年05期
5 王甲生;吳曉平;廖巍;陳永強(qiáng);;改進(jìn)的加權(quán)復(fù)雜網(wǎng)絡(luò)節(jié)點重要度評估方法[J];計算機(jī)工程;2012年10期
6 肖宇;許煒;夏霖;;網(wǎng)絡(luò)社區(qū)中的意見領(lǐng)袖特征分析[J];計算機(jī)工程與科學(xué);2011年01期
7 馬寧;劉怡君;;基于超網(wǎng)絡(luò)的輿論領(lǐng)袖識別應(yīng)用研究[J];中國科學(xué)院院刊;2012年05期
8 張萌;余芬芬;;微博意見領(lǐng)袖對網(wǎng)絡(luò)輿論的影響[J];新聞世界;2012年07期
9 肖宇;許煒;張晨;何丹丹;;社交網(wǎng)絡(luò)中用戶區(qū)域影響力評估算法研究[J];微電子學(xué)與計算機(jī);2012年07期
10 王平;謝耘耕;;突發(fā)公共事件中微博意見領(lǐng)袖的實證研究——以“溫州動車事故”為例[J];現(xiàn)代傳播(中國傳媒大學(xué)學(xué)報);2012年03期
,本文編號:2455648
本文鏈接:http://sikaile.net/guanlilunwen/ydhl/2455648.html