社交信息傳播時序預(yù)測算法
[Abstract]:The increasingly popular social networks provide a wide range of data bases and application scenarios for the prediction of information dissemination. The research of information dissemination prediction is based on the known information dissemination process, using methods to predict the trend of social information in the future, in order to understand the whole process of information dissemination in advance. With the help of information dissemination and prediction method, network companies can better provide personalized recommendation services for users and government departments to take timely and effective public opinion control and guidance. The research of information dissemination prediction involves many fields, such as large-scale data parallel processing, social network topology analysis and text content analysis, which attracts big data and cloud computing. The attention of scholars in the fields of complex networks and natural language processing. Information dissemination prediction is an important research direction in social networks. Recent research methods can be divided into graph and non-graph methods. Most non-graph methods use infectious disease model and classification model, and seldom consider the clustering characteristics of social time series. In the clustering based time series prediction algorithm (CTP), each cluster centroid is regarded as a kind of propagation pattern, so the prediction can be realized by classifying the nearest neighbor propagation pattern of the prediction object. That is, CTP takes the nearest neighbor clustering centroid of the predicted object as its prediction result. Therefore, the prediction performance of CTP depends on the fit between the prediction object and its nearest clustering centroid. The higher the fitting degree is, the better the prediction performance of CTP is. By analyzing the physical meaning of the scaling distance, it is observed that the scaling distance can better measure the similarity between time series. This paper holds that the nearest neighbor centroid based on the scaling distance of the predicted object may be more suitable for the prediction object to obtain higher prediction performance. However, there is a lack of research on the effect of scaling distance on the prediction of CTP. Therefore, based on CTP and zoom distance, this paper proposes a scalable clustering based time series prediction algorithm S-CTP. The improved S-CTP takes the nearest neighbor clustering centroid of the predicted object as the prediction result to improve its fitting degree with the predicted object. The experimental results on the prediction performance. Twitter and phrase datasets show that S-CTP improves the generalization performance of CTP. In CTP, the similarity between some nearest neighbor clustering members and predictive objects is higher, and the other part is lower, which leads to lower prediction performance of CTP. In order to solve the problem of low prediction performance of CTP, a time series prediction algorithm D-CTP based on piecewise clustering is proposed based on the characteristics of CTP and time series segmentation. In order to select the cluster members most similar to the prediction object, the improved D-CTP always takes the prediction object as the cluster centroid and then refines the cluster centroid in the known length time series of the predicted object and the predicted length time series. Similar to S-CTP 's proposal, In this paper, based on D-CTP and zoom distance, a series prediction algorithm based on scalable piecewise clustering. Twitter and phrase data sets are proposed. The experimental results show that the time series prediction algorithm based on S-CTP is based on both zooming distance and segment clustering. The generalization performance of CTP is improved in one step.
【學(xué)位授予單位】:西南交通大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2017
【分類號】:TP393.09
【參考文獻】
相關(guān)期刊論文 前10條
1 游新年;劉群;;基于傳染病模型的微博信息傳播預(yù)測研究[J];計算機應(yīng)用與軟件;2016年05期
2 李洋;陳毅恒;劉挺;;微博信息傳播預(yù)測研究綜述[J];軟件學(xué)報;2016年02期
3 周雪峰;徐恪;張藍珊;張賽;;社交網(wǎng)絡(luò)的傳播測量與時間序列聚類分析[J];小型微型計算機系統(tǒng);2015年07期
4 孔慶超;毛文吉;;基于動態(tài)演化的討論帖流行度預(yù)測[J];軟件學(xué)報;2014年12期
5 曹玖新;吳江林;石偉;劉波;鄭嘯;羅軍舟;;新浪微博網(wǎng)信息傳播分析與預(yù)測[J];計算機學(xué)報;2014年04期
6 毛佳昕;劉奕群;張敏;馬少平;;基于用戶行為的微博用戶社會影響力分析[J];計算機學(xué)報;2014年04期
7 王昊;李義萍;馮卓楠;馮鈴;;流行病模型在微博轉(zhuǎn)發(fā)預(yù)測中的應(yīng)用(英文)[J];中國通信;2013年03期
8 易成岐;鮑媛媛;薛一波;姜京池;;新浪微博的大規(guī)模信息傳播規(guī)律研究[J];計算機科學(xué)與探索;2013年06期
9 韓忠明;陳妮;樂嘉錦;段大高;孫踐知;;面向熱點話題時間序列的有效聚類算法研究[J];計算機學(xué)報;2012年11期
10 張賽;徐恪;李海濤;;微博類社交網(wǎng)絡(luò)中信息傳播的測量與分析[J];西安交通大學(xué)學(xué)報;2013年02期
,本文編號:2239087
本文鏈接:http://sikaile.net/guanlilunwen/ydhl/2239087.html