融合內(nèi)容及行為的虛假評論檢測方法研究
[Abstract]:With the development of the Internet, especially the rapid development of electronic commerce, more and more consumers prefer to shop online, and it is more and more easy for consumers to comment on the products they buy. These product reviews provide valuable information resources for manufacturers and potential consumers. Due to the existence of some interest relations, there may be some false or false content, these false comments to a certain extent affect the reference value of comment information, thus misleading consumers, so it is particularly important to detect false comments. The most basic comment information is the content information of the comment. It is very important to mine the content information of the comment and detect the false comment by using the content information of the comment; in addition, it is very important to mine the behavior of the reviewer. Identifying false comments by discovering abnormal behavior patterns also plays an important role. This paper focuses on product and service reviews, focusing on the detection of false comments based on the content of comments, and the detection of false comments based on the behavior of reviewers. The key issues of detecting false comments such as comment content and reviewer behavior are studied. The main works are as follows: (1) A method of false comment detection based on comment content is proposed. In this method, a motif based on affective dependency is constructed, which is used by topic-opposite sentiment dependency model, TOSDM), to extract the subject information of comments and their corresponding emotional information, and then combines the subject and emotional information of comments. Finally, a supervised learning classifier is used to detect false comments. (2) A method of false comment detection based on reviewer's behavior is proposed. The method firstly selects 10 dimensions to reflect the behavior of the reviewer according to the comment data, and normalizes the feature of each dimension. Then, the clustering matrix is constructed according to the characteristics of each comment, and the K-means algorithm is improved by using F statistics. Finally, the degree of each cluster deviating from the whole comment data set is calculated, and the abnormal cluster is determined according to the threshold. Thus, the detection of false comments is realized. (3) A semi-supervised detection method of false comments is proposed, which combines the content of comments with the behavior of the reviewers. The method firstly extracts the content features of comments and the behavioral features of reviewers. Then, with the help of Co-Training 's semi-supervised learning idea, the two kinds of features are regarded as independent views, and the classifiers are constructed using the two independent features. The unlabeled samples with high confidence are selected. Finally, the training model is updated with these selected samples to improve the classifier effect. (4) A prototype system of false comment detection is designed and implemented. It is convenient to further study the detection method of false comment.
【學(xué)位授予單位】:昆明理工大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2014
【分類號】:TP393.08
【參考文獻(xiàn)】
相關(guān)期刊論文 前10條
1 譚文堂;朱洪;葛斌;李芳芳;肖衛(wèi)東;;垃圾評論自動過濾方法[J];國防科技大學(xué)學(xué)報(bào);2012年05期
2 曾雪強(qiáng),王明文,陳素芬;一種基于潛在語義結(jié)構(gòu)的文本分類模型[J];華南理工大學(xué)學(xué)報(bào)(自然科學(xué)版);2004年S1期
3 孫升蕓;田萱;;產(chǎn)品垃圾評論檢測研究綜述[J];計(jì)算機(jī)科學(xué);2011年S1期
4 邱云飛;王建坤;邵良杉;劉大有;;基于用戶行為的產(chǎn)品垃圾評論者檢測研究[J];計(jì)算機(jī)工程;2012年11期
5 魏小娟;李翠平;陳紅;;Co-Training——內(nèi)容和鏈接的Web Spam檢測方法[J];計(jì)算機(jī)科學(xué)與探索;2010年10期
6 解曉敏;李云;;最小最大模塊化網(wǎng)絡(luò)中基于聚類的數(shù)據(jù)劃分方法研究[J];南京大學(xué)學(xué)報(bào)(自然科學(xué)版);2012年02期
7 宋海霞;嚴(yán)馨;余正濤;石林賓;蘇斐;;基于自適應(yīng)聚類的虛假評論檢測[J];南京大學(xué)學(xué)報(bào)(自然科學(xué)版);2013年04期
8 張倩;瞿有利;;用于網(wǎng)絡(luò)評論分析的主題-對立情感挖掘模型[J];計(jì)算機(jī)科學(xué)與探索;2013年07期
9 周志華;;基于分歧的半監(jiān)督學(xué)習(xí)[J];自動化學(xué)報(bào);2013年11期
10 趙妍妍;秦兵;劉挺;;文本情感分析[J];軟件學(xué)報(bào);2010年08期
相關(guān)博士學(xué)位論文 前1條
1 李方濤;基于產(chǎn)品評論的情感分析研究[D];清華大學(xué);2011年
,本文編號:2185124
本文鏈接:http://sikaile.net/guanlilunwen/ydhl/2185124.html