天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

基于句子結(jié)構(gòu)的中文微博情緒分析系統(tǒng)

發(fā)布時(shí)間:2018-07-11 14:06

  本文選題:中文微博 + 中文分詞。 參考:《大連理工大學(xué)》2014年碩士論文


【摘要】:隨著互聯(lián)網(wǎng)的高速發(fā)展以及移動(dòng)終端的普及,社交網(wǎng)絡(luò)對(duì)人們生活的影響日益增強(qiáng)。隨著微博這種便捷并且具有極強(qiáng)即時(shí)性的社交網(wǎng)絡(luò)漸漸進(jìn)入網(wǎng)民的生活。越來(lái)越多的人會(huì)選擇在微博上分享、獲取信息,交流情感與觀點(diǎn)。由于微博還具有極強(qiáng)的原創(chuàng)性以及其貼近大眾的特點(diǎn)。通過(guò)對(duì)微博這種短文本進(jìn)行情緒分析,可以實(shí)現(xiàn)輿情監(jiān)控等許多功能。 情緒分析的含義是在情感極性傾向分析的基礎(chǔ)上進(jìn)行更細(xì)粒度的情緒分類。本文所設(shè)計(jì)的系統(tǒng)主要將情緒分類為憤怒、厭惡、恐懼、高興、喜好、悲傷和驚訝七種情緒分類。通過(guò)分析中文微博的自身特點(diǎn),其與英文微博的區(qū)別以及與傳統(tǒng)書面語(yǔ)的差異,設(shè)計(jì)了本系統(tǒng)。數(shù)據(jù)來(lái)源為新浪微博。通過(guò)調(diào)用新浪微博官方提供的API接口獲取一定數(shù)量的微博,提取出其中的微博內(nèi)容、地域來(lái)源、終端來(lái)源等基本信息。在將微博內(nèi)容進(jìn)行去冗余標(biāo)點(diǎn)等預(yù)處理后,利用中科院提供的開源分詞系統(tǒng)以及哈工大的句子結(jié)構(gòu)劃分系統(tǒng)得到分詞結(jié)果以及句子結(jié)構(gòu)劃分結(jié)果。最后,利用句子結(jié)構(gòu)以及微博的話題、情感詞庫(kù)以及否定和程度副詞詞庫(kù)進(jìn)行量化計(jì)算出微博中對(duì)于關(guān)鍵詞的情感細(xì)粒度分類結(jié)果,進(jìn)而通過(guò)對(duì)憤怒、厭惡、恐懼、高興、喜好、悲傷和驚訝七種情緒的極性劃分,得到微博的情感極性傾向分析結(jié)果存入MySQL數(shù)據(jù)庫(kù)。 利用jsp和tomcat,將MySQL數(shù)據(jù)庫(kù)中的分析結(jié)果,以折線圖、柱狀圖和餅狀圖的方式展現(xiàn)出來(lái)。并且用戶可以根據(jù)地域來(lái)源、終端來(lái)源以及時(shí)間等基本信息分別查看關(guān)鍵詞情感極性傾向。對(duì)于注冊(cè)用戶,可以在成功登陸后查看關(guān)鍵詞的情緒分析結(jié)果。
[Abstract]:With the rapid development of the Internet and the popularity of mobile terminals, the social network has a growing influence on people's life. With the convenient and extremely instant social network of micro-blog, the social network has gradually entered the life of Internet users. More and more people will choose to share on micro-blog, obtain information, exchange feelings and views. Because micro-blog is also It has strong originality and its close to the public characteristics. By analyzing the short text of micro-blog, we can achieve many functions such as public opinion monitoring and so on.
The meaning of emotional analysis is to carry out a more fine-grained emotion classification based on the analysis of emotional polarity. The system designed in this paper mainly classifications of emotion into seven kinds of emotional classifications: anger, disgust, fear, delight, preference, sadness and surprise. By analyzing the self characteristics of Chinese micro-blog, the difference between the Chinese and English micro-blog and the traditional book are analyzed. This system is designed. The data source is Sina micro-blog. A certain amount of micro-blog is obtained by calling the API interface provided by Sina micro-blog to extract basic information such as micro-blog content, geographical source, terminal source and so on. The word system and the sentence structure division system of Kazakhstan get the result of the word segmentation and the result of the sentence structure division. Finally, using the sentence structure and the topic of micro-blog, the emotional lexicon and the negative and degree adverb thesaurus to quantify the result of the fine grain classification of the key words in micro-blog, and then through the anger and disgust, The polarity of fear, joy, preference, sadness and surprise are divided into seven kinds of emotions. The result of micro-blog's polar polarity analysis is stored in MySQL database.
Using JSP and tomcat, the analysis results in the MySQL database are displayed in the way of line diagram, bar graph and pie chart. And users can view the keyword emotional polarity according to local sources, terminal sources and time and other basic information. For registered users, they can view the emotional points of key words after successful landing. Analysis the result.
【學(xué)位授予單位】:大連理工大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2014
【分類號(hào)】:TP393.092;TP391.1

【參考文獻(xiàn)】

相關(guān)期刊論文 前9條

1 孫茂松,鄒嘉彥;漢語(yǔ)自動(dòng)分詞研究評(píng)述[J];當(dāng)代語(yǔ)言學(xué);2001年01期

2 孫鐵利;劉延吉;;中文分詞技術(shù)的研究現(xiàn)狀與困難[J];信息技術(shù);2009年07期

3 劉群,張華平,俞鴻魁,程學(xué)旗;基于層疊隱馬模型的漢語(yǔ)詞法分析[J];計(jì)算機(jī)研究與發(fā)展;2004年08期

4 魏椺;向陽(yáng);陳千;;中文文本情感分析綜述[J];計(jì)算機(jī)應(yīng)用;2011年12期

5 張華平,劉群;基于N-最短路徑方法的中文詞語(yǔ)粗分模型[J];中文信息學(xué)報(bào);2002年05期

6 周勝臣;瞿文婷;石英子;施詢之;孫韻辰;;中文微博情感分析研究綜述[J];計(jì)算機(jī)應(yīng)用與軟件;2013年03期

7 趙妍妍;秦兵;劉挺;;文本情感分析[J];軟件學(xué)報(bào);2010年08期

8 張春霞,郝天永;漢語(yǔ)自動(dòng)分詞的研究現(xiàn)狀與困難[J];系統(tǒng)仿真學(xué)報(bào);2005年01期

9 朱明;郭春生;;隱馬爾可夫模型及其最新應(yīng)用與發(fā)展[J];計(jì)算機(jī)系統(tǒng)應(yīng)用;2010年07期

,

本文編號(hào):2115448

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/guanlilunwen/ydhl/2115448.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶fc4b3***提供,本站僅收錄摘要或目錄,作者需要?jiǎng)h除請(qǐng)E-mail郵箱bigeng88@qq.com