基于中文微博的突發(fā)話題檢測系統(tǒng)的設(shè)計與實現(xiàn)
發(fā)布時間:2018-06-02 19:30
本文選題:中文微博 + 突發(fā)話題檢測。 參考:《北京郵電大學(xué)》2014年碩士論文
【摘要】:隨著社會化網(wǎng)絡(luò)服務(wù)(Social Network Services, SNS)的不斷發(fā)展,微博(Microblogging)已經(jīng)成為了很多國人生活中一個必不可少的組成部分。微博是一種通過單向的關(guān)注機制來分享簡短即時消息的廣播式的社交網(wǎng)絡(luò)平臺。作為一種社會化媒體,它所具有的短文本、實時、社交和媒體特性大大縮短了信息從發(fā)布到廣泛擴(kuò)散蔓延的時間,更加有利于信息的快速傳播。當(dāng)前,微博已經(jīng)成為了網(wǎng)絡(luò)輿論的主要爆發(fā)地和聚集地。有效地檢測出微博中的突發(fā)話題,不管是對普通用戶、商家還是政府部門來說,都有著很強的現(xiàn)實意義。 本論文在總結(jié)現(xiàn)有突發(fā)話題檢測研究成果的基礎(chǔ)上,設(shè)計并實現(xiàn)了一個基于新浪微博的突發(fā)話題檢測系統(tǒng)(Emerging Topic Detection System),簡稱ETD。它實時地從新浪微博采集用戶和微博數(shù)據(jù),并盡可能提高數(shù)據(jù)集的完整性與一致性。它使用了一種新穎的的突發(fā)話題檢測模型,能夠更準(zhǔn)確地將大量微博中的突發(fā)話題檢測出來。最后通過使用一些最新的數(shù)據(jù)可視化技術(shù),將檢測出的突發(fā)話題信息在Web前端進(jìn)行展示。 論文首先對突發(fā)話題檢測的相關(guān)理論和技術(shù)背景進(jìn)行了簡要介紹;之后詳細(xì)描述了基于中文微博的突發(fā)話題檢測系統(tǒng)ETD的需求分析,并對整個系統(tǒng)進(jìn)行了總體設(shè)計;接下來描述了系統(tǒng)內(nèi)部各個子系統(tǒng)的詳細(xì)設(shè)計與實現(xiàn),包括數(shù)據(jù)采集子系統(tǒng)、突發(fā)話題檢測子系統(tǒng)和突發(fā)話題可視化子系統(tǒng);然后,對整個系統(tǒng)進(jìn)行單元測試和集成測試,表明整個系統(tǒng)達(dá)到了預(yù)期的設(shè)計目標(biāo);論文最后對全文進(jìn)行了總結(jié),對未來的工作進(jìn)行了展望,并總結(jié)了作者在研究生期間的所有工作和成果。
[Abstract]:With the development of social Network Services, SNS), Weibo microblogging has become an essential part of Chinese life. Weibo is a broadcast social networking platform that shares short instant messages through a one-way focus mechanism. As a kind of social media, it has the features of short text, real-time, social and media, which greatly shorten the time of information spreading from publication to wide spread, and is more conducive to the rapid dissemination of information. At present, Weibo has become the main outbreak of network public opinion and gathering. It is of great practical significance to detect the burst topic in Weibo effectively, whether for ordinary users, merchants or government departments. On the basis of summarizing the existing research results of burst topic detection, this paper designs and implements an emerging Topic Detection system based on Sina Weibo. It collects user and Weibo data from Sina Weibo in real time, and improves the integrity and consistency of data set as much as possible. It uses a novel burst topic detection model, which can detect a large number of burst topics in Weibo more accurately. Finally, by using some latest data visualization techniques, the detected burst topic information is displayed in the front end of Web. Firstly, this paper briefly introduces the theory and technology background of burst topic detection, then describes the requirement analysis of burst topic detection system (ETD) based on Chinese Weibo in detail, and designs the whole system. Then it describes the detailed design and implementation of each subsystem of the system, including data acquisition subsystem, burst topic detection subsystem and burst topic visualization subsystem. It shows that the whole system has achieved the expected design goal. Finally, the thesis summarizes the full text, prospects the future work, and summarizes all the work and achievements of the author during the postgraduate period.
【學(xué)位授予單位】:北京郵電大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2014
【分類號】:TP393.092
【參考文獻(xiàn)】
相關(guān)期刊論文 前2條
1 鄭斐然;苗奪謙;張志飛;高燦;;一種中文微博新聞話題檢測的方法[J];計算機科學(xué);2012年01期
2 邱云飛;程亮;;微博突發(fā)話題檢測方法研究[J];計算機工程;2012年09期
,本文編號:1969922
本文鏈接:http://sikaile.net/guanlilunwen/ydhl/1969922.html
最近更新
教材專著