Phrase-based hashtag recommendation for microblog posts
本文選題:recommendation + topic; 參考:《Science China(Information Sciences)》2017年01期
【摘要】:In microblogs, authors use hashtags to mark keywords or topics. These manually labeled tags can be used to benefit various live social media applications(e.g., microblog retrieval, classification). However,because only a small portion of microblogs contain hashtags, recommending hashtags for use in microblogs are a worthwhile exercise. In addition, human inference often relies on the intrinsic grouping of words into phrases.However, existing work uses only unigrams to model corpora. In this work, we propose a novel phrase-based topical translation model to address this problem. We use the bag-of-phrases model to better capture the underlying topics of posted microblogs. We regard the phrases and hashtags in a microblog as two different languages that are talking about the same thing. Thus, the hashtag recommendation task can be viewed as a translation process from phrases to hashtags. To handle the topical information of microblogs, the proposed model regards translation probability as being topic specific. We test the methods on data collected from realworld microblogging services. The results demonstrate that the proposed method outperforms state-of-the-art methods that use the unigram model.
[Abstract]:In microblogs, authors use hashtags to mark keywords or topics. These manually labeled tags can be used to benefit various live social media applications(e.g., microblog retrieval, classification). However,because only a small portion of microblogs contain hashtags, recommending hashtags for use in microblogs are a worthwhile exercise. In addition, human inference often relies on the intrinsic grouping of words into phrases.However, existing work uses only unigrams to model corpora. In this work, we propose a novel phrase-based topical translation model to address this problem. We use the bag-of-phrases model to better capture the underlying topics of posted microblogs. We regard the phrases and hashtags in a microblog as two different languages that are talking about the same thing. Thus, the hashtag recommendation task can be viewed as a translation process from phrases to hashtags. To handle the topical information of microblogs, the proposed model regards translation probability as being topic specific. We test the methods on data collected from realworld microblogging services. The results demonstrate that the proposed method outperforms state-of-the-art methods that use the unigram model.
【作者單位】: Shanghai
【基金】:funded by National Natural Science Foundation of China (Grant Nos. 61473092, 61472088, 61532011) National High Technology Research and Development Program of China (Grant No. 2015AA015408) Shanghai Science and Technology Development Funds (Grant Nos. 13dz2260200, 13511504300)
【分類號】:TP391.3
【相似文獻】
相關(guān)期刊論文 前4條
1 ;Small-world patterns in Chinese phrase networks[J];Chinese Science Bulletin;2005年03期
2 蘇勁松;劉洋;劉群;董槐林;;基于圖的統(tǒng)計機器翻譯詞匯化調(diào)序模型(英文)[J];中國通信;2014年05期
3 陳毅東;史曉東;;Improving Phrase-Based Statistical Machine Translation Models by Incorporating Syntax-Based Language Models[J];Journal of Donghua University(English Edition);2010年02期
4 ;[J];;年期
相關(guān)會議論文 前2條
1 ;RESEARCH ON THE PROFESSIONAL TRAINING OF PHYSICAL EDUCATION TEACHERS[A];第五屆全國青年體育科學(xué)學(xué)術(shù)會議、第二屆中國體育博士高層論壇論文集[C];2008年
2 ;A Synchronous Grammar for Statistical MachineTranslation[A];機器翻譯研究進展——第四屆全國機器翻譯研討會論文集[C];2008年
相關(guān)碩士學(xué)位論文 前3條
1 曾嘉鳴;漢語主語省略機制及其法譯研究[D];廣東外語外貿(mào)大學(xué);2015年
2 張元;法漢翻譯中的否定現(xiàn)象[D];外交學(xué)院;2009年
3 楊寧;漢語助詞在外向型漢法詞典中的處理[D];廣東外語外貿(mào)大學(xué);2003年
,本文編號:1891280
本文鏈接:http://sikaile.net/kejilunwen/ruanjiangongchenglunwen/1891280.html