天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當(dāng)前位置:主頁(yè) > 科技論文 > 軟件論文 >

基于矩陣分解和子模最大化的微博新聞?wù)椒?/H1>
發(fā)布時(shí)間:2018-09-12 19:51
【摘要】:針對(duì)面向微博的中文新聞?wù)闹饕魬?zhàn),提出了一種將矩陣分解與子模最大化相結(jié)合的新聞自動(dòng)摘要方法。該方法首先利用正交矩陣分解模型得到新聞文本潛語(yǔ)義向量,解決了短文本信息稀疏問(wèn)題,并使投影方向近似正交以減少冗余;然后從相關(guān)性和多樣性等方面評(píng)估新聞?wù)Z句集合,該評(píng)估函數(shù)由多個(gè)單調(diào)子模函數(shù)和一個(gè)評(píng)估語(yǔ)句不相似度的非子模函數(shù)組成;最后設(shè)計(jì)貪心算法生成最終摘要。在NLPCC2015數(shù)據(jù)集上的實(shí)驗(yàn)結(jié)果表明,該方法能有效提高面向微博的新聞自動(dòng)摘要質(zhì)量,ROUGE得分超過(guò)其他基線系統(tǒng)。
[Abstract]:Aiming at the main challenge of Weibo's Chinese news abstract, this paper proposes an automatic news digest method which combines matrix decomposition with submodule maximization. Firstly, the latent semantic vector of news text is obtained by using orthogonal matrix decomposition model, which solves the problem of sparse information in short text, and makes the projection direction approximate orthogonal to reduce redundancy. Then the set of news statements is evaluated from the aspects of correlation and diversity. The evaluation function is composed of several monotonic submodules and a non-submodule function to evaluate the dissimilarity of statements. Finally, a greedy algorithm is designed to generate the final summary. The experimental results on the NLPCC2015 dataset show that this method can effectively improve the quality of automatic news abstracts for Weibo and the score of group is higher than that of other baseline systems.
【作者單位】: 武漢大學(xué)計(jì)算機(jī)學(xué)院;
【基金】:國(guó)家社科重大招標(biāo)計(jì)劃資助項(xiàng)目(11&ZD189) 國(guó)家自然科學(xué)基金面上資助項(xiàng)目(61373108)
【分類號(hào)】:TP391.1

【相似文獻(xiàn)】

相關(guān)碩士學(xué)位論文 前1條

1 高亞奇;基于判別特征回歸的子模優(yōu)化跟蹤算法[D];大連理工大學(xué);2016年

,

本文編號(hào):2240056


本文鏈接:http://sikaile.net/kejilunwen/ruanjiangongchenglunwen/2240056.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶13b41***提供,本站僅收錄摘要或目錄,作者需要?jiǎng)h除請(qǐng)E-mail郵箱bigeng88@qq.com