基于情感分類的產(chǎn)品評(píng)論垂直搜索引擎的研究
[Abstract]:With the continuous development of Internet technology and the rising of e-commerce, BBSs, blogs, Weibo constantly emerge, the online interaction between merchants and buyers is becoming more and more frequent. More and more buyers post product reviews on the Internet after using the products, the number of comments is increasing, the comments themselves are more colloquial and unstructured. It is time-consuming and laborious for potential buyers to pick out the information they care about from a large number of product reviews when they make decisions on the supply and demand relationship in the market, and it is one-sided and lagging. So search engines play an important role in the Internet today. Powerful search engines like Baidu and Google are aimed at different fields and different kinds of general search engines. In a particular area of product review, however, appears to be inadequate. Therefore, it is necessary to research and develop a vertical search engine with emotion classification for product reviews. Based on the current research situation at home and abroad, this paper makes a further study on the identification of evaluation objects, the identification of evaluation phrases, the collocation identification between evaluation objects and evaluation phrases, and the emotional orientation of evaluation phrases in Chinese product review texts. The main work is as follows: (1) the candidate set of evaluation object is obtained by using part of speech sequence in the method of identifying evaluation object, and the concept and algorithm of integrity and stability of evaluation object are put forward to filter the noise of evaluation object. Using the cooccurrence rule of evaluation object and evaluation phrase and the frequency of the evaluation object appearing in the whole comment text or the whole corpus, the confidence degree of the evaluation object is sorted, and the evaluation object is extracted. (2) the conjunctive dictionary is selected. The dictionary of affective words, the dictionary of degree words and the dictionary of negative words are perfected to identify the evaluation phrases and analyze the affective tendency of the evaluation phrases. Through the eight features of the relationship between the evaluation object and the evaluation phrase, support vector machine is used to identify the collocation relationship between the evaluation object and the evaluation phrase. Finally, the emotional tendency of the whole review text is judged. (3) based on the emotional tendency of the Chinese product review text, a vertical search engine is constructed by using the popular SSH framework MySQL database and open source software package Lucene. Users can easily and quickly query their own interested information. A vertical search engine with emotion classification is constructed through the above research, which enables merchants and potential customers to quickly and accurately find useful information for themselves from a vast number of review articles, which has certain commercial value. The research method of emotion classification of Chinese text has certain academic value.
【學(xué)位授予單位】:湖南工業(yè)大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2012
【分類號(hào)】:TP391.3
【相似文獻(xiàn)】
相關(guān)期刊論文 前10條
1 顧鵬堯;;讓搜索引擎更好地服務(wù)于教育教學(xué)[J];科學(xué)24小時(shí);2003年Z1期
2 陳新顏;垂直搜索引擎辨析[J];現(xiàn)代情報(bào);2004年09期
3 胡文勝;;垂直搜索助號(hào)碼百事通與商務(wù)領(lǐng)航[J];每周電腦報(bào);2006年32期
4 胡潔;丁寧;關(guān)靜;曹福年;張磊;;基于“PUBMED+PDF”的醫(yī)學(xué)垂直搜索引擎的實(shí)踐[J];信息系統(tǒng)工程;2009年05期
5 一林;;垂直搜索:前進(jìn)路上的喜與憂[J];互聯(lián)網(wǎng)天地;2010年02期
6 牟思;;基于垂直搜索引擎的學(xué)校網(wǎng)站的研究與建設(shè)[J];中國(guó)教育技術(shù)裝備;2011年21期
7 田野;垂直搜索火熱為哪般[J];中國(guó)計(jì)算機(jī)用戶;2005年37期
8 胡文勝;;垂直搜索助號(hào)碼百事通與商務(wù)領(lǐng)航[J];每周電腦報(bào);2006年31期
9 邊凱;;你會(huì)搜索嗎?[J];中國(guó)計(jì)算機(jī)用戶;2007年23期
10 宿建光;;指點(diǎn)通:移動(dòng)垂直搜索的創(chuàng)新者[J];通信世界;2007年03期
相關(guān)會(huì)議論文 前10條
1 王上;于海;王鉦旋;;Deep Web垂直搜索引擎設(shè)計(jì)與實(shí)現(xiàn)[A];第26屆中國(guó)數(shù)據(jù)庫(kù)學(xué)術(shù)會(huì)議論文集(B輯)[C];2009年
2 林歡歡;王文杰;史忠植;;移動(dòng)環(huán)境下垂直搜索引擎[A];第三屆全國(guó)信息檢索與內(nèi)容安全學(xué)術(shù)會(huì)議論文集[C];2007年
3 王旭;杜軍平;;質(zhì)檢總局互聯(lián)網(wǎng)輿情監(jiān)控系統(tǒng)中聚焦爬蟲的研究[A];中國(guó)電子學(xué)會(huì)第十七屆信息論學(xué)術(shù)年會(huì)論文集[C];2010年
4 趙[
本文編號(hào):2166049
本文鏈接:http://sikaile.net/kejilunwen/sousuoyinqinglunwen/2166049.html