基于內(nèi)容的農(nóng)業(yè)網(wǎng)絡(luò)信息可信度評估方法研究
[Abstract]:With the popularization of network technology, information technology has been developed rapidly, and agriculture is gradually realizing agricultural informatization in the process of social informatization. The main body of agriculture is farmers. In the service of agricultural information, it is impossible for farmers to judge the true reliability of all kinds of information in the network because of the problems of low level of knowledge and culture and weak economic ability. In view of these problems in the process of agricultural information service, this paper studies how to evaluate the credibility of agricultural network information. The main work includes: (1) aiming at the problem that the traditional TF-IDF topic extraction method does not consider the location of words on the web page, a TF-IDF method based on word position weight is proposed to extract agricultural web information. The experimental results show that the proposed method is more accurate than the traditional TF-IDF method, and the extraction effect is ideal. (2) aiming at the problem that the search engine does not consider its credibility in the stage of obtaining candidate web pages, a content-based method for evaluating the credibility of agricultural network information is proposed. This paper mainly constructs an index system with four levels of credibility evaluation index: the first layer judges the authority of the web page, aiming at the problem that there is no authoritative classification and quantification standard of the web page at present, we define a weighting table of the authority degree of the website. It has a good effect on differentiating the authority of different web pages. The second layer judges the timeliness of the web page, and puts forward a new method to establish the specific time attenuation function by the date of the publication of the network information content, which can better reflect the influence of the timeliness on the credibility of the agricultural network information. The third layer judges the relevance of the web page, and generates the word frequency vector of each candidate page by introducing the VSM model, and calculates the correlation degree between the content of the candidate page and the keyword. The fourth layer judges the influence of the web page and introduces the, Page View value and Time on Page value of the website PR value in combination with the two aspects of the web page link and user behavior, which can well quantify the size of the influence of the web page. (3) different topics are set to reflect the relationship between the number of query words and the relevance of the topic. The results show that the average value of the topic relevance of the candidate pages is 77.4, and the result is the best; (4) search engine natural sort, lack of correlation index sort and content-based evaluation method are established respectively to verify the credibility of candidate web pages. The distribution of reliability value of natural ranking is large, and the ranking of lack of correlation index ranks some information which is independent of subject content in the front position. The ranking of the methods in this paper filters out the highly reliable web pages related to the subject content and can be provided to the users first. It shows that the evaluation method based on the content in this paper is effective and practical in evaluating the credibility of agricultural web information.
【學(xué)位授予單位】:湖南農(nóng)業(yè)大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2015
【分類號】:S126
【參考文獻(xiàn)】
相關(guān)期刊論文 前10條
1 冀俊忠;張玲玲;吳晨生;吳金源;;基于知識語義權(quán)重特征的樸素貝葉斯情感分類算法[J];北京工業(yè)大學(xué)學(xué)報(bào);2014年12期
2 胡堰;彭啟民;胡曉惠;;一種基于隱語義概率模型的個性化Web服務(wù)推薦方法[J];計(jì)算機(jī)研究與發(fā)展;2014年08期
3 徐靜;楊小平;柳增;;基于內(nèi)容信任的Web信息可信度驗(yàn)證方法研究[J];北京理工大學(xué)學(xué)報(bào);2014年07期
4 楊博;陳賀昌;朱冠宇;趙學(xué)華;;基于超鏈接多樣性分析的新型網(wǎng)頁排名算法[J];計(jì)算機(jī)學(xué)報(bào);2014年04期
5 卓志宏;;一種基于語義信息的主題相關(guān)性判別模型[J];計(jì)算機(jī)與現(xiàn)代化;2013年09期
6 馬海波;楊楠;于新興;;用戶差別化和主題敏感的PageRank算法[J];大連交通大學(xué)學(xué)報(bào);2013年04期
7 黃f^;俞建家;;基于分類排名的網(wǎng)站可信度分析[J];福州大學(xué)學(xué)報(bào)(自然科學(xué)版);2013年01期
8 丁世飛;齊丙娟;譚紅艷;;支持向量機(jī)理論與算法研究綜述[J];電子科技大學(xué)學(xué)報(bào);2011年01期
9 艾靜;王仲遠(yuǎn);孟小峰;;C-Rank:一種Deep Web數(shù)據(jù)記錄可信度評估方法[J];計(jì)算機(jī)科學(xué)與探索;2009年06期
10 鞠時光;呂霞;王];;基于時間鏈接分析的頁面排序優(yōu)化算法[J];計(jì)算機(jī)應(yīng)用研究;2009年07期
,本文編號:2382007
本文鏈接:http://sikaile.net/kejilunwen/nykj/2382007.html