基于混淆矩陣的分類器選擇集成方法研究

發(fā)布時間：2018-04-29 23:12

本文選題：多分類器系統(tǒng) + 選擇性集成　；參考：《河南理工大學(xué)》2016年碩士論文

【摘要】：集成學(xué)習(xí)是機(jī)器學(xué)習(xí)領(lǐng)域的重要研究方向,它通過訓(xùn)練多個個體分類器并把它們組合起來形成多分類器系統(tǒng),以此來提高分類性能。但是隨著計算機(jī)技術(shù)的發(fā)展,數(shù)據(jù)量的增大,參與集成的分類器也就越來越多,這樣一方面計算量迅速增長,另一方面,分類器之間的差異度也變小,影響了集成的準(zhǔn)確性,而一個有效的集成系統(tǒng)需要參與集成的分類器具有比較高的準(zhǔn)確性和差異性。研究表明,從訓(xùn)練產(chǎn)生的基分類器中選擇一部分來集成,這種方法可能比使用全部的基分類器來進(jìn)行集成效果更好。因此,從大量的基分類器中選擇出具有高差異性的分類器作為代表參與集成,已成為集成學(xué)習(xí)的一個研究趨勢,需要進(jìn)行更加深入的研究。本文在集成學(xué)習(xí)的基礎(chǔ)上,首先介紹了多分類器集成的國內(nèi)外研究背景和意義,總結(jié)了集成學(xué)習(xí)的國內(nèi)外研究現(xiàn)狀。其次介紹了集成學(xué)習(xí)的概念和兩種經(jīng)典的集成算法Bagging和Boosting算法,接著,分別列舉了乘積規(guī)則、求和規(guī)則等6種集成規(guī)則。然后從差異性度量公式的角度出發(fā),介紹了差異性度量的概念,以及常用的度量公式。最后提出了一種新的多分類器選擇性方法,具體方法是構(gòu)造所有基分類器的混淆矩陣作為聚類算法的數(shù)據(jù)對象,依據(jù)各聚類中樣本的分布情況,選擇出一定數(shù)量的分類器作為代表,構(gòu)成新的待集成分類器集合,然后把這個方法應(yīng)用于Bagging的訓(xùn)練過程中。為了驗證本文方法的可行性,在UCI數(shù)據(jù)集上進(jìn)行實驗,將本文的方法應(yīng)用于Bagging算法的訓(xùn)練過程得到的實驗結(jié)果與使用原始的Bagging算法得到的結(jié)果進(jìn)行比較,表明,該方法可以有效提高集成系統(tǒng)的準(zhǔn)確率。并選擇不同的集成規(guī)則進(jìn)行集成,對結(jié)果進(jìn)行分析。
[Abstract]:Ensemble learning is an important research direction in the field of machine learning. It improves classification performance by training multiple individual classifiers and combining them to form multiple classifiers. However, with the development of computer technology and the increase of data volume, more and more classifiers are involved in the integration. On the one hand, the amount of computation increases rapidly, on the other hand, the difference between the classifiers becomes smaller, which affects the accuracy of integration. An effective integration system needs to participate in the integration of classifiers with high accuracy and difference. It is shown that this method is more effective than using all the base classifiers to integrate a part of the basic classifiers generated by the training. Therefore, it has become a research trend of ensemble learning to select classifiers with high diversity from a large number of base classifiers as representatives to participate in integration, which need to be further studied. On the basis of integrated learning, this paper first introduces the research background and significance of multi-classifier integration at home and abroad, and summarizes the current research situation of integrated learning at home and abroad. Secondly, the concept of integration learning and two classical integration algorithms, Bagging and Boosting, are introduced. Then, six kinds of integration rules, including product rule and summation rule, are listed respectively. Then, from the point of view of the difference measurement formula, the concept of the difference measure and the commonly used measurement formula are introduced. Finally, a new multi-classifier selective method is proposed. The method is to construct the confusion matrix of all base classifiers as the data object of the clustering algorithm, according to the distribution of samples in each cluster. A certain number of classifiers are selected as representatives to form a new ensemble of classifiers, and then this method is applied to the training process of Bagging. In order to verify the feasibility of this method, the experiment is carried out on the UCI dataset. The experimental results obtained from the training process of the Bagging algorithm are compared with the results obtained by using the original Bagging algorithm. This method can effectively improve the accuracy of the integrated system. Different integration rules are selected and the results are analyzed.
【學(xué)位授予單位】：河南理工大學(xué)
【學(xué)位級別】：碩士
【學(xué)位授予年份】：2016
【分類號】：TP181;TP311.13

【相似文獻(xiàn)】

相關(guān)期刊論文前10條

1 呂岳,施鵬飛,趙宇明;多分類器組合的投票表決規(guī)則[J];上海交通大學(xué)學(xué)報;2000年05期

2 韓宏;楊靜宇;;多分類器組合及其應(yīng)用[J];計算機(jī)科學(xué);2000年01期

3 陳剛,戚飛虎;多分類器結(jié)合的人臉識別[J];上海交通大學(xué)學(xué)報;2001年02期

4 韓宏,楊靜宇,婁震;基于層次的分類器組合[J];南京理工大學(xué)學(xué)報(自然科學(xué)版);2002年01期

5 趙誼虹,程國華,史習(xí)智;多分類器融合中一種新的加權(quán)算法[J];上海交通大學(xué)學(xué)報;2002年06期

6 王正群,葉暉,孫興華,楊靜宇;模糊多分類器組合[J];小型微型計算機(jī)系統(tǒng);2003年01期

7 楊利英,覃征,王向華;多分類器融合實現(xiàn)機(jī)型識別[J];計算機(jī)工程與應(yīng)用;2004年15期

8 楊利英,覃征,王衛(wèi)紅;多分類器融合系統(tǒng)設(shè)計與應(yīng)用[J];計算機(jī)工程;2005年05期

9 陳湘;;1-范數(shù)軟間隔分類器的風(fēng)險[J];湖北大學(xué)學(xué)報(自然科學(xué)版);2006年02期

10 秦鋒;楊波;程澤凱;;分類器性能評價標(biāo)準(zhǔn)研究[J];計算機(jī)技術(shù)與發(fā)展;2006年10期

相關(guān)會議論文前10條

1 王占一;徐蔚然;劉東鑫;郭軍;;一種基于兩級分類器的垃圾短信過濾方法[A];第五屆全國信息檢索學(xué)術(shù)會議論文集[C];2009年

2 翟靜;李海宏;唐常杰;陳敏敏;李智;;可驗證對象集分類器的再訓(xùn)練演進(jìn)[A];第十九屆全國數(shù)據(jù)庫學(xué)術(shù)會議論文集（研究報告篇）[C];2002年

3 陳繼航;劉家鋒;趙巍;唐降龍;;聯(lián)機(jī)手寫識別筆段特征分類器的學(xué)習(xí)方法[A];黑龍江省計算機(jī)學(xué)會2009年學(xué)術(shù)交流年會論文集[C];2010年

4 穆明生;;基于特征集的多種分類器模型的在線筆跡認(rèn)證[A];第十屆全國信號處理學(xué)術(shù)年會（CCSP-2001）論文集[C];2001年

5 彭濤;左萬利;赫楓齡;;基于鏈接上下文的分類器主題爬行技術(shù)(英文)[A];第二十三屆中國數(shù)據(jù)庫學(xué)術(shù)會議論文集（技術(shù)報告篇）[C];2006年

6 王嵐;陳珂;遲惠生;;基于多特征組合多分類器的方法用于“與文本無關(guān)”的說話人辨認(rèn)[A];第四屆全國人機(jī)語音通訊學(xué)術(shù)會議論文集[C];1996年

7 謝秋玲;;應(yīng)用于心電圖分類的KNN-SVM分類器研究[A];2006中國控制與決策學(xué)術(shù)年會論文集[C];2006年

8 胡瓊;汪榮貴;胡韋偉;孫見青;;基于級聯(lián)分類器的快速人臉檢測方法[A];計算機(jī)技術(shù)與應(yīng)用進(jìn)展·2007——全國第18屆計算機(jī)技術(shù)與應(yīng)用（CACIS）學(xué)術(shù)會議論文集[C];2007年

9 李蘭春;王雙成;杜瑞杰;;認(rèn)知結(jié)構(gòu)評估的動態(tài)貝葉斯網(wǎng)絡(luò)分類器方法[A];2011年中國智能自動化學(xué)術(shù)會議論文集（第一分冊）[C];2011年

10 邵小健;段華;賀國平;;一種改進(jìn)的最少核分類器[A];中國運(yùn)籌學(xué)會第七屆學(xué)術(shù)交流會論文集（上卷）[C];2004年

相關(guān)重要報紙文章前1條

1 黃明;精子分類器決定生男生女[N];廣東科技報;2000年

相關(guān)博士學(xué)位論文前10條

1 張非;對抗逃避攻擊的防守策略研究[D];華南理工大學(xué);2015年

2 張文博;多類別智能分類器方法研究[D];西安電子科技大學(xué);2014年

3 許勁松;智能交通中目標(biāo)檢測與分類關(guān)鍵技術(shù)研究[D];南京理工大學(xué);2014年

4 趙作林;基于圖像分析的北京地區(qū)楊樹種類識別研究[D];北京林業(yè)大學(xué);2015年

5 任亞峰;基于標(biāo)注和未標(biāo)注數(shù)椐的虛假評論識別研究[D];武漢大學(xué);2015年

6 曹鵬;不均衡數(shù)據(jù)分類方法的研究[D];東北大學(xué);2014年

7 劉明;分類器組合技術(shù)研究及其在人機(jī)交互系統(tǒng)中的應(yīng)用[D];北京交通大學(xué);2008年

8 嚴(yán)志永;在劃分?jǐn)?shù)據(jù)空間的視角下基于決策邊界的分類器研究[D];浙江大學(xué);2011年

9 王U，

本文編號：1821992

資料下載

論文發(fā)表

支付寶下載

Download by Alipay
微信下載

Download by Wechat
會員下載

Download by Member

本文鏈接：http://sikaile.net/kejilunwen/ruanjiangongchenglunwen/1821992.html

上一篇：數(shù)字音樂軟件的界面視覺設(shè)計研究
下一篇：動態(tài)背景下行人檢測模塊的設(shè)計與實現(xiàn)

論文發(fā)表

·知網(wǎng)|萬方|維普|龍源|省級|國家級|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

基于混淆矩陣的分類器選擇集成方法研究