天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當前位置:主頁 > 科技論文 > 數學論文 >

線性回歸模型中響應值的選取對二分類問題的影響

發(fā)布時間:2018-05-29 04:07

  本文選題:二分類問題 + 線性回歸模型 ; 參考:《華北電力大學(北京)》2016年碩士論文


【摘要】:本文主要在多元線性回歸模型下,研究了不同響應值以及不同的臨界值的選取對兩個總體分類問題的影響。首先我們取判別規(guī)則中的臨界值為響應值的均值及中點,并在這兩種情況下,分別討論了不同響應值的選取對平衡及不平衡數據二分類問題的影響。同時,我們將判別規(guī)則中的臨界值取為響應值的均值,并將響應變量賦值為三組不同的值,這時得到的判別結果與經典判別分析方法如:距離判別法、Bayes判別法對比分析,找到它們之間的聯(lián)系及優(yōu)缺點。此外,我們還使響應值取定,并探討用三種臨界值得到的三種判別規(guī)則對數據分類判別,依據錯判概率最小原則,選出最合適的臨界值。在理論研究的基礎上,我們用r語言以及5-fold Cross-Validation準則,對響應變量分別取三組值,并將臨界值賦值為響應值的均值的三種情況下,對平衡、不平衡模擬數據及真實數據WDBC進行分析,得到了與文章理論相符的模擬結果。另外,我們還對響應變量分別賦為三組不同的值,臨界值分別取0或響應值的均值或響應值的中點的九種情況,將它們所對應的錯判概率進行了程序模擬,得到了與理論證明一致的模擬結果,而且找到了這9種情況之間的聯(lián)系,并選出了使得錯判率較小的臨界值,以便更好地對新的數據分類。
[Abstract]:In this paper, the effects of different response values and different critical values on the two population classification problems are studied under the multivariate linear regression model. First, we take the critical value in the discriminant rule as the mean and the middle point of the response value, and in these two cases, we discuss the influence of the selection of different response values on the two-classification problem of equilibrium and unbalanced data, respectively. At the same time, the critical value in the discriminant rule is taken as the mean value of the response value, and the response variable is assigned to three groups of different values. The result obtained is compared with the classical discriminant analysis method such as the distance discriminant method and Bayes discriminant method. Find out the relationship between them and their advantages and disadvantages. In addition, we also determine the response value, and discuss the classification of data by using the three kinds of critical values, and select the most appropriate critical value according to the principle of minimum misjudgment probability. On the basis of theoretical research, we use r language and 5-fold Cross-Validation criterion to take three sets of values for response variables, and assign the critical value to the three cases of mean value of response value. The unbalanced simulation data and the real data are analyzed by WDBC, and the simulation results are in agreement with the theory of the paper. In addition, the response variables are assigned to three groups of different values, and the critical values are taken as nine cases of the mean value of the response value or the midpoint of the response value, respectively, and the corresponding misjudgment probability is simulated by the program. The simulation results consistent with the theoretical proof are obtained, and the relationship between the nine cases is found, and the critical value which makes the error rate smaller is selected to better classify the new data.
【學位授予單位】:華北電力大學(北京)
【學位級別】:碩士
【學位授予年份】:2016
【分類號】:O212.1

【相似文獻】

相關碩士學位論文 前1條

1 楊巖麗;線性回歸模型中響應值的選取對二分類問題的影響[D];華北電力大學(北京);2016年

,

本文編號:1949424

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/kejilunwen/yysx/1949424.html


Copyright(c)文論論文網All Rights Reserved | 網站地圖 |

版權申明:資料由用戶4b566***提供,本站僅收錄摘要或目錄,作者需要刪除請E-mail郵箱bigeng88@qq.com