天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當前位置:主頁 > 科技論文 > 基因論文 >

distinguishable gene subset selection Pearson correlation co

發(fā)布時間:2016-09-09 09:06

  本文關(guān)鍵詞:基于統(tǒng)計相關(guān)性與K-means的區(qū)分基因子集選擇算法,由筆耕文化傳播整理發(fā)布。


基于統(tǒng)計相關(guān)性與K-means的區(qū)分基因子集選擇算法

Statistical Correlation and K-Means Based Distinguishable Gene Subset Selection Algorithms

[1] [2]

XIE Juan-Ying, GAO Hong-Chao (School of Computer Science, Shaanxi Normal University, Xi'an 710062, China)

陜西師范大學計算機科學學院,陜西西安710062

文章摘要針對高維小樣本癌癥基因數(shù)據(jù)集的有效區(qū)分基因子集選擇難題,提出基于統(tǒng)計相關(guān)性和K-means的新穎混合基因選擇算法實現(xiàn)有效區(qū)分基因子集選擇。算法首先采用Pearson相關(guān)系數(shù)和Wilcoxon秩和檢驗計算各基因與類標的相關(guān)性,根據(jù)統(tǒng)計相關(guān)性原則選取與類標相關(guān)性較大的若干基因構(gòu)成預選擇基因子集;然后,采用K-means算法將預選擇基因子集中高度相關(guān)的基因聚集到同一類簇,訓練 SVM 分類模型,計算每一個基因的權(quán)重,從每一類簇選擇一個權(quán)重最大或者采用輪盤賭思想從每一類簇選擇一個得票數(shù)最多的基因作為本類簇的代表基因,各類簇的代表基因構(gòu)成有效區(qū)分基因子集。將該算法與采用隨機策略選擇各類簇代表基因的隨機基因選擇算法 Random, Guyon的經(jīng)典基因選擇算法SVM-RFE、采用順序前向搜索策略的基因選擇算法SVM-SFS進行實驗比較,幾個經(jīng)典基因數(shù)據(jù)集上的200次重復實驗的平均實驗結(jié)果表明:所提出的混合基因選擇算法能夠選擇到區(qū)分性能非常好的基因子集,建立在該區(qū)分基因子集上的分類器具有非常好的分類性能。

AbstrTo deal with the challenging problem of recognizing the small number of distinguishable genes which can tell the cancer patients from normal people in a dataset with a small number of samples and tens of thousands of genes, novel hybrid gene selection algorithms are proposed in this paper based on the statistical correlation and K-means algorithm. The Pearson correlation coefficient and Wilcoxon signed-rank test are respectively adopted to calculate the importance of each gene to the classification to filter the least important genes and preserve about 10 percent of the important genes as the pre-selected gene subset. Then the related genes in the pre-selected gene subset are clustered via K-means algorithm, and the weight of each gene is calculated from the related coefficient of the SVM classifier. The most important gene, with the biggest weight or with the highest votes when the roulette wheel strategy is used, is chosen as the representative gene of each cluster to construct the distinguishable gene subset. In order to verify the effectiveness of the proposed hybrid gene subset selection algorithms, the random selection strategy (named Random) is also adopted to select the representative genes from clusters. The proposed distinguishable gene subset selection algorithms are compared with Random and the very popular gene selection algorithm SVM-RFE by Guyon and the pre-studied gene selection algorithm SVM-SFS. The average experimental results of 200 runs of the aforementioned gene selection algorithms on some classic and very popular gene expression datasets with extensive experiments demonstrate that the proposed distinguishable gene subset selection algorithms can find the optimal gene subset, and the cl

文章關(guān)鍵詞:

Keyword::distinguishable gene subset selection Pearson correlation coefficient Wilcxon singed-rank test K-means clustering statistical correlation Filter algorithms Wrapper algorithms

課題項目:國家自然科學基金(31372250);中央高;究蒲袠I(yè)務(wù)費專項基金(GK201102007);陜西省科技攻關(guān)項目(2013K12-03-24)

 

 


  本文關(guān)鍵詞:基于統(tǒng)計相關(guān)性與K-means的區(qū)分基因子集選擇算法,,由筆耕文化傳播整理發(fā)布。



本文編號:112108

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/kejilunwen/jiyingongcheng/112108.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶63736***提供,本站僅收錄摘要或目錄,作者需要刪除請E-mail郵箱bigeng88@qq.com
日韩一区欧美二区国产| 中文字字幕在线中文乱码二区| 日本不卡一本二本三区| 婷婷色网视频在线播放| 黄男女激情一区二区三区| 美国黑人一级黄色大片| 欧美日韩在线视频一区| 亚洲一区二区福利在线| 午夜福利视频日本一区| 日本最新不卡免费一区二区| 午夜免费精品视频在线看| 久久99精品日韩人妻| 日韩人妻少妇一区二区| 成人国产激情在线视频| 色一情一乱一区二区三区码| 精品一区二区三区中文字幕| 亚洲中文字幕一区三区| 熟女乱一区二区三区丝袜| 欧美日韩中国性生活视频| 日本婷婷色大香蕉视频在线观看 | 中文字幕日产乱码一区二区| 精品国产一区二区欧美| 国产农村妇女成人精品| 成年午夜在线免费视频| 亚洲国产婷婷六月丁香| 午夜精品一区二区三区国产| 99久久国产亚洲综合精品| 欧美日韩国产的另类视频| 免费观看一级欧美大片| 丰满人妻少妇精品一区二区三区| 在线免费国产一区二区三区| 亚洲国产香蕉视频在线观看| 国产视频在线一区二区| 国产免费人成视频尤物| 九九热精品视频免费在线播放| 日韩欧美一区二区黄色| 免费特黄一级一区二区三区| 国产一级内片内射免费看| 中文字幕亚洲精品乱码加勒比 | 老熟妇乱视频一区二区| 欧美高潮喷吹一区二区|