天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當(dāng)前位置:主頁(yè) > 醫(yī)學(xué)論文 > 腫瘤論文 >

基于張量分解的癌癥亞型分析算法的研究

發(fā)布時(shí)間:2018-10-30 12:06
【摘要】:通過形態(tài)學(xué)或所屬組織器官命名的癌癥并不準(zhǔn)確,癌癥的臨床治療需要更精確的亞型才能對(duì)癥下藥和靶向治療。通過對(duì)基因芯片數(shù)據(jù)如m RNA、mi RNA、DNA、蛋白質(zhì)等數(shù)據(jù)的分析能發(fā)現(xiàn)和識(shí)別出更準(zhǔn)確的癌癥亞型。整合多源基因組數(shù)據(jù)不僅能夠發(fā)現(xiàn)腫瘤與基因組數(shù)據(jù)的關(guān)系,而且可以發(fā)現(xiàn)各基因數(shù)據(jù)之間對(duì)腫瘤的協(xié)同共作用關(guān)系。綜合考慮不同基因數(shù)據(jù),在不丟失信息的前提下分析不同數(shù)據(jù)相互之間的共享結(jié)構(gòu)是分析癌癥亞型的難點(diǎn)。本文使用多維陣列的張量結(jié)構(gòu)來整合多源基因組數(shù)據(jù),不經(jīng)過中間數(shù)據(jù)轉(zhuǎn)換,保留的原始單一基因數(shù)據(jù)的特有信息,同時(shí)挖掘不同基因數(shù)據(jù)之間的協(xié)同致病模式。本文介紹了張量模型的原理和框架,在基于乳腺癌的基因表達(dá)譜數(shù)據(jù)和DNA甲基化數(shù)據(jù)上構(gòu)建了張量模型,構(gòu)建的方法是對(duì)預(yù)處理的芯片數(shù)據(jù)做差異表達(dá)分析,有明顯差異的基因在張量中置位1或者保留原芯片值。表達(dá)正常或沒有明顯差異的基因則稀疏化為0。這樣基因表達(dá)譜數(shù)據(jù)和甲基化數(shù)據(jù)就整合為一個(gè)三維張量。在現(xiàn)有的CP-ARP分解算法的基礎(chǔ)上,本文針對(duì)基因芯片數(shù)據(jù)高維度小樣本的數(shù)據(jù)特征和基因功能差異表達(dá)和表達(dá)水平正常的兩極化特征,引入了非負(fù)和稀疏性限制條件,優(yōu)化了CP分解模型。改進(jìn)的模型使用基于隨機(jī)梯度下降的ALS優(yōu)化方法,在計(jì)算性能上有所提升。使用改進(jìn)的分解方法在與已經(jīng)驗(yàn)證的乳腺癌五種亞型對(duì)比結(jié)果證明了張量分解模型在癌癥分型應(yīng)用上的有效性。通過對(duì)癌癥分型的結(jié)果分析,驗(yàn)證了Her2這種臨床已證明存在的亞型。從平均輪廓系數(shù)和生存分析等角度證明了算法的性能和所分亞型的有效性。證實(shí)了本文提出的方法在癌癥的分型以及癌癥診斷治療上能提供一定的參考和借鑒。
[Abstract]:Cancer named by morphology or tissue or organ is not accurate. The clinical treatment of cancer requires more precise subtypes in order to get the right medicine and target treatment. More accurate cancer subtypes can be identified by analyzing microarray data such as m RNA,mi RNA,DNA, protein. The integration of multi-source genomic data can not only find the relationship between tumor and genomic data, but also find the synergistic co-action relationship between gene data and tumor. Considering different gene data and analyzing the shared structure of different data without losing information, it is difficult to analyze cancer subtype. In this paper, the Zhang Liang structure of multi-dimensional array is used to integrate the multi-source genomic data, and the unique information of the original single gene data is preserved without intermediate data conversion, and the cooperative pathogenicity patterns among different genetic data are also mined. In this paper, the principle and framework of Zhang Liang model are introduced, and then, on the basis of gene expression profile data and DNA methylation data of breast cancer, Zhang Liang model is constructed. The method is to analyze the differential expression of pre-processed microarray data. There are significant differences in the gene in Zhang Liang to place 1 or to retain the original chip value. Genes that express normal or no significant differences are sparse to 0. In this way, the gene expression profile data and methylation data are integrated into a three-dimensional Zhang Liang. Based on the existing CP-ARP decomposition algorithms, this paper introduces non-negative and sparse constraints for the data characteristics of high-dimensional small samples of gene chip data and the polarimetric characteristics of normal expression and expression level of gene functional differences. The CP decomposition model is optimized. The improved model uses the ALS optimization method based on stochastic gradient descent to improve the computational performance. The application of Zhang Liang decomposition model in cancer classification was proved by using the improved decomposition method in comparison with the five subtypes of breast cancer. Her2, a clinically proven subtype, was verified by analysis of cancer typing results. The performance of the algorithm and the validity of the subtype are proved from the point of view of average contour coefficient and survival analysis. It is confirmed that the proposed method can provide some reference for the classification of cancer and the diagnosis and treatment of cancer.
【學(xué)位授予單位】:哈爾濱工業(yè)大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2017
【分類號(hào)】:R73-3

【參考文獻(xiàn)】

相關(guān)期刊論文 前4條

1 李澤,包雷,黃英武,孫之榮;基于基因表達(dá)譜的腫瘤分型和特征基因選取[J];生物物理學(xué)報(bào);2002年04期

2 田振軍,張志琪,唐量,郭進(jìn),劉健;應(yīng)用cDNA微矩陣基因芯片篩選運(yùn)動(dòng)性心肌肥大相關(guān)基因的初步研究[J];中國(guó)運(yùn)動(dòng)醫(yī)學(xué)雜志;2002年02期

3 何志巍,姚開泰;DNA微陣列(或芯片)技術(shù)原理及應(yīng)用[J];生物化學(xué)與生物物理進(jìn)展;1999年05期

4 王升啟;基因芯片技術(shù)及應(yīng)用研究進(jìn)展[J];生物工程進(jìn)展;1999年04期

相關(guān)博士學(xué)位論文 前1條

1 郭煒煒;基于張量表示的多維信息處理方法研究[D];國(guó)防科學(xué)技術(shù)大學(xué);2014年

相關(guān)碩士學(xué)位論文 前3條

1 詹勇;基于主題模型和混合模型的微博客交叉話題發(fā)現(xiàn)研究[D];西南交通大學(xué);2013年

2 韓斌;基于內(nèi)容的超像素合并及其在圖像分割中的應(yīng)用[D];上海交通大學(xué);2013年

3 李寅;基于張量分解的視覺顯著性算法研究[D];上海交通大學(xué);2011年

,

本文編號(hào):2299963

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/yixuelunwen/zlx/2299963.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶2044e***提供,本站僅收錄摘要或目錄,作者需要?jiǎng)h除請(qǐng)E-mail郵箱bigeng88@qq.com