天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

基于拓撲二級結(jié)構(gòu)和閱讀框識別不同細胞器基因組的非編碼RNA

發(fā)布時間:2018-01-09 22:04

  本文關(guān)鍵詞:基于拓撲二級結(jié)構(gòu)和閱讀框識別不同細胞器基因組的非編碼RNA 出處:《內(nèi)蒙古大學》2016年博士論文 論文類型:學位論文


  更多相關(guān)文章: 細胞基因組非編碼RNA 開放閱讀框 拓撲二級結(jié)構(gòu) microRNA基因簇 功能和通路富集分析


【摘要】:隨著功能基因組學的飛速發(fā)展,對非編碼轉(zhuǎn)錄產(chǎn)物功能的研究引起越來越多人的關(guān)注。人類基因組中超過98%的序列為非蛋白質(zhì)編碼DNA,這些非編碼DNA序列大部分會轉(zhuǎn)錄成RNA,并且直接以RNA的形式發(fā)揮功能,參與基因的轉(zhuǎn)錄調(diào)控、蛋白質(zhì)的翻譯等重要的生物學功能。而且近年來人們發(fā)現(xiàn)非編碼RNA往往與疾病、DNA損傷修復(fù)、植物的應(yīng)激反應(yīng)有關(guān)。隨著RNA數(shù)據(jù)的積累,細胞核基因組編碼的短鏈或長鏈非編碼RNA(Non-coding RNA,ncRNA)被確認對線粒體的功能以及線粒體動力學有一定的調(diào)節(jié)作用。許多ncRNAs一方面調(diào)控與細胞器功能相關(guān)的核基因,同時也與線粒體的形態(tài)、新陳代謝,線粒體白噬以及與線粒體有關(guān)的凋亡有關(guān)。然而,人們對遺傳信息在不同細胞器間以ncRNAs形式傳遞的了解長期以來仍然有限。因此,理解細胞器之間的ncRNAs信息傳遞顯得非常重要。隨著ncRNA研究在細胞器基因組水平的日益深入,發(fā)現(xiàn)對不同細胞器基因組轉(zhuǎn)錄的ncRNA的識別有助于進一步了解不同細胞器基因組ncRNA的功能。本研究在細胞器基因組水平對不同細胞器基因組轉(zhuǎn)錄的ncRNAs的注釋問題進行了系統(tǒng)的研究,包括構(gòu)建細胞器基因組水平不同定位信息的非編碼RNAs數(shù)據(jù)集、結(jié)合ncRNA序列和結(jié)構(gòu)特征提取有效的特征參數(shù)并對參數(shù)進行優(yōu)化、預(yù)測算法的建立以及算法的推廣等。積累的組學數(shù)據(jù)所呈現(xiàn)的基因水平的復(fù)雜性很難從蛋白質(zhì)編碼基因數(shù)量的角度加以解釋,為此,有人認為長期以來被認為是垃圾的、缺乏蛋白質(zhì)編碼能力的ncRNA的調(diào)節(jié)作用可以解釋這種現(xiàn)象。其中,微小RNAs (microRNAs, miRNAs)和小干涉RNAs (siRNAs)一致被認為在生物的調(diào)控功能中扮演重要角色。本論文最后,以microRNA為例,研究非編碼RNA與靶基因?qū)θ橄侔┑陌l(fā)生、發(fā)展的調(diào)控作用。考慮到多個microRNA對靶基因存在協(xié)同調(diào)控作用,我們選取在乳腺癌中起抑癌作用的miR-17-92基因簇及其2個旁系同源基因簇轉(zhuǎn)錄的1 5種microRNA序列及其共調(diào)控靶基因作為研究對象,對基因簇轉(zhuǎn)錄的microRNA序列特征及其共調(diào)控靶基因在乳腺正常組織和乳腺癌組織中的調(diào)控作用進行研究。論文主要的研究內(nèi)容如下:一、我們首次從NONCOGING v3.0數(shù)據(jù)庫中收集并整理出有細胞器基因組注釋的ncRNA序列,并對序列長度分布進行分析?紤]到序列相似性對預(yù)測的影響,進一步采用Cd-hit軟件構(gòu)建了序列相似性在80%以下的數(shù)據(jù)集ncRNA_361 dataset。從最簡單的堿基物理化學特性出發(fā),討論不同細胞器基因組轉(zhuǎn)錄的ncRNA序列的理化特性。在此基礎(chǔ)上進一步考慮閱讀框下的n-mer組分偏好,結(jié)構(gòu)-序列模式下三聯(lián)體組分,以及簡并密碼子偏好。通過深入探討無閱讀框與閱讀框?qū)ψR別不同細胞器基因組轉(zhuǎn)錄的ncRNA序列的影響,發(fā)現(xiàn)最優(yōu)閱讀框為第一閱讀框。二、考慮到ncRNA的結(jié)構(gòu)信息更能反映ncRNA執(zhí)行功能時的空間構(gòu)象,而保守模體反映了序列在長期進化過程中的壓力。我們首次提取ncRNA序列的拓撲二級結(jié)構(gòu)特征和保守模體作為在細胞器基因組水平識別ncRNA的特征參量。特征的融合不可避免會帶來維數(shù)的增加,我們結(jié)合前人的經(jīng)驗,提出兩種不同的降維方法:一是特征的降維映射,二是基于mRMR的增加特征選擇(IFS)的方法,即選取最優(yōu)特征子集。結(jié)合目前較為流行的離散增量算法(increment of diversity classifier, ID)、K緊鄰算法(K-nearest neighbor classifier, KNN)以及支持向量機算法(support vector machine, SVM)提出多算法的融合:改進的離散量結(jié)合K緊鄰算法(the improved K-minimum increment of diversity classifier, iK-MID)、高效的平均K緊鄰算法(the improved K-nearest neighbor classifier, iKNN)以及離散增量結(jié)合支持向量機算法(the increment of diversity combining support vector machine,ID-SVM)。最后,通過不同算法之間的相互比較,探索更加有效的細胞器基因組ncRNA識別的理論模型。三、應(yīng)用生物信息學手段,對特定miRNA基因簇(hsa-miR-17-92基因簇)及其旁系同源基因簇轉(zhuǎn)錄的miRNA序列特征及其共調(diào)控靶基因在乳腺不同組織的表達水平進行了研究,并利用反饋機制簡單解釋該miRNA對下游基因的調(diào)控機制,為生物學實驗提供了有一定研究意義和價值的線索。
[Abstract]:With the rapid development of functional genomics, research on non encoding transcription function has attracted more and more attention. More than 98% of the sequence in the human genome is a non protein encoding DNA, these non DNA sequences encoding most transcribed into RNA, and directly in the form of RNA function, participate in gene transcription regulation, biological function protein translation and so on. But in recent years it was found that non encoding RNA are often associated with the disease, DNA damage repair, the stress response of plants. With the RNA data, the nuclear genome encoding short or long chain non encoding RNA (Non-coding RNA ncRNA) was identified on mitochondrial function and regulation of mitochondrial dynamics effect of regulation of nuclear gene related to cell function of many ncRNAs on the one hand, and the morphology of mitochondria, mitochondrial autophagy and The new supersedes the old., and line Particles related to apoptosis. However, the people of the genetic information in the form of ncRNAs in different organelles of understanding between a long time is still limited. Therefore, understanding the organelle ncRNAs information transmission is very important. With the increasingly ncRNA research in organelle genome level further, found that the identification of different organelle genome transcription the ncRNA helps to further understand the different organelle genomes ncRNA function. This study makes a systematic research on the organelle genome level annotation on different organelle genome transcription of ncRNAs, encoding non RNAs data including the construction of organelle genomes with different levels of positioning information set, combined with ncRNA sequence and structure feature extraction the effective feature parameters and optimizes the parameters prediction algorithm is established and the algorithm for the promotion. The data of gene water accumulation group Flat complexity is very difficult from the amount of protein encoding genes explain, therefore, some people think that has long been regarded as waste, lack of regulation of protein encoding ability of ncRNA can explain this phenomenon. Among them, the small RNAs (microRNAs, miRNAs) and small interference RNAs (siRNAs) was found to play an important role in the regulation of biological function. At the end of the thesis, taking microRNA as an example, study the non occurrence of encoding RNA and target gene for breast cancer, regulation of development. Considering the multiple microRNA synergistic regulation of target genes, we selected in breast cancer within 15 microRNA sequence of miR-17-92 gene cluster of tumor suppressor role and 2 paralogous gene clusters and co transcriptional regulation of target genes as the research object, the sequence characteristics of microRNA gene clusters and co transcriptional regulation of target genes in normal tissues and breast Regulation of breast cancer tissues were studied. The main research contents of this paper are as follows: first, we for the first time from the NONCOGING V3.0 database to collect and sort out the ncRNA sequence of organelle genome annotation, and the sequence length distribution were analyzed. Considering the influence of sequence similarity to predict, further uses the Cd-hit software to build the sequence the similarity in the 80% following data sets ncRNA_361 dataset. base starting from the physical and chemical characteristics of the most simple, to discuss the physicochemical properties of different ncRNA sequences of organelle genome transcription. On the basis of further consideration of reading n-mer group box the preference structure sequence mode three CIS components, as well as degenerate codon preference No. Through discussing the reading frame and reading frame influence ncRNA sequence recognition of different organelle genome transcription, find the optimal reading frame for the first reading frame two. And considering the structure information of ncRNA could reflect the spatial conformation of ncRNA when executing the function, and the conserved motif sequence reflects the pressure in the long process of evolution. We first extract two features of ncRNA sequences and topological motifs as organelles in the genome level general characteristic parameters of ncRNA. Don't increase the feature integration will inevitably bring dimension, we combined with previous experience, put forward two kinds of different reduction methods: one is the reduction characteristics of the two is to increase the feature selection based on mRMR (IFS) method, which selects the optimal feature subset. Combined with the discrete incremental algorithm popular (increment of diversity classifier. ID K (K-nearest), close to the neighbor classifier KNN algorithm) and the support vector machine algorithm (support vector machine, SVM) proposed fusion algorithm: discrete combined with improved K close to the count (the improved K-minimum increment of by diversity classifier, iK-MID), the average K efficient algorithm (the improved K-nearest neighbor near classifier, iKNN) and combined with the discrete incremental support vector machine algorithm (the increment of diversity combining support vector machine, ID-SVM). Finally, through the comparison between different methods, explore the theoretical model of cell genomic ncRNA recognition more effectively. Three, application of bioinformatics to specific miRNA gene cluster (hsa-miR-17-92 cluster) miRNA sequence characteristics and paralogous gene cluster and its transcriptional co regulation the expression level of target genes in different tissues of the breast were studied, and the feedback mechanism of the regulation mechanism of miRNA simple explanation on the downstream gene, provides certain research significance and valuable clues for biological experiments.

【學位授予單位】:內(nèi)蒙古大學
【學位級別】:博士
【學位授予年份】:2016
【分類號】:Q811.4

【參考文獻】

相關(guān)期刊論文 前10條

1 武成艷;李前忠;陳穎麗;;不同細胞器基因組轉(zhuǎn)錄的ncRNA的序列特征分析和識別[J];內(nèi)蒙古大學學報(自然科學版);2015年05期

2 孟琳;王天一;李曉曦;馬萍;;MicroRNA在乳腺癌中作用的研究進展[J];現(xiàn)代腫瘤醫(yī)學;2015年09期

3 李潔;秦性良;邵寧生;;MicroRNA及其靶基因的時空特異性與動態(tài)變化[J];生物化學與生物物理進展;2013年07期

4 葉靜;李明華;龍霞;萬匯涓;范昭;;乳腺癌MCF-7細胞的分子生物學特征[J];實用癌癥雜志;2012年02期

5 祁磊;苗俊英;;長非編碼RNA[J];生命的化學;2011年03期

6 陳潤生;;關(guān)于非編碼RNA研究的一些思考[J];生命科學;2010年07期

7 于紅;;表觀遺傳學:生物細胞非編碼RNA調(diào)控的研究進展[J];遺傳;2009年11期

8 崔彬;李娜;寧長申;張龍現(xiàn);菅復(fù)春;;基因序列在原蟲分子系統(tǒng)學中的應(yīng)用[J];中國病原生物學雜志;2008年03期

9 陳潤生;;與生物信息學相關(guān)的兩個前沿方向——非編碼基因和復(fù)雜生物網(wǎng)絡(luò)[J];生物物理學報;2007年04期

10 陳龍;李俐俐;;非編碼RNA及其功用[J];生物學教學;2007年07期



本文編號:1402682

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/shoufeilunwen/jckxbs/1402682.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶e5b4b***提供,本站僅收錄摘要或目錄,作者需要刪除請E-mail郵箱bigeng88@qq.com
国产亚洲欧美自拍中文自拍| 五月婷婷欧美中文字幕| 91欧美一区二区三区成人| 亚洲中文字幕熟女丝袜久久| 国产精品流白浆无遮挡| 麻豆在线观看一区二区| 免费大片黄在线观看日本| 不卡中文字幕在线免费看| 欧美丰满大屁股一区二区三区| 亚洲一区二区三区四区性色av| 一区二区在线激情视频| 成人精品国产亚洲av久久| 国产av精品高清一区二区三区| 国产精品午夜福利免费在线| 老司机激情五月天在线不卡 | 国产精品日韩精品一区| 国产中文字幕一二三区| 欧美三级精品在线观看| 亚洲熟妇中文字幕五十路| 日韩蜜桃一区二区三区| 色综合久久六月婷婷中文字幕| 美女极度色诱视频在线观看| 两性色午夜天堂免费视频| 中文字幕佐山爱一区二区免费| 东北女人的逼操的舒服吗| 熟妇久久人妻中文字幕| 国产成人精品一区在线观看| 欧美多人疯狂性战派对| 爱草草在线观看免费视频| 亚洲精品黄色片中文字幕 | 欧美黑人在线一区二区| 丰满熟女少妇一区二区三区| 午夜福利黄片免费观看| 国产成人综合亚洲欧美日韩| 日韩欧美在线看一卡一卡| 尹人大香蕉一级片免费看| 国产一区在线免费国产一区| 国产精品推荐在线一区| 亚洲香艳网久久五月婷婷| 东北老熟妇全程露脸被内射| 91精品视频免费播放|