基于質(zhì)譜血清多肽組譜圖的管理分析系統(tǒng)構(gòu)建與應(yīng)用研究
[Abstract]:In the post-genome era, great progress has been made in basic research and clinical application of proteomics with the completion of human and other model organism genome sequencing and important breakthroughs in mass spectrometry instruments and methods. Clinical proteomics involves a variety of data types. Serum polypeptide profiles (hemopeptide profiles) are one of the most important, and are based on non-gel systems in clinical proteomics applications. The basic principles of these proteomics are universal. Detection of blood by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF/MS) or surface-enhanced laser desorption/ionization time-of-flight mass spectrometry (SELDI-TOF/MS) The exact mass of the polypeptide group in the serum is then processed using bioinformatics. By comparing the differences between the peptide maps of disease and healthy controls, one can discover disease-specific proteins or polypeptides, and thus help to study the pathogenesis of disease at the protein level.
Hemopeptide mapping has broad application prospects in the fields of biomarker discovery, early diagnosis and individualized treatment. However, the following factors must be taken into account in the application of hemopeptide mapping in clinical research. First, the influence of sample selection on hemopeptide mapping is important for the patients and patients who need to be collected by clinical research institutes. Normal control group samples should take into account individual differences and individual differences. Individual differences between normal control groups include age, sex, race, family history and disease history. Disease patients'samples should contain complete disease subtypes, and the collected information should be as complete as possible to meet the needs of building mathematical models and validation. Secondly, the impact of sample collection on hemopeptide mapping technology is pre-analysis differences, including sample collection, storage and transportation due to environmental differences in the impact of samples, as these differences are generally not related to disease, may increase the complexity of finding disease-related differences in proteins or peptides, and ultimately affect blood Finally, the influence of instrumental analysis on hemopeptide mapping technology is discussed. MALDI-TOF/MS and SELDI-TOF/MS are the main mass spectrometers needed for hemopeptide mapping technology. Due to various factors in the process of mass spectrometry experiment, the original spectrum data produced by mass spectrometry contains a large number of noise signals, which must be pre-processed to remove. Interference.
In view of the characteristics of large number of variables and samples in hemopeptide map, facing such complex data, only through bioinformatics method can we identify a group of peptide peaks closely related to disease and discover the characteristic information related to disease in hemopeptide map. However, the existing data management and analysis tools can not meet the needs of the disease. For this reason, we combine clinical proteomics with bioinformatics to develop a management and analysis system based on mass spectrometry serum peptide profiles, BioSunMS. This system is based on ECLIPSE plug-in architecture, using JAVA. Language development has the characteristics of easy release and secondary development, friendly interface, cross-system platform, easy to manage clinical samples, mass spectrogram and mass spectrogram pretreatment and modeling analysis, so as to facilitate the relevant researchers to carry out disease classification and typing research conveniently and quickly. Finally, we based on lung cancer patients peptide. The sample classification and typing research of the graph illustrate the function of BioSunMS as an example.
1. blood peptide map database construction
Serum peptide profiles, samples and clinical information of normal persons and patients with various tumors (including lung cancer, liver cancer, breast cancer, rectal cancer, prostate cancer, leukemia, etc.) are stored in the hemopeptidase database. The database mainly contains sample sources, diagnostic methods, sample processing, mass spectrometry detection methods, and serum peptide mass spectrometry numbers. The database mainly provides the following important functions: serum peptide map inquiry, through the system, users can obtain the marker spectrum peaks of specific tumors and corresponding peptide sequences; various diseases blood peptide map data submission, through this system, researchers can collect disease blood peptide map data in their laboratory, Submitted to this database, thus enriching the types of diseases in the database; analysis of the disease information of the blood peptide map, the detection personnel will be directly obtained by the clinical blood peptide map query through this database, thus obtaining disease-related information.
Software development of data processing and analysis of blood peptide map 2.
In order to rapidly and accurately carry out tumor classification and typing research based on hemopeptide map data, a data processing and analysis module of hemopeptide map was developed. After statistical analysis of the processed data, the characteristic peaks can be found, the blood peptide map model can be established, and the blind samples can be discriminated.
3. tumor classification and typing based on blood peptide map data
With support vector machine (SVM), principal component analysis (PCA), genetic algorithm (GA), Na? Ve Bayes, partial least squares (PLS) and other commonly used statistical and machine learning methods as tools, tumor classification and typing module based on blood peptide map data was constructed, and model parameters were provided. The optimization function is convenient for relevant personnel to carry out tumor classification and typing research.
Establishment of 4. tumor characteristic blood peptide map model
The study was carried out in collaboration with the National Center for Instrumental Analysis (NIAA). In the previous work, NIAA has completed the collection of high resolution mass spectrometry (HRMS) data from 1000 healthy people and 2000 patients with lung cancer, liver cancer, breast cancer, rectal cancer, prostate cancer and leukemia. The blood peptide maps of 254 lung cancer patients and 257 normal controls were analyzed in the database. Firstly, we constructed the training set from the blood peptide maps of 150 lung cancer patients and 150 control samples. The rest 104 lung cancer patients and 107 normal control samples were used to construct the test set. Seventy-four characteristic peaks were screened out according to the standard of P 0.005. Based on these variables, we constructed the classification model of lung cancer hemopeptide map by SVM and validated it by test set. For test set, the accuracy, sensitivity and specificity of classification were 92.3%, 96.3% and 94.3% respectively. Based on the information of spectral peaks, an early diagnosis model of lung cancer based on mass spectrometric hemopeptide map was constructed, and the early diagnosis of lung cancer was preliminarily explored.
To sum up, a software named BioSunMS, which integrates the database management and analysis of serum peptide profiles of mass spectrometry, was constructed. The system was used to analyze the data of lung cancer hemopeptide profiles, and the early diagnosis model of lung cancer hemopeptide profiles was constructed, which provided bioinformatics support for the related research based on mass spectrometry hemopeptide profiles.
【學(xué)位授予單位】:中國人民解放軍軍事醫(yī)學(xué)科學(xué)院
【學(xué)位級(jí)別】:博士
【學(xué)位授予年份】:2009
【分類號(hào)】:R346
【相似文獻(xiàn)】
相關(guān)期刊論文 前10條
1 李向陽;張嘉保;何永聚;王景龍;;CD4~+T細(xì)胞表位預(yù)測及其應(yīng)用[J];安徽農(nóng)業(yè)科學(xué);2011年17期
2 毛向明;邢榮威;景曉瑋;周其趙;余慶鋒;郭文彬;武小強(qiáng);褚慶軍;馮春瓊;;弱精子癥相關(guān)基因的生物信息學(xué)研究[J];中華男科學(xué)雜志;2011年08期
3 沈霞;譚亞芳;劉清;;金銀花中綠原酸及其異構(gòu)體三維結(jié)構(gòu)的生物信息學(xué)研究[J];陜西中醫(yī);2011年07期
4 孫紅;殷作群;孫妍;丁瑜;;生物信息學(xué)在醫(yī)藥學(xué)領(lǐng)域中的應(yīng)用[J];醫(yī)學(xué)信息(上旬刊);2011年09期
5 丁克祥;董萍;韓晉云;楊永鵬;丁宇;丁振華;;神經(jīng)肽Y及其受體的生物信息學(xué)和醫(yī)學(xué)生理學(xué)的研究[J];國際老年醫(yī)學(xué)雜志;2010年03期
6 付芹芹;荊春霞;楊光;郭志云;孫小會(huì);王穗湘;李月琴;周天鴻;;微小隱孢子蟲腺苷酸激酶基因克隆及分析[J];中國公共衛(wèi)生;2011年07期
7 李江域;趙東升;王玉民;;GPU計(jì)算及其在生物醫(yī)學(xué)研究中的應(yīng)用[J];軍事醫(yī)學(xué);2011年08期
8 鄭輝;黃志剛;聞人慶;李洪義;;眼皮膚白化病患者酪氨酸酶基因突變的研究[J];中國應(yīng)用生理學(xué)雜志;2011年03期
9 朱文楠;習(xí)楊;呂湘;劉德培;;人類miRNA上游轉(zhuǎn)錄因子及下游靶基因的基因本體分析[J];中國微生態(tài)學(xué)雜志;2011年08期
10 崔穎;王芳;蘇建忠;劉洪波;張巖;史慶春;;醫(yī)學(xué)院校生物信息學(xué)專業(yè)《數(shù)據(jù)庫原理與技術(shù)》教學(xué)方法研究與實(shí)踐[J];數(shù)理醫(yī)藥學(xué)雜志;2011年04期
相關(guān)會(huì)議論文 前10條
1 李媛;崔尚金;李建偉;于康震;;分子生態(tài)學(xué)與生物信息學(xué)[A];中國畜牧獸醫(yī)學(xué)會(huì)禽病學(xué)分會(huì)第十一次學(xué)術(shù)研討會(huì)論文集[C];2002年
2 陸文聰;鈕冰;;基于數(shù)據(jù)挖掘的生物信息學(xué)研究進(jìn)展[A];中國化學(xué)會(huì)第27屆學(xué)術(shù)年會(huì)第15分會(huì)場摘要集[C];2010年
3 陳婷婷;郭婷婷;李林;安冬;;基于生物信息學(xué)的功能蛋白基因序列分類研究[A];2011年全國通信安全學(xué)術(shù)會(huì)議論文集[C];2011年
4 盧學(xué)春;楊波;朱宏麗;姚善謙;;采用生物信息學(xué)方法優(yōu)化依硫磷酸聯(lián)合方案治療MDS的應(yīng)用研究[A];中國科協(xié)海峽兩岸學(xué)術(shù)研討會(huì)——2008血液腫瘤論壇會(huì)議會(huì)編[C];2008年
5 阮林;何穎;鄒澤紅;傅意玲;陳惠芳;陶愛林;;外源蛋白過敏原性生物信息學(xué)評(píng)價(jià)[A];中華醫(yī)學(xué)會(huì)2010年全國變態(tài)反應(yīng)學(xué)術(shù)會(huì)議暨中歐變態(tài)反應(yīng)高峰論壇參會(huì)指南/論文匯編[C];2010年
6 馮文龍;趙清杰;;基于遺傳算法的DNA多序列比對(duì)問題[A];2007年中國智能自動(dòng)化會(huì)議論文集[C];2007年
7 康曉東;;生物信息學(xué)及其研究對(duì)象[A];2003年全國醫(yī)學(xué)影像技術(shù)學(xué)術(shù)會(huì)議論文匯編[C];2003年
8 王智宇;童強(qiáng)松;曾甫清;劉媛;顧朝輝;鄭麗端;蔡嘉斌;蔣國松;;小鼠睪丸特異性基因TSEG-4的克隆及表達(dá)分析[A];第十五屆全國泌尿外科學(xué)術(shù)會(huì)議論文集[C];2008年
9 朱云平;劉湘軍;魏麗萍;李亦學(xué);;肝臟蛋白質(zhì)組的生物信息學(xué)研究[A];中國蛋白質(zhì)組學(xué)第三屆學(xué)術(shù)大會(huì)論文摘要[C];2005年
10 孫琳琳;蔣繼志;;生物信息學(xué)及其在作物抗性基因研究中的應(yīng)用[A];中國植物病理學(xué)會(huì)2006年學(xué)術(shù)年會(huì)論文集[C];2006年
相關(guān)重要報(bào)紙文章 前10條
1 衣曉峰 喬蕤琳;哈醫(yī)大建立系列生物信息學(xué)研究方法[N];中國醫(yī)藥報(bào);2010年
2 記者 郭曉靜 通訊員 熊學(xué)莉;三醫(yī)大建起生物信息學(xué)數(shù)據(jù)庫[N];重慶日?qǐng)?bào);2003年
3 本報(bào)記者 白毅;生物信息學(xué)院士談[N];中國醫(yī)藥報(bào);2002年
4 中科院生物學(xué)部 張春霆;對(duì)生物信息學(xué)的展望[N];北京科技報(bào);2000年
5 中科院院士 吳e,
本文編號(hào):2184447
本文鏈接:http://sikaile.net/yixuelunwen/shiyanyixue/2184447.html