基于網(wǎng)格的蛋白質(zhì)結(jié)構(gòu)預(yù)測(cè)的并行實(shí)現(xiàn)
發(fā)布時(shí)間:2018-05-15 22:35
本文選題:蛋白質(zhì)折疊 + 遺傳算法; 參考:《武漢科技大學(xué)》2012年碩士論文
【摘要】:蛋白質(zhì)折疊結(jié)構(gòu)預(yù)測(cè)問(wèn)題是當(dāng)前生物學(xué)研究的一個(gè)熱點(diǎn)。由于其特殊的結(jié)構(gòu)和所使用的模型限制,利用NP問(wèn)題的求解來(lái)求其最小能量值從而推測(cè)出折疊結(jié)構(gòu)是這個(gè)問(wèn)題的研究方向之一。許多算法用來(lái)解決這個(gè)問(wèn)題,然而由于復(fù)雜的計(jì)算導(dǎo)致的時(shí)間的消耗代價(jià)昂貴。遺傳退火算法(GAA)是遺傳算法和模擬退火算法的結(jié)合,同時(shí)具備了遺傳算法全局搜索能力強(qiáng)和退火算法局部收斂快的優(yōu)勢(shì),因此成為蛋白質(zhì)折疊結(jié)構(gòu)預(yù)測(cè)問(wèn)題常用算法。 網(wǎng)格計(jì)算是一種分布式并行計(jì)算,它致力于利用網(wǎng)絡(luò)上的閑置資源來(lái)解決大規(guī)模計(jì)算問(wèn)題。網(wǎng)格并行系統(tǒng)的設(shè)計(jì)以網(wǎng)格中間件為基礎(chǔ),具備管理性強(qiáng)、安全性高、數(shù)據(jù)傳輸方便和擴(kuò)展性好的特點(diǎn),從計(jì)算的效率、設(shè)備的代價(jià)來(lái)看,適合大規(guī)模計(jì)算問(wèn)題的普遍研究;贛PI的網(wǎng)格編程接口是實(shí)現(xiàn)網(wǎng)格并行計(jì)算的一種應(yīng)用。 本文利用遺傳退火算法來(lái)解決蛋白質(zhì)折疊結(jié)構(gòu)預(yù)測(cè)問(wèn)題,將串行算法移植到并行的網(wǎng)格平臺(tái)中。在并行算法中,種群被分為多個(gè)子種群分布到子節(jié)點(diǎn)中各自進(jìn)行演化操作。同時(shí)根據(jù)實(shí)際情況對(duì)算法的算子(選擇、交叉、變異)進(jìn)行改進(jìn),,以求得更好的計(jì)算速度和算法效率。
[Abstract]:Protein folding structure prediction is a hot topic in current biological research. Because of its special structure and the limitation of the model used, the solution of NP problem is used to calculate its minimum energy value, and it is inferred that folding structure is one of the research directions of this problem. Many algorithms are used to solve this problem, but the time consumption due to complex computation is expensive. Genetic annealing algorithm (GA) is a combination of genetic algorithm and simulated annealing algorithm. It has the advantages of strong global search ability and fast local convergence of annealing algorithm, so it has become a common algorithm for protein folding structure prediction. Grid computing is a kind of distributed parallel computing, which is dedicated to solving large-scale computing problems by using idle resources on the network. The design of grid parallel system is based on grid middleware, which has the characteristics of strong management, high security, convenient data transmission and good expansibility. From the point of view of computing efficiency and equipment cost, it is suitable for the general research of large-scale computing problems. Grid programming interface based on MPI is an application to realize grid parallel computing. In this paper, the genetic annealing algorithm is used to solve the protein folding structure prediction problem, and the serial algorithm is transplanted to the parallel grid platform. In the parallel algorithm, the population is divided into multiple subpopulations and distributed to the child nodes to perform evolutionary operations. At the same time, the operators (selection, crossover, mutation) of the algorithm are improved according to the actual situation, in order to obtain better calculation speed and algorithm efficiency.
【學(xué)位授予單位】:武漢科技大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2012
【分類號(hào)】:Q51;TP338.6
本文編號(hào):1894264
本文鏈接:http://sikaile.net/kejilunwen/jisuanjikexuelunwen/1894264.html
最近更新
教材專著