云計(jì)算平臺下資源監(jiān)控與態(tài)勢評估方法研究
本文選題:OpenStack + 資源監(jiān)控 ; 參考:《西安電子科技大學(xué)》2014年碩士論文
【摘要】:如今,大數(shù)據(jù)時(shí)代已經(jīng)真正到來并逐漸對我們的生活產(chǎn)生了深遠(yuǎn)的影響。大數(shù)據(jù)時(shí)代中備受關(guān)注的是云計(jì)算領(lǐng)域的發(fā)展,通過人們對云計(jì)算概念的不斷探索與實(shí)踐,“云”的理念已經(jīng)對技術(shù)發(fā)展有了深刻而遠(yuǎn)大的影響,也逐漸引起各界的關(guān)注。2003年,谷歌發(fā)表了核心論文來闡述云計(jì)算理念;2006年,亞馬遜將云計(jì)算技術(shù)商業(yè)化;近些年,“云”已經(jīng)觸手可及,并可以作為一種服務(wù)深入我們的生活。這種環(huán)境下,催生了一些云計(jì)算平臺,OpenStack就是最富代表性的。論文首先整理了相關(guān)的基礎(chǔ)概念,包含云計(jì)算、云平臺及虛擬化的知識,同時(shí)對現(xiàn)在比較流行的云平臺進(jìn)行了介紹;其次,論文對比分析了幾種監(jiān)控技術(shù)在不同場景下的應(yīng)用;然后,論文對態(tài)勢評估的概念、特點(diǎn)及評估要素等做了詳細(xì)的闡述,同時(shí)對比了現(xiàn)有的態(tài)勢評估的方法;谝陨蟽(nèi)容,論文最終設(shè)計(jì)并實(shí)現(xiàn)了基于OpenStack云平臺的單發(fā)/單收多播拓?fù)浣Y(jié)構(gòu)的監(jiān)控系統(tǒng)部署方案,進(jìn)而,通過對監(jiān)控?cái)?shù)據(jù)的處理,提出了云平臺態(tài)勢評估的方法。關(guān)于云監(jiān)控系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn),論文主要通過改進(jìn)Ganglia默認(rèn)的多發(fā)/多收的多播拓?fù)浣Y(jié)構(gòu),設(shè)計(jì)單發(fā)/單收的拓?fù)浣Y(jié)構(gòu),避免所有節(jié)點(diǎn)都要接收其他節(jié)點(diǎn)的指標(biāo)數(shù)據(jù),進(jìn)而避免CPU資源不必要的浪費(fèi),消除大型集群的運(yùn)行開銷。同時(shí),每個(gè)集群的gmetad守護(hù)進(jìn)程使用RRD文件對資源數(shù)據(jù)信息進(jìn)行匯聚,并匯總至主控節(jié)點(diǎn),主控節(jié)點(diǎn)部署了利用Bootstrap、Highcharts、node.js等技術(shù)開發(fā)的UI模塊,一方面,為管理員提供查看平臺資源數(shù)據(jù)實(shí)時(shí)變化的平臺系統(tǒng);另一方面,為用戶提供任務(wù)提交及結(jié)果查看等入口。論文中對態(tài)勢評估方法的研究,主要目的是為了挖掘平臺產(chǎn)生的大量數(shù)據(jù)潛在的價(jià)值。由于云平臺的規(guī)模比較大,資源數(shù)據(jù)也比較復(fù)雜,論文所利用的態(tài)勢評估方法主要借鑒D-S證據(jù)推理法及決策判斷法,采用“點(diǎn)”和“段”的資源監(jiān)控?cái)?shù)據(jù)或日志信息對平臺的健康狀態(tài)及任務(wù)的運(yùn)行情況進(jìn)行評估及預(yù)測,同時(shí),也可以通過監(jiān)控?cái)?shù)據(jù)的變化趨勢來反應(yīng)任務(wù)本身的特點(diǎn),通過一段時(shí)間的任務(wù)執(zhí)行情況來判斷整個(gè)平臺或者某個(gè)集群的穩(wěn)定性,通過一些故障提示可以對平臺的態(tài)勢及故障原因進(jìn)行分析,進(jìn)而保證平臺問題及時(shí)得到解決。云監(jiān)控系統(tǒng)對于一個(gè)可靠的云平臺來說是必要的,并為平臺可用性提供支撐;態(tài)勢評估方法直接影響云平臺數(shù)據(jù)價(jià)值的最大化。最后,論文對所實(shí)現(xiàn)的系統(tǒng)進(jìn)行了功能測試及性能測試,系統(tǒng)所表現(xiàn)出的實(shí)時(shí)性和穩(wěn)定性都可以達(dá)到論文目標(biāo)。
[Abstract]:Today, the big data era has really arrived and gradually had a profound impact on our lives. In the era of big data, the development of cloud computing has attracted much attention. Through the continuous exploration and practice of cloud computing, the concept of "cloud" has had a profound and far-reaching impact on the development of technology, and has gradually aroused the attention of all circles. Google has published a core paper on cloud computing; Amazon commercialized cloud computing technology in 2006; and in recent years, the cloud has been within reach and can be used as a service in our lives. In this environment, some cloud computing platform is the most representative OpenStack. Firstly, the paper summarizes the basic concepts, including cloud computing, cloud platform and virtualization, and introduces the popular cloud platform. Secondly, the paper compares and analyzes the application of several monitoring technologies in different scenarios. Then, the concept, characteristics and evaluation elements of situation assessment are described in detail, and the existing situation assessment methods are compared. Based on the above content, the paper finally designs and implements the deployment scheme of the monitoring system based on OpenStack cloud platform, which is based on the topology of single send / single receive multicast. Furthermore, through the processing of monitoring data, the method of cloud platform situation assessment is put forward. As to the design and implementation of cloud monitoring system, this paper mainly improves ganglia's default multicast topology, designs single send / receive topology, and avoids all nodes receiving index data from other nodes. Then avoid unnecessary waste of CPU resources and eliminate the running overhead of large clusters. At the same time, the gmetad daemons of each cluster use the RRD file to aggregate the resource data information and aggregate it to the master control node, which deploys the UI modules developed using the technologies such as Bootstrapger, Highchartsnnode.js, etc., on the one hand, On the other hand, it provides users with access to task submission and result viewing. The main purpose of this paper is to explore the potential value of a large amount of data generated by the platform. Because of the large scale of cloud platform and the complexity of resource data, the situation assessment methods used in this paper mainly draw lessons from D-S evidence reasoning method and decision judgment method. The resource monitoring data or log information of "dot" and "segment" are used to evaluate and predict the health status of the platform and the operation of the task. At the same time, the characteristics of the task itself can be reflected by monitoring the changing trend of the data. The stability of the whole platform or a cluster can be judged by a period of time task execution, and the situation and the cause of the fault can be analyzed through some fault hints, so as to ensure that the platform problem can be solved in time. Cloud monitoring system is necessary for a reliable cloud platform and provides support for platform availability. Situation assessment method directly affects the maximum value of cloud platform data. Finally, the function and performance of the system are tested, and the real-time and stability of the system can reach the goal of the paper.
【學(xué)位授予單位】:西安電子科技大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2014
【分類號】:TP393.08
【相似文獻(xiàn)】
相關(guān)期刊論文 前10條
1 李靜嚴(yán);;濟(jì)鋼能源管控中心水資源監(jiān)控與調(diào)度[J];電氣時(shí)代;2009年05期
2 任怡;張菁;陳紅;吳慶波;孔金珠;戴華東;管剛;;云應(yīng)用引擎的資源監(jiān)控和計(jì)費(fèi)機(jī)制研究[J];通信學(xué)報(bào);2012年S1期
3 羅建;;關(guān)注存儲資源管理[J];中國電信業(yè);2004年09期
4 陳剛;武永衛(wèi);柳佳;楊廣文;鄭緯民;;基于資源監(jiān)控的網(wǎng)格調(diào)度系統(tǒng)[J];華中科技大學(xué)學(xué)報(bào)(自然科學(xué)版);2007年S2期
5 張燕妮;肖峰;張建威;戴冶;;網(wǎng)絡(luò)系統(tǒng)資源監(jiān)控與應(yīng)用[J];網(wǎng)絡(luò)安全技術(shù)與應(yīng)用;2008年02期
6 余越;Unicengter TNG用于網(wǎng)絡(luò)資源監(jiān)控的原理與設(shè)計(jì)[J];上海鐵道科技;2001年01期
7 ;系統(tǒng)盡在我掌控 系統(tǒng)資源監(jiān)控軟件Winpulse[J];電腦采購周刊;2004年23期
8 彭洪;易昌善;黃巖渠;;面向業(yè)務(wù)的IT資源監(jiān)控系統(tǒng)設(shè)計(jì)[J];金融電子化;2008年03期
9 胡亮;車喜龍;;基于Nu-支持向量回歸的網(wǎng)格資源監(jiān)控與預(yù)測系統(tǒng)(英文)[J];自動(dòng)化學(xué)報(bào);2010年01期
10 方娟;張書杰;邸瑞華;黃河;;網(wǎng)格資源監(jiān)控關(guān)鍵技術(shù)的研究[J];計(jì)算機(jī)應(yīng)用與軟件;2005年12期
相關(guān)會(huì)議論文 前2條
1 李曉陽;丁峰;翟玉建;;基于GMA的網(wǎng)絡(luò)資源監(jiān)控技術(shù)的研究與實(shí)現(xiàn)[A];第十六屆全國青年通信學(xué)術(shù)會(huì)議論文集(上)[C];2011年
2 陳建香;劉歡;盧蓓蓉;;智慧校園中的地圖建設(shè)研究——基于資源管控的視角[A];中國高等教育學(xué)會(huì)教育信息化分會(huì)第十二次學(xué)術(shù)年會(huì)論文集[C];2014年
相關(guān)重要報(bào)紙文章 前5條
1 侯軍 摩卡軟件有限公司資深售前經(jīng)理 于尚民;IT業(yè)務(wù)融合需要深層監(jiān)控[N];通信產(chǎn)業(yè)報(bào);2009年
2 政務(wù)報(bào)道組;水利部力推國家水資源監(jiān)控能力建設(shè)項(xiàng)目建設(shè)管理工作[N];中國水利報(bào);2013年
3 通訊員 朱朝明;國家水資源監(jiān)控能力建設(shè)項(xiàng)目辦公室成立[N];中國水利報(bào);2012年
4 劉恒 通訊員 包楠;重慶聯(lián)通承建水資源監(jiān)控自動(dòng)化系統(tǒng)[N];人民郵電;2007年
5 記者 張強(qiáng);湖北億元投入建水資源在線監(jiān)控體系[N];中國水利報(bào);2014年
相關(guān)碩士學(xué)位論文 前10條
1 張仲妹;云計(jì)算環(huán)境下的資源監(jiān)控應(yīng)用研究[D];北方工業(yè)大學(xué);2013年
2 梁立新;工商網(wǎng)絡(luò)安全資源監(jiān)控管理平臺[D];四川大學(xué);2004年
3 文新宇;資源監(jiān)控與負(fù)載能力分析系統(tǒng)設(shè)計(jì)與實(shí)現(xiàn)[D];大連理工大學(xué);2013年
4 靳京;面向用戶的網(wǎng)格資源監(jiān)控服務(wù)系統(tǒng)的研究[D];燕山大學(xué);2006年
5 李玉凱;網(wǎng)格環(huán)境中資源監(jiān)控技術(shù)的研究[D];華北電力大學(xué)(河北);2007年
6 陳紅;云計(jì)算應(yīng)用引擎計(jì)費(fèi)機(jī)制研究與實(shí)現(xiàn)[D];國防科學(xué)技術(shù)大學(xué);2011年
7 田X;基于Intranet的資源監(jiān)控技術(shù)研究[D];國防科學(xué)技術(shù)大學(xué);2006年
8 李曉陽;基于GMA的資源監(jiān)控技術(shù)的研究與實(shí)現(xiàn)[D];南京航空航天大學(xué);2010年
9 宋彩華;云資源監(jiān)控中的數(shù)據(jù)傳輸模型研究[D];鄭州大學(xué);2013年
10 宋雪飛;基于MA的策略網(wǎng)管中資源監(jiān)控的研究與設(shè)計(jì)[D];吉林大學(xué);2005年
,本文編號:2001726
本文鏈接:http://sikaile.net/guanlilunwen/ydhl/2001726.html