高考數(shù)據(jù)分布式存儲(chǔ)優(yōu)化的設(shè)計(jì)與實(shí)現(xiàn)
[Abstract]:In recent years, the rapid development of information industry in various industries has given birth to the explosive growth of industry data, including, of course, the field of college entrance examination. As we all know, the college entrance examination every year will produce a huge amount of college entrance examination data, how to store these large amounts of college entrance examination data quickly and efficiently is an important topic worth studying. In the face of TB level or even PB level of massive data, the traditional relational database data storage capacity is increasingly weak. With the emergence of large-scale data, the emergence of a lot of data storage technology. Among them, Google's GFS and Apache's HDFS are two typical big data distributed storage technologies. The emergence of HDFS.HDFS, which is now a popular Apache company, allows enterprises to use clusters of cheap machines to store large amounts of data in a distributed manner. But the distributed file storage of HDFS is controlled by one master node, and the storage mode of multiple slave data nodes is prone to the bottleneck problem of master node. For the college entrance examination data studied in this paper, if we use HDFS to store a large amount of college entrance examination data, when a large number of candidates simultaneously online query results, the requests from different clients will flood into the main node of HDFS. This is a great challenge for the master node of HDFS. In view of the above problems, through the in-depth study and analysis of HDFS distributed storage technology, this paper proposes a distributed storage scheme of HDFS MongoDB to solve the bottleneck problem of HDFS master node, thus making the distributed storage of college entrance examination data more optimized. Examinee inquiry results are more efficient. Based on the above analysis, the main research work of this paper is as follows: (1) firstly, the background and significance of the topic are defined, and then the distributed storage technology, the college entrance examination information technology, are applied to the thesis. And the development of Spark big data platform technology is analyzed. (2) the bottleneck problem of main node in storing college entrance examination data using HDFS distributed storage technology is analyzed. Then an optimization scheme of distributed storage of college entrance examination data based on HDFS MongoDB is proposed. (3) according to the specific requirements of the college entrance examination institute, the query system of college entrance examination results using the optimized storage scheme is from the user's point of view, functional point of view, According to the requirement analysis, the system structure, the system function and the function of the system are analyzed in detail from the point of view of non-functional. The system database and HDFS MongoDB distributed storage are designed in detail. (4) based on the detailed design of the system, the implementation method is given. The function of the system is tested by using the black box test method, and the performance of the system is tested from three aspects: response time, throughput and concurrency. Finally, the main research content of this paper is briefly described, and the direction of the next efforts is defined.
【學(xué)位授予單位】:山東師范大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2017
【分類(lèi)號(hào)】:TP333
【參考文獻(xiàn)】
相關(guān)期刊論文 前10條
1 邱麗娟;;大數(shù)據(jù)處理平臺(tái)Spark基礎(chǔ)實(shí)踐研究[J];無(wú)線(xiàn)互聯(lián)科技;2017年01期
2 蔡文濤;;Spark大數(shù)據(jù)處理平臺(tái)的構(gòu)建及應(yīng)用[J];中國(guó)新通信;2016年15期
3 王康;李東靜;陳海光;;分布式存儲(chǔ)系統(tǒng)中改進(jìn)的一致性哈希算法[J];計(jì)算機(jī)技術(shù)與發(fā)展;2016年07期
4 賀靜霞;;高考教育功能的異化及其回歸[J];中國(guó)教育學(xué)刊;2016年05期
5 Yang Liu;Feng Yang;;Scala tympani drill-out technique for oval window atresia with malformed facial nerve: A report of three cases[J];Journal of Otology;2015年04期
6 劉峰波;;大數(shù)據(jù)Spark技術(shù)研究[J];數(shù)字技術(shù)與應(yīng)用;2015年09期
7 陳豪;謝曉蘭;;基于云技術(shù)的校園服務(wù)系統(tǒng)服務(wù)器端設(shè)計(jì)研究[J];科技視界;2015年09期
8 郭昕;朱春暉;;從教育公平視角探討我國(guó)的高考制度改革[J];湖南科技大學(xué)學(xué)報(bào)(社會(huì)科學(xué)版);2014年02期
9 王月春;;基于HDFS的遠(yuǎn)程教育課件資源管理[J];網(wǎng)絡(luò)安全技術(shù)與應(yīng)用;2013年09期
10 李唯;;學(xué)生成績(jī)管理系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn)[J];軟件導(dǎo)刊;2012年12期
相關(guān)會(huì)議論文 前1條
1 田原;王營(yíng)康;肖達(dá);楊榆;;云存儲(chǔ)系統(tǒng)中的存儲(chǔ)與數(shù)據(jù)拆分方案[A];第十九屆全國(guó)青年通信學(xué)術(shù)年會(huì)論文集[C];2014年
相關(guān)重要報(bào)紙文章 前2條
1 嚴(yán)雪林;;中國(guó)企業(yè)大數(shù)據(jù)應(yīng)用現(xiàn)狀及趨勢(shì)[N];現(xiàn)代物流報(bào);2014年
2 別志銘;;基于云的大數(shù)據(jù)分析系統(tǒng)[N];網(wǎng)絡(luò)世界;2013年
相關(guān)碩士學(xué)位論文 前10條
1 崔鑫;海量空間數(shù)據(jù)的分布式存儲(chǔ)管理及并行處理技術(shù)研究[D];國(guó)防科學(xué)技術(shù)大學(xué);2010年
2 童明;基于HDFS的分布式存儲(chǔ)研究與應(yīng)用[D];華中科技大學(xué);2012年
3 郭匡宇;基于MongoDB的傳感器數(shù)據(jù)分布式存儲(chǔ)的研究與應(yīng)用[D];南京郵電大學(xué);2013年
4 段弘;基于Play的用戶(hù)匹配與內(nèi)容推薦系統(tǒng)設(shè)計(jì)與實(shí)現(xiàn)[D];電子科技大學(xué);2013年
5 李吉檀;基于教育公平視角的我國(guó)異地高考制度研究[D];武漢理工大學(xué);2013年
6 唐振坤;基于Spark的機(jī)器學(xué)習(xí)平臺(tái)設(shè)計(jì)與實(shí)現(xiàn)[D];廈門(mén)大學(xué);2014年
7 高正九;基于HDFS的云存儲(chǔ)系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn)[D];中國(guó)科學(xué)技術(shù)大學(xué);2014年
8 孔曉斌;面向高考招生的智能數(shù)據(jù)分析系統(tǒng)研究[D];太原科技大學(xué);2008年
9 呂林;基于MongoDB的應(yīng)用平臺(tái)的研究與實(shí)現(xiàn)[D];北京郵電大學(xué);2015年
10 李爽;基于Spark的數(shù)據(jù)處理分析系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn)[D];北京交通大學(xué);2015年
,本文編號(hào):2408652
本文鏈接:http://sikaile.net/kejilunwen/jisuanjikexuelunwen/2408652.html