HBase數據庫的安全索引研究
[Abstract]:With the arrival of big data era, the traditional relational database can not meet the needs of massive data processing in terms of access rate and so on. Therefore, NoSQL database emerges as the times require. Compared with relational database, NoSQL database has shortcomings in security. HBASE distributed database is a kind of NoSQL database with perfect security mechanism, but the data files in HBase database are stored in plain text. This forms a certain security loophole. It is very meaningful to encrypt the data files in HBase database and construct the secure secondary index of ciphertext for HBase database, which makes the HBase ciphertext database have the ability of conditional query. According to the different data types of column data in HBase distributed database, numerical ciphertext retrieval and text ciphertext retrieval can be studied respectively. For encrypted text retrieval of numerical data in HBase distributed database, a search strategy based on improved secure R-tree is proposed. When constructing the R-tree secure index for the data of the data table of HBase, the dynamic characteristics of the R-tree are used to store the RowKey row in the proper location of the index table according to the set encryption policy, according to the column value needed to build the index of the data table. At the same time, the index table IndexTable corresponding to the data table DataTable of HBase is stored in HBase to ensure the use and maintenance of the index table. This method can well support the column equivalent retrieval and range retrieval. A secure inverted index-based retrieval strategy for Chinese text-based data retrieval in HBase distributed database is proposed in this paper. When constructing text-based ciphertext index, we combine the coprocessor framework of HBase, which is commonly used in building secondary index, and Lucene, an open source search engine toolkit, to encrypt word segmentation after standardizing the Column Family/Q ualifer-qualified column in HBase as text content. At the same time, the data integrity and availability are ensured by storing the relevant files in the Lucene database in the HBase database. This method can well support the equivalent query and substring query for columns. The prototype system of ciphertext retrieval based on HBase is designed and implemented, which includes user management module, encryption and decryption module, key management module, ciphertext index generation and retrieval module, and so on. The security index of HBASE makes full use of the existing fast retrieval ability of HBase, and the current research results still have some limitations on the support of the diversity of ciphertext data query, and the security index of HBASE has made full use of the fast retrieval ability of the existing HBASE. This direction still has great research space and significance.
【學位授予單位】:華中科技大學
【學位級別】:碩士
【學位授予年份】:2016
【分類號】:TP311.13;TP309
【參考文獻】
相關期刊論文 前10條
1 李文成;趙逢禹;;企業(yè)云存儲數據的加密與密文全文檢索研究[J];小型微型計算機系統(tǒng);2013年02期
2 曹夕;許力;陳蘭香;;云存儲系統(tǒng)中數據完整性驗證協議[J];計算機應用;2012年01期
3 鄒燕飛;于成尊;趙亮;;基于Lucene的文本搜索引擎的設計和實現[J];計算機與現代化;2011年09期
4 盧冬海;何先波;;淺析NoSQL數據庫[J];中國西部科技;2011年02期
5 鄭榕增;林世平;;基于Lucene的中文倒排索引技術的研究[J];計算機技術與發(fā)展;2010年03期
6 田秀彥;李忠;羅士美;鄧力強;;基于Struts+Hibernate+Spring的整合架構及其在Web開發(fā)中的應用[J];計算機與現代化;2008年11期
7 王靜疆;;搜索引擎評價指標體系比較研究[J];圖書情報工作;2008年10期
8 馮朝勝;秦志光;袁丁;;數據庫加密系統(tǒng)密鑰管理模塊的設計[J];電子科技大學學報;2007年05期
9 胡立春;武友新;張燁;姜曉東;;LDAP環(huán)境下的統(tǒng)一用戶管理系統(tǒng)的研究與實現[J];計算機工程與設計;2007年04期
10 管建和;甘劍峰;;基于Lucene全文檢索引擎的應用研究與實現[J];計算機工程與設計;2007年02期
相關碩士學位論文 前6條
1 付文靜;基于HBase的大數據存儲查詢技術研究[D];電子科技大學;2015年
2 蔡平;基于Hadoop的NoSQL數據庫安全研究[D];上海交通大學;2013年
3 關爾昱;基于數據庫外層加密的數據庫加密系統(tǒng)的設計與實現[D];北京郵電大學;2010年
4 宋賽;密文全文檢索系統(tǒng)的安全索引結構研究[D];華中科技大學;2009年
5 劉可;數據庫加密系統(tǒng)研究[D];合肥工業(yè)大學;2006年
6 余登峰;基于R樹的空間數據索引技術研究與實現[D];中國地質大學;2006年
,本文編號:2440504
本文鏈接:http://sikaile.net/kejilunwen/ruanjiangongchenglunwen/2440504.html