TMS:一種新的海量數(shù)據(jù)多維選擇Top-k查詢算法
發(fā)布時間:2018-03-18 06:26
本文選題:TMS算法 切入點:有序列表 出處:《計算機研究與發(fā)展》2017年03期 論文類型:期刊論文
【摘要】:在許多應(yīng)用中,Top-k是一種十分重要的查詢類型,它在潛在的巨大數(shù)據(jù)空間中返回用戶感興趣的少量數(shù)據(jù).Top-k查詢通常具有指定的多維選擇條件.分析發(fā)現(xiàn):現(xiàn)有算法無法有效處理海量數(shù)據(jù)的多維選擇Top-k查詢.提出了一個基于有序列表的TMS(top-k with multi-dimensional selection)算法,有效計算海量數(shù)據(jù)上的具有多維選擇的Top-k結(jié)果.TMS算法利用層次化結(jié)構(gòu)的選擇屬性網(wǎng)格對原數(shù)據(jù)表執(zhí)行水平劃分,每一個分片的元組以面向列的模式存儲,并且度量屬性的列表根據(jù)其屬性值降序排列.給定多維選擇條件,TMS算法利用選擇屬性網(wǎng)格確定相關(guān)網(wǎng)格單元,有效減少需要讀取的元組數(shù)量,提出雙排序方法執(zhí)行多維選擇的漸進評價,并提出有效剪切操作來剪切不滿足多維選擇條件和分?jǐn)?shù)要求的候選元組.實驗結(jié)果表明:TMS算法性能優(yōu)于現(xiàn)有算法.
[Abstract]:Top-k is a very important query type in many applications. It returns a small amount of data of interest to the user in the potential huge data space. Top-k query usually has the specified multidimensional selection condition. It is found that the existing algorithms can not deal with the multi-dimensional selection Top-k query of the massive data effectively. A TMS(top-k with multi-dimensional selection algorithm based on ordered lists is presented. The Top-k result with multi-dimension selection on massive data is calculated effectively. The hierarchical selection attribute grid is used to divide the original data table horizontally, and the tuples of each slice are stored in a column-oriented mode. The list of metric attributes is arranged in descending order according to the value of the attribute. Given the multi-dimensional selection condition, the TMS algorithm uses the selection attribute grid to determine the relevant grid cells, which effectively reduces the number of tuples to be read. A two-order method is proposed to perform the progressive evaluation of multidimensional selection, and an effective shearing operation is proposed to cut candidate tuples which do not meet the requirements of multidimensional selection and scores. The experimental results show that the performance of the two-order algorithm is superior to that of the existing algorithms.
【作者單位】: 哈爾濱工業(yè)大學(xué)計算機科學(xué)與技術(shù)學(xué)院;
【基金】:國家“九七三”重點基礎(chǔ)研究發(fā)展計劃基金項目(2012CB316200) 國家自然科學(xué)基金項目(61502121,61402130,61272046,61190115,61173022,61033015) 山東省自然科學(xué)基金項目(ZR2013FQ028) 山東省科技重大專項基金項目(2015ZDXX0210B02)~~
【分類號】:TP311.13
【相似文獻(xiàn)】
相關(guān)重要報紙文章 前1條
1 姜波;電腦也需要病歷[N];中國電腦教育報;2003年
,本文編號:1628363
本文鏈接:http://sikaile.net/kejilunwen/ruanjiangongchenglunwen/1628363.html
最近更新
教材專著