支持軌跡隱私保護(hù)的兩階段用戶興趣區(qū)構(gòu)建方法
本文選題:數(shù)據(jù)聚類 + 空間數(shù)據(jù); 參考:《計(jì)算機(jī)學(xué)報(bào)》2017年12期
【摘要】:針對空間大數(shù)據(jù)開放共享中平衡隱私保護(hù)與數(shù)據(jù)可用性矛盾的需求,該文從空間和時(shí)間兩個(gè)維度對空間時(shí)序數(shù)據(jù)進(jìn)行分析,提出蘊(yùn)含空間、時(shí)間和群體特征的用戶興趣區(qū)構(gòu)建方法.該方法分為兩個(gè)階段:第一階段的個(gè)人興趣區(qū)構(gòu)建首先將m個(gè)移動(dòng)用戶的軌跡數(shù)據(jù)預(yù)處理到n個(gè)采樣時(shí)刻,并形式化為隱含時(shí)序關(guān)系的m×n階位置矩陣;然后根據(jù)訪問頻率等指標(biāo)對每個(gè)用戶在位置矩陣中的行向量進(jìn)行聚類、合并和優(yōu)化,獲得每個(gè)用戶在不同時(shí)間段的若干個(gè)人興趣區(qū).第二階段的公共興趣區(qū)構(gòu)建在第一階段的基礎(chǔ)上,首先對每個(gè)用戶按照一定的選取方式提取代表個(gè)人興趣區(qū)位置信息的位置點(diǎn),并對全部m個(gè)移動(dòng)用戶的個(gè)人興趣區(qū)的位置點(diǎn)進(jìn)行二次聚類,獲得所有用戶在不同時(shí)間尺度上的若干公共興趣區(qū);然后根據(jù)實(shí)際應(yīng)用場景需要,提取包含時(shí)間標(biāo)記的公共興趣區(qū).通過對比公開數(shù)據(jù)集的處理結(jié)果和百度地圖的實(shí)體數(shù)據(jù),驗(yàn)證了該方法所構(gòu)建的用戶興趣區(qū)與實(shí)際生活中的功能區(qū)域基本一致.應(yīng)用實(shí)例證明,該文方法所構(gòu)建的用戶興趣區(qū)可以為空間大數(shù)據(jù)開放共享中的軌跡隱私保護(hù)提供有效的技術(shù)支撐.
[Abstract]:In order to balance the contradiction between privacy protection and data availability in the open sharing of space big data, this paper analyzes spatial temporal data from two dimensions of space and time, and puts forward the implied space.Time and group characteristics of user interest area construction method.The method is divided into two stages: in the first stage, the region of personal interest is constructed by preprocessing the trajectory data of m mobile users to n sampling times, and formalizing the m 脳 n order position matrix of implicit temporal relation;Then the row vectors of each user in the location matrix are clustered, merged and optimized according to the access frequency and other indicators, and several areas of personal interest of each user in different time periods are obtained.In the second stage, the public interest area is constructed on the basis of the first stage. Firstly, the location points representing the location information of the area of personal interest are extracted from each user according to a certain selection method.The location points of all m mobile users' personal areas of interest are clustered twice to obtain some common areas of interest of all users on different time scales, and then according to the needs of practical application scenarios,Extract the area of public interest that contains the time tag.By comparing the processing result of the public data set and the entity data of Baidu map, it is verified that the user interest area constructed by this method is basically consistent with the function area in real life.The application examples show that the user interest area constructed by this method can provide an effective technical support for the path privacy protection in the open sharing of space big data.
【作者單位】: 西安交通大學(xué)電子與信息工程學(xué)院;陜西省計(jì)算機(jī)網(wǎng)絡(luò)重點(diǎn)實(shí)驗(yàn)室;
【基金】:國家自然科學(xué)基金(61472316,61502380) 中央高;究蒲袠I(yè)務(wù)費(fèi)綜合交叉重點(diǎn)項(xiàng)目(XKJC2014008) 陜西省重大基礎(chǔ)研究項(xiàng)目(2016ZDJC-05)資助~~
【分類號】:TP309
【相似文獻(xiàn)】
相關(guān)期刊論文 前10條
1 王杰;使圖像的編輯更加容易[J];中文信息;1998年Z1期
2 鄭運(yùn)剛;馬建國;;基于分類的用戶興趣漂移模型[J];情報(bào)雜志;2008年01期
3 楊杰;陳恩紅;;面向個(gè)性化服務(wù)的用戶興趣偏移檢測及處理方法[J];電子技術(shù);2009年11期
4 陳圣兵;李龍澍;紀(jì)霞;;多層次用戶興趣模式的動(dòng)態(tài)捕捉[J];計(jì)算機(jī)工程與應(yīng)用;2009年36期
5 鄭曉健;龐淑英;何英;;一種面向主題的用戶興趣挖掘模型研究[J];昆明學(xué)院學(xué)報(bào);2010年03期
6 花青松;劉海峰;胡錚;;基于基尼系數(shù)的用戶興趣分布模式度量方法[J];計(jì)算機(jī)工程;2012年22期
7 蔣學(xué)鋒;;用戶興趣的結(jié)構(gòu)和個(gè)性化服務(wù)的實(shí)現(xiàn)[J];計(jì)算技術(shù)與自動(dòng)化;2005年04期
8 李鈍;曹元大;張龍飛;;用戶興趣優(yōu)化過濾方法研究[J];計(jì)算機(jī)工程;2006年20期
9 費(fèi)洪曉;戴弋;穆s,
本文編號:1735736
本文鏈接:http://sikaile.net/kejilunwen/ruanjiangongchenglunwen/1735736.html