基于無監(jiān)督學習技術的位置大數據分析

發(fā)布時間：2018-01-27 03:11

本文關鍵詞： 出租車軌跡特征出行方式識別　出處：《浙江理工大學》2017年碩士論文　論文類型：學位論文

【摘要】：隨著社會的發(fā)展和科學技術的不斷進步,移動通信和傳感設備等位置感知技術形成了大量的位置數據,對這些位置數據進行挖掘和分析,從而發(fā)現其潛在的有用信息,合理進行城市建設規(guī)劃和出行路線規(guī)劃,將會大大促進現代社會的智能化、信息化。同時,從人們出行的軌跡數據中得到其出行的交通方式,也將有助于研究人員從中推測人們對交通工具擁有情況、收入水平和職業(yè)情況。本文圍繞位置大數據進行了一些相關研究。首先,對出租車GPS軌跡數據進行分析,分析了可能產生數據誤差的影響因素,并有針對性地對原始數據做預處理,包括對數據進行誤差分析,數據處理依據和方法,之后對居民出行的起止點進行識別,并利用ArcGIS進行地圖的可視化呈現�；谔幚砗蟮腉PS數據,分析了居民出行的時空分布特征,包括工作日和休息日的出行量,日出行的高峰時段,并得到了相關的結論。將聚類分析方法應用到出租車GPS數據的應用研究中,選取合理的聚類方法對研究區(qū)域進行交通小區(qū)的劃分,并建立出行OD矩陣。其次,采用基于時間空間聚類的K-Means方法,得到乘客上下車的活躍中心點,并結合ArcGIS空間分析工具,構建緩沖區(qū)找到乘客最容易搭乘出租車的路段,解決了活躍中心點偏離道路的問題。此外,利用密度聚類的方法,對城市居民出行熱點區(qū)域的分布規(guī)律進行了研究,并與實際情況相結合,證明了本文方法的有效性。然后,為提高不同出行方式的識別率,提出了一種基于深度學習的出行方式識別模型,以微軟亞洲研究院收集的用戶GPS軌跡數據集為基礎,利用時間特性提取不同出行方式對應的GPS軌跡段,分析了不同出行方式的特征,考慮了迭代次數對均方根誤差和網絡訓練運行時間的影響,選擇合適的迭代次數,以盡可能短的時間,使網絡達到最優(yōu)的訓練效果。最后,將本文方法與BP神經網絡以及SVM方法進行實驗對比,結果表明,本文提出的基于深度學習的模型能對不同的出行方式進行有效識別,其識別準確率較傳統(tǒng)方法有明顯的提高,證明了該模型在出行方式識別問題上的可行性。
[Abstract]:With the development of society and the continuous progress of science and technology, location sensing technology such as mobile communication and sensor equipment has formed a large number of location data, mining and analysis of these location data. In order to find its potential useful information, reasonable planning of urban construction and travel route planning, will greatly promote the intelligence of modern society, information. At the same time. Getting the way people travel from track data will also help researchers speculate on how people own vehicles. Income level and career situation. This paper carried out some related research around location big data. Firstly, the paper analyzed the GPS track data of taxi, and analyzed the factors that may cause the data error. And the original data preprocessing, including data error analysis, data processing basis and methods, and then the residents travel start and stop point identification. Based on the processed GPS data, this paper analyzes the spatial and temporal distribution characteristics of residents' travel, including the travel volume on weekdays and rest days, and the peak time of daily travel. And get the relevant conclusion. Apply the cluster analysis method to the taxi GPS data application research, select the reasonable clustering method to the study area to divide the traffic area. Secondly, the K-Means method based on time-space clustering is used to get the active center points of passengers, and combined with ArcGIS spatial analysis tools. Constructing buffer zone to find the section where passengers can easily take a taxi solves the problem of active center point deviating from the road. In addition, the density clustering method is used. This paper studies the distribution law of urban residents' travel hot spots, and combines with the actual situation to prove the effectiveness of this method. Then, in order to improve the recognition rate of different travel modes. Based on the user GPS trajectory data collected by Microsoft Asia Research Institute, a travel pattern recognition model based on deep learning is proposed. The GPS trajectory segments corresponding to different trip modes are extracted by using time characteristics. The characteristics of different travel modes are analyzed and the effects of iteration times on root mean square error and network training running time are considered. The appropriate number of iterations is selected for the shortest possible time. Finally, the experiment results show that the proposed method is compared with BP neural network and SVM method. The proposed model based on depth learning can effectively identify different travel modes, and its recognition accuracy is obviously higher than that of the traditional method, which proves the feasibility of the model in the problem of trip pattern identification.
【學位授予單位】：浙江理工大學
【學位級別】：碩士
【學位授予年份】：2017
【分類號】：TP311.13

【相似文獻】

相關期刊論文前10條

1 斯華齡;張立明;;多通道無監(jiān)督學習——一種新的智能處理方法[J];科學;2004年01期

2 馮文剛;高雋;Bill P.Buckles;吳克偉;;無監(jiān)督學習的無線膠囊內診鏡視頻分類方法[J];中國圖象圖形學報;2011年11期

3 陳斌;陳松燦;潘志松;李斌;;異常檢測綜述[J];山東大學學報(工學版);2009年06期

4 王濤;李艾華;蔡艷平;王聲才;;基于核的學習機研究綜述[J];計算機應用研究;2010年06期

5 劉開第;劉昕;趙奇;周少玲;;基于分類權與質心驅動的無監(jiān)督學習算法[J];自動化學報;2009年05期

6 裘晨曦;徐雅斌;李艷平;李卓;;一種基于無監(jiān)督學習的社交網絡流量快速識別方法[J];數學的實踐與認識;2014年03期

7 賈真;何大可;尹紅風;李天瑞;;基于無監(jiān)督學習的部分-整體關系獲取[J];西南交通大學學報;2014年04期

8 張躍,譚詠梅,姚天順;英漢機譯中一種基于無監(jiān)督學習的詞類消歧策略[J];小型微型計算機系統(tǒng);2000年08期

9 趙博;李永忠;楊鴿;徐靜;;改進SVM在入侵檢測中的應用研究[J];計算機工程與應用;2009年17期

10 修馳;宋柔;;基于無監(jiān)督學習的專業(yè)領域分詞歧義消解方法[J];計算機應用;2013年03期

相關會議論文前1條

1 王波;王厚峰;;中文單詞聚類的比較研究[A];第三屆學生計算語言學研討會論文集[C];2006年

相關博士學位論文前2條

1 李狀;基于無監(jiān)督學習的風電機組傳動鏈智能故障診斷方法研究[D];華北電力大學(北京);2016年

2 錢利強;無監(jiān)督學習框架下學習分類器系統(tǒng)聚類與主干網提取方法研究[D];蘇州大學;2016年

相關碩士學位論文前4條

1 趙福青;無監(jiān)督學習的產品評論微摘要技術研究[D];寧波大學;2015年

2 林海娟;時間序列無監(jiān)督學習算法研究[D];福州大學;2013年

3 李斯凡;基于無監(jiān)督學習技術的位置大數據分析[D];浙江理工大學;2017年

4 岳永鵬;深度無監(jiān)督學習算法研究[D];西南石油大學;2015年

，

本文編號：1467398

資料下載

論文發(fā)表

支付寶下載

Download by Alipay
微信下載

Download by Wechat
會員下載

Download by Member

本文鏈接：http://sikaile.net/shoufeilunwen/xixikjs/1467398.html

上一篇：領域自適應中文分詞系統(tǒng)的研究與實現
下一篇：TD-LTE網絡優(yōu)化思路創(chuàng)新研究

論文發(fā)表

·知網|萬方|維普|龍源|省級|國家級|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

基于無監(jiān)督學習技術的位置大數據分析