基于隨機森林算法的土壤圖斑分解
[Abstract]:The drawing method of polygon and the process of using long field survey and aerial picture interpretation determine that the efficiency of the traditional soil map is relatively low and time-consuming, and the precision of the traditional soil map is difficult to meet the development of modern science. The traditional soil map is mainly faced with the following problems. First, the size of the mapping scale often determines the size of the smallest plot. The larger the scale, the smaller the smallest map that can be expressed in the soil map, so the traditional soil map will ignore the small spots in the large plot because of the scale limitation during the drawing process. The space and attribute of the soil map are simplified. Secondly, the expression of the hand polygon also neglects the characteristics of the soil spatial gradient. The mutation of the polygon boundary leads to the mutation of the soil space and properties that have been changed continuously. Finally, based on the expert experience and manual drawing, it is very time-consuming and easy to produce people. However, the traditional soil map, which contains a large number of expert knowledge, is the valuable information left by the history, and still has important reference value for the present research. This paper takes the water river basin of huayuhe Town, Hong'an County, Huanggang City, Hubei Province as the research area, and combines the traditional soil map obtained by the National Second Soil Census. Some terrain data and multi spectral data are used in GIS platform and R language environment to excavate soil environmental knowledge, and use this model to decompose the original soil map in space, and get more detailed spatial distribution map of spatial information. The specific research steps are divided into following steps: 1) extraction and research area The initial environmental variables in this selection include soil parent material data, topographic data and multispectral data, using gradient, slope, terrain humidity index, curvature along the contour, horizontal curvature and horizontal curvature to extract normalized vegetation from multi spectral data. Index, normalized water index, the first principal component, deviation, information entropy, variance, mean value, and the dependent variable.2 used in the research of the parent material. The sampling point is designed with the weighted sampling pattern of the patch area to ensure that each spot has at least 10 samples, and the 6686 samples are finally determined. Boundary factor data and classification of sample data according to the parent material.3) environmental factors screening. In order to ensure mapping precision and efficiency, we need to eliminate a part of the factors that have low contribution to the model. This study uses the variable importance measure importance () function provided by the R language to determine the parameters of the.4 model. Two very important parameters, mtry and nTree, can be used to judge the.5) model through the calculation of the external error of the random forest model and the calculation of the model stability respectively. Using the Random Forest packet in the R language, the data are modeled and four groups of models under the four matrix units are obtained, and the four groups of models are used to study each grid position in the area. The environment factor information is voted to determine the soil type in each location by voting, and then the soil map of the area is obtained. The study shows that the whole soil map after the decomposition of the map is significantly increased in the number of spots compared with the traditional soil map, and the spatial distribution is more detailed, showing more details. In this study, we use the RF model to achieve a better expression on the classification problem. It shows that the knowledge of using the RF model to obtain the soil environmental relationship is true and credible. It can provide a efficient method for the fine digital soil mapping. In addition, the variable importance measure function provided by the random forest algorithm can be important to the variables. In order to delete the factor of small contribution to the model, it not only ensures the accuracy of the classification, but also greatly improves the efficiency of the calculation. It provides a reliable method and basis for the soil map decomposition in large area in the future.
【學(xué)位授予單位】:華中農(nóng)業(yè)大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2017
【分類號】:S159.9
【參考文獻】
相關(guān)期刊論文 前10條
1 劉雪琦;朱阿興;楊琳;繆亞敏;曾燦英;;土壤圖更新中基于土壤類型面積分級的訓(xùn)練樣點選擇方法[J];土壤學(xué)報;2017年01期
2 王茵茵;齊雁冰;陳洋;解飛;;基于多分辨率遙感數(shù)據(jù)與隨機森林算法的土壤有機質(zhì)預(yù)測研究[J];土壤學(xué)報;2016年02期
3 黃魏;羅云;汪善勤;陳家贏;韓宗偉;祁大成;;基于傳統(tǒng)土壤圖的土壤—環(huán)境關(guān)系獲取及推理制圖研究[J];土壤學(xué)報;2016年01期
4 郭澎濤;李茂芬;羅微;林清火;唐群鋒;劉志崴;;基于多源環(huán)境變量和隨機森林的橡膠園土壤全氮含量預(yù)測[J];農(nóng)業(yè)工程學(xué)報;2015年05期
5 趙北庚;;基于R語言randomForest包的隨機森林建模研究[J];計算機光盤軟件與應(yīng)用;2015年02期
6 韓宗偉;黃魏;羅云;張春弟;祁大成;;基于路網(wǎng)的土壤采樣布局優(yōu)化——模擬退火神經(jīng)網(wǎng)絡(luò)算法[J];應(yīng)用生態(tài)學(xué)報;2015年03期
7 楊琳;朱阿興;張淑杰;安藝明;;土壤制圖中多等級代表性采樣與分層隨機采樣的對比研究[J];土壤學(xué)報;2015年01期
8 寧亮亮;張曉麗;;基于紋理信息的Landsat-8影像植被分類初探[J];中南林業(yè)科技大學(xué)學(xué)報;2014年09期
9 韓宗偉;黃魏;張春弟;羅云;;基于土壤養(yǎng)分-景觀關(guān)系的土壤采樣布局合理性研究[J];華中農(nóng)業(yè)大學(xué)學(xué)報;2014年01期
10 張淑杰;朱阿興;劉京;楊琳;;基于樣點的數(shù)字土壤屬性制圖方法及樣點設(shè)計綜述[J];土壤;2012年06期
相關(guān)碩士學(xué)位論文 前2條
1 周銀;基于決策樹方法的縣級土壤數(shù)字制圖研究[D];浙江大學(xué);2011年
2 李杭燕;時間序列NDVI數(shù)據(jù)集重建方法研究[D];蘭州大學(xué);2010年
,本文編號:2146909
本文鏈接:http://sikaile.net/kejilunwen/nykj/2146909.html