基于監(jiān)督學(xué)習(xí)的開(kāi)源平臺(tái)軟件開(kāi)發(fā)行為研究
[Abstract]:Since the end of the 20th century, the booming open source software is gradually challenging the traditional proprietary software dominant software industry pattern, the emergence of gradually increasing open source software has a great impact on the market structure of the software industry. The distributed development model is gradually developing with the change of open source software development requirements, and the appearance of the drag-and-drop distributed development model leads to the development direction of a new distributed software development model. The research on the characteristics of development behavior in open source development is a hot topic in the field of software evolution, which can help developers to understand the law of software evolution more deeply and improve the existing software development process. As more and more developers are involved in open source software development, some code-managed platforms, such as GitHub and BitBucket, have gradually begun to provide appropriate support for distributed software development. When analyzing the development behavior on GitHub, it is necessary to deal with a large amount of loose data, and in order to obtain the depth value, it is often necessary to use intelligent and complex analysis, such as machine learning, and so on. In this paper, the open source projects based on drag-and-drop development model mounted on GitHub are analyzed, and the rules of development process turnover, external contribution acceptance and processing time of external contribution are found out. This paper analyzes the developer's development action behavior and constructs a prediction model according to the influence of different development behaviors on the final acceptance of the contribution to predict whether an external contribution can eventually be adopted. In the process of extracting behavior features, we consider adding history-based behavior features to effectively complement the set of features needed to construct the prediction model. In this paper, the prediction model is to solve the problem of classification of the final state of drag-and-drop requests, and a large-scale data supervised learning algorithm (support vector machine) will be used to realize the classification of large-scale data. In this paper, the performance of the selected prediction model will be compared, the selection of a suitable prediction model will be studied, and according to the existing SVM algorithm, there will be too much computation in the process of parameter optimization of the kernel function. Some problems such as learning performance and low recognition rate are improved. Finally, the prediction model for data adaptation is discussed. The innovative research contents of this paper are as follows: 1. This paper studies the acceptance strategy of drag-and-drop requests in open source systems. This paper selects and classifies the eigenvalues of GitHub massive data by machine learning common algorithm classifiers, considering the behavior characteristics of the test part and historical data. The feature set is effectively extended by introducing test coverage, human history successful submission request rate, and project historical success acceptance request rate factor into the feature set. 2. In order to improve the efficiency of grid search, this paper improves the exhaustive pattern of grid search algorithm and applies it to the construction of prediction model. A grid detection parameter selection algorithm (GDPS). Based on the combination of pattern search and grid search algorithm is proposed in this paper. The optimal parameter pairs of the SVM kernel function used to construct the prediction model are selected to improve the learning performance and the recognition rate of the SVM algorithm so as to obtain a prediction model with higher accuracy.
【學(xué)位授予單位】:哈爾濱工程大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2016
【分類號(hào)】:TP311.52
【參考文獻(xiàn)】
相關(guān)期刊論文 前5條
1 羅霖;;大規(guī)模機(jī)器學(xué)習(xí)問(wèn)題研究[J];艦船電子工程;2013年02期
2 袁霖;王懷民;尹剛;史殿習(xí);李翔;;開(kāi)源環(huán)境下開(kāi)發(fā)人員行為特征挖掘與分析[J];計(jì)算機(jī)學(xué)報(bào);2010年10期
3 顧昊;錢(qián)曉俊;梁洪亮;;開(kāi)源平臺(tái)下軟件管理技術(shù)的研究[J];計(jì)算機(jī)應(yīng)用研究;2007年08期
4 趙國(guó)棟;黃永中;;開(kāi)源軟件在高校的應(yīng)用與推廣策略研究[J];中國(guó)遠(yuǎn)程教育;2007年01期
5 伍恒,張衛(wèi)民,王靖;軟件的分布式協(xié)同開(kāi)發(fā)環(huán)境[J];吉首大學(xué)學(xué)報(bào)(自然科學(xué)版);2003年01期
相關(guān)博士學(xué)位論文 前3條
1 張利軍;大規(guī)模機(jī)器學(xué)習(xí)理論研究與應(yīng)用[D];浙江大學(xué);2012年
2 薛貞霞;支持向量機(jī)及半監(jiān)督學(xué)習(xí)中若干問(wèn)題的研究[D];西安電子科技大學(xué);2009年
3 王朝勇;支持向量機(jī)若干算法研究及應(yīng)用[D];吉林大學(xué);2008年
相關(guān)碩士學(xué)位論文 前3條
1 董亞?wèn)|;面向不平衡分類的邏輯回歸算法[D];鄭州大學(xué);2015年
2 徐奔;開(kāi)源軟件開(kāi)發(fā)人員行為特征的可視化挖掘[D];上海交通大學(xué);2013年
3 王梅;一種改進(jìn)的核函數(shù)參數(shù)選擇方法[D];西安科技大學(xué);2011年
,本文編號(hào):2455558
本文鏈接:http://sikaile.net/kejilunwen/ruanjiangongchenglunwen/2455558.html