面向數(shù)據(jù)挖掘的關(guān)系型領(lǐng)域知識(shí)融合方法研究
[Abstract]:Most of the existing data mining technologies are based on the original level. The corresponding mining methods are domainless knowledge fusion or the process of realizing knowledge discovery by integrating domain knowledge with the user's participation. However, there are hierarchical differences in data in practical application areas, some of which are raw, others that are closely related to others, And the proper combination or generalization granularity of these related data may better reveal its inherent law. Therefore, to make full use of domain knowledge related to raw data to guide the work of data mining, we can "observe and analyze the same problem from very different granularity", so as to obtain knowledge at a reasonable data level. Flexible conversion at different data levels, free commutation, no difficulty, this has become an important research topic. In view of the fact that a large number of data exist in the field of practical application, there is domain knowledge in the form of attribute extension or extension, and most of such domain knowledge appears in the form of relational tables. Therefore, this paper focuses on the representation of relational domain knowledge and its fusion with data mining research, so that knowledge discovery can be carried out automatically and effectively. The main work of this paper is as follows: (1) A structured representation model based on relational model domain knowledge (DKMRM (Domain Knowledge of Multi-Relations Model,DKMRM) is proposed. In the model, the relational model is used to map or project the domain knowledge of the related attributes in the data table, so as to form the contextual table of domain knowledge, and then form a complex multi-relational representation model. When mining for relational database system, some raw data can be generalized or exemplified to a reasonable level by using this model and necessary transformation strategy. (2) the research work of data mining based on DKMRM. A relational domain knowledge fusion method for data mining is proposed. Taking the classification problem as a practical case, the framework of classification mining method for integrating relational domain knowledge is established. In view of the limitations of traditional mining methods, the framework of this method effectively solves the problem of transfer source, transfer path and termination strategy. (3) A multi-relational classification mining algorithm CC-DKMR (Classification of Characters based on Domain Knowledge of Multi-Relations, based on attribute selection is proposed. CC-DKMR) and CS-DKMR (Classification of Sheets based on Domain Knowledge of Multi-Relations,CS-DKMR), a multi-relational classification mining algorithm based on relational table selection, to seek for mining patterns and flexible transformation mechanisms at different data granularity levels. Acquire more valuable knowledge from domain knowledge. Experimental results show that this method is effective. (4) A method for evaluating the fusion of domain knowledge in the evaluation stage of data mining is proposed to solve the "oracle" phenomenon in the algorithm (program) of data mining. It is difficult for traditional evaluation methods to be adaptive. Based on the metamorphosis testing technology, the method effectively utilizes domain knowledge, and carries out research and analysis on the specific cases of classification, association and clustering mining algorithm, and constructs the metamorphosis relation for the specific algorithm. The experimental results show that this method can effectively achieve the purpose of evaluation and is applicable to other fields.
【學(xué)位授予單位】:合肥工業(yè)大學(xué)
【學(xué)位級(jí)別】:博士
【學(xué)位授予年份】:2016
【分類號(hào)】:TP311.13
【參考文獻(xiàn)】
相關(guān)期刊論文 前10條
1 謝亮;張晶;胡學(xué)鋼;;主從關(guān)系數(shù)據(jù)庫(kù)中關(guān)聯(lián)規(guī)則挖掘算法研究[J];合肥工業(yè)大學(xué)學(xué)報(bào)(自然科學(xué)版);2009年05期
2 董國(guó)偉;徐寶文;陳林;聶長(zhǎng)海;王璐璐;;蛻變測(cè)試技術(shù)綜述[J];計(jì)算機(jī)科學(xué)與探索;2009年02期
3 彭珍;楊炳儒;李冬艷;侯偉;寧頂利;;多關(guān)系數(shù)據(jù)分類方法綜述[J];計(jì)算機(jī)工程與應(yīng)用;2008年34期
4 何軍;劉紅巖;杜小勇;;挖掘多關(guān)系關(guān)聯(lián)規(guī)則[J];軟件學(xué)報(bào);2007年11期
5 徐光美;楊炳儒;張偉;寧淑榮;;多關(guān)系數(shù)據(jù)挖掘方法研究[J];計(jì)算機(jī)應(yīng)用研究;2006年09期
6 李道國(guó);苗奪謙;杜偉林;;粒度計(jì)算在人工神經(jīng)網(wǎng)絡(luò)中的應(yīng)用[J];同濟(jì)大學(xué)學(xué)報(bào)(自然科學(xué)版);2006年07期
7 ;A Granular Computing Model Based on Tolerance relation[J];The Journal of China Universities of Posts and Telecommunications;2005年03期
8 朱靖波,陳文亮;基于領(lǐng)域知識(shí)的文本分類[J];東北大學(xué)學(xué)報(bào);2005年08期
9 吳鵬,施小純,唐江峻,林惠民,陳宗岳;關(guān)于蛻變測(cè)試和特殊用例測(cè)試的實(shí)例研究(英文)[J];軟件學(xué)報(bào);2005年07期
10 李道國(guó),苗奪謙,張紅云;粒度計(jì)算的理論、模型與方法[J];復(fù)旦學(xué)報(bào)(自然科學(xué)版);2004年05期
,本文編號(hào):2370556
本文鏈接:http://sikaile.net/shoufeilunwen/xxkjbs/2370556.html