基于SVM的初等數(shù)學問題自動分類的研究與應(yīng)用
[Abstract]:As we all know, with the rapid development of computer information technology, information technology has been applied in every aspect of our life. In the field of education, people's eyes have gradually shifted from offline tutoring and manual marking of examination papers to intelligent Internet education based on artificial intelligence. One of the important prerequisites for the realization of this new concept of mathematical education is to transform the text into natural language. In popular terms, it is to convert mathematical statements understood by human beings into pre-defined computer storage knowledge. To allow the computer to handle the next step. These processing mainly have the solution, as well as the whole flow judgment paper and so on. This premise can also be called natural language processing process. Classification is the main problem in the process of natural language processing. This paper is mainly divided into two parts. The first part is the participle of elementary mathematical problem text, as well as part of speech tagging and named entity recognition. In the second part, the paper classifies the text of elementary mathematics problem based on SVM, and then transforms it into the representation of computer reasoning according to different categories. In English, there is a space between each word, but Chinese is different, all characters are connected together, so the Chinese text should be partitioned. However, mathematical expressions contain more symbols with specific meanings, so the general participle method is not feasible. Therefore, it is necessary to construct a special participle for mathematical expression. Similarly, the entities expressed in mathematical language are different from those expressed in common language. The entities of common language are more time, place, name and so on. In mathematical expressions, the entities that contain important information are often mathematical nouns, such as triangles, equations and so on. Therefore, it is necessary to define a specific named entity for the primary mathematical direction and then extract it. In this paper, conditional random fields are used to label named entities. There are many types involved in elementary mathematics problems. In order to solve elementary mathematical problems automatically, the first thing to do is to classify the problems and then call the corresponding solving methods according to different categories. The text preprocessing of primary mathematical problem text tagged by named entity model includes deactivating words and establishing word bag model. In this paper, chi-square statistics are used to select text feature vectors. In this way, the feature vector can reduce the computational cost and maintain the classification accuracy by selecting dimensionality reduction. Finally, according to the method proposed in this paper, the support vector machine (SVM) is used to implement a system for extracting named entities from elementary mathematical problems and classifying them. The system can accurately label named entities and provide knowledge representation for later problem solving and so on. At the same time, effective topic classification can be used as inference pruning for later problem solving or marking.
【學位授予單位】:電子科技大學
【學位級別】:碩士
【學位授予年份】:2017
【分類號】:TP391.1;O12
【參考文獻】
相關(guān)期刊論文 前10條
1 奚雪峰;周國棟;;面向自然語言處理的深度學習研究[J];自動化學報;2016年10期
2 何苑;郝夢巖;;基于自然語言處理的計算機專業(yè)數(shù)學課程教學研究[J];長治學院學報;2016年02期
3 邱均平;方國平;;基于知識圖譜的中外自然語言處理研究的對比分析[J];現(xiàn)代圖書情報技術(shù);2014年12期
4 李海艦;田躍新;李文杰;;互聯(lián)網(wǎng)思維與傳統(tǒng)企業(yè)再造[J];中國工業(yè)經(jīng)濟;2014年10期
5 王宇;邵洪雨;;基于主題詞提取的國內(nèi)自然語言處理研究現(xiàn)狀分析[J];情報科學;2013年03期
6 唐釗;;條件隨機場模型在中文人名識別中的研究與實現(xiàn)[J];現(xiàn)代計算機(專業(yè)版);2012年21期
7 楊皓東;江凌;李國俊;;國內(nèi)自然語言處理研究熱點分析——基于共詞分析[J];圖書情報工作;2011年10期
8 付年鈞;彭昌水;王慰;;中文分詞技術(shù)及其實現(xiàn)[J];軟件導刊;2011年01期
9 周穎;袁鶯;馬玉慧;任峗;;小學數(shù)學應(yīng)用題自動解答特征分析及研究路線[J];中國電化教育;2010年08期
10 李國臣;王瑞波;李濟洪;;基于條件隨機場模型的漢語功能塊自動標注[J];計算機研究與發(fā)展;2010年02期
相關(guān)博士學位論文 前2條
1 計峰;自然語言處理中序列標注模型的研究[D];復(fù)旦大學;2012年
2 魯松;自然語言處理中詞相關(guān)性知識無導獲取和均衡分類器構(gòu)建[D];中國科學院研究生院(計算技術(shù)研究所);2001年
相關(guān)碩士學位論文 前8條
1 張磊磊;基于Hadoop和SVM算法的中文文本分類的研究與實現(xiàn)[D];昆明理工大學;2015年
2 王綱;一種改進隱條件隨機場模型的行為識別方法[D];西安電子科技大學;2014年
3 王鵬;基于Lucene的中文分詞技術(shù)研究與實現(xiàn)[D];浙江工商大學;2014年
4 張碩果;基于條件隨機場模型的文本分類研究[D];重慶大學;2010年
5 毛玉才;基于語義網(wǎng)技術(shù)的語義檢索系統(tǒng)模型研究[D];黑龍江大學;2008年
6 王秋;淺析自然語言理解及其應(yīng)用[D];陜西師范大學;2008年
7 王宇寧;隱馬爾可夫模型在信息抽取中的應(yīng)用研究[D];大連理工大學;2007年
8 趙俊霞;中學數(shù)學教師專業(yè)知識的發(fā)展[D];東北師范大學;2006年
,本文編號:2298820
本文鏈接:http://sikaile.net/kejilunwen/yysx/2298820.html