可修改答案的CD-CAT的研究
發(fā)布時(shí)間:2018-03-26 21:05
本文選題:認(rèn)知診斷計(jì)算機(jī)化自適應(yīng) 切入點(diǎn):可修改答案 出處:《江西師范大學(xué)》2016年碩士論文
【摘要】:和以往的紙筆測(cè)驗(yàn)(Paper And Pencil Based Test,PP)相比計(jì)算機(jī)化自適應(yīng)測(cè)驗(yàn)(Computerized Adaptive Testing,CAT)根據(jù)被試的作答反應(yīng)自適應(yīng)地選擇題目,CAT既減少了測(cè)驗(yàn)的長度,并且顯著提高了測(cè)驗(yàn)的精度。認(rèn)知診斷計(jì)算機(jī)化自適應(yīng)測(cè)驗(yàn)(Cognitive Diagnostic Computerized Adaptive Testing,CD-CAT)是認(rèn)知診斷理論和計(jì)算機(jī)化自適應(yīng)測(cè)驗(yàn)的理論相結(jié)合的產(chǎn)物,它不僅具有CAT的特點(diǎn),同時(shí)還具有診斷的功能,CD-CAT旨在對(duì)個(gè)體的認(rèn)知過程、加工技能或知識(shí)結(jié)構(gòu)進(jìn)行診斷,從而為后續(xù)的補(bǔ)救性教學(xué)提供有效的借鑒,它更強(qiáng)調(diào)考察被試內(nèi)部的加工過程。然而,目前絕大多數(shù)CAT和CD-CAT不允許被試修改答案,研究者主要擔(dān)心修改答案會(huì)降低它們的有效性。允許修改答案符合被試一貫的測(cè)驗(yàn)習(xí)慣,修改之后的分?jǐn)?shù)更能反映被試真實(shí)的水平,從而能夠進(jìn)一步促進(jìn)CAT和CD-CAT在實(shí)際中的應(yīng)用。已有的研究主要從三個(gè)方面提出了可修改答案CAT的控制方法:一是測(cè)驗(yàn)設(shè)計(jì);二是改進(jìn)選題策略;三是建構(gòu)模型。Han(2013)提出的題目口袋法(Item Pocket,IP)是目前具有較好應(yīng)用前景的可修改答案的CAT(Reviewable CAT)控制方法,IP法的思路是計(jì)算機(jī)為被試提供了一種題目口袋選擇,即允許被試作答過程中,隨時(shí)可以把待修改的題目或者暫時(shí)想跳過的題目放入IP中,然后接著作答下一個(gè)題目,放入IP內(nèi)的題目不參與當(dāng)前能力估計(jì)。當(dāng)IP容量已滿后,被試需要替換一題才能再次放入。IP法的缺點(diǎn)是其容量不容易控制,容量過大將導(dǎo)致較大的估計(jì)誤差。本文在IP方法的基礎(chǔ)之上加以改進(jìn),提出了新計(jì)分的題目口袋法(Modified IP,MIP),即對(duì)放入IP內(nèi)修改的題目重新計(jì)分。與IP法相比,Stocking(1997)的設(shè)計(jì)對(duì)修改行為有較多的限制,Stocking設(shè)計(jì)1允許考生在答完所有題目后,返回修改固定數(shù)量的題目,修改后的作答并沒有體現(xiàn)在自適應(yīng)選題中;Stocking設(shè)計(jì)2是允許被試修改單獨(dú)限時(shí)題目單元內(nèi)的答案。在測(cè)驗(yàn)過程中將題目按照先后順序劃分為固定長度的題目單元,題目單元長度根據(jù)實(shí)際需要而規(guī)定。被試可以在單元內(nèi)對(duì)題目進(jìn)行檢查并修改,計(jì)算機(jī)根據(jù)被試當(dāng)前單元的作答來選擇下一個(gè)單元,提交答案后的單元不允許再次返回修改。與Stocking設(shè)計(jì)一相比,被試在Stocking設(shè)計(jì)二中對(duì)測(cè)驗(yàn)有更多的掌控,不管測(cè)驗(yàn)被分割為多少個(gè)小單元,被試還可以修改所有的題目,但設(shè)計(jì)一中只能修改固定數(shù)量的題目。另外設(shè)計(jì)二中被試修改單元內(nèi)答案會(huì)影響下一個(gè)單元的選擇,計(jì)算機(jī)會(huì)根據(jù)被試修改后的能力估計(jì)值選擇下一個(gè)單元。而設(shè)計(jì)一中修改題目之后的能力估計(jì)值并沒有體現(xiàn)在自適應(yīng)選題上。從這點(diǎn)來看設(shè)計(jì)二要比設(shè)計(jì)一更符合CAT的規(guī)則。以上幾種修改答案的設(shè)計(jì)方法在CAT的應(yīng)用中各有優(yōu)缺點(diǎn)(IP、MIP、Stocking設(shè)計(jì)1和Stocking設(shè)計(jì)2),CD-CAT是由CAT的進(jìn)一步發(fā)展而來,但兩者之間又有很大的區(qū)別,為了驗(yàn)證上述方法在可修改答案的CD-CAT(Reviewable Cognitive Diagnostic Computerized Adaptive Testing,RCD-CAT)的效果,模擬研究分別采用了DINA和R-RUM模型,假設(shè)被試知識(shí)狀態(tài)和題庫都服從均勻分布,模擬生成5000個(gè)被試,300容量的題庫,知識(shí)狀態(tài)的估計(jì)采用極大似然估計(jì)方法(Maximum Likelihood Estimation,MLE),屬性考察個(gè)數(shù)分別是5個(gè)和7個(gè),選題策略包括:Kullback Leibler(KL)、Posterior Weighted KL(PWKL)、Hybrid KL(HKL)和Modified Posterior-Weighted KL(MPWKL),測(cè)驗(yàn)長度分為10題和20題。通過通過蒙特卡洛模擬研究發(fā)現(xiàn):第一,與傳統(tǒng)不修改答案的CD-CAT相比,本文提到的RCD-CAT方法,可以在不損失診斷精度和題庫曝光率的基礎(chǔ)上,允許學(xué)生修改答案,這符合學(xué)生一般作答行為習(xí)慣,減少學(xué)生做答的負(fù)擔(dān)及焦慮程度,更易被大眾接受。第二,當(dāng)研究采用了DINA模型的時(shí)候,MIP法和IP法的效果沒有太大區(qū)別,結(jié)果表明MIP的效果依賴于被試的作答概率分布。第三,在所有的研究條件下,Stocking設(shè)計(jì)的模式判準(zhǔn)率要高于其他三種方法,其中,Stocking設(shè)計(jì)2的模式判準(zhǔn)率略微優(yōu)于Stocking設(shè)計(jì)1,結(jié)果表明Stocking設(shè)計(jì)在RCD-CAT的應(yīng)用中具有良好的前景。總之,RCD-CAT不僅符合被試一貫的測(cè)驗(yàn)習(xí)慣,并且通過修改答案有助于提高診斷的準(zhǔn)確率,進(jìn)一步而言本研究有助于進(jìn)一步為RCD-CAT和RCAT的研究和實(shí)踐應(yīng)用提供理論和方法支持。
[Abstract]:And the paper and pencil test (Paper And Pencil Based Test, PP) compared with the computerized adaptive test (Computerized Adaptive, Testing, CAT) according to the responses of subjects to choose the adaptive problem, CAT can not only reduce the test length, and improve the precision of the test. The computer adaptive test (Cognitive Diagnostic Computerized cognitive diagnosis Adaptive Testing, CD-CAT) is a product of the theory of cognitive diagnosis and computerized adaptive test combined with the theory, it not only has the characteristics of CAT, but also has the function of diagnosis, CD-CAT to the cognitive process of individual diagnosis, processing skills or knowledge structure, so as to provide effective reference for subsequent remedial teaching, it more emphasis on the internal process of study subjects. However, the vast majority of CAT and CD-CAT were not allowed to modify the answer, researchers worry about repair Change your answer will reduce their effectiveness. Allows you to modify the answer with subjects consistently test habit, after the modification of the scores can reflect the true level of the subjects, so as to further promote the application of CAT and CD-CAT in practice. The previous research mainly from three aspects put forward the control method can modify the answer: CAT is a test design; two is to improve the selection strategy; three is to construct the model of.Han (2013) put forward the topic of pocket (Item Pocket, IP) method is a good application prospect can be modified to answer CAT (Reviewable CAT) control method, IP method is the idea of computer provides a topic selection for pocket try, which allows participants to answer process, ready to take the title to be modified or temporarily want to skip the title in IP, and then answer the next question, put into IP subject is not involved in the current capacity when the IP volume estimation. After the full amount, the subjects need to replace the one to put the.IP back the disadvantage of its capacity is not easy to control, the capacity will lead to larger estimation error. This paper improved on the basis of IP method, put forward the new method of scoring title pocket (Modified IP, MIP), that is to put into IP modify title re score. Compared with IP, Stocking (1997) design are more restricted to modify the behavior of Stocking 1 in the design allows candidates to answer all questions, return to modify a fixed number of questions, answers are not reflected in the modified adaptive selection; Stocking design are allowed 2 subjects modification of separate units within the limit question answer. In the test process will be subject in accordance with the order is divided into fixed length unit title, title of unit length stipulated according to the actual needs. The subjects in unit of title can check and Modify the computer to select the next unit according to answer participants of the current unit, the unit is not allowed to submit the answer back again. And a modified Stocking design compared to the subjects in the Stocking design of the second test has more control, no matter how many tests are divided into small units, participants can also modify all the title, but a design can only modify the fixed number of topics. The second was another design modification unit the answer will be under the influence of a unit selection, the computer will according to the ability of the modified estimates to select the next unit. Design capacity after a modified subject is not reflected in the adaptive estimation topic. From this point of view to design a more than two design conform to the rules of CAT. The design methods of the above several changing answers have both advantages and disadvantages in the application of CAT (IP, MIP, Stocking and Stocking 1 2), CD-CAT is a further development of the CAT and come, but also there is the big difference between the two, in order to verify the method can modify the answer to CD-CAT (Reviewable Cognitive Diagnostic Computerized Adaptive Testing, RCD-CAT) the effect of simulated respectively using DINA and R-RUM model, hypothesis subjects knowledge and questions obey uniform distribution, simulation of 5000 subjects, 300 capacity questions, estimation method to estimate the state of knowledge of the maximum likelihood (Maximum Likelihood, Estimation, MLE), the number of properties investigated were 5 and 7, the strategies include: Kullback Leibler (KL), Posterior Weighted KL (PWKL). Hybrid KL (HKL) and Modified Posterior-Weighted KL (MPWKL), the test length is divided into 10 and 20 questions. Through the research by Monte Carlo simulation found that: first, compared with the traditional answer does not modify the CD-CAT, the RC The D-CAT method can, without loss of diagnostic accuracy and database exposure rate, allows the student to change the answer, which is consistent with the general answer student behavior, reduce the burden of students and the degree of anxiety and answer, more easily accepted by the public. In second, when the time of the DINA model, MIP method and IP method the effect is not much difference, the results show that the MIP effect depends on the probability distribution of the subjects to answer. Third, in all conditions, Stocking design pattern match ratio is higher than the other three methods, the design of Stocking 2 model accuracy slightly better than the Stocking 1, the results show that the Stocking design has a good prospect in the application of RCD-CAT. In short, RCD-CAT is not only in conformity with the subjects consistently test habits, and by modifying the answer is helpful to improve the accuracy of diagnosis, further this study helps to further RCD-CAT It provides theoretical and method support for the research and practical application of RCAT.
【學(xué)位授予單位】:江西師范大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2016
【分類號(hào)】:B841
【參考文獻(xiàn)】
相關(guān)期刊論文 前5條
1 林U,
本文編號(hào):1669575
本文鏈接:http://sikaile.net/shekelunwen/xinlixingwei/1669575.html
最近更新
教材專著