四参数Logistic模型和传统模型对被试作答拟合能力的比较研究被引量：7

A Comparison Study for the Four Parameter Logistic Model and Traditional Logistic Models

下载PDF

导出

摘要针对测验中高能力被试答错容易试题的睡眠现象,可使用四参数Logistic模型分析数据。研究选取了来自心理测验和成就测验的实际数据,分别采用传统模型和四参数Logistic模型进行拟合,对不同模型的拟合指标及参数估计结果进行比较。结果表明,四参数Logistic模型能够提高拟合程度,增强估计结果的准确性,有效纠正高能力被试能力被低估的现象。建议在必要时使用四参数Logistic模型进行数据分析。 High-ability test-takers may on occasion answer an easy question incorrectly,which is called sleeping phenomenon（Wright,1977）. In these situations,four parameter logistic model（4 PM）may be uniquely suited for characterizing the data. The 4 PM was proposed by Barton and Lord（1981）,which added the d parameter to allow upper asymptotes to be less than 1. 00. The more general formulation of the 4 PM（ Waller Reise,2010） suggestedd as an item-specific upper asymptote. Besides,a three parameter logistic model for reversed data（3 PMR） was discussed,which was suited for the situations with no guessing phenomenon but sleeping phenomenon. In the previous researches,the4 PM provided good fit for some psychological tests,such as MMPI and so on. However,for achievement tests,Barton and Lord in their earlier work found that the 4 PM failed to improve the likelihood or to significantly change any ability estimates for the datasets collected by ETS. Therefore,is it really inappropriate to use the 4 PM in achievement tests？ Moreover,most previous researches focused on the differences of parameter estimations based on simulated data. However,how often the sleeping phenomenon happen in real situations is still worth studying. In our research,we fitted seven models to the Taylor Manifest Anxiety Scale（TMA）and the large-scale Maths test. Meanwhile,the dataset of Maths tests was used to construct two different distributions：approximately normal distribution（ skewness is 0. 097） and negatively skewed distribution（ skewness is-0. 199）. The models compared were Rasch model,two parameter logistic model（2 PM）,three parameter logistic model（3 PM）,3 PM with reversing scores on each item（3 PM_R）,4 PM,4 PM with equal guessing parameters（4 PM_c） and 4 PM with equal d parameters（4 PM_d）. The R package sirt was used to estimate all the models in our study. In order to investigate the differences of these models,we computed：（1） the model fit index AIC,BIC;（2） the correlations of the item parameter estimations of the best fitted logistic model with d parameter and the second best model without d parameter,for all the items and after the easiest 5,10,and 10 items were deleted;（3） the correlations of the ability parameter estimations of the two models discussed in（2）,for all and the top 1000,500,300,200,100 respondents. The results indicated that（1）the Rasch model showed the worst fit for all the datasets. For TMA data,the 3 PMR showed the best fit,for the Maths tests,the 4 PM showed the best fit;（2） the difficulty parameters were quite similar inthe two compared models,however,there was lager difference between the discrimination parameters,the negatively skewed Standard Maths test data showed similar results;when the easiest items were deleted,the correlation of the discrimination parameters became larger,especially for the negatively skewed Standard Maths test;（3） the ability parameters of two compared models correlated highly across all groups of respondents,however,the correlations for the top 1000,500,300,200,100 groups were relatively small,especially for the top 100 respondents. In conclusion,the 4 PM is necessary in both psychological tests and achievement tests. For practitioners who should make a decision about whether to choose the 4 PM,the type of the tests,the purpose of the tests,and the complexity of the computation should be considered at the same time.

作者刘玥刘红云 Liu Yue;Liu Hongyun(School of Psychology, Beijing Normal University, Beijing 100875)

机构地区北京师范大学心理学部

出处《心理学探新》 CSSCI 北大核心 2018年第3期228-235,共8页 Psychological Exploration

关键词项目反应理论睡眠现象四参数Logistic模型 item response theory sleeping phenomenon four - parameter logistic model

分类号 B841.2 [哲学宗教—基础心理学]

引文网络
相关文献

参考文献3

1简小珠,戴海崎,彭春妹.IRT中Logistic模型的c、γ参数对能力估计的改善[J].心理学报,2007,39(4):737-746. 被引量：6
2简小珠,焦璨,Steven P.Reise,彭春妹.四参数模型对被试作答异常现象的拟合与纠正[J].心理科学进展,2010,18(3):537-544. 被引量：7
3简小珠,张敏强,彭春妹.四参数Logistic模型研究进展及其评析[J].心理学探新,2010,30(3):69-73. 被引量：8

二级参考文献24

1TIAN Maozai & CHEN Gemai School of Statistics, Renmin University of China, Beijing 100872, China and Center for Applied Statistics, Renmin University of China, Beijing 100872, China,Department of Mathematics and Statistics, University of Calgary, Canada.Hierarchical linear regression models for conditional quantiles[J].Science China Mathematics,2006,49(12):1800-1815. 被引量：20
2戴海崎,简小珠.被试作答的偶然性对IRT能力估计的影响研究[J].心理科学,2005,28(6):1433-1436. 被引量：6
3吴德福.四参数Logistic函数的简化求解[J].中华核医学杂志,1990,10:48-48.
4简小珠,戴海崎,彭春妹.IRT中Logistic模型的c、γ参数对能力估计的改善[J].心理学报,2007,39(4):737-746. 被引量：6
5Wright B D.Solving measurement problems with the research model.Journal of Educational Measurement,1977,14:97-116.
6Reise S P,Waller N G.How many IRT parameters does it take to model psychopathology items?Psychological Methods,2003,8(2):164-184.
7Hessen D J.A new class of parametric IRT models for dichotomous item scores.Journal of Applied Measurement,2004,5(4):385-397.
8McDonald R P.Non-linear factor analysis.Psychometric Monographs,1967:15.
9Barton M A,Lord F M.An upper asymptote for the three-parameter Logistic item response model.In:Research Bulletin.Princeton,NJ:Educational Testing Service,1981:81-20.
10Hambleton R K,Swaminathan H.Item response theory:Principles and applications.Boston:Kluwer-Nijhoff,1985:48-49.

共引文献14

1简小珠,焦璨,Steven P.Reise,彭春妹.四参数模型对被试作答异常现象的拟合与纠正[J].心理科学进展,2010,18(3):537-544. 被引量：7
2简小珠,张敏强,彭春妹.四参数Logistic模型研究进展及其评析[J].心理学探新,2010,30(3):69-73. 被引量：8
3王宇,帅斌,李季涛.基于生长曲线的中国铁路网生命周期判定[J].交通运输系统工程与信息,2015,15(1):23-29.
4简小珠,张敏强.IRT下猜测现象和失误现象的原因阐释与数学推导[J].考试研究,2015,11(4):56-60.
5简小珠,戴海琦.4参数GRM对猜测现象和失误现象的纠正[J].江西师范大学学报（自然科学版）,2016,40(2):140-144. 被引量：4
6简小珠,戴海琦.“CAT初始作答影响最终成绩”的模拟分析与纠正[J].心理学探新,2016,36(3):276-280. 被引量：1
7杨思亮.人格测验中的反应偏差及其处置[J].太原城市职业技术学院学报,2013(11):143-144.
8杨小露,张红,张春晖.历史遗址类旅游地的生命周期研究——以美国14家历史遗址公园为例[J].地理科学进展,2019,0(6):918-929. 被引量：3
9刘玥,刘红云.心理与教育测验中异常作答处理的新技术:混合模型方法[J].心理科学进展,2021,29(9):1696-1710. 被引量：1
10张玉柳,赵波,陶金洪.基于模糊认知诊断模型的学生认知状态研究[J].江西师范大学学报（自然科学版）,2021,45(5):452-459. 被引量：2

同被引文献47

1吴江,黄震方.旅游地生命周期曲线模拟的初步研究——Logistic曲线模型方法的应用[J].地理与地理信息科学,2004,20(5):91-94. 被引量：38
2杨效忠,陆林.旅游地生命周期研究的回顾和展望[J].人文地理,2004,19(5):5-10. 被引量：35
3保继刚,彭华.旅游地拓展开发研究——以丹霞山阳元石景区为例[J].地理科学,1995,15(1):63-70. 被引量：58
4保继刚.喀斯特洞穴旅游开发[J].地理学报,1995,50(4):353-359. 被引量：100
5戴海崎,简小珠.被试作答的偶然性对IRT能力估计的影响研究[J].心理科学,2005,28(6):1433-1436. 被引量：6
6刘声涛,戴海崎,周骏.新一代测验理论—认知诊断理论的源起与特征[J].心理学探新,2006,26(4):73-77. 被引量：50
7孙根年,薛刚.25年来秦俑馆旅游生命周期与结构变化研究[J].干旱区地理,2007,30(2):283-288. 被引量：26
8陆林.山岳型旅游地生命周期研究——安徽黄山、九华山实证分析[J].地理科学,1997,17(1):63-69. 被引量：131
9赵焕英,包金风.实时荧光定量PCR技术的原理及其应用研究进展[J].中国组织化学与细胞化学杂志,2007,16(4):492-497. 被引量：135
10陈青,丁树良.三参数等级反应模型及其信息函数的应用[J].考试研究,2009,5(2):77-84. 被引量：2

引证文献7

1侯红渠,邓玉林,樊云龙,吕雪飞,李晓琼.便携式实时荧光定量PCR仪数据处理方法研究[J].生命科学仪器,2023,21(5):1-7.
2杨小露,张红,张春晖.历史遗址类旅游地的生命周期研究——以美国14家历史遗址公园为例[J].地理科学进展,2019,0(6):918-929. 被引量：3
3张玉柳,赵波.深度学习视角下学习者模糊认知地图的构建与应用[J].现代教育技术,2021,31(11):37-45. 被引量：8
4张玉柳,赵波,陶金洪.基于模糊认知诊断模型的学生认知状态研究[J].江西师范大学学报（自然科学版）,2021,45(5):452-459. 被引量：2
5金英姿,王佶旻.四参数Logistic模型与双参数、三参数Logistic模型在语言测验中的拟合比较及睡眠现象检验--以来华留学生预科结业考试为例[J].中国考试,2022(8):57-65. 被引量：1
6童昊,喻晓锋,秦春影,彭亚风,钟小缘.多级计分测验中基于残差统计量的被试拟合研究[J].心理学报,2022,54(9):1126-1140. 被引量：1
7曾光,张玉玲,谢晓尧,黎瑞源.一种改进的4参数等级反应模型和应用[J].江西师范大学学报（自然科学版）,2023,47(2):124-132.

二级引证文献15

1卢嘉新,林雅情,王敏.庐山马尾水旅游发展路径探析[J].台湾农业探索,2020(3):54-60.
2梁永国,王立鑫,韩志勇.基于Logistic模型的沿海地区旅游地生命周期研究--以河北省为例[J].经济论坛,2021(12):70-81. 被引量：2
3SUN Yehong,YAO Cancan,CHEN Yuexin,SONG Yuxin,WANG Ying.Ecological Theory and Practice in Tourism Research in the New Era[J].Journal of Resources and Ecology,2022,13(1):142-160.
4罗生全,陈卓,张熙.基于增值评价的学生作业设计价值向度及优化策略[J].中国教育科学（中英文）,2022,5(4):83-93. 被引量：5
5廖正山,李曼丽.在线课程作业设计策略——基于八门在线课程样本的分析证据[J].开放教育研究,2022,28(5):79-92. 被引量：1
6秦春影,喻晓锋.多级属性Q矩阵的验证与估计[J].心理学报,2022,54(11):1403-1415.
7赖风,张曦文.基于增值评价的高校思政育人价值及集成创新[J].江苏高教,2023(4):113-119. 被引量：6
8曾光,张玉玲,谢晓尧,黎瑞源.一种改进的4参数等级反应模型和应用[J].江西师范大学学报（自然科学版）,2023,47(2):124-132.
9罗江华,张玉柳.多模态大模型驱动的学科知识图谱进化及教育应用[J].现代教育技术,2023,33(12):76-88. 被引量：13
10廖磊,李辉,杜庆庆,宋坤.认知地图视角下工贸企业安全生产责任制审计的研究[J].工业安全与环保,2023,49(S02):66-69.

1丁鹏勋.别再封建迷信了，“鬼压床”只是一种睡眠现象[J].党课,2017,0(18):78-79.
2冯智文.美国中小学成就测验的功能和作用[J].云南教育（视界）,2007,0(6X):47-48.
3杨宪华,金敏,郑林科.大学生生活应激源对心理健康影响的预测模型[J].中国健康心理学杂志,2018,26(5):775-778. 被引量：8
4高景林.洲际弹道导弹的水下部署[J].中国航天,1986(10):28-30.
5白影,李凤,杨毅,龚玲.动态脑电图对急性昏迷早期病人的预后评估[J].癫痫与神经电生理学杂志,2018,27(2):93-96. 被引量：2
6高精度无损检测设备(工业CT)[J].中国科技信息,2018(9):1-1.
7王爱霞,刘延锦,郭园丽,郭丽娜,董小方.心理一致感在脑卒中患者心理压力与抑郁间的中介作用[J].中华现代护理杂志,2018,24(18):2118-2122. 被引量：18
8贺腾飞,梁宝勇.大学生创新能力问卷的编制[J].现代预防医学,2018,45(11):2108-2112. 被引量：10
9杜赟鹏.现代心理测验中防止作假的方式[J].价值工程,2017,36(32):198-199.
10凌爱凡,陈骁阳.嵌入GARCH波动率估计的B1ack-Litterman投资组合模型[J].中国管理科学,2018,26(6):17-25. 被引量：7

心理学探新

2018年第3期

浏览历史

内容加载中请稍等...

四参数Logistic模型和传统模型对被试作答拟合能力的比较研究被引量：7

参考文献3

二级参考文献24

共引文献14

同被引文献47

引证文献7

二级引证文献15

相关作者

相关机构

相关主题

浏览历史

四参数Logistic模型和传统模型对被试作答拟合能力的比较研究 被引量：7

参考文献3

二级参考文献24

共引文献14

同被引文献47

引证文献7

二级引证文献15

相关作者

相关机构

相关主题

浏览历史

四参数Logistic模型和传统模型对被试作答拟合能力的比较研究被引量：7