Reading ------- For the last class, I'd like you to read some material on the class of psychometric models known as "cognitive diagnosis models" (CDMs) or sometimes "diagnostic classification models (DCMs; same thing). ** Miscellaneous background: These models are basically two-layer Bayes nets with hidden nodes, that allow us to enumerate the KCs necessary to solve particular tasks, and allow us to make inferences about whether individuals do or don't possess those KCs based on their patterns of performance on specific sets of tasks. (We can also make inferences about how good or bad the tasks are at eliciting information about the KCs, which is also a useful psychometric activity.) The models are closely related to knowledge tracing models as discussed in Corbett, A. T., Anderson, J. R. and O'Brien, A. T. (1995). Student modeling in the ACT programming tutor. Chapter 2 in Nichols, P. D., Chipman, S. F. and Brennan, R. L. (eds.) (1995). Cognitively diagnostic assessment. Hillsdale, NJ: Lawrence Erlbaum Associates. CDMs are distinct from KT of the type described in this article mainly in that they allow for making inferences abour more than one skill at a time, from individual task performance. (of course several task performances must be looked at together to reduce the SE of the inference, as usual [more tasks --> more reliability!]). A paper that discusses these ideas fairly carefully, before they caught on in psychometrics, is VanLehn, K., Niu, Z., Siler, S. and Gertner, A. (1998). Student modeling from conventional test data: a Bayesian approach without priors. pp. 434--443 in Goetl, B. et al. (Eds.) (1998). Proceedings of the Intelligent Tutoring Systems Fourth International Conference, ITS 98. Berlin, Hiedelberg: Springer-Verlag. ** What I actually want you to read: Unfortunately I'm not aware of a single good, short discussion of CDMs. I'm going to suggest you look at several sources. Pdf's are in the "Reading" subdirectory for this lecture, unless otherwise indicated. * One that I like is Junker-sijtsma-apm-2001.pdf. But it really only makes sense to read up through p. 266, for an introduction. * de la Torre-dina-est-115-30-jebs.pdf is also pretty good, up through about p. 118. It focuses on the particular model, the DINA model, that we will consider in class. * For a broader view, rupp-and-templin-review-article.pdf is good, though rather pedantic. This paper was published with several discussions (pro and con), and the links are all here: http://www.tandfonline.com/toc/hmes20/6/4 Data Analysis ------------- There is a nice R package (CDM) for analyzing data with the DINA model. Unfortunately Stampfer's data is not coded in a way that makes DINA analysis very interesting. So instead we will look at some of the data from Koedinger & Nathan's (2004) DFA study of story problems. The following stuff is in the data subdirectory for today's lecture: DFA1&2-hao.csv -- the data! cdm.r -- the R code! Koedinger-Nathan-LS04.pdf -- the article! I think you have looked at Ken's article in class already but if not please skim it. Also, please have a look at, and try, the code in cdm.r. It's actually kind of neat. See you in class, -BJ