ABSTRACT

Inter-rater reliability models of assessment attribute variance to errors of measurement. Such errors, because of their random nature, inherently become inaccessible to modeling. Inter-rater reliability models fail to take into account that assessors frequently have very good, observation-based reasons for rating pilots in one rather than another way. Most significantly, inter-rater reliability models treat assessment as a measurement issue.