The Impact of Score Distribution Shape on National Test Results: Evaluating the Accuracy of Estimating Item Information Function and Test Reliability for Students' Scores

Authors

DOI:

https://doi.org/10.35516/edu.v50i2%20-S1.998

Keywords:

Item response theory, item information function, test reliability, distribution shape of data

Abstract

Objectives: This study aimed to examine how the distribution of students' scores on national tests affects the accuracy of estimating the item information function and test reliability, utilizing the principles of Item Response Theory.

Methods: The study employed a descriptive comparative approach. A random sample of 12,000 male and female students from the eighth grade, who had taken the Arabic, English, Mathematics, and Science papers of The National Exam for Controlling the Quality of Education during the 2018/2019 academic year, was selected. Based on the skewness of their score distributions (positive, normal, negative), the students were divided into 12 groups, each containing 1,000 male and female students. The statistical program BILOG_MG3 was used to analyze the data, applying the Three-Parameter Model of Item Response Theory to extract the item information function and its standard error. Additionally, the theoretical and experimental reliability coefficients were determined based on the distribution of the data.

 Results: The findings revealed that scores with positive skewness had the highest mean standard error of the estimate for the item information function in national tests, and both the theoretical and empirical reliability of these tests were lower. Statistically significant differences were observed in the mean errors between data with a normal distribution and data with positive or negative skewness in the subjects of Arabic language, English language, and mathematics, while no significant differences were found in the Science subject.

Conclusions: Based on the study's results, it is recommended to consider the test information function as a criterion when designing national tests.

Downloads

Download data is not yet available.

References

Alwan, A. M., & Jasim, K. J. (2022). The Effect of the Difference in the Distribution of the Level of Ability that is Skeweded Positive for the Parameters of the Items of the Mental Ability Test According to the Item Response Theory. International Journal of Early Childhood Special Education, 14(1), 1150-1160.

Baker, F. B. (2001). The Basic Item Response Theory. ERIC.

Crocker, L., & Algina, J. (1986). Introduction to classical and modern test theory. Holt, Rinehart and Winston, 6277 Sea Harbor Drive, Orlando, FL 32887.‏

Corder, W. & Foreman, I. (2014). Nonparametric statistics A Step-by-Step Approach. (2nd ed.). Wiley.

Doran, H. C. (2005). The information function for the one-parameter logistic model: Is it reliability. Educational and Psychological Measurement, 65(5), 665-675.

Embretson, S. E., & Reise, S. P. (2013). Item response theory. Psychology Press.‏

Feldt, L. S. (1980). A test of the hypothesis that Cronbach’s alpha reliability coefficient is the same for two tests administered to the same sample. Psychometrika, 45(1), 99–105.

Hambleton, R. K., & Jones, R. W. (1994). Item Parameter Estimation Errors and Their Influence on Test Information Functions. Applied Measurement in Education, 7(3), 171–186.

Hambleton, R. & Swaminathan, H. (1985). Item Response Theory: Principles and Applications. Boston : Kluwer .

Hambleton, R. K., Swaminathan, H., & Rogers, H. J. (1991). Fundamentals of item response theory (Vol. 2). Sage.‏

Lord, F.M. (1980). Application of Item Response Theory to Practical Testing Problems. Hillsdale, NJ: Erlbaum.

Ngwane, F. F (2021). Effects of Item Parameter Distributions on Test Information Function. International Journal of Mathematics And its Applications, 9(1),41–50.

Oosterwijk, P. R., van der Ark, L. A., & Sijtsma, K. (2019). Using confidence intervals for assessing reliability of real tests. Assessment, 26(7), 1207-1216.

Sass, D. A., Schmitt, T. A., & Walker, C. M. (2008). Estimating Non-Normal Latent Trait Distributions within Item Response Theory Using True and Estimated Item Parameters. Applied Measurement in Education, 21(1), 65–88.

Yuan, K.-H., & Bentler, P. M. (2002). On robustness of the normal-theory based asymptotic distributions of three reliability coefficient estimates. Psychometrika, 67, 251-259.

Zimowski, M. F., Muraki, E., Mislevy, R. J., & Bock, R. D. (2003). BILOG-MG [computer software]. Lincolnwood, IL: Scientific Software International.‏

Published

2023-08-30

How to Cite

Kreshan, A. S. M. ., & Al-Anati, J. M. M. . . (2023). The Impact of Score Distribution Shape on National Test Results: Evaluating the Accuracy of Estimating Item Information Function and Test Reliability for Students’ Scores. Dirasat: Educational Sciences, 50(2 -S1), 102–116. https://doi.org/10.35516/edu.v50i2 -S1.998
Received 2022-04-09
Accepted 2022-06-14
Published 2023-08-30