UPSI Digital Repository (UDRep)
Start | FAQ | About

QR Code Link :

Type :thesis
Subject :QA Mathematics
Main Author :Faiz Zulkifli
Title :Pembangunan model regresi ordinal teori respons item teguh dalam meramal prestasi gred peperiksaan akhir pelajar
Place of Production :Tanjong Malim
Publisher :Fakulti Sains dan Matematik
Year of Publication :2021
Corporate Name :Universiti Pendidikan Sultan Idris
PDF Guest :Click to view PDF file

Abstract : Universiti Pendidikan Sultan Idris
Kajian ini bertujuan membangunkan model regresi ordinal teori respons item (TRI) teguh dalam meramal prestasi gred peperiksaan akhir pelajar. Kaedah pembangunan model adalah berasaskan model regresi ordinal iaitu model ganjil kumulatif (MGK) dan analisis literatur bersistematik. MGK diubah suai dengan menerapkan TRI dan kaedah teguh penganggar-M (pemberat Huber dan Tukey Bisquare). Sampel kajian terdiri daripada 326 orang pelajar dari salah sebuah universiti awam di Malaysia yang mendaftar kursus berkaitan STEM. Sementara enam orang pakar dalam bidang statistik terlibat bagi mengesahkan kualiti sampel item soalan yang digunakan. Data kajian dianalisis menggunakan analisis deskriptif, indeks tahap persetujuan Cohen Kappa, analisis faktor, analisis pengukuran Rasch, plot diagnostik dan penyuaian model. Model yang dibangunkan diuji kebagusannya terhadap data sebenar dan simulasi. Simulasi Monte Carlo dijalankan berdasarkan faktor simulasi iaitu saiz sampel, kombinasi tahap kesukaran, peratus pencemaran dan sisihan piawai data pencilan yang melibatkan ukuran bias, ralat punca min kuasa dua, pekali penentuan dan statistik Lipsitz. Dapatan kajian mendapati model yang menerapkan TRI dimensi berbilang memberikan hasil penyuaian lebih baik berbanding model asas yang mana statistik Lipsitz bagi MGK-TRI (522.78) adalah kurang daripada MGK (549.94). Manakala, penganggar-M dengan pemberat Tukey Bisquare menunjukkan prestasi keteguhan lebih baik berbanding pemberat Huber dan penganggar kebolehjadian maksimum. Kesimpulannya, kajian ini berjaya membangunkan model ramalan prestasi gred peperiksaan akhir pelajar yang menerapkan TRI dan kaedah teguh dalam mengatasi masalah multikolinearan dan pengaruh data pencilan pada model regresi ordinal. Model yang dihasilkan memberikan implikasi dari segi teoritikal, metodologi dan sumbangan kepada pihak-pihak berkepentingan dalam statistik dan pendidikan, Kementerian Pendidikan Tinggi Malaysia, universiti dan industri dalam meramal prestasi gred peperiksaan akhir pelajar.


Abd Mutalib, Z. (2018). UA Diberi Pilihan Laksana iCGPA. Berita Harian Online.

Retrieved from



Abdullah, A. H., Abidin, N. L. Z., & Ali, M. (2015). Analysis of Students’ Errors in

Solving Higher Order Thinking Skills (HOTS) Problems for the Topic of Fraction.

Asian Social Science, 11(21), 133–142.


Abdullah, A. H., Mokhtar, M., Halim, N. D. A., Ali, D. F., Tahir, L. M., & Kohar, U.

H. A. (2017). Mathematics Teachers’ Level of Knowledge and Practice on the

Implementation of Higher-Order Thinking Skills (HOTS). Eurasia Journal of

Mathematics, Science and Technology Education, 13(1), 3–17.


Abreu, M. N., Siqueira, A. L., Cardoso, C. S., & Caiaffa, W. T. (2008). Ordinal Logistic

Regression Models: Application in Quality of Life Studies. Cadernos de Saude

Publica, 24 Suppl 4, s581–s591.


Adams, R. ., Wu, M. ., Cloney, D., & Wilson, M. R. (2020). ConQuest. Retrieved July

14, 2020, from


Adedayo, A. A., & Ojo, O. O. (2018). Bayesian Method for Solving the Problem of

Multicollinearity in Regression. Afrika Statistika, 13(3), 1823–1834.


Adejo, O. W., & Connolly, T. (2018). Predicting Student Academic Performance Using

Multi-model Heterogeneous Ensemble Approach. Journal of Applied Research in

Higher Education, 10(1), 61–75.


Adnan, A., & Sugiarto, S. (2017). The Outlier Detection for Ordinal Data Using

Scalling Technique of Regression Coefficients. In IOP Conf. Series: Journal of

Physics: Conf. Series (Vol. 855, pp. 1–7).


Agresti, A. (1989). Tutorial on Modeling Ordered Categorical Response Data.

Psychological Bulletin, 105(2), 290–301.


Agresti, A. (2010). Analysis of Ordinal Categorical Data (2nd ed.). John Wiley & Sons,



Agresti, A. (2013). Categorical Data Analysis. Wiley-Interscience.


Agus, M., Penna, M. P., Peró-Cebollero, M., & Guàrdia-Olmos, J. (2016). Assessing

Probabilistic Reasoning in Verbal-numerical and Graphical-pictorial Formats: an

Evaluation of the Psychometric Properties of an Instrument. Eurasia Journal of

Mathematics, Science and Technology Education, 12(8), 2013–2038.


Akinoso, S. O. (2018). Mathematics Teachers Awareness of Teachable Moments in

Nigerian Classroom. Eurasia Journal of Mathematics, Science and Technology

Education, 14(2), 683–689.


Alhadlaq, A. M., Alshammari, O. F., Alsager, S. M., Neel, K. A. F., & Mohamed, A.

G. (2015). Ability of Admissions Criteria to Predict Early Academic Performance

Among Students of Health Science Colleges at King Saud University, Saudi

Arabia. Journal of Dental Education, 79(6), 665–670.


Al-khadher, M. M. A., & Albursan, I. S. (2017). Accuracy of Measurement in the

Classical and the Modern Test Theory : an Empirical Study on a Children

Intelligence Test Accuracy of Measurement in the Classical and the Modern Test

Theory : an Empirical Study on a Children Intelligence Test. International Journal

of Psychological Studies, 9(1), 71.


Al-Sheeb, B. A., Hamouda, A. M., & Abdella, G. M. (2019). Modeling of Student

Academic Achievement in Engineering Education Using Cognitive and Non-

Cognitive Factors. Journal of Applied Research in Higher Education, 11(2), 178–



Alzen, J. L., Langdon, L. S., & Otero, V. K. (2018). A Logistic Regression Investigation

of the Relationship Between the Learning Assistant Model and Failure Rates in

Introductory STEM Courses. International Journal of STEM Education, 5(1), 1–



Ananth, C. V, & Kleinbaum, D. G. (1997). Regression Models for Ordinal Responses :

A Review of Methods and Applications. International Journal of Epidermiology,

26(6), 1323–1333.


Anazifa, R. D., & Djukri. (2017). Project-Based Learning and Problem- Based

Learning: Are They Effective to Improve Student’s Thinking Skills? Jurnal

Pendidikan IPA Indonesia, 6(2), 346–355.


Anderson, J. A. (1984). Regression and Ordered Categorical Variables. Journal of the

Royal Statistical Society. Series B (Methodological). WileyRoyal Statistical



Anderson, L. W., Krathwohl, D. R., P.W., A., Cruikshank, K. A., Mayer, R. E., Pintrich,

P. R., … Wittrock, M. C. (2001). A Taxonomy for Learning, Teaching, and

Assessing: a Revision of Bloom’s Taxonomy of Educational Objectives. New

York: Longman.


Andrich, D. (1978). A rating formulation for ordered response categories.

Psychometrika, 43, 561-573.


Arco-tirado, J. L., Fernández-martín, F., Ramos-garcía, A. M., Littvay, L., & Villoria,

J. (2018). A Counterfactual Impact Evaluation of a Bilingual Program on Students’

Grade Point Average at a Spanish University. Evaluation and Program Planning,

68(February), 81–89.


Ari, E., & Yildiz, Z. (2014). Parallel Lines Assumption in Ordinal Logistic Regression

and Analysis Approaches. International Interdisciplinary Journal of Scientific

Research, 1(3), 8–23.


Arievitch, I. M. (2020). The Vision of Developmental Teaching and Learning and

Bloom’s Taxonomy of Educational Objectives. Learning, Culture and Social

Interaction, 25, 100274.


Artusi, R., Verderio, P., & Marubini, E. (2002). Bravais-pearson and Spearman

Correlation Coefficients: Meaning, Test of Hypothesis and Confidence Interval.

The International Journal of Biological Markers, 17(2), 148–151.


Ashenafi, M. M., Riccardi, G., & Ronchetti, M. (2015). Predicting Students’ Final

Exam Scores From Their Course Activities. In Proceedings - Frontiers in

Education Conference, FIE, 2014.


Asshaari, I., Othman, H., Bahaludin, H., Ismail, N. A., & Nopiah, Z. M. (2012).

Appraisal on Bloom’s Separation in Final Examination Question of Engineering

Mathematics Courses Using Rasch Measurement Model. Procedia - Social and

Behavioral Sciences, 60(2009), 172–178.


Athani, S. S., Kodli, S. A., Banavasi, M. N., & Hiremath, P. G. S. (2017). Student

Academic Performance and Social Behavior Predictor Using Data Mining

Techniques. In Proceedings - IEEE International Conference Computing

Communication Automation ICCCA 2017 ,170–174.


Auerbach, A. J. J., & Andrews, T. C. (2018). Pedagogical Knowledge for Activelearning

Instruction in Large Undergraduate Biology Courses: a Large-scale

Qualitative Investigation of Instructor Thinking. International Journal of STEM

Education, 5(1).


Ayers, E., & Junker, B. (2008). IRT Modeling of Tutor Performance to Predict End-of-

Year Exam Scores. Educational and Psychological Measurement, 68(6), 972–987.


Azizah, U., & Nasrudin, H. (2018). Development of Chemistry Instructional Materials

Based on Cooperative Group Investigation (CGI) to Empower Thinking Skills.

Journal of Physics: Conference Series, 1108(1).


Bäcklin, C. L., & Gustafsson, M. G. (2018). Developer-Friendly and Computationally

Efficient Predictive Modeling Without Information Leakage: the emil Package for

R. Journal of Statistical Software, 85(13).


Badri, M., Alnuaimi, A., Mohaidat, J., Al Rashedi, A., Yang, G., & Al Mazroui, K.

(2016). My Science Class and Expected Career Choices- a Structural Equation

Model of Determinants Involving Abu Dhabi High School Students. International

Journal of STEM Education, 3(1).


Bahrum, S., Wahid, N., & Ibrahim, N. (2017). Integration of STEM Education in

Malaysia and Why to STEAM. International Journal of Academic Research in

Business and Social Sciences, 7(6), 645–654.


Baily, C., Ryan, Q. X., Astolfi, C., & Pollock, S. J. (2017). Conceptual Assessment

Tool for Advanced Undergraduate Electrodynamics. Physical Review Physics

Education Research, 13(2), 1–10.


Baker, F. B., & Kim, S.-H. (2004). Item Response Theory: Parameter Estimation

Techniques (2nd ed.). Taylor & Francis Group.


Bal, C., Demir, S., & Aladag, C. H. (2016). A Comparison of Different Model Selection

Criteria for Forecasting EURO / USD Exchange Rates by Feed Forward Neural

Network. In Proceedings - International Journal of Computing, Communications

& Instrumentation Enggineering (IJCCIE), 3(2), 1–5.


Bana, M., & Ligas, M. (2014). Empirical Tests of Performance of Some M–estimators.

Geodesy and Cartography, 63(2), 127–146.


Barlybayev, A., Sharipbay, A., Ulyukova, G., Sabyrov, T., & Kuzenbayev, B. (2016).

Student’s Performance Evaluation by Fuzzy Logic. Procedia Computer Science,

102(August), 98–105.


Battauz, M. (2015). equateIRT : An R Package for IRT Test Equating . Journal of

Statistical Software, 68(7).


Baur, T., & Lukes, D. (2009). An Evaluation of the IRT Models Through Monte Carlo

Simulation. UW-L Journal of Undergraduate Research, (XII), 1–7.


Baygin, M., Yetis, H., Karakose, M., & Akin, E. (2016). An Effect Analysis of Industry

4.0 to Higher Education. In 2016 15th International Conference on Information

Technology Based Higher Education and Training (ITHET), 1–4.


Begg, A. (1997). Some Emerging Influences Underpinning Assessment in Statistics.

The Assessment Challenge in Statistics Education, 17–25.


Bellettini, C., Lonati, V., Malchiodi, D., Monga, M., Morpurgo, A., & Torelli, M.

(2015). How Challenging Are Bebras Tasks? An IRT Analysis Based on the

Performance of Italian Students. Annual Conference on Innovation and

Technology in Computer Science Education, ITiCSE, 2015-June, 27–32.


Bender, R., & Benner, A. (2000). Calculating Ordinal Regression Models in SAS and

S-Plus. Biometrical Journal, 42(6), 677–700.


Benešová, A., & Tupa, J. (2017). Requirements for Education and Qualification of

People in Industry 4.0. Procedia Manufacturing, 11, 2195–2202.


Berg, R. G. van den. (2020). SPSS Factor Analysis- Absolute Beginners Tutorial.

Retrieved July 14, 2020, from



Bernama (2018). UiTM Rombak iCGPA Supaya Lebih Mesra Pensyarah. Retrieved

April 22, 2020, from



Bianco, A. M., & Yohai, V. J. (1996). Robust Estimation in the Logistic Regression

Model, 17–34.


Binh, H. T., & Duy, B. T. (2017). Predicting Students’ Performance Based on Learning

Style by Using Artificial Neural Networks, In Proceedings - 2017 9th

International Conference on Knowledge and Systems Engineering (KSE), 48-53


Bloom, B. S. (1956). Taxonomy of Educational Objectives – Handbook 1 Cognitive

Domain. London: Longman.


Bock, R. D., & Aitkin, M. (1981). Marginal Maximum Likelihood Estimation of Item

Parameters: Application of an EM Algorithm. Psychometrika, 46(4), 443–459.


Bond, T., & Fox, C. M. (2015). Applying The Rasch Model Fundamental Measurement

in the Human Sciences (3rd ed.). New York: Routledge.


Bondell, H. D. (2008). A Characteristic Function Approach to the Biased Sampling

Model, With Application to Robust Logistic Regression. Journal of Statistical

Planning and Inference, 138, 742–755.


Bonsaksen, T., Brown, T., Lim, H. B., & Fong, K. (2017). Approaches to Studying

Predict Academic Performance in Undergraduate Occupational Therapy Students:

a Cross-cultural Study. BMC Medical Education, 17(1), 1–9.


Brassil, C. E., & Couch, B. A. (2019). Multiple-true-false Questions Reveal More

Thoroughly the Complexity of Student Thinking Than Multiple-choice Questions:

a Bayesian Item Response Model Comparison. International Journal of STEM

Education, 6(1).


Brazeal, K. R., Brown, T. L., & Couch, B. A. (2016). Characterizing Student

Perceptions of and Buy-in Toward Common Formative Assessment Techniques.

CBE Life Sciences Education, 15(4).


Brester, C., Rönkkö, M., Kolehmainen, M., Semenkin, E., Kauhanen, J., Tuomainen,

T.-P., … Ronkainen, K. (2018). Evolutionary Methods for Variable Selection in

the Epidemiological Modeling of Cardiovascular Diseases. BioData Mining,

11(1), 1–14.


Buergin, R. (2020). vcrpart: Tree-Based Varying Coefficient Regression for

Generalized Linear and Ordinal Mixed Models. Retrieved from https://CRAN.R


Bulut, O., & Sunbul, Ö. (2017). Monte Carlo Simulation Studies in Item Response

Theory with the R Programming Language. Journal of Measurement and

Evaluation in Education and Psychology, 8(3), 266–287.


Buniyamin, N., Mat, U. Bin, & Arshad, P. M. (2016). Educational Data Mining for

Prediction and Classification of Engineering Students Achievement. IEEE 7th

International Conference on Engineering Education, ICEED 2015, 49–53.


Butterworth, J., & Thwaites, G. (2013). Thinking Skills: Critical Thinking and Problem

Solving (2nd ed.). Cambridge: Cambridge University Press.


Cai, L. (2010). Metropolis-Hastings Robbins-Monro Algorithm for Confirmatory Item

Factor Analysis. Journal of Educational and Behavioral Statistics, 35(3), 307–



Capuano, A. W. (2012). Constrained Ordinal Models With Application in Occupational

and Constrained Ordinal Models With Application in Occupational and

Environmental Health Environmental Health. University of Iowa.


Carroll, R. J., & Pederson, S. (1993). On Robustness in the Logistic Regression Model.

Journal Royal Statistical Society, 55(3), 693–706.


Celik, A. O., & Guzel, B. E. (2017). Mathematics Teachers’ Knowledge of Student

Thinking and Its Evidences in Their Instruction. Journal on Mathematics

Education, 8(2), 199–210.


Chalmers, R. P. (2012). mirt : A Multidimensional Item Response Theory Package for

the R Environment. Journal of Statistical Software, 48(6).


Chalmers, R. P. (2016). Generating Adaptive and Non-adaptive Test Interfaces for

Multidimensional Item Response Theory Applications. Journal of Statistical

Software, 71.


Chan, S. W., Ismail, Z., & Sumintono, B. (2014). A Rasch Model Analysis on

Secondary Students’ Statistical Reasoning Ability in Descriptive Statistics.

Procedia - Social and Behavioral Sciences, 129, 133–139.


Chootongchai, S., & Songkram, N. (2018). Design and Development of SECI and

Moodle Online Learning Systems to Enhance Thinking and Innovation Skills for

Higher Education Learners. International Journal of Emerging Technologies in

Learning, 13(3), 154–172.


Christensen, R. H. B. (2019). ordinal: Regression Models for Ordinal Data. Retrieved



Christian, T. M., & Ayub, M. (2014). Exploration of Classification Using NBtree for

Predicting Students’ Performance. Proceedings of 2014 International Conference

on Data and Software Engineering, ICODSE 2014, 1–6.


Cladellas, R., Muro, A., Vargas-Guzmán, E. A., Bastardas, A., & Gomà-i-Freixanet,

M. (2017). Sensation Seeking and High School Performance. Personality and

Individual Differences, 117, 117–121.


Clarke, B. S., & Clarke, J. L. (2018). Predictive Statistics : Analysis and Inference

Beyond Models. Cambridge University Press.


Cohen, L., Manion, L., & Morrison, K. (2018). Research Methods in Education (8th

ed.). Routledge.


Columbus, L. (2019). Data Scientist Leads 50 Best Jobs In America For 2019

According To Glassdoor. Retrieved April 21, 2020, from


Copas, J. B. (1988). Binary Regression Models for Contaminated Data. Journal Royal

Statistical Society, 50(2), 225–265.


Crane, N., Zusho, A., Ding, Y., & Cancelli, A. (2017). Domain-specific Metacognitive

Calibration in Children With Learning Disabilities. Contemporary Educational

Psychology, 50, 72–79.


Cronbach, L. J. (1951). Coefficient Alpha and the Internal Structure of Tests.

Psychometrika, 16(3), 297–334.


Croux, C., & Haesbroeck, G. (2003). Implementing the Bianco and Yohai Estimator

for Logistic Regression. Computational Statistics & Data Analysis, 44, 273–295.


Croux, C., Flandre, C., & Haesbroeck, G. (2002). The Breakdown Behavior of the

Maximum Likelihood Estimator in the Logistic Regression Model. Statistics and

Probability Letters, 60(4), 377–386.


Croux, C., Haesbroeck, G., & Ruwet, C. (2013). Robust Estimation for Ordinal

Regression. Journal of Statistical Planning and Inference, 143(9), 1486–1499.


Das, A. K., & Rodriguez-Marek, E. (2019). A Predictive Analytics System for

Forecasting Student Academic Performance: Insights From a Pilot Project at

Eastern Washington University. In 2019 Joint 8th International Conference on

Informatics, Electronics & Vision (ICIEV) and 2019 3rd International Conference

on Imaging, Vision & Pattern Recognition (icIVPR), 255–262.


de Kort, J. M., Dolan, C. V., Lubke, G. H., & Molenaar, D. (2017). Studying the

Strength of Prediction Using Indirect Mixture Modeling: Nonlinear Latent

Regression with Heteroskedastic Residuals. Structural Equation Modeling, 24(2),



Deanna, S. (2018). Logistic and Linear Regression Assumptions : Violation

Recognition and Control. In Recognition and Control, 1–21.


Dignan, L. (2019). Data Science Dominates Linkedin’s Emerging Jobs Ranking.

Retrieved April 21, 2020, from



Dobson, A. J., & Barnett, A. G. (2018). An Introduction to Generalized Linear Models

(4th ed.). Taylor & Francis Group.


Donoghoe, M. W. (2018). glm2: Fitting Generalized Linear Models. Retrieved from


Drath, R., & Horch, A. (2014). Industrie 4.0: Hit or Hype? IEEE Industrial Electronics

Magazine, 8(2), 56–58.


Dunham, B., Yapa, G., & Yu, E. (2015). Calibrating the Difficulty of an Assessment

Tool: The Blooming of a Statistics Examination. Journal of Statistics Education,



Edwards, J. M., & Finch, W. H. (2018). Recursive Partitioning Methods for Data

Imputation in the Context of Item Response Theory: a Monte Carlo Simulation.

Psicologica, 39(1), 88–117.


Ellis, J. L. (2019). Factor Analysis and Item Analysis. Retrieved from



Embretson, S. E., Reise, S., & Reise, S. P. (2000). Item Response Theory for

Psychologists (Multivariate Applications Book Series). New Jersey: Lawrence

Erlbaum Associates, Inc.


Engel, J. (1988). Polytomous Logistic Regression. Statistica Neerlandica, 42(4), 233–



Erda, G., Indahwati, & Djuraidah, A. (2019). Outlier Handling of Robust

Geographically and Temporally Weighted Regression. Journal of Physics:

Conference Series, 1175(1).


Fagerland, M. W., & Hosmer, D. W. (2012). A Generalized Hosmer-lemeshow

Goodness-of-fit Test for Multinomial Logistic Regression Models. Stata Journal,

12(3), 447–453.


Fagerland, M. W., & Hosmer, D. W. (2016). Tests for Goodness of Fit in Ordinal

Logistic Regression Models. Journal of Statistical Computation and Simulation,

86(17), 3398–3418.


Falk, C. F., & Ju, U. (2020). Estimation of Response Styles Using the Multidimensional

Nominal Response Model: A Tutorial and Comparison With Sum Scores.

Frontiers in Psychology, 11, 1–17.


Fernandez, D. B., & Lujan-Mora, S. (2017). Comparison of Applications for

Educational Data Mining in Engineering Education. 2017 IEEE World

Engineering Education Conference (EDUNINE), 81–85.


Fielding, A. (1999). Why Use Arbitrary Points Scores?: Ordered Categories in Models

of Educational Progress. Journal of the Royal Statistical Society: Series A

(Statistics in Society), 162(3), 303–328.


Fienberg, S. E. (1980). The Analysis of Cross-Classified Categorical Data: Second

Edition. Cambridge: Massachusetts Institute of Technology Press.


Fisher Jr., W. P. (2007). Rating Scale Instrument Quality Criteria. Rasch Measurement

Transaction, 21(1095).


Fitri, S., & Zahari, C. L. (2019). The Implementation of Blended Learning to Improve

Understanding of Mathematics. Journal of Physics: Conference Series, 1188(1).


Fleckenstein, J., Leucht, M., Pant, H. A., & Köller, O. (2016). Proficient Beyond

Borders: Assessing Non-native Speakers in a Native Speakers’ Framework.

Large-Scale Assessments in Education, 4(1).


Foster, R. C. (2020). A Generalized Framework for Classical Test Theory. Journal of

Mathematical Psychology, 96.


Fox, J., & Weisberg, S. (2012). An R Companion to Applied Regression: Third Edition.



Francis, E. (2018). Effects of Some Coding Techniques On Multicolinearity and Model

Statistics. Mathematical Theory and Modeling, 8(4), 156–167.


Franses, P. H., & Paap, R. (2010). Quantitative Models in Marketing Research.

Cambridge University Press.


Fuadiah, N. F., Suryadi, D., & Turmudi, T. (2019). Teaching and Learning Activities

in Classroom and Their Impact on Student Misunderstanding: A Case Study on

Negative Integers. International Journal of Instruction, 12(1), 407–424.


Gerber, N. L., & Price, J. K. (2018). Measures of Function and Health-Related Quality

of Life. Principles and Practice of Clinical Research. Elsevier Inc.


Gibbons, L. E., Crane, P. K., Seung, M., & Choi, W. (2016). Package “lordif” Type

Package Title Logistic Ordinal Regression Differential Item Functioning using



Golding, C. (2019). Discerning Student Thinking : a Practical Theoretical Framework

for Recognising or Informally Assessing Different Ways of Thinking. Teaching in

Higher Education, 24(4), 478-492.


Gómez-Rey, P., Fernández-Navarro, F., & Barberà, E. (2016). Ordinal Regression by

a Gravitational Model in the Field of Educational Data Mining. Expert Systems,

33(2), 161–175.


Goodhew, L. M., & Robertson, A. D. (2017). Exploring the Role of Content Knowledge

in Responsive Teaching. Physical Review Physics Education Research, 13(1), 1–



Goodman, L. A. (1979). Simple Models for the Analysis of Association in Crossclassifications

Having Ordered Categories. Journal of the American Statistical

Association, 74(367), 537–552.


Greenwell, B. M., Mccarthy, A. J., Boehmke, B. C., & Liu, D. (2018). Residuals and

Diagnostics for Binary and Ordinal Regression Models: An Introduction to the

sure Package. The R Journal, 10, 381–394.


Groll, A. (2020). GMMBoost: Likelihood-Based Boosting for Generalized Mixed

Models. Retrieved from


Grundspenkis, J. (2019). Intelligent Knowledge Assessment Systems: Myth or Reality.

Frontiers in Artificial Intelligence and Applications, 315, 31-46.


Guo, B., Zhang, R., Xu, G., Shi, C., & Yang, L. (2015). Predicting Students

Performance in Educational Data Mining. 2015 International Symposium on

Educational Technology (ISET), 125–128.


Hadar, L. L., & Tirosh, M. (2019). Creative Thinking in Mathematics Curriculum: an

Analytic Framework. Thinking Skills and Creativity, 33(September 2018),



Hadfield, J. (2019). MCMCglmm: MCMC Generalised Linear Mixed Models.

Retrieved from


Han, M., Tong, M., Chen, M., Liu, J., & Liu, C. (2017). Application of Ensemble

Algorithm in Students’ Performance Prediction. Proceedings - 2017 6th IIAI

International Congress on Advanced Applied Informatics, IIAI-AAI 2017, 735–



Hariharasudan, A., & Kot, S. (2018). A Scoping Review on Digital English and

Education 4.0 for Industry 4.0. Social Sciences, 7(11), 227.


Harrell, E. F. (2020). rms: Regression Modeling Strategies. Retrieved from


Hauck, W. W., & Donner, A. (1977). Wald’s Test as Applied to Hypotheses in Logit

Analysis. Journal of the American Statistical Association, 72(360), 851.


Hauke, J., & Kossowski, T. (2011). Comparison Of Values Of Pearson’s And

Spearman’s Correlation Coefficients On The Same Sets Of Data. Quaestiones

Geographicae, 30(2), 87–93.


Himelfarb, I. (2019). A Primer on Standardized Testing: History, Measurement,

Classical Test Theory, Item Response Theory, and Equating. Journal of

Chiropractic Education, 33(2), 151–163.


Hobza, T., Pardo, L., & Vajda, I. (2008). Robust Median Estimator in Logistic

Regression. Journal of Statistical Planning and Inference, 138, 3822–3840.


Hosmer, D. W., & Lemesbow, S. (1980). Goodness of Fit Tests for the Multiple

Logistic Regression Model. Communications in Statistics - Theory and Methods,

9(10), 1043–1069.


Hosmer, D. W., & Lemeshow, S. (2000). Applied Logistic Regression (2nd ed.). John

Wiley & Sons, Inc.


Hosseinian, S., & Morgenthaler, S. (2011). Robust Binary Regression. Journal of

Statistical Planning and Inference, 141(4), 1497–1509.


Hu, X. (2018). Foreign Language Education in Colleges and Universities Based on

Globalization Background. Educational Sciences: Theory & Practice, 18(6),



Hubert, M., Debruyne, M., & Rousseeuw, P. J. (2018). Minimum Covariance

Determinant and Extensions. Wiley Interdisciplinary Reviews: Computational

Statistics, 10(3), e1421.


Huo, X., & Cao, S. (2019). Aggregated inference. Wiley Interdisciplinary Reviews:

Computational Statistics, 11(1), 1–13.


Hussin, A. A. (2018). Education 4.0 Made Simple : Ideas For Teaching. International

Journal of Education and Literacy Studies, 6(3), 92–98.


Iannario, M., Clara, A., & Piccolo, D. (2016). Robustness Issues for CUB Models.

TEST, 25, 731-750.


Iannario, M., Monti, A. C., Piccolo, D., & Ronchetti, E. (2017). Robust Inference for

Ordinal Response Models, 11, 3407–3445.


Ikbal, S., Tamhane, A., Sengupta, B., Chetlur, M., Ghosh, S., & Appleton, J. (2015).

On Early Prediction of Risks in Academic Performance for Students. IBM Journal

of Research and Development, 59(6), 1–14.


Ikuma, L. H., Steele, A. dann, S., Adio, O., & Waggenspack, W. N. (2019). Large-scale

Student Programs Increase Persistence in STEM Fields in a Public University

Setting. Journal of Engineering Education, 108(1), 57–81.


Imdadullah, M., Aslam, M., & Altaf, S. (2016). mctest: An R Package for Detection of

Collinearity Among Regressors. The R Journal, 8(2), 495–505.


Imrey, P. B., Koch, G. G., Stokes, M. E., Darroch, J. N., Freeman, D. H., & Tolley, H.

D. (1981). Categorical Data Analysis: Some Reflections on the Log Linear Model

and Logistic Regression. Part I: Historical and Methodological Overview.

International Statistical Review / Revue Internationale de Statistique, 49(3), 265.


Irribarra, D. T., & Freud, R. (2020). WrightMap: IRT Item-Person Map with 'ConQuest'

Integration. Retrieved from


James, N., Harrell, Jr, & Shepherd, B. (2021). Bayesian Cumulative Probability Models

for Continuous and Mixed Outcomes.


Jayarajah, K., Saat, R. M., & Rauf, R. A. A. (2014). A Review of Science, Technology,

Engineering & Mathematics (STEM) Education Research From 1999-2013: A

Malaysian perspective. Eurasia Journal of Mathematics, Science and Technology

Education, 10(3), 155–163.


Jesson, J., Matheson, L., & Lacey, F. M. (2011). Doing Your Literature Review :

Traditional and Systematic Techniques. SAGE Publications Ltd.


John P. L., Hao Wu, H., & Yu, G. (2016). Building an Evaluation Scale using Item

Response Theory. Proc Conf Empir Methods Nat Lang Process, 648–657.


John, M., Bettye, S., Ezra, T., & Robert, W. (2016). A Formative Evaluation of a

Southeast High School Integrative Science, Technology, Engineering, and

Mathematics (STEM) Academy. Technology in Society, 45, 34–39.


Joyce, T., Crockett, S., Jaeger, D. A., Altindag, O., & O’Connell, S. D. (2015). Does

Classroom Time Matter? Economics of Education Review, 46, 64–77.


Judi, H. M., Mohamed, H., Ashari, N. S. @, Jenal, R., & Hanawi, S. A. (2012).

Alignment of Statistics Course using Examination Items. Procedia - Social and

Behavioral Sciences, 59, 264–269.


Kaiser, H. F. (1974). An Index of Factorial Simplicity. Psychometrika, 39, 31–36.


Kementerian Pendidikan Malaysia (2013). Malaysia Education Blueprint 2013-2025

(Preschool to Post- Secondary Education). Putrajaya Malaysia: Kementerian



Kementerian Pendidikan Malaysia (2015). Malaysia Education Blueprint 2015-2025

(Higher Education). Putrajaya Malaysia: Kementerian Pengajian Tinggi.


Kementerian Pendidikan Tinggi Malaysia (2016). Rubrik PNGK Bersepadu (iCGPA)

Panduan Pentaksiran Hasil Pembelajaran. Putrajaya Malaysia: Kementerian

Pendidikan Tinggi.


Kerlinger, F. N., & Lee, H. B. (2000). Foundations of Behavioral Research (4th ed.).

Fort Worth TX: Harcourt College Publishers.


Kesselmeier, M., & Bermejo, J. L. (2017). Robust Logistic Regression to Narrow Down

the Winner’s Curse for Rare and Recessive Susceptibility Variants. Briefings in

Bioinformatics, 18(6), 962–972.


Khajah, M. M., Huang, Y., Mozer, M. C., & Brusilovsky, P. (2015). Integrating

Knowledge Tracing and Item Response Theory : A Tale of Two Frameworks.

CEUR Workshop Proceedings, 1181, 7–15.


Kien-Kheng, F., Azlan, N., Noor, S., Ahmad, D., Lee, N., Leong, H., & Mohamed, I.

(2016). Relationship Between Cognitive Factors and Performance in an

Introductory Statistics Course : a Malaysian Case Study Introduction. Malaysian

Journal of Mathematical Sciences, 10(3), 269–282.


Kim, S. Y., Lee, W., & Kolen, M. J. (2019). Simple-Structure Multidimensional Item

Response Theory Equating for Multidimensional Tests. Educational and

Psychological Measurement, 80(1), 91-125.


Kline, P. (2014). An Easy Guide to Factor Analysis. Routledge.


Komarudin, U., Rustaman, N. Y., & Hasanah, L. (2017). Promoting Students’

Conceptual Understanding Using STEM. AIP Conference Proceedings, 1848(1).


Koretsky, M., Keeler, J., Ivanovitch, J., & Cao, Y. (2018). The Role of Pedagogical

Tools in Active Learning: a Case for Sense-making. International Journal of

STEM Education, 5(1).


Kosmidis, I. (2014). Improved Estimation in Cumulative Link Models. Journal of the

Royal Statistical Society: Series B, 76(1), 169–196.


Kosmidis, I., & Firth, D. (2009). Bias Reduction in Exponential Family Nonlinear

Models. Biometrika, 96(4), 793-804.


Krasilnikov, A., & Smirnova, A. (2017). Online Social Adaptation of First-year

Students and Their Academic Performance. Computers and Education, 113, 327–



Krishna Kishore, K. V., Venkatramaphanikumar, S., & Alekhya, S. (2014). Prediction

of Student Academic Progression: a Case Study on Vignan University. 2014

International Conference on Computer Communication and Informatics, 1–6.


Kumar, S. C., Chowdary, E. D., Venkatramaphanikumar, S., & Kishore, K. V. K.

(2016). M5P Model Tree in Predicting Student Performance: a Case Study. 2016

IEEE International Conference on Recent Trends in Electronics, Information &

Communication Technology (RTEICT), 1103–1107.


Kumari, P., Jain, P. K., & Pamula, R. (2018). An Efficient Use of Ensemble Methods

to Predict Students Academic Performance. 2018 4th International Conference on

Recent Advances in Information Technology (RAIT), 1–6.


Laerd Statistics (2020). Using the PLUM Procedure to Carry Out an Ordinal Regression

in SPSS. Retrieved July 14, 2020, from



Landis, J. R., & Koch, G. G. (1977). The Measurement of Observer Agreement for

Categorical Data. Biometrics, 33(1), 159.


Li, C., & Shepherd, B. E. (2012). A New Residual for Ordinal Outcomes. Biometrika,

99(2), 473–480.


Lin, M., Preston, A., Kharrufa, A., & Kong, Z. (2016). Making L2 Learners’ Reasoning

Skills Visible: the Potential of Computer Supported Collaborative Learning

Environments. Thinking Skills and Creativity, 22, 303–322.


Linacre, J. M. (2008). The Expected Value of a Point Biserial (or Similar) Correlation.

Retrieved October 22, 2019, from


Lipsitz, S. R., Fitzmaurice, G. M., Molenberghs, G., Lipsitzt, B. S. R., Farber, D., &

Fitzmaurice, M. (1996). Goodness-of-fit Tests for Ordinal Response Regression

Models, 45(2), 175–190.


Lipsitz, S. R., Fitzmaurice, G. M., Regenbogen, S. E., Sinha, D., Ibrahim, J. G., &

Gawande, A. A. (2012). Bias Correction for the Proportional Odds Logistic

Regression Model With Application to a Study of Surgical Complications. Journal

of the Royal Statistical Society. Series C: Applied Statistics, 62(2), 233–250.


Liu, D., & Zhang, H. (2017). Residuals and Diagnostics for Ordinal Regression Models:

A Surrogate Approach. Journal of the American Statistical Association, 113(522),



Lo, C. K., Hew, K. F., & Chen, G. (2017). Toward a Set of Design Principles for

Mathematics Flipped Classrooms: a Synthesis of Research in Mathematics

Education. Educational Research Review, 22, 50–73.


Lopez Guarin, C. E., Guzman, E. L., & Gonzalez, F. A. (2015). A Model to Predict

Low Academic Performance at a Specific Enrollment Using Data Mining. Revista

Iberoamericana de Tecnologias Del Aprendizaje, 10(3), 119–125.


Lord, F. M. (1952). A Theory of Test Scores. Psychometric Monograph, 7.


Lord, F. M. (1986). Maximum Likelihood and Bayesian Parameter Estimation in Item

Response Theory. Journal of Educational Measurement, 23(2), 157–162.


Ma, T., Li, H., Wm, E., Jj, K., Manne, U., Bae, S., … Kp, S. (2014). Robust Logistic

and Probit Methods for Binary and Multinomial Regression, 5(4).


Macfarlane, B. (2014). Student Performativity in Higher Education: Converting

Learning as a Private Space Into a Public Performance, Higher Education

Research & Development. 34(2), 338-350.


Magis, D., & Barrada, J. R. (2017). Computerized Adaptive Testing with R : Recent

Updates of the Package catR . Journal of Statistical Software, 76, 1-19.


Magis, D., Béland, S., Tuerlinckx, F., & de Boeck, P. (2010). A General Framework

and an R Package for the Detection of Dichotomous Differential Item Functioning.

Behavior Research Methods, 42(3), 847–862.


Mahmud, Z., Ismail, N. Z.-I., Kassim, N. L. A., & Zainol, M. S. (2018). The Effects Of

Attitudes Towards Statistics, Perceived Ability, Learning Practices And Teaching

Practices On Students’ Performance In Statistics: A Review. Journal of Islamic

Thought and Civilization of the International Islamic University Malaysia (Iium),

(Special Issue), 71–97.


Mair, P. (2020). CRAN Task View: Psychometric Models and Methods.


Mair, P., Hatzinger, R., Maier, M. J., Rusch, T., Debelak, R., & Maintainer (2020).

eRm: Extended Rasch Modeling. Retrieved from https://CRAN.Rproject.



Maki, S., & Horita, T. (2017). Research on Statistical Literacy Using Japanese

Textbooks. 2017 6th IIAI International Congress on Advanced Applied

Informatics (IIAI-AAI), 711–714.


Manor, O., & Power, C. (2000). Dichotomous or Categorical Response? Analysing

Self-rated Health and Lifetime. Int J Epidemiol, 29(1), 149–157.


Marbouti, F., Diefes-Dux, H. A., & Madhavan, K. (2016). Models for Early Prediction

of at-risk Students in a Course Using Standards-based Grading. Computers and

Education, 103, 1–15.


Margot, K. C., & Kettler, T. (2019). Teachers’ Perception of STEM Integration and

Education: a Systematic Literature Review. International Journal of STEM

Education, 6.


Maria, M., Shahbodin, F., & Pee, N. C. (2018). Malaysian Higher Education System

Towards Industry 4.0- Current Trends Overview. AIP Conference Proceedings

2016, 1-7.


Masters, G. N. (1982). A Rasch model for partial credit scoring. Psychometrika, 47,



Mativo, J. M., & Huang, S. (2014). Prediction of Students’ Academic Performance:

Adapt a Methodology of Predictive Modeling for a Small Sample Size. 2014 IEEE

Frontiers in Education Conference (FIE) Proceedings, 1–3.


Mayilvaganan, M., & Kalpanadevi, D. (2014). Comparison of Classification

Techniques for Predicting the Cognitive Skill of Students in Education

Environment. 2014 IEEE International Conference on Computational Intelligence

and Computing Research, 1–4.


McCullagh, P. (1980). Regression Models for Ordinal Data. Journal of the Royal

Statistical Society. Series B, 42, 109–142.


McCullagh, P., & Nelder, J. A. (1989). Generalized Linear Models (2nd ed.). London,

New York: Chapman and Hall.


Mckelvey, D. R., & Zavoina, W. (1975). A Statistical Model for the Analysis of Ordinal

Level Dependent Variables. Journal of Mathematical Sociology, 4, 103–120.


Meier, Y., Xu, J., Atan, O., & Van Der Schaar, M. (2016). Predicting Grades. IEEE

Transactions on Signal Processing, 64(4), 959–972.


Mejia, A., & Filus, A. (2018). Exploring Predictors of Impact of School-based

Management in Rural Mexico: Do Student Engagement, Teacher Attitudes and

Parent Involvement Predict Better Academic Outcomes? International Journal of

Educational Research, 88, 95–108.


Mignani, S., Monari, P., Cagnone, S., & Ricci, R. (2006). Multidimensional Versus

Unidimensional Models for Ability Testing. In Data Analysis, Classification and

the Forward Search, 339–346.


Milaturrahmah, N., Mardiyana, & Pramudya, I. (2017). Science, Technology,

Engineering, Mathematics (STEM) as Mathematics Learning Approach in 21st

Century. AIP Conference Proceedings, 1868(1).


Mircioiu, C., & Atkinson, J. (2017). A Comparison of Parametric and Non-Parametric

Methods Applied to a Likert Scale. Pharmacy (Basel, Switzerland), 5(2), 26.


Mishra, T., Kumar, D., & Gupta, S. (2014). Mining Students’ Data for Prediction

Performance. International Conference on Advanced Computing and

Communication Technologies, ACCT, 255–262.


Mohamad, M. M., Sulaiman, N. L., Sern, L. C., & Salleh, K. M. (2015). Measuring the

Validity and Reliability of Research Instruments. Procedia - Social and

Behavioral Sciences, 204, 164–171.


Mohamed Talib, A., Alomary, F. O., & Alwadi, H. F. (2018). Assessment of Student

Performance for Course Examination Using Rasch Measurement Model: A Case

Study of Information Technology Fundamentals Course. Education Research

International, 2018, 1–8.


Mohamed, H., Ashaari, N. S. @, Judi, H. M., & Wook, T. S. M. T. (2012). Factors

Affecting FTSM Students’ Achievement in Statistics Course. Procedia - Social

and Behavioral Sciences, 59, 125–129.


Mohd Ali, S., Norfarah, N., Ilya Syazwani, J. I., & Mohd Erfy, I. (2019). The Effect of

Computerized-adaptive Test on Reducing Anxiety Towards Math Test for

Polytechnic Students. Journal of Technical Education and Training, 11(4), 27–35.


Mohd Rasid, N. S., Md Nasir, N. A., A/l Aperar Singh, P. S., & Cheong, T. H. (2020).

STEM Integration: Factors Affecting Effective Instructional Practices in Teaching

Mathematics. Asian Journal of University Education, 16(1), 56.

Mourtzis, D., Vasilakopoulos, A., Zervas, E., & Boli, N. (2019). Manufacturing System

Design Using Simulation in Metal Industry Towards Education 4.0. Procedia

Manufacturing, 31, 155–161.


Muawiyah, D., Yamtinah, S., & Indriyanti, N. Y. (2018). Higher Education 4.0:

Assessment on Environmental Chemistry Course in Blended Learning Design.

Journal of Physics: Conference Series, 1097(1), 1–7.


Murad, H., Fleischman, A., Sadetzki, S., Geyer, O., & Freedman, L. S. (2003). Small

Samples and Ordered Logistic Regression: Does it Help to Collapse Categories of

Outcome? The American Statistician, 57(3), 155–160.


Mutanu, L., & Machoka, P. (2019). Enhancing Computer Students’ Academic

Performance Through Predictive Modelling - a Proactive Approach. 14th

International Conference on Computer Science and Education, ICCSE 2019, 97–



Muthukrishnan, R., & Myilsamy, R. (2010). M-Estimators in Regression Models.

Journal of Mathematics Research, 2(4), 23–27.


Nagelkerke, N. J. D. (1991). A Note on a General Definition of the Coefficient of

Determination. Biometrika, 78(3), 691-692.


Nahar, J., & Purwani, S. (2017). Application of Robust M-Estimator Regression in

Handling Data Outliers. In 4th ICRIEMS, 53–60.


Nering, L. M., & Ostini, R. (2011). Handbook of Polytomous Item Response Theory

Models. New York, NY: Taylor & Francis Group.


Noguez, J., Neri, L., Gonzalez-Nucamendi, A., & Robledo-Rella, V. (2016).

Characteristics of Self-regulation of Engineering Students to Predict and Improve

Their Academic Performance. 2016 IEEE Frontiers in Education Conference

(FIE), 1–8.


Norman, C. (2014). Ordinal Methods for Behavioral Data Analysis. Psychology Press.


Nunnally, J. C., & Bernstein, I. H. (1994). Psychometric Theory. McGraw-Hill.


Nurgabyl, D., Kalzhanova, G., Ualiyev, N., & Abdoldinova, G. (2017). Construction

of a Mathematical Model for Calibrating Test Task Parameters and the Knowledge

Level Scale of University Students by Means of Testing. Eurasia Journal of

Mathematics, Science and Technology Education, 13(11), 7421–7429.


Olmus, H., Nazman, E., & Erbas, S. (2017). An Evaluation of the Two Parameter (2-

PL) IRT Models Through a Simulation Study. Gazi University Journal of Science,

30(1), 235–249.


Omar, N., Haris, S. S., Hassan, R., Arshad, H., Rahmat, M., Zainal, N. F. A., & Zulkifli,

R. (2012). Automated Analysis of Exam Questions According to Bloom’s

Taxonomy. Procedia - Social and Behavioral Sciences, 59(1956), 297–303.


Öztürk, N. K., & Karabatsos, G. (2017). A Bayesian Robust IRT Outlier-Detection

Model. Applied Psychological Measurement, 41(3), 195–208.


Özyurt, H., & Özyurt, Ö. (2015). Ability Level Estimation of Students on Probability

Unit via Computerized Adaptive Testing. Eurasian Journal of Educational

Research, 15(58), 27–44.


Pada, A. U. T., Kartowagiran, B., & Subali, B. (2016). Separation Index and Fit Items

of Creative Thinking Skills Assessment. Research and Evaluation in Education,

2(1), 1-2.


Papageorgiou, G., & Hinde, J. (2019). mixcat: Mixed Effects Cumulative Link and

Logistic Regression Models. Retrieved from https://CRAN.Rproject.



Pardo, A., Han, F., & Ellis, R. A. (2017). Combining University Student Self-regulated

Learning Indicators and Engagement With Online Learning Events to Predict

Academic Performance. IEEE Transactions on Learning Technologies, 10(1), 82–



Park, J. S., Park, C. G., & Lee, K. E. (2019). Simultaneous Outlier Detection and

Variable Selection via Difference-based Regression Model and Stochastic Search

Variable Selection. Communications for Statistical Applications and Methods,

26(2), 149–161.


Partchev, I. (2017). irtoys: A Collection of Functions Related to Item Response Theory

(IRT). Retrieved from


Passante, G., & Kohnle, A. (2019). Enhancing Student Visual Understanding of the

Time Evolution of Quantum Systems. Physical Review Physics Education

Research, 15(1), 1-14.


Peterson, B., & Harrell, Frank E., J. (1990). Partial Proportional Odds Models for

Ordinal Response Variables. Applied Statistics, 39, 205–217.


Pößnecker, W., & Tutz, G. (2016). A General Framework for the Selection of Effect

Type in Ordinal Regression. Munich, Bavaria, Germany.


Pradeep, A., Das, S., & Kizhekkethottam, J. J. (2015). Students Dropout Factor

Prediction Using EDM Techniques. Proceedings of the IEEE International

Conference on Soft-Computing and Network Security, ICSNS 2015, 1-7.


Pregibon, D. (1982). Resistant Fits for Some Coxnmolily Used Logistic Models with

Medical Applications. Biometrics, 38(2), 485–498.


Pruscha, H. (1994). Partial Residuals in Cumulative Regression Models for Ordinal

Data. Statistical Papers, 35(1), 273–284.


Pulkstenis, E., & Robinson, T. J. (2004). Goodness-of-fit Tests for Ordinal Response

Regression Models. Statistics in Medicine, 23(6), 999–1014.


Radmehr, F., & Drake, M. (2018a). An Assessment-based Model for Exploring the

Solving of Mathematical Problems: Utilizing Revised Bloom’s Taxonomy and

Facets of Metacognition. Studies in Educational Evaluation, 59, 41–51.


Radmehr, F., & Drake, M. (2018b). Revised Bloom’s Taxonomy and Major Theories

and Frameworks That Influence the Teaching, Learning, and Assessment of

Mathematics: a Comparison. International Journal of Mathematical Education in

Science and Technology, 50(6), 895-920.


Raines, T. C., Gordon, M., Harrell-williams, L., Diliberto, R. A., Parke, E. M., Raines,

T. C., … Diliberto, R. A. (2017). Adaptive Skills and Academic Achievement in

Latino Students. Journal of Applied School Psychology, 33(4), 245–260.


Rajeswari, S., & Lawrance, R. (2016). Classification Model to Predict the Learners’

Academic Performance Using Big Data. 2016 International Conference on

Computing Technologies and Intelligent Data Engineering (ICCTIDE’16), 1–6.


Rasch, G. (1960). Probabilistic Models for Some Intelligence and Attainment Tests.

(Copenhagen, Danish Institute for Educational Research), expanded edition

(1980) with foreword and afterword by B. D. Wright. Chicago: The University of

Chicago Press.


Rasheed, B. A., Adnan, R., Saffari, S. E., & Pati, K. (2014). Robust Weighted Least

Squares Estimation of Regression Parameter in the Presence of Outliers and

Heteroscedastic Errors. Jurnal Teknologi, 71(1), 11–18.


Raus, M. I. M., Janor, R. M., Sadjirin, R., & Sahri, Z. (2014). The Development of i-

QuBES for UiTM: From Feasibility Study to the Design Phase. Proceedings -

2014 5th IEEE Control and System Graduate Research Colloquium, ICSGRC

2014, 96–101.


Reckase, M. D. (2009). Multidimensional Item Response Theory Models. New York,

NY: Springer New York.


Ren, Z., & Sweeney, M. (2016). Predicting Student Performance Using Personalized

Analytics. Computer, 49(4), 61–69.


Rezaie, M., & Golshan, M. (2015). Computer Adaptive Test (CAT): Advantages and

Limitations. International Journal of Educational Investigations Available

Online, 2(5), 128–137.


Riani, M., Torti, F., & Zani, S. (2012). Outliers and Robustness for Ordinal Data.

Modern Analysis of Customer Surveys: with applications using R (1st ed.), 155–



Ricardo, A. M., Douglas, R. M., Victor, J. Y., & Matias, S. B. (2019). Robust Statistics

Theory and Methods (with R) (2nd ed.). John Wiley & Sons Ltd.


Riese, A., Rappaport, L., Alverson, B., Park, S., & Rockney, R. M. (2017). Clinical

Performance Evaluations of Third-Year Medical Students and Association With

Student and Evaluator Gender. Academic Medicine, 92(6), 835–840.


Ripley, B., Venables, B., Bates, D. M., Hornik, K., Gebhardt, A., & Firth, D. (2020).

MASS: Support Functions and Datasets for Venables and Ripley's MASS.

Retrieved from


Rizopoulos, D. (2006). ltm : An R Package for Latent Variable Modeling. Journal Of

Statistical Software, 17(5), 1–25.


Rizopoulos, D. (2018). Package “ltm” Title Latent Trait Models under IRT. Retrieved



Rojko, A. (2017). Industry 4.0 Concept : Background and Overview. International

Journal of Interactive Mobile Technologies (IJIM), 11(5), 77–90.


Ronald, K. H., & Russell W. J. (1993). Comparison of Classical Test Theory and Item

Response Theory and Their Applications to Test Development. Educational

Measurement: Issues and Practice, 38-47.


Rosaini, R., Budiyono, B., & Pratiwi, H. (2019). Mathematics Teacher Supporting

Higher Order Thinking Skill of Students Through Assessment as Learning in

Instructional Model. Journal of Physics: Conf. Series, 1157.


Rousseeuw, P. J., & Leroy, A. M. (1987). Robust Regression and Outlier Detection.

Hoboken, NJ, USA: John Wiley & Sons, Inc.


Rousseeuw, P. J., & van Driessen, K. (1999). A Fast Algorithm for the Minimum

Covariance Determinant Estimator. Technometrics, 41(3), 212.


Rubio, D. M., Berg-Weger, M., Tebb, S. S., Lee, E. S., & Rauch, S. (2003).

Objectifying Content Validity: Conducting a Content Validity Study in Social

Work Research. Social Work Research, 27(2), 94–104.


Ruckstuhl, A. (2016). Robust Fitting of Parametric Models Based on M-Estimation.


Rusch, T., Mair, P., & Hatzinger, R. (2013). Psychometrics with R: A Review of CRAN

Packages for Item Response Theory. Discussion Paper Series of the Center for

Empirical Research Methods, 1–28.


Rusimamto, P. W., Nurlaela, L., Sumbawati, M. S., Munoto, & Samani, M. (2019).

Development of Critical and Creative Thinking Skills to Increase Competence of

PLC Programming for Electrical Engineering Education Students. IOP

Conference Series: Materials Science and Engineering, 535(1).


Sagala, P. N., & Andriani, A. (2019). Development of Higher-Order Thinking Skills

(HOTS) Questions of Probability Theory Subject Based on Bloom’s Taxonomy.

Journal of Physics: Conference Series, 1188(1), 1–13.


Sagar, P., Prinima, & Indu (2017). Analysis of Prediction Techniques based on

Classification and Regression General Terms. International Journal of Computer

Applications, 163(7), 47-51.


Said-metwaly, S., Kyndt, E., & Noortgate, W. Van Den. (2019). The Factor Structure

of the Verbal Torrance Test of Creative Thinking in an Arabic Context: Classical

Test Theory and Multidimensional Item Response Theory Analyses. Thinking

Skills and Creativity, 35.


Salim, N. R., Fauzi, A., & Ayub, M. (2017). Relationship Between Mathematics

Statistics Engagement and Attitudes Towards Statistics Among Undergraduate

Students in Malaysia. AIP Conference Proceedings, 1795.


Sall, J. (1991). A Monotone Regression Smoother Based on Ordinal Cumulative

Logistic Regression. ASA Proceedings of Statistical Computing Section, 276–281.


Salzberger, T., & Koller, M. (2019). The Direction of the Response Scale Matters-

Accounting for the Unit of Measurement. European Journal of Marketing, 53(5),



Samejima, F. (1972). A General Model for Free-Response. Psychometrika, 35(18),139.


SAS Institute Inc. (2017). SAS/STAT ® 14.3 User’s Guide The CATMOD Procedure.

Retrieved from


SAS Institute Inc. (2019). SAS Help Center: PROC LOGISTIC Statement. Retrieved

July 12, 2020, from


SAS Institute Inc. (2020). What is a Data Scientist?. Retrieved April 21, 2020, from


Seheult, A. H., Green, P. J., Rousseeuw, P. J., & Leroy, A. M. (2006). Robust

Regression and Outlier Detection. Journal of the Royal Statistical Society. Series

A (Statistics in Society), 152(1), 133.


Seifu, G. (2016). Assessment of the Implementation of Continuous Assessment : the

Case of METTU University. Europian Journal of Science and Mathematics

Education, 4(4), 534–544.


Shahiri, A. M., Husain, W., & Rashid, N. A. (2015). A Review on Predicting Student’s

Performance Using Data Mining Techniques. Procedia Computer Science, 72,



Sharif, S., & Atiany, T. A. M. (2018). Testing Several Correlation Matrices Using

Robust Approach. Asian Journal of Scientific Research, 11(1), 84–95.


Sheng, Y., & Wikle, C. K. (2009). Bayesian IRT Models Incorporating General and

Specific Abilities. Behaviormetrika, 36(1), 27–48.


Sikder, M. F., Uddin, M. J., & Halder, S. (2016). Predicting Students Yearly

Performance Using Neural Network: a Case Study of BSMRSTU. 2016 5th

International Conference on Informatics, Electronics and Vision (ICIEV), 524–



Simeckova, M. (2005). Maximum Weighted Likelihood Estimator in Logistic

Regression. In WDS’05 Proceedings of Contributed Papers, 144–148.


Slim, A., Heileman, G. L., Kozlick, J., & Abdallah, C. T. (2015). Predicting Student

Success Based on Prior Performance. In Proceedings - 2014 IEEE Symposium on

Computational Intelligence and Data Mining (CIDM), 410–415.


Smith, E. V. (2002). Detecting and Evaluating the Impact of Multidimensionality Using

Item Fit Statistics and Principal Component Analysis of Residuals. Journal of

Applied Measurement, 3(2), 205–231.


Smith, G. (2018). Step Away From Stepwise. Journal of Big Data, 5(32), 1–12.


Snell, E. J., Cox, D., & Cox, R. (1987). Applied Statistics: A Handbook of BMDPTM

Analyses. Springer Science Business Media.


Solihatun, S., Rangka, I. B., Ratnasari, D., Radyati, A., Siregar, Y., Wulansari, L., …

Rahim, R. (2019). Measuring of Student Learning Performance Based on

Geometry Test for Middle Class in Elementary School Using Dichotomous Rasch

Analysis. Journal of Physics: Conference Series, 1157(3), 1-7.


Sorour, S. E., Mine, T., Goda, K., & Hirokawa, S. (2015). Predicting Students’ Grades

Based on Free Style Comments Data by Artificial Neural Network. Proceedings -

Frontiers in Education Conference, FIE, 1-9.


Sothan, S. (2018). The Determinants of Academic Performance : Evidence From a

Cambodian University. Studies in Higher Education, 44(11), 2096-2111.


SSI (2020a). BILOGMG. Retrieved July 14, 2020, from

SSI (2020b). PARSCALE. Retrieved July 14, 2020, from


Steyer, R. (2015). Classical (Psychometric) Test Theory. International Encyclopedia of

the Social & Behavioral Sciences, 3, 785-791.


Sturman, E. D., & Zappala-Piemme, K. (2017). Development of the Grit Scale for

Children and Adults and Its Relation to Student Efficacy, Test Anxiety, and

Academic Performance. Learning and Individual Differences, 59, 1–10.


Summers, M. M., Couch, B. A., Knight, J. K., Brownell, S. E., Crowe, A. J., Semsar,

K., … Smith, M. K. (2018). EcoEvo-MAPS: An Ecology and Evolution

Assessment for Introductory Through Advanced Undergraduates. CBE Life

Sciences Education, 17(2).


Susanti, Y., Pratiwi, H., H., S. S., & Liana, T. (2014). M Estimation, S Estimation, and

MM Estimation in Robust Regression. International Journal of Pure and Applied

Mathematics, 91(3), 349–360.


SwMATH (2020). MULTILOG- Mathematical Software. Retrieved July 14, 2020,



Tai, J., Dawson, P., Panadero, E., Boud, D., & Ajjawi, R. (2017). Developing

Evaluative Judgement: Enabling Students to Make Decisions About the Quality of

Work. Higher Education, 467–481.


TalentCorp (2019). Semak Apa Pekerjaan Masa Hadapan Untuk Anda. Petaling Jaya.

Retrieved from


Tawil, N. M., Ismail, N. A., Asshaari, I., Osman, H., Nopiah, Z. M., & Zaharim, A.

(2012). Comparing Lecture and E-learning as Learning Process in Mathematics

and Statistics Courses for Engineering Students in Universiti Kebangsaan

Malaysia. Procedia - Social and Behavioral Sciences, 60, 420–425.


Tekkumru-Kisa, M., & Stein, M. K. (2017). A Framework for Planning and Facilitating

Video-based Professional Development. International Journal of STEM Education, 4, 28.


Testa, S., Toscano, A., & Rosato, R. (2018). Distractor Efficiency in an Item Pool for

a Statistics Classroom Exam: Assessing Its Relation With Item Cognitive Level

Classified According to Bloom’s Taxonomy. Frontiers in Psychology, 9, 1–12.


Thaneerananon, T., Triampo, W., & Nokkaew, A. (2016). Development of a Test to

Evaluate Students’ Analytical Thinking Based on Fact versus Opinion

Differentiation. International Journal of Instruction, 9(2), 123–138.


Tharwat, A. (2009). Principal Component Analysis-A Tutorial.


Thiele, T., Singleton, A., Pope, D., & Stanistreet, D. (2016). Predicting Students’

Academic Performance Based on School and Socio-demographic Characteristics.

Studies in Higher Education, 41(8), 1424-1446.


Thompson, L. A. (2009). R (and S-PLUS) Manual to Accompany Agresti’s Categorical

Data Analysis (2002) 2nd edition. Categorical Data Analysis.


Tijmstra, J., & Bolsinova, M. (2019). Bayes Factors for Evaluating Latent Monotonicity

in Polytomous Item Response Theory Models. Psychometrika, 84(3), 846–869.


Tutz, G. (2014). Regression for Categorical Data. Cambridge: Cambridge University



Ueckert, S. (2018). Modeling Composite Assessment Data Using Item Response

Theory. CPT: Pharmacometrics and Systems Pharmacology, 7(4), 205–218.


Ünlü, A., & Yanagida, T. (2011). R You Ready for R?: The CRAN Psychometrics Task

View. British Journal of Mathematical and Statistical Psychology, 64(1), 182–



van der Linden, W. J. (2016). Handbook of Item Response Theory Volume One.

London, New York: Taylor & Francis Group.


van der Linden, W. J. (2018). Handbook of Item Response Theory Volume Three:

Applications. London, New York: Taylor & Francis Group.


van der Zanden, P. J. A. C., Denessen, E., Cillessen, A. H. N., & Meijer, P. C. (2018).

Domains and Predictors of First-year Student Success: a Systematic Review.

Educational Research Review, 23, 57–77.


Villagrá-Arnedo, C. J., Gallego-Durán, F. J., Llorens-Largo, F., Compañ-Rosique, P.,

Satorre-Cuerda, R., Molina-Carmona, R., … Molina-Carmona, R. (2017).

Improving the Expressiveness of Black-box Models for Predicting Student

Performance. Computers in Human Behavior, 72, 621–631.


Villarroel, V., Boud, D., Bloxham, S., Bruna, D., & Bruna, C. (2020). Using Principles

of Authentic Assessment to Redesign Written Examinations and Tests.

Innovations in Education and Teaching International, 57(1), 38–49.


Vora, D. R., & Rajamani, K. (2019). A Hybrid Classification Model for Prediction of

Academic Performance of Students : a Big Data Application. Evolutionary



Walker, S. H., & Duncan, D. B. (1967). Estimation of the Probability of an Event as a

Function of Several Independent Variables. Biometrika, 54, 167–179.


Wang, J. C., & Holan, S. H. (2012). Bayesian Multi-regime Smooth Transition

Regression With Ordered Categorical Variables. Computational Statistics and

Data Analysis, 56(12), 4165–4179.


Wang, R., Hao, P., Zhou, X., Campbell, A. T., & College, D. (2015). SmartGPA:

Academic Performance Can Assess and Predict How Smartphones of College

Students. In the 2015 ACM International Joint Conference on Ubiquitous

Computing (UbiComp 2015), 19, 13–17.


Watan, S., & Sugiman. (2018). Exploring the Relationship Between Teachers’

Instructional and Students’ Geometrical Thinking Levels Based on Van Hiele

Theory. Journal of Physics: Conference Series, 1097(1).


Weng, T. S., & Yang, D. C. (2017). Research on Mathematical Animation Using Pascal

Animation as an Example. Eurasia Journal of Mathematics, Science and

Technology Education, 13(6), 1687–1699.


Whitney, B. M., Cheng, Y., Brodersen, A. S., & Hong, M. R. (2018). The Scale of

Student Engagement in Statistics: Development and Initial Validation. Journal of

Psychoeducational Assessment, 37(5), 553-565.


Wijekoon, C. N., Amaratunge, H., Silva, Y. De, & Senanayake, S. (2017). Emotional

Intelligence and Academic Performance of Medical Undergraduates : a Crosssectional

Study in a Selected University in Sri Lanka. BMC Medical Education,

17(176), 1–11.


Williams, R. A. (2016). Ordinal Regression Models : Problems, Solutions, and

Problems With the Solutions. Stata Users Group, German Stata Users' Group

Meetings 2008.


Winsteps (2020). Rasch Analysis + Rasch Measurement Software + 1PL IRT.

Retrieved July 14, 2020, from


Wright, B. D., & Panchapakesan, N. (1969). A Procedure for Sample-Free Item

Analysis. Educational and Psychological Measurement, 29, 23–48.


Xu, J., Moon, K. H., & van der Schaar, M. (2017). A Machine Learning Approach for

Tracking and Predicting Student Performance in Degree Programs. IEEE Journal

of Selected Topics in Signal Processing, 11(5), 742–753.


Ye, F., & Lord, D. (2014). Comparing Three Commonly Used Crash Severity Models

on Sample Size Requirements: Multinomial Logit, Ordered Probit and Mixed

Logit Models. Analytic Methods in Accident Research, 1, 72–85.


Yee, T., & Moler, C. (2020). VGAM: Vector Generalized Linear and Additive Models.

Retrieved from


Yen, T. S., & Halili, S. H. (2015). Effective Teaching of Higher-Order Thinking (HOT)

in Education. Distance Education and E-Learning, 3(2), 41–47.


You, H. S., Kim, K., Black, K., & Min, K. W. (2018). Assessing Science Motivation

for College Students: Validation of the Science Motivation Questionnaire II Using

the Rasch-andrich Rating Scale Model. Eurasia Journal of Mathematics, Science

and Technology Education, 14(4), 1161–1173.


Young, D. E., & Meredith, D. C. (2017). Using the Resources Framework to Design,

Assess, and Refine Interventions on Pressure in Fluids. Physical Review Physics

Education Research, 13(1), 1–16.


Yusof, A. L., Naim, N. F., Latip, M. F. A., Aminuddin, N., & Ya’acob, N. (2017).

Implementation of Integrated Cumulative Grade Point Average (iCGPA) Towards

Academic Excellence in Malaysia. In 2017 IEEE 9th International Conference on

Engineering Education (ICEED), 106–109.


Zainudin, S., Ahmad, K., Ali, N. M., & Zainal, N. F. A. (2012). Determining Course

Outcomes Achievement Through Examination Difficulty Index Measurement.

Procedia - Social and Behavioral Sciences, 59, 270–276.


Zhang, Q., & Stephens, M. (2016). Profiling Teacher Capacity in Statistical Thinking

of National Curriculum Reform: a Comparative Study Between Australia and

China. Eurasia Journal of Mathematics, Science and Technology Education,

12(4), 733–746.


Zollanvari, A., Kizilirmak, R. C., Kho, Y. H., & Hernandez-Torrano, D. (2017).

Predicting Students’ GPA and Developing Intervention Strategies Based on Self-

Regulatory Learning Behaviors. IEEE Access, 5, 23792-23802.


Zulkifli, F., Abidin, R. Z., & Mohamed, Z. (2019). Evaluating the Quality of Exam

Questions: a Multidimensional Item Response. International Journal of Recent

Technology and Engineering, 8(2 Special Issue 11), 606–612.


Zulkifli, F., Abidin, R. Z., Razi, N. F. M., Mohammad, N. H., Ahmad, R., & Azmi, A.

Z. (2018). Evaluating Quality and Reliability of Final Exam Questions for

Probability and Statistics Course Using Rasch Model. International Journal of

Engineering and Technology(UAE), 7(4), 32–36.


This material may be protected under Copyright Act which governs the making of photocopies or reproductions of copyrighted materials.
You may use the digitized material for private study, scholarship, or research.

Back to previous page

Installed and configured by Bahagian Automasi, Perpustakaan Tuanku Bainun, Universiti Pendidikan Sultan Idris
If you have enquiries, kindly contact us at or 016-3630263. Office hours only.