| 
              UPSI Digital Repository (UDRep)               |   | 
| 
 | 
 | ||||||||||||||||||||||||
| Abstract : Universiti Pendidikan Sultan Idris | 
| Kajian ini bertujuan membangunkan model regresi ordinal teori respons item (TRI)
teguh dalam meramal prestasi gred peperiksaan akhir pelajar. Kaedah pembangunan
model adalah berasaskan model regresi ordinal iaitu model ganjil kumulatif (MGK) dan
analisis literatur bersistematik. MGK diubah suai dengan menerapkan TRI dan kaedah
teguh penganggar-M (pemberat Huber dan Tukey Bisquare). Sampel kajian terdiri
daripada 326 orang pelajar dari salah sebuah universiti awam di Malaysia yang
mendaftar kursus berkaitan STEM. Sementara enam orang pakar dalam bidang statistik
terlibat bagi mengesahkan kualiti sampel item soalan yang digunakan. Data kajian
dianalisis menggunakan analisis deskriptif, indeks tahap persetujuan Cohen Kappa,
analisis faktor, analisis pengukuran Rasch, plot diagnostik dan penyuaian model. Model
yang dibangunkan diuji kebagusannya terhadap data sebenar dan simulasi. Simulasi
Monte Carlo dijalankan berdasarkan faktor simulasi iaitu saiz sampel, kombinasi tahap
kesukaran, peratus pencemaran dan sisihan piawai data pencilan yang melibatkan
ukuran bias, ralat punca min kuasa dua, pekali penentuan dan statistik Lipsitz. Dapatan
kajian mendapati model yang menerapkan TRI dimensi berbilang memberikan hasil
penyuaian lebih baik berbanding model asas yang mana statistik Lipsitz bagi MGK-TRI
(522.78) adalah kurang daripada MGK (549.94). Manakala, penganggar-M dengan
pemberat Tukey Bisquare menunjukkan prestasi keteguhan lebih baik berbanding
pemberat Huber dan penganggar kebolehjadian maksimum. Kesimpulannya, kajian ini
berjaya membangunkan model ramalan prestasi gred peperiksaan akhir pelajar yang
menerapkan TRI dan kaedah teguh dalam mengatasi masalah multikolinearan dan
pengaruh data pencilan pada model regresi ordinal. Model yang dihasilkan memberikan
implikasi dari segi teoritikal, metodologi dan sumbangan kepada pihak-pihak
berkepentingan dalam statistik dan pendidikan, Kementerian Pendidikan Tinggi
Malaysia, universiti dan industri dalam meramal prestasi gred peperiksaan akhir pelajar. | 
| References | 
| Abd Mutalib, Z. (2018). UA Diberi Pilihan Laksana iCGPA. Berita Harian Online. Retrieved from https://www.bharian.com.my/berita/nasional/2018/06/440082/uadiberi- pilihan-laksana-icgpa. 
 Abdullah, A. H., Abidin, N. L. Z., & Ali, M. (2015). Analysis of Students’ Errors in Solving Higher Order Thinking Skills (HOTS) Problems for the Topic of Fraction. Asian Social Science, 11(21), 133–142. 
 Abdullah, A. H., Mokhtar, M., Halim, N. D. A., Ali, D. F., Tahir, L. M., & Kohar, U. H. A. (2017). Mathematics Teachers’ Level of Knowledge and Practice on the Implementation of Higher-Order Thinking Skills (HOTS). Eurasia Journal of Mathematics, Science and Technology Education, 13(1), 3–17. 
 Abreu, M. N., Siqueira, A. L., Cardoso, C. S., & Caiaffa, W. T. (2008). Ordinal Logistic Regression Models: Application in Quality of Life Studies. Cadernos de Saude Publica, 24 Suppl 4, s581–s591. 
 Adams, R. ., Wu, M. ., Cloney, D., & Wilson, M. R. (2020). ConQuest. Retrieved July 14, 2020, from https://www.acer.org/my/conquest 
 Adedayo, A. A., & Ojo, O. O. (2018). Bayesian Method for Solving the Problem of Multicollinearity in Regression. Afrika Statistika, 13(3), 1823–1834. 
 Adejo, O. W., & Connolly, T. (2018). Predicting Student Academic Performance Using Multi-model Heterogeneous Ensemble Approach. Journal of Applied Research in Higher Education, 10(1), 61–75. 
 Adnan, A., & Sugiarto, S. (2017). The Outlier Detection for Ordinal Data Using Scalling Technique of Regression Coefficients. In IOP Conf. Series: Journal of Physics: Conf. Series (Vol. 855, pp. 1–7). 
 Agresti, A. (1989). Tutorial on Modeling Ordered Categorical Response Data. Psychological Bulletin, 105(2), 290–301. 
 Agresti, A. (2010). Analysis of Ordinal Categorical Data (2nd ed.). John Wiley & Sons, Inc. 
 Agresti, A. (2013). Categorical Data Analysis. Wiley-Interscience. 
 Agus, M., Penna, M. P., Peró-Cebollero, M., & Guàrdia-Olmos, J. (2016). Assessing Probabilistic Reasoning in Verbal-numerical and Graphical-pictorial Formats: an Evaluation of the Psychometric Properties of an Instrument. Eurasia Journal of Mathematics, Science and Technology Education, 12(8), 2013–2038. 
 Akinoso, S. O. (2018). Mathematics Teachers Awareness of Teachable Moments in Nigerian Classroom. Eurasia Journal of Mathematics, Science and Technology Education, 14(2), 683–689. 
 Alhadlaq, A. M., Alshammari, O. F., Alsager, S. M., Neel, K. A. F., & Mohamed, A. G. (2015). Ability of Admissions Criteria to Predict Early Academic Performance Among Students of Health Science Colleges at King Saud University, Saudi Arabia. Journal of Dental Education, 79(6), 665–670. 
 Al-khadher, M. M. A., & Albursan, I. S. (2017). Accuracy of Measurement in the Classical and the Modern Test Theory : an Empirical Study on a Children Intelligence Test Accuracy of Measurement in the Classical and the Modern Test Theory : an Empirical Study on a Children Intelligence Test. International Journal of Psychological Studies, 9(1), 71. 
 Al-Sheeb, B. A., Hamouda, A. M., & Abdella, G. M. (2019). Modeling of Student Academic Achievement in Engineering Education Using Cognitive and Non- Cognitive Factors. Journal of Applied Research in Higher Education, 11(2), 178– 198. 
 Alzen, J. L., Langdon, L. S., & Otero, V. K. (2018). A Logistic Regression Investigation of the Relationship Between the Learning Assistant Model and Failure Rates in Introductory STEM Courses. International Journal of STEM Education, 5(1), 1– 12. 
 Ananth, C. V, & Kleinbaum, D. G. (1997). Regression Models for Ordinal Responses : A Review of Methods and Applications. International Journal of Epidermiology, 26(6), 1323–1333. 
 Anazifa, R. D., & Djukri. (2017). Project-Based Learning and Problem- Based Learning: Are They Effective to Improve Student’s Thinking Skills? Jurnal Pendidikan IPA Indonesia, 6(2), 346–355. 
 Anderson, J. A. (1984). Regression and Ordered Categorical Variables. Journal of the Royal Statistical Society. Series B (Methodological). WileyRoyal Statistical Society. 
 Anderson, L. W., Krathwohl, D. R., P.W., A., Cruikshank, K. A., Mayer, R. E., Pintrich, P. R., … Wittrock, M. C. (2001). A Taxonomy for Learning, Teaching, and Assessing: a Revision of Bloom’s Taxonomy of Educational Objectives. New York: Longman. 
 Andrich, D. (1978). A rating formulation for ordered response categories. Psychometrika, 43, 561-573. 
 Arco-tirado, J. L., Fernández-martín, F., Ramos-garcía, A. M., Littvay, L., & Villoria, J. (2018). A Counterfactual Impact Evaluation of a Bilingual Program on Students’ Grade Point Average at a Spanish University. Evaluation and Program Planning, 68(February), 81–89. 
 Ari, E., & Yildiz, Z. (2014). Parallel Lines Assumption in Ordinal Logistic Regression and Analysis Approaches. International Interdisciplinary Journal of Scientific Research, 1(3), 8–23. 
 Arievitch, I. M. (2020). The Vision of Developmental Teaching and Learning and Bloom’s Taxonomy of Educational Objectives. Learning, Culture and Social Interaction, 25, 100274. 
 Artusi, R., Verderio, P., & Marubini, E. (2002). Bravais-pearson and Spearman Correlation Coefficients: Meaning, Test of Hypothesis and Confidence Interval. The International Journal of Biological Markers, 17(2), 148–151. 
 Ashenafi, M. M., Riccardi, G., & Ronchetti, M. (2015). Predicting Students’ Final Exam Scores From Their Course Activities. In Proceedings - Frontiers in Education Conference, FIE, 2014. 
 Asshaari, I., Othman, H., Bahaludin, H., Ismail, N. A., & Nopiah, Z. M. (2012). Appraisal on Bloom’s Separation in Final Examination Question of Engineering Mathematics Courses Using Rasch Measurement Model. Procedia - Social and Behavioral Sciences, 60(2009), 172–178. 
 Athani, S. S., Kodli, S. A., Banavasi, M. N., & Hiremath, P. G. S. (2017). Student Academic Performance and Social Behavior Predictor Using Data Mining Techniques. In Proceedings - IEEE International Conference Computing Communication Automation ICCCA 2017 ,170–174. 
 Auerbach, A. J. J., & Andrews, T. C. (2018). Pedagogical Knowledge for Activelearning Instruction in Large Undergraduate Biology Courses: a Large-scale Qualitative Investigation of Instructor Thinking. International Journal of STEM Education, 5(1). 
 Ayers, E., & Junker, B. (2008). IRT Modeling of Tutor Performance to Predict End-of- Year Exam Scores. Educational and Psychological Measurement, 68(6), 972–987. 
 Azizah, U., & Nasrudin, H. (2018). Development of Chemistry Instructional Materials Based on Cooperative Group Investigation (CGI) to Empower Thinking Skills. Journal of Physics: Conference Series, 1108(1). 
 Bäcklin, C. L., & Gustafsson, M. G. (2018). Developer-Friendly and Computationally Efficient Predictive Modeling Without Information Leakage: the emil Package for R. Journal of Statistical Software, 85(13). 
 Badri, M., Alnuaimi, A., Mohaidat, J., Al Rashedi, A., Yang, G., & Al Mazroui, K. (2016). My Science Class and Expected Career Choices- a Structural Equation Model of Determinants Involving Abu Dhabi High School Students. International Journal of STEM Education, 3(1). 
 Bahrum, S., Wahid, N., & Ibrahim, N. (2017). Integration of STEM Education in Malaysia and Why to STEAM. International Journal of Academic Research in Business and Social Sciences, 7(6), 645–654. 
 Baily, C., Ryan, Q. X., Astolfi, C., & Pollock, S. J. (2017). Conceptual Assessment Tool for Advanced Undergraduate Electrodynamics. Physical Review Physics Education Research, 13(2), 1–10. 
 Baker, F. B., & Kim, S.-H. (2004). Item Response Theory: Parameter Estimation Techniques (2nd ed.). Taylor & Francis Group. 
 Bal, C., Demir, S., & Aladag, C. H. (2016). A Comparison of Different Model Selection Criteria for Forecasting EURO / USD Exchange Rates by Feed Forward Neural Network. In Proceedings - International Journal of Computing, Communications & Instrumentation Enggineering (IJCCIE), 3(2), 1–5. 
 Bana, M., & Ligas, M. (2014). Empirical Tests of Performance of Some M–estimators. Geodesy and Cartography, 63(2), 127–146. 
 Barlybayev, A., Sharipbay, A., Ulyukova, G., Sabyrov, T., & Kuzenbayev, B. (2016). Student’s Performance Evaluation by Fuzzy Logic. Procedia Computer Science, 102(August), 98–105. 
 Battauz, M. (2015). equateIRT : An R Package for IRT Test Equating . Journal of Statistical Software, 68(7). 
 Baur, T., & Lukes, D. (2009). An Evaluation of the IRT Models Through Monte Carlo Simulation. UW-L Journal of Undergraduate Research, (XII), 1–7. 
 Baygin, M., Yetis, H., Karakose, M., & Akin, E. (2016). An Effect Analysis of Industry 4.0 to Higher Education. In 2016 15th International Conference on Information Technology Based Higher Education and Training (ITHET), 1–4. 
 Begg, A. (1997). Some Emerging Influences Underpinning Assessment in Statistics. The Assessment Challenge in Statistics Education, 17–25. 
 Bellettini, C., Lonati, V., Malchiodi, D., Monga, M., Morpurgo, A., & Torelli, M. (2015). How Challenging Are Bebras Tasks? An IRT Analysis Based on the Performance of Italian Students. Annual Conference on Innovation and Technology in Computer Science Education, ITiCSE, 2015-June, 27–32. 
 Bender, R., & Benner, A. (2000). Calculating Ordinal Regression Models in SAS and S-Plus. Biometrical Journal, 42(6), 677–700. 
 Benešová, A., & Tupa, J. (2017). Requirements for Education and Qualification of People in Industry 4.0. Procedia Manufacturing, 11, 2195–2202. 
 Berg, R. G. van den. (2020). SPSS Factor Analysis- Absolute Beginners Tutorial. Retrieved July 14, 2020, from https://www.spss-tutorials.com/spss-factoranalysis- tutorial/ 
 Bernama (2018). UiTM Rombak iCGPA Supaya Lebih Mesra Pensyarah. Retrieved April 22, 2020, from http://www.astroawani.com/berita-malaysia/uitm-rombakicgpa- supaya-lebih-mesra-pensyarah-180416 
 Bianco, A. M., & Yohai, V. J. (1996). Robust Estimation in the Logistic Regression Model, 17–34. 
 Binh, H. T., & Duy, B. T. (2017). Predicting Students’ Performance Based on Learning Style by Using Artificial Neural Networks, In Proceedings - 2017 9th International Conference on Knowledge and Systems Engineering (KSE), 48-53 
 Bloom, B. S. (1956). Taxonomy of Educational Objectives – Handbook 1 Cognitive Domain. London: Longman. 
 Bock, R. D., & Aitkin, M. (1981). Marginal Maximum Likelihood Estimation of Item Parameters: Application of an EM Algorithm. Psychometrika, 46(4), 443–459. 
 Bond, T., & Fox, C. M. (2015). Applying The Rasch Model Fundamental Measurement in the Human Sciences (3rd ed.). New York: Routledge. 
 Bondell, H. D. (2008). A Characteristic Function Approach to the Biased Sampling Model, With Application to Robust Logistic Regression. Journal of Statistical Planning and Inference, 138, 742–755. 
 Bonsaksen, T., Brown, T., Lim, H. B., & Fong, K. (2017). Approaches to Studying Predict Academic Performance in Undergraduate Occupational Therapy Students: a Cross-cultural Study. BMC Medical Education, 17(1), 1–9. 
 Brassil, C. E., & Couch, B. A. (2019). Multiple-true-false Questions Reveal More Thoroughly the Complexity of Student Thinking Than Multiple-choice Questions: a Bayesian Item Response Model Comparison. International Journal of STEM Education, 6(1). 
 Brazeal, K. R., Brown, T. L., & Couch, B. A. (2016). Characterizing Student Perceptions of and Buy-in Toward Common Formative Assessment Techniques. CBE Life Sciences Education, 15(4). 
 Brester, C., Rönkkö, M., Kolehmainen, M., Semenkin, E., Kauhanen, J., Tuomainen, T.-P., … Ronkainen, K. (2018). Evolutionary Methods for Variable Selection in the Epidemiological Modeling of Cardiovascular Diseases. BioData Mining, 11(1), 1–14. 
 Buergin, R. (2020). vcrpart: Tree-Based Varying Coefficient Regression for Generalized Linear and Ordinal Mixed Models. Retrieved from https://CRAN.R project.org/package=vcrpart. 
 Bulut, O., & Sunbul, Ö. (2017). Monte Carlo Simulation Studies in Item Response Theory with the R Programming Language. Journal of Measurement and Evaluation in Education and Psychology, 8(3), 266–287. 
 Buniyamin, N., Mat, U. Bin, & Arshad, P. M. (2016). Educational Data Mining for Prediction and Classification of Engineering Students Achievement. IEEE 7th International Conference on Engineering Education, ICEED 2015, 49–53. 
 Butterworth, J., & Thwaites, G. (2013). Thinking Skills: Critical Thinking and Problem Solving (2nd ed.). Cambridge: Cambridge University Press. 
 Cai, L. (2010). Metropolis-Hastings Robbins-Monro Algorithm for Confirmatory Item Factor Analysis. Journal of Educational and Behavioral Statistics, 35(3), 307– 335. 
 Capuano, A. W. (2012). Constrained Ordinal Models With Application in Occupational and Constrained Ordinal Models With Application in Occupational and Environmental Health Environmental Health. University of Iowa. 
 Carroll, R. J., & Pederson, S. (1993). On Robustness in the Logistic Regression Model. Journal Royal Statistical Society, 55(3), 693–706. 
 Celik, A. O., & Guzel, B. E. (2017). Mathematics Teachers’ Knowledge of Student Thinking and Its Evidences in Their Instruction. Journal on Mathematics Education, 8(2), 199–210. 
 Chalmers, R. P. (2012). mirt : A Multidimensional Item Response Theory Package for the R Environment. Journal of Statistical Software, 48(6). 
 Chalmers, R. P. (2016). Generating Adaptive and Non-adaptive Test Interfaces for Multidimensional Item Response Theory Applications. Journal of Statistical Software, 71. 
 Chan, S. W., Ismail, Z., & Sumintono, B. (2014). A Rasch Model Analysis on Secondary Students’ Statistical Reasoning Ability in Descriptive Statistics. Procedia - Social and Behavioral Sciences, 129, 133–139. 
 Chootongchai, S., & Songkram, N. (2018). Design and Development of SECI and Moodle Online Learning Systems to Enhance Thinking and Innovation Skills for Higher Education Learners. International Journal of Emerging Technologies in Learning, 13(3), 154–172. 
 Christensen, R. H. B. (2019). ordinal: Regression Models for Ordinal Data. Retrieved from https://CRAN.R-project.org/package=ordinal. 
 Christian, T. M., & Ayub, M. (2014). Exploration of Classification Using NBtree for Predicting Students’ Performance. Proceedings of 2014 International Conference on Data and Software Engineering, ICODSE 2014, 1–6. 
 Cladellas, R., Muro, A., Vargas-Guzmán, E. A., Bastardas, A., & Gomà-i-Freixanet, M. (2017). Sensation Seeking and High School Performance. Personality and Individual Differences, 117, 117–121. 
 Clarke, B. S., & Clarke, J. L. (2018). Predictive Statistics : Analysis and Inference Beyond Models. Cambridge University Press. 
 Cohen, L., Manion, L., & Morrison, K. (2018). Research Methods in Education (8th ed.). Routledge. 
 Columbus, L. (2019). Data Scientist Leads 50 Best Jobs In America For 2019 According To Glassdoor. Retrieved April 21, 2020, from https://www.forbes.com. 
 Copas, J. B. (1988). Binary Regression Models for Contaminated Data. Journal Royal Statistical Society, 50(2), 225–265. 
 Crane, N., Zusho, A., Ding, Y., & Cancelli, A. (2017). Domain-specific Metacognitive Calibration in Children With Learning Disabilities. Contemporary Educational Psychology, 50, 72–79. 
 Cronbach, L. J. (1951). Coefficient Alpha and the Internal Structure of Tests. Psychometrika, 16(3), 297–334. 
 Croux, C., & Haesbroeck, G. (2003). Implementing the Bianco and Yohai Estimator for Logistic Regression. Computational Statistics & Data Analysis, 44, 273–295. 
 Croux, C., Flandre, C., & Haesbroeck, G. (2002). The Breakdown Behavior of the Maximum Likelihood Estimator in the Logistic Regression Model. Statistics and Probability Letters, 60(4), 377–386. 
 Croux, C., Haesbroeck, G., & Ruwet, C. (2013). Robust Estimation for Ordinal Regression. Journal of Statistical Planning and Inference, 143(9), 1486–1499. 
 Das, A. K., & Rodriguez-Marek, E. (2019). A Predictive Analytics System for Forecasting Student Academic Performance: Insights From a Pilot Project at Eastern Washington University. In 2019 Joint 8th International Conference on Informatics, Electronics & Vision (ICIEV) and 2019 3rd International Conference on Imaging, Vision & Pattern Recognition (icIVPR), 255–262. 
 de Kort, J. M., Dolan, C. V., Lubke, G. H., & Molenaar, D. (2017). Studying the Strength of Prediction Using Indirect Mixture Modeling: Nonlinear Latent Regression with Heteroskedastic Residuals. Structural Equation Modeling, 24(2), 301–313. 
 Deanna, S. (2018). Logistic and Linear Regression Assumptions : Violation Recognition and Control. In Recognition and Control, 1–21. 
 Dignan, L. (2019). Data Science Dominates Linkedin’s Emerging Jobs Ranking. Retrieved April 21, 2020, from https://www.zdnet.com/article/data-sciencedominates- linkedins-emerging-jobs-ranking/ 
 Dobson, A. J., & Barnett, A. G. (2018). An Introduction to Generalized Linear Models (4th ed.). Taylor & Francis Group. 
 Donoghoe, M. W. (2018). glm2: Fitting Generalized Linear Models. Retrieved from https://CRAN.R-project.org/package=glm2. 
 Drath, R., & Horch, A. (2014). Industrie 4.0: Hit or Hype? IEEE Industrial Electronics Magazine, 8(2), 56–58. 
 Dunham, B., Yapa, G., & Yu, E. (2015). Calibrating the Difficulty of an Assessment Tool: The Blooming of a Statistics Examination. Journal of Statistics Education, 23(3). 
 Edwards, J. M., & Finch, W. H. (2018). Recursive Partitioning Methods for Data Imputation in the Context of Item Response Theory: a Monte Carlo Simulation. Psicologica, 39(1), 88–117. 
 Ellis, J. L. (2019). Factor Analysis and Item Analysis. Retrieved from ttps://www.applyingstatisticsinbehaviouralresearch.com. 
 Embretson, S. E., Reise, S., & Reise, S. P. (2000). Item Response Theory for Psychologists (Multivariate Applications Book Series). New Jersey: Lawrence Erlbaum Associates, Inc. 
 Engel, J. (1988). Polytomous Logistic Regression. Statistica Neerlandica, 42(4), 233– 252. 
 Erda, G., Indahwati, & Djuraidah, A. (2019). Outlier Handling of Robust Geographically and Temporally Weighted Regression. Journal of Physics: Conference Series, 1175(1). 
 Fagerland, M. W., & Hosmer, D. W. (2012). A Generalized Hosmer-lemeshow Goodness-of-fit Test for Multinomial Logistic Regression Models. Stata Journal, 12(3), 447–453. 
 Fagerland, M. W., & Hosmer, D. W. (2016). Tests for Goodness of Fit in Ordinal Logistic Regression Models. Journal of Statistical Computation and Simulation, 86(17), 3398–3418. 
 Falk, C. F., & Ju, U. (2020). Estimation of Response Styles Using the Multidimensional Nominal Response Model: A Tutorial and Comparison With Sum Scores. Frontiers in Psychology, 11, 1–17. 
 Fernandez, D. B., & Lujan-Mora, S. (2017). Comparison of Applications for Educational Data Mining in Engineering Education. 2017 IEEE World Engineering Education Conference (EDUNINE), 81–85. 
 Fielding, A. (1999). Why Use Arbitrary Points Scores?: Ordered Categories in Models of Educational Progress. Journal of the Royal Statistical Society: Series A (Statistics in Society), 162(3), 303–328. 
 Fienberg, S. E. (1980). The Analysis of Cross-Classified Categorical Data: Second Edition. Cambridge: Massachusetts Institute of Technology Press. 
 Fisher Jr., W. P. (2007). Rating Scale Instrument Quality Criteria. Rasch Measurement Transaction, 21(1095). 
 Fitri, S., & Zahari, C. L. (2019). The Implementation of Blended Learning to Improve Understanding of Mathematics. Journal of Physics: Conference Series, 1188(1). 
 Fleckenstein, J., Leucht, M., Pant, H. A., & Köller, O. (2016). Proficient Beyond Borders: Assessing Non-native Speakers in a Native Speakers’ Framework. Large-Scale Assessments in Education, 4(1). 
 Foster, R. C. (2020). A Generalized Framework for Classical Test Theory. Journal of Mathematical Psychology, 96. 
 Fox, J., & Weisberg, S. (2012). An R Companion to Applied Regression: Third Edition. SAGE. 
 Francis, E. (2018). Effects of Some Coding Techniques On Multicolinearity and Model Statistics. Mathematical Theory and Modeling, 8(4), 156–167. 
 Franses, P. H., & Paap, R. (2010). Quantitative Models in Marketing Research. Cambridge University Press. 
 Fuadiah, N. F., Suryadi, D., & Turmudi, T. (2019). Teaching and Learning Activities in Classroom and Their Impact on Student Misunderstanding: A Case Study on Negative Integers. International Journal of Instruction, 12(1), 407–424. 
 Gerber, N. L., & Price, J. K. (2018). Measures of Function and Health-Related Quality of Life. Principles and Practice of Clinical Research. Elsevier Inc. 
 Gibbons, L. E., Crane, P. K., Seung, M., & Choi, W. (2016). Package “lordif” Type Package Title Logistic Ordinal Regression Differential Item Functioning using IRT. 
 Golding, C. (2019). Discerning Student Thinking : a Practical Theoretical Framework for Recognising or Informally Assessing Different Ways of Thinking. Teaching in Higher Education, 24(4), 478-492. 
 Gómez-Rey, P., Fernández-Navarro, F., & Barberà, E. (2016). Ordinal Regression by a Gravitational Model in the Field of Educational Data Mining. Expert Systems, 33(2), 161–175. 
 Goodhew, L. M., & Robertson, A. D. (2017). Exploring the Role of Content Knowledge in Responsive Teaching. Physical Review Physics Education Research, 13(1), 1– 24. 
 Goodman, L. A. (1979). Simple Models for the Analysis of Association in Crossclassifications Having Ordered Categories. Journal of the American Statistical Association, 74(367), 537–552. 
 Greenwell, B. M., Mccarthy, A. J., Boehmke, B. C., & Liu, D. (2018). Residuals and Diagnostics for Binary and Ordinal Regression Models: An Introduction to the sure Package. The R Journal, 10, 381–394. 
 Groll, A. (2020). GMMBoost: Likelihood-Based Boosting for Generalized Mixed Models. Retrieved from https://CRAN.R-project.org/package=GMMBoost. 
 Grundspenkis, J. (2019). Intelligent Knowledge Assessment Systems: Myth or Reality. Frontiers in Artificial Intelligence and Applications, 315, 31-46. 
 Guo, B., Zhang, R., Xu, G., Shi, C., & Yang, L. (2015). Predicting Students Performance in Educational Data Mining. 2015 International Symposium on Educational Technology (ISET), 125–128. 
 Hadar, L. L., & Tirosh, M. (2019). Creative Thinking in Mathematics Curriculum: an Analytic Framework. Thinking Skills and Creativity, 33(September 2018), 100585. 
 Hadfield, J. (2019). MCMCglmm: MCMC Generalised Linear Mixed Models. Retrieved from https://CRAN.R-project.org/package=MCMCglmm 
 Han, M., Tong, M., Chen, M., Liu, J., & Liu, C. (2017). Application of Ensemble Algorithm in Students’ Performance Prediction. Proceedings - 2017 6th IIAI International Congress on Advanced Applied Informatics, IIAI-AAI 2017, 735– 740. 
 Hariharasudan, A., & Kot, S. (2018). A Scoping Review on Digital English and Education 4.0 for Industry 4.0. Social Sciences, 7(11), 227. 
 Harrell, E. F. (2020). rms: Regression Modeling Strategies. Retrieved from https://CRAN.R-project.org/package=rms. 
 Hauck, W. W., & Donner, A. (1977). Wald’s Test as Applied to Hypotheses in Logit Analysis. Journal of the American Statistical Association, 72(360), 851. 
 Hauke, J., & Kossowski, T. (2011). Comparison Of Values Of Pearson’s And Spearman’s Correlation Coefficients On The Same Sets Of Data. Quaestiones Geographicae, 30(2), 87–93. 
 Himelfarb, I. (2019). A Primer on Standardized Testing: History, Measurement, Classical Test Theory, Item Response Theory, and Equating. Journal of Chiropractic Education, 33(2), 151–163. 
 Hobza, T., Pardo, L., & Vajda, I. (2008). Robust Median Estimator in Logistic Regression. Journal of Statistical Planning and Inference, 138, 3822–3840. 
 Hosmer, D. W., & Lemesbow, S. (1980). Goodness of Fit Tests for the Multiple Logistic Regression Model. Communications in Statistics - Theory and Methods, 9(10), 1043–1069. 
 Hosmer, D. W., & Lemeshow, S. (2000). Applied Logistic Regression (2nd ed.). John Wiley & Sons, Inc. 
 Hosseinian, S., & Morgenthaler, S. (2011). Robust Binary Regression. Journal of Statistical Planning and Inference, 141(4), 1497–1509. 
 Hu, X. (2018). Foreign Language Education in Colleges and Universities Based on Globalization Background. Educational Sciences: Theory & Practice, 18(6), 3400–3407. 
 Hubert, M., Debruyne, M., & Rousseeuw, P. J. (2018). Minimum Covariance Determinant and Extensions. Wiley Interdisciplinary Reviews: Computational Statistics, 10(3), e1421. 
 Huo, X., & Cao, S. (2019). Aggregated inference. Wiley Interdisciplinary Reviews: Computational Statistics, 11(1), 1–13. 
 Hussin, A. A. (2018). Education 4.0 Made Simple : Ideas For Teaching. International Journal of Education and Literacy Studies, 6(3), 92–98. 
 Iannario, M., Clara, A., & Piccolo, D. (2016). Robustness Issues for CUB Models. TEST, 25, 731-750. 
 Iannario, M., Monti, A. C., Piccolo, D., & Ronchetti, E. (2017). Robust Inference for Ordinal Response Models, 11, 3407–3445. 
 Ikbal, S., Tamhane, A., Sengupta, B., Chetlur, M., Ghosh, S., & Appleton, J. (2015). On Early Prediction of Risks in Academic Performance for Students. IBM Journal of Research and Development, 59(6), 1–14. 
 Ikuma, L. H., Steele, A. dann, S., Adio, O., & Waggenspack, W. N. (2019). Large-scale Student Programs Increase Persistence in STEM Fields in a Public University Setting. Journal of Engineering Education, 108(1), 57–81. 
 Imdadullah, M., Aslam, M., & Altaf, S. (2016). mctest: An R Package for Detection of Collinearity Among Regressors. The R Journal, 8(2), 495–505. 
 Imrey, P. B., Koch, G. G., Stokes, M. E., Darroch, J. N., Freeman, D. H., & Tolley, H. D. (1981). Categorical Data Analysis: Some Reflections on the Log Linear Model and Logistic Regression. Part I: Historical and Methodological Overview. International Statistical Review / Revue Internationale de Statistique, 49(3), 265. 
 Irribarra, D. T., & Freud, R. (2020). WrightMap: IRT Item-Person Map with 'ConQuest' Integration. Retrieved from https://CRAN.R-project.org/package=WrightMap. 
 James, N., Harrell, Jr, & Shepherd, B. (2021). Bayesian Cumulative Probability Models for Continuous and Mixed Outcomes. 
 Jayarajah, K., Saat, R. M., & Rauf, R. A. A. (2014). A Review of Science, Technology, Engineering & Mathematics (STEM) Education Research From 1999-2013: A Malaysian perspective. Eurasia Journal of Mathematics, Science and Technology Education, 10(3), 155–163. 
 Jesson, J., Matheson, L., & Lacey, F. M. (2011). Doing Your Literature Review : Traditional and Systematic Techniques. SAGE Publications Ltd. 
 John P. L., Hao Wu, H., & Yu, G. (2016). Building an Evaluation Scale using Item Response Theory. Proc Conf Empir Methods Nat Lang Process, 648–657. 
 John, M., Bettye, S., Ezra, T., & Robert, W. (2016). A Formative Evaluation of a Southeast High School Integrative Science, Technology, Engineering, and Mathematics (STEM) Academy. Technology in Society, 45, 34–39. 
 Joyce, T., Crockett, S., Jaeger, D. A., Altindag, O., & O’Connell, S. D. (2015). Does Classroom Time Matter? Economics of Education Review, 46, 64–77. 
 Judi, H. M., Mohamed, H., Ashari, N. S. @, Jenal, R., & Hanawi, S. A. (2012). Alignment of Statistics Course using Examination Items. Procedia - Social and Behavioral Sciences, 59, 264–269. 
 Kaiser, H. F. (1974). An Index of Factorial Simplicity. Psychometrika, 39, 31–36. 
 Kementerian Pendidikan Malaysia (2013). Malaysia Education Blueprint 2013-2025 (Preschool to Post- Secondary Education). Putrajaya Malaysia: Kementerian Pendidikan. 
 Kementerian Pendidikan Malaysia (2015). Malaysia Education Blueprint 2015-2025 (Higher Education). Putrajaya Malaysia: Kementerian Pengajian Tinggi. 
 Kementerian Pendidikan Tinggi Malaysia (2016). Rubrik PNGK Bersepadu (iCGPA) Panduan Pentaksiran Hasil Pembelajaran. Putrajaya Malaysia: Kementerian Pendidikan Tinggi. 
 Kerlinger, F. N., & Lee, H. B. (2000). Foundations of Behavioral Research (4th ed.). Fort Worth TX: Harcourt College Publishers. 
 Kesselmeier, M., & Bermejo, J. L. (2017). Robust Logistic Regression to Narrow Down the Winner’s Curse for Rare and Recessive Susceptibility Variants. Briefings in Bioinformatics, 18(6), 962–972. 
 Khajah, M. M., Huang, Y., Mozer, M. C., & Brusilovsky, P. (2015). Integrating Knowledge Tracing and Item Response Theory : A Tale of Two Frameworks. CEUR Workshop Proceedings, 1181, 7–15. 
 Kien-Kheng, F., Azlan, N., Noor, S., Ahmad, D., Lee, N., Leong, H., & Mohamed, I. (2016). Relationship Between Cognitive Factors and Performance in an Introductory Statistics Course : a Malaysian Case Study Introduction. Malaysian Journal of Mathematical Sciences, 10(3), 269–282. 
 Kim, S. Y., Lee, W., & Kolen, M. J. (2019). Simple-Structure Multidimensional Item Response Theory Equating for Multidimensional Tests. Educational and Psychological Measurement, 80(1), 91-125. 
 Kline, P. (2014). An Easy Guide to Factor Analysis. Routledge. 
 Komarudin, U., Rustaman, N. Y., & Hasanah, L. (2017). Promoting Students’ Conceptual Understanding Using STEM. AIP Conference Proceedings, 1848(1). 
 Koretsky, M., Keeler, J., Ivanovitch, J., & Cao, Y. (2018). The Role of Pedagogical Tools in Active Learning: a Case for Sense-making. International Journal of STEM Education, 5(1). 
 Kosmidis, I. (2014). Improved Estimation in Cumulative Link Models. Journal of the Royal Statistical Society: Series B, 76(1), 169–196. 
 Kosmidis, I., & Firth, D. (2009). Bias Reduction in Exponential Family Nonlinear Models. Biometrika, 96(4), 793-804. 
 Krasilnikov, A., & Smirnova, A. (2017). Online Social Adaptation of First-year Students and Their Academic Performance. Computers and Education, 113, 327– 338. 
 Krishna Kishore, K. V., Venkatramaphanikumar, S., & Alekhya, S. (2014). Prediction of Student Academic Progression: a Case Study on Vignan University. 2014 International Conference on Computer Communication and Informatics, 1–6. 
 Kumar, S. C., Chowdary, E. D., Venkatramaphanikumar, S., & Kishore, K. V. K. (2016). M5P Model Tree in Predicting Student Performance: a Case Study. 2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT), 1103–1107. 
 Kumari, P., Jain, P. K., & Pamula, R. (2018). An Efficient Use of Ensemble Methods to Predict Students Academic Performance. 2018 4th International Conference on Recent Advances in Information Technology (RAIT), 1–6. 
 Laerd Statistics (2020). Using the PLUM Procedure to Carry Out an Ordinal Regression in SPSS. Retrieved July 14, 2020, from https://statistics.laerd.com/spsstutorials/ ordinal-regression-using-spss-statistics-2.php 
 Landis, J. R., & Koch, G. G. (1977). The Measurement of Observer Agreement for Categorical Data. Biometrics, 33(1), 159. 
 Li, C., & Shepherd, B. E. (2012). A New Residual for Ordinal Outcomes. Biometrika, 99(2), 473–480. 
 Lin, M., Preston, A., Kharrufa, A., & Kong, Z. (2016). Making L2 Learners’ Reasoning Skills Visible: the Potential of Computer Supported Collaborative Learning Environments. Thinking Skills and Creativity, 22, 303–322. 
 Linacre, J. M. (2008). The Expected Value of a Point Biserial (or Similar) Correlation. Retrieved October 22, 2019, from https://www.winsteps.com/winman/correlations.htm 
 Lipsitz, S. R., Fitzmaurice, G. M., Molenberghs, G., Lipsitzt, B. S. R., Farber, D., & Fitzmaurice, M. (1996). Goodness-of-fit Tests for Ordinal Response Regression Models, 45(2), 175–190. 
 Lipsitz, S. R., Fitzmaurice, G. M., Regenbogen, S. E., Sinha, D., Ibrahim, J. G., & Gawande, A. A. (2012). Bias Correction for the Proportional Odds Logistic Regression Model With Application to a Study of Surgical Complications. Journal of the Royal Statistical Society. Series C: Applied Statistics, 62(2), 233–250. 
 Liu, D., & Zhang, H. (2017). Residuals and Diagnostics for Ordinal Regression Models: A Surrogate Approach. Journal of the American Statistical Association, 113(522), 845–854. 
 Lo, C. K., Hew, K. F., & Chen, G. (2017). Toward a Set of Design Principles for Mathematics Flipped Classrooms: a Synthesis of Research in Mathematics Education. Educational Research Review, 22, 50–73. 
 Lopez Guarin, C. E., Guzman, E. L., & Gonzalez, F. A. (2015). A Model to Predict Low Academic Performance at a Specific Enrollment Using Data Mining. Revista Iberoamericana de Tecnologias Del Aprendizaje, 10(3), 119–125. 
 Lord, F. M. (1952). A Theory of Test Scores. Psychometric Monograph, 7. 
 Lord, F. M. (1986). Maximum Likelihood and Bayesian Parameter Estimation in Item Response Theory. Journal of Educational Measurement, 23(2), 157–162. 
 Ma, T., Li, H., Wm, E., Jj, K., Manne, U., Bae, S., … Kp, S. (2014). Robust Logistic and Probit Methods for Binary and Multinomial Regression, 5(4). 
 Macfarlane, B. (2014). Student Performativity in Higher Education: Converting Learning as a Private Space Into a Public Performance, Higher Education Research & Development. 34(2), 338-350. 
 Magis, D., & Barrada, J. R. (2017). Computerized Adaptive Testing with R : Recent Updates of the Package catR . Journal of Statistical Software, 76, 1-19. 
 Magis, D., Béland, S., Tuerlinckx, F., & de Boeck, P. (2010). A General Framework and an R Package for the Detection of Dichotomous Differential Item Functioning. Behavior Research Methods, 42(3), 847–862. 
 Mahmud, Z., Ismail, N. Z.-I., Kassim, N. L. A., & Zainol, M. S. (2018). The Effects Of Attitudes Towards Statistics, Perceived Ability, Learning Practices And Teaching Practices On Students’ Performance In Statistics: A Review. Journal of Islamic Thought and Civilization of the International Islamic University Malaysia (Iium), (Special Issue), 71–97. 
 Mair, P. (2020). CRAN Task View: Psychometric Models and Methods. 
 Mair, P., Hatzinger, R., Maier, M. J., Rusch, T., Debelak, R., & Maintainer (2020). eRm: Extended Rasch Modeling. Retrieved from https://CRAN.Rproject. org/package=eRm. 
 Maki, S., & Horita, T. (2017). Research on Statistical Literacy Using Japanese Textbooks. 2017 6th IIAI International Congress on Advanced Applied Informatics (IIAI-AAI), 711–714. 
 Manor, O., & Power, C. (2000). Dichotomous or Categorical Response? Analysing Self-rated Health and Lifetime. Int J Epidemiol, 29(1), 149–157. 
 Marbouti, F., Diefes-Dux, H. A., & Madhavan, K. (2016). Models for Early Prediction of at-risk Students in a Course Using Standards-based Grading. Computers and Education, 103, 1–15. 
 Margot, K. C., & Kettler, T. (2019). Teachers’ Perception of STEM Integration and Education: a Systematic Literature Review. International Journal of STEM Education, 6. 
 Maria, M., Shahbodin, F., & Pee, N. C. (2018). Malaysian Higher Education System Towards Industry 4.0- Current Trends Overview. AIP Conference Proceedings 2016, 1-7. 
 Masters, G. N. (1982). A Rasch model for partial credit scoring. Psychometrika, 47, 149-174. 
 Mativo, J. M., & Huang, S. (2014). Prediction of Students’ Academic Performance: Adapt a Methodology of Predictive Modeling for a Small Sample Size. 2014 IEEE Frontiers in Education Conference (FIE) Proceedings, 1–3. 
 Mayilvaganan, M., & Kalpanadevi, D. (2014). Comparison of Classification Techniques for Predicting the Cognitive Skill of Students in Education Environment. 2014 IEEE International Conference on Computational Intelligence and Computing Research, 1–4. 
 McCullagh, P. (1980). Regression Models for Ordinal Data. Journal of the Royal Statistical Society. Series B, 42, 109–142. 
 McCullagh, P., & Nelder, J. A. (1989). Generalized Linear Models (2nd ed.). London, New York: Chapman and Hall. 
 Mckelvey, D. R., & Zavoina, W. (1975). A Statistical Model for the Analysis of Ordinal Level Dependent Variables. Journal of Mathematical Sociology, 4, 103–120. 
 Meier, Y., Xu, J., Atan, O., & Van Der Schaar, M. (2016). Predicting Grades. IEEE Transactions on Signal Processing, 64(4), 959–972. 
 Mejia, A., & Filus, A. (2018). Exploring Predictors of Impact of School-based Management in Rural Mexico: Do Student Engagement, Teacher Attitudes and Parent Involvement Predict Better Academic Outcomes? International Journal of Educational Research, 88, 95–108. 
 Mignani, S., Monari, P., Cagnone, S., & Ricci, R. (2006). Multidimensional Versus Unidimensional Models for Ability Testing. In Data Analysis, Classification and the Forward Search, 339–346. 
 Milaturrahmah, N., Mardiyana, & Pramudya, I. (2017). Science, Technology, Engineering, Mathematics (STEM) as Mathematics Learning Approach in 21st Century. AIP Conference Proceedings, 1868(1). 
 Mircioiu, C., & Atkinson, J. (2017). A Comparison of Parametric and Non-Parametric Methods Applied to a Likert Scale. Pharmacy (Basel, Switzerland), 5(2), 26. 
 Mishra, T., Kumar, D., & Gupta, S. (2014). Mining Students’ Data for Prediction Performance. International Conference on Advanced Computing and Communication Technologies, ACCT, 255–262. 
 Mohamad, M. M., Sulaiman, N. L., Sern, L. C., & Salleh, K. M. (2015). Measuring the Validity and Reliability of Research Instruments. Procedia - Social and Behavioral Sciences, 204, 164–171. 
 Mohamed Talib, A., Alomary, F. O., & Alwadi, H. F. (2018). Assessment of Student Performance for Course Examination Using Rasch Measurement Model: A Case Study of Information Technology Fundamentals Course. Education Research International, 2018, 1–8. 
 Mohamed, H., Ashaari, N. S. @, Judi, H. M., & Wook, T. S. M. T. (2012). Factors Affecting FTSM Students’ Achievement in Statistics Course. Procedia - Social and Behavioral Sciences, 59, 125–129. 
 Mohd Ali, S., Norfarah, N., Ilya Syazwani, J. I., & Mohd Erfy, I. (2019). The Effect of Computerized-adaptive Test on Reducing Anxiety Towards Math Test for Polytechnic Students. Journal of Technical Education and Training, 11(4), 27–35. 
 Mohd Rasid, N. S., Md Nasir, N. A., A/l Aperar Singh, P. S., & Cheong, T. H. (2020). STEM Integration: Factors Affecting Effective Instructional Practices in Teaching Mathematics. Asian Journal of University Education, 16(1), 56. Mourtzis, D., Vasilakopoulos, A., Zervas, E., & Boli, N. (2019). Manufacturing System Design Using Simulation in Metal Industry Towards Education 4.0. Procedia Manufacturing, 31, 155–161. 
 Muawiyah, D., Yamtinah, S., & Indriyanti, N. Y. (2018). Higher Education 4.0: Assessment on Environmental Chemistry Course in Blended Learning Design. Journal of Physics: Conference Series, 1097(1), 1–7. 
 Murad, H., Fleischman, A., Sadetzki, S., Geyer, O., & Freedman, L. S. (2003). Small Samples and Ordered Logistic Regression: Does it Help to Collapse Categories of Outcome? The American Statistician, 57(3), 155–160. 
 Mutanu, L., & Machoka, P. (2019). Enhancing Computer Students’ Academic Performance Through Predictive Modelling - a Proactive Approach. 14th International Conference on Computer Science and Education, ICCSE 2019, 97– 102. 
 Muthukrishnan, R., & Myilsamy, R. (2010). M-Estimators in Regression Models. Journal of Mathematics Research, 2(4), 23–27. 
 Nagelkerke, N. J. D. (1991). A Note on a General Definition of the Coefficient of Determination. Biometrika, 78(3), 691-692. 
 Nahar, J., & Purwani, S. (2017). Application of Robust M-Estimator Regression in Handling Data Outliers. In 4th ICRIEMS, 53–60. 
 Nering, L. M., & Ostini, R. (2011). Handbook of Polytomous Item Response Theory Models. New York, NY: Taylor & Francis Group. 
 Noguez, J., Neri, L., Gonzalez-Nucamendi, A., & Robledo-Rella, V. (2016). Characteristics of Self-regulation of Engineering Students to Predict and Improve Their Academic Performance. 2016 IEEE Frontiers in Education Conference (FIE), 1–8. 
 Norman, C. (2014). Ordinal Methods for Behavioral Data Analysis. Psychology Press. 
 Nunnally, J. C., & Bernstein, I. H. (1994). Psychometric Theory. McGraw-Hill. 
 Nurgabyl, D., Kalzhanova, G., Ualiyev, N., & Abdoldinova, G. (2017). Construction of a Mathematical Model for Calibrating Test Task Parameters and the Knowledge Level Scale of University Students by Means of Testing. Eurasia Journal of Mathematics, Science and Technology Education, 13(11), 7421–7429. 
 Olmus, H., Nazman, E., & Erbas, S. (2017). An Evaluation of the Two Parameter (2- PL) IRT Models Through a Simulation Study. Gazi University Journal of Science, 30(1), 235–249. 
 Omar, N., Haris, S. S., Hassan, R., Arshad, H., Rahmat, M., Zainal, N. F. A., & Zulkifli, R. (2012). Automated Analysis of Exam Questions According to Bloom’s Taxonomy. Procedia - Social and Behavioral Sciences, 59(1956), 297–303. 
 Öztürk, N. K., & Karabatsos, G. (2017). A Bayesian Robust IRT Outlier-Detection Model. Applied Psychological Measurement, 41(3), 195–208. 
 Özyurt, H., & Özyurt, Ö. (2015). Ability Level Estimation of Students on Probability Unit via Computerized Adaptive Testing. Eurasian Journal of Educational Research, 15(58), 27–44. 
 Pada, A. U. T., Kartowagiran, B., & Subali, B. (2016). Separation Index and Fit Items of Creative Thinking Skills Assessment. Research and Evaluation in Education, 2(1), 1-2. 
 Papageorgiou, G., & Hinde, J. (2019). mixcat: Mixed Effects Cumulative Link and Logistic Regression Models. Retrieved from https://CRAN.Rproject. org/package=mixcat. 
 Pardo, A., Han, F., & Ellis, R. A. (2017). Combining University Student Self-regulated Learning Indicators and Engagement With Online Learning Events to Predict Academic Performance. IEEE Transactions on Learning Technologies, 10(1), 82– 92. 
 Park, J. S., Park, C. G., & Lee, K. E. (2019). Simultaneous Outlier Detection and Variable Selection via Difference-based Regression Model and Stochastic Search Variable Selection. Communications for Statistical Applications and Methods, 26(2), 149–161. 
 Partchev, I. (2017). irtoys: A Collection of Functions Related to Item Response Theory (IRT). Retrieved from https://CRAN.R-project.org/package=irtoys. 
 Passante, G., & Kohnle, A. (2019). Enhancing Student Visual Understanding of the Time Evolution of Quantum Systems. Physical Review Physics Education Research, 15(1), 1-14. 
 Peterson, B., & Harrell, Frank E., J. (1990). Partial Proportional Odds Models for Ordinal Response Variables. Applied Statistics, 39, 205–217. 
 Pößnecker, W., & Tutz, G. (2016). A General Framework for the Selection of Effect Type in Ordinal Regression. Munich, Bavaria, Germany. 
 Pradeep, A., Das, S., & Kizhekkethottam, J. J. (2015). Students Dropout Factor Prediction Using EDM Techniques. Proceedings of the IEEE International Conference on Soft-Computing and Network Security, ICSNS 2015, 1-7. 
 Pregibon, D. (1982). Resistant Fits for Some Coxnmolily Used Logistic Models with Medical Applications. Biometrics, 38(2), 485–498. 
 Pruscha, H. (1994). Partial Residuals in Cumulative Regression Models for Ordinal Data. Statistical Papers, 35(1), 273–284. 
 Pulkstenis, E., & Robinson, T. J. (2004). Goodness-of-fit Tests for Ordinal Response Regression Models. Statistics in Medicine, 23(6), 999–1014. 
 Radmehr, F., & Drake, M. (2018a). An Assessment-based Model for Exploring the Solving of Mathematical Problems: Utilizing Revised Bloom’s Taxonomy and Facets of Metacognition. Studies in Educational Evaluation, 59, 41–51. 
 Radmehr, F., & Drake, M. (2018b). Revised Bloom’s Taxonomy and Major Theories and Frameworks That Influence the Teaching, Learning, and Assessment of Mathematics: a Comparison. International Journal of Mathematical Education in Science and Technology, 50(6), 895-920. 
 Raines, T. C., Gordon, M., Harrell-williams, L., Diliberto, R. A., Parke, E. M., Raines, T. C., … Diliberto, R. A. (2017). Adaptive Skills and Academic Achievement in Latino Students. Journal of Applied School Psychology, 33(4), 245–260. 
 Rajeswari, S., & Lawrance, R. (2016). Classification Model to Predict the Learners’ Academic Performance Using Big Data. 2016 International Conference on Computing Technologies and Intelligent Data Engineering (ICCTIDE’16), 1–6. 
 Rasch, G. (1960). Probabilistic Models for Some Intelligence and Attainment Tests. (Copenhagen, Danish Institute for Educational Research), expanded edition (1980) with foreword and afterword by B. D. Wright. Chicago: The University of Chicago Press. 
 Rasheed, B. A., Adnan, R., Saffari, S. E., & Pati, K. (2014). Robust Weighted Least Squares Estimation of Regression Parameter in the Presence of Outliers and Heteroscedastic Errors. Jurnal Teknologi, 71(1), 11–18. 
 Raus, M. I. M., Janor, R. M., Sadjirin, R., & Sahri, Z. (2014). The Development of i- QuBES for UiTM: From Feasibility Study to the Design Phase. Proceedings - 2014 5th IEEE Control and System Graduate Research Colloquium, ICSGRC 2014, 96–101. 
 Reckase, M. D. (2009). Multidimensional Item Response Theory Models. New York, NY: Springer New York. 
 Ren, Z., & Sweeney, M. (2016). Predicting Student Performance Using Personalized Analytics. Computer, 49(4), 61–69. 
 Rezaie, M., & Golshan, M. (2015). Computer Adaptive Test (CAT): Advantages and Limitations. International Journal of Educational Investigations Available Online, 2(5), 128–137. 
 Riani, M., Torti, F., & Zani, S. (2012). Outliers and Robustness for Ordinal Data. Modern Analysis of Customer Surveys: with applications using R (1st ed.), 155– 169. 
 Ricardo, A. M., Douglas, R. M., Victor, J. Y., & Matias, S. B. (2019). Robust Statistics Theory and Methods (with R) (2nd ed.). John Wiley & Sons Ltd. 
 Riese, A., Rappaport, L., Alverson, B., Park, S., & Rockney, R. M. (2017). Clinical Performance Evaluations of Third-Year Medical Students and Association With Student and Evaluator Gender. Academic Medicine, 92(6), 835–840. 
 Ripley, B., Venables, B., Bates, D. M., Hornik, K., Gebhardt, A., & Firth, D. (2020). MASS: Support Functions and Datasets for Venables and Ripley's MASS. Retrieved from https://CRAN.R-project.org/package=MASS. 
 Rizopoulos, D. (2006). ltm : An R Package for Latent Variable Modeling. Journal Of Statistical Software, 17(5), 1–25. 
 Rizopoulos, D. (2018). Package “ltm” Title Latent Trait Models under IRT. Retrieved from http://www.jstatsoft.org/v17/. 
 Rojko, A. (2017). Industry 4.0 Concept : Background and Overview. International Journal of Interactive Mobile Technologies (IJIM), 11(5), 77–90. 
 Ronald, K. H., & Russell W. J. (1993). Comparison of Classical Test Theory and Item Response Theory and Their Applications to Test Development. Educational Measurement: Issues and Practice, 38-47. 
 Rosaini, R., Budiyono, B., & Pratiwi, H. (2019). Mathematics Teacher Supporting Higher Order Thinking Skill of Students Through Assessment as Learning in Instructional Model. Journal of Physics: Conf. Series, 1157. 
 Rousseeuw, P. J., & Leroy, A. M. (1987). Robust Regression and Outlier Detection. Hoboken, NJ, USA: John Wiley & Sons, Inc. 
 Rousseeuw, P. J., & van Driessen, K. (1999). A Fast Algorithm for the Minimum Covariance Determinant Estimator. Technometrics, 41(3), 212. 
 Rubio, D. M., Berg-Weger, M., Tebb, S. S., Lee, E. S., & Rauch, S. (2003). Objectifying Content Validity: Conducting a Content Validity Study in Social Work Research. Social Work Research, 27(2), 94–104. 
 Ruckstuhl, A. (2016). Robust Fitting of Parametric Models Based on M-Estimation. 
 Rusch, T., Mair, P., & Hatzinger, R. (2013). Psychometrics with R: A Review of CRAN Packages for Item Response Theory. Discussion Paper Series of the Center for Empirical Research Methods, 1–28. 
 Rusimamto, P. W., Nurlaela, L., Sumbawati, M. S., Munoto, & Samani, M. (2019). Development of Critical and Creative Thinking Skills to Increase Competence of PLC Programming for Electrical Engineering Education Students. IOP Conference Series: Materials Science and Engineering, 535(1). 
 Sagala, P. N., & Andriani, A. (2019). Development of Higher-Order Thinking Skills (HOTS) Questions of Probability Theory Subject Based on Bloom’s Taxonomy. Journal of Physics: Conference Series, 1188(1), 1–13. 
 Sagar, P., Prinima, & Indu (2017). Analysis of Prediction Techniques based on Classification and Regression General Terms. International Journal of Computer Applications, 163(7), 47-51. 
 Said-metwaly, S., Kyndt, E., & Noortgate, W. Van Den. (2019). The Factor Structure of the Verbal Torrance Test of Creative Thinking in an Arabic Context: Classical Test Theory and Multidimensional Item Response Theory Analyses. Thinking Skills and Creativity, 35. 
 Salim, N. R., Fauzi, A., & Ayub, M. (2017). Relationship Between Mathematics Statistics Engagement and Attitudes Towards Statistics Among Undergraduate Students in Malaysia. AIP Conference Proceedings, 1795. 
 Sall, J. (1991). A Monotone Regression Smoother Based on Ordinal Cumulative Logistic Regression. ASA Proceedings of Statistical Computing Section, 276–281. 
 Salzberger, T., & Koller, M. (2019). The Direction of the Response Scale Matters- Accounting for the Unit of Measurement. European Journal of Marketing, 53(5), 871–891. 
 Samejima, F. (1972). A General Model for Free-Response. Psychometrika, 35(18),139. 
 SAS Institute Inc. (2017). SAS/STAT ® 14.3 User’s Guide The CATMOD Procedure. Retrieved from http://support.sas.com/thirdpartylicenses. 
 SAS Institute Inc. (2019). SAS Help Center: PROC LOGISTIC Statement. Retrieved July 12, 2020, from https://documentation.sas.com. 
 SAS Institute Inc. (2020). What is a Data Scientist?. Retrieved April 21, 2020, from https://www.sas.com/en_my/insights/analytics/what-is-a-data-scientist.html 
 Seheult, A. H., Green, P. J., Rousseeuw, P. J., & Leroy, A. M. (2006). Robust Regression and Outlier Detection. Journal of the Royal Statistical Society. Series A (Statistics in Society), 152(1), 133. 
 Seifu, G. (2016). Assessment of the Implementation of Continuous Assessment : the Case of METTU University. Europian Journal of Science and Mathematics Education, 4(4), 534–544. 
 Shahiri, A. M., Husain, W., & Rashid, N. A. (2015). A Review on Predicting Student’s Performance Using Data Mining Techniques. Procedia Computer Science, 72, 414–422. 
 Sharif, S., & Atiany, T. A. M. (2018). Testing Several Correlation Matrices Using Robust Approach. Asian Journal of Scientific Research, 11(1), 84–95. 
 Sheng, Y., & Wikle, C. K. (2009). Bayesian IRT Models Incorporating General and Specific Abilities. Behaviormetrika, 36(1), 27–48. 
 Sikder, M. F., Uddin, M. J., & Halder, S. (2016). Predicting Students Yearly Performance Using Neural Network: a Case Study of BSMRSTU. 2016 5th International Conference on Informatics, Electronics and Vision (ICIEV), 524– 529. 
 Simeckova, M. (2005). Maximum Weighted Likelihood Estimator in Logistic Regression. In WDS’05 Proceedings of Contributed Papers, 144–148. 
 Slim, A., Heileman, G. L., Kozlick, J., & Abdallah, C. T. (2015). Predicting Student Success Based on Prior Performance. In Proceedings - 2014 IEEE Symposium on Computational Intelligence and Data Mining (CIDM), 410–415. 
 Smith, E. V. (2002). Detecting and Evaluating the Impact of Multidimensionality Using Item Fit Statistics and Principal Component Analysis of Residuals. Journal of Applied Measurement, 3(2), 205–231. 
 Smith, G. (2018). Step Away From Stepwise. Journal of Big Data, 5(32), 1–12. 
 Snell, E. J., Cox, D., & Cox, R. (1987). Applied Statistics: A Handbook of BMDPTM Analyses. Springer Science Business Media. 
 Solihatun, S., Rangka, I. B., Ratnasari, D., Radyati, A., Siregar, Y., Wulansari, L., … Rahim, R. (2019). Measuring of Student Learning Performance Based on Geometry Test for Middle Class in Elementary School Using Dichotomous Rasch Analysis. Journal of Physics: Conference Series, 1157(3), 1-7. 
 Sorour, S. E., Mine, T., Goda, K., & Hirokawa, S. (2015). Predicting Students’ Grades Based on Free Style Comments Data by Artificial Neural Network. Proceedings - Frontiers in Education Conference, FIE, 1-9. 
 Sothan, S. (2018). The Determinants of Academic Performance : Evidence From a Cambodian University. Studies in Higher Education, 44(11), 2096-2111. 
 SSI (2020a). BILOGMG. Retrieved July 14, 2020, from https://ssicentral.com/index.php/products/bilogmg-gen/ SSI (2020b). PARSCALE. Retrieved July 14, 2020, from https://ssicentral.com/index.php/products/psl-general/ 
 Steyer, R. (2015). Classical (Psychometric) Test Theory. International Encyclopedia of the Social & Behavioral Sciences, 3, 785-791. 
 Sturman, E. D., & Zappala-Piemme, K. (2017). Development of the Grit Scale for Children and Adults and Its Relation to Student Efficacy, Test Anxiety, and Academic Performance. Learning and Individual Differences, 59, 1–10. 
 Summers, M. M., Couch, B. A., Knight, J. K., Brownell, S. E., Crowe, A. J., Semsar, K., … Smith, M. K. (2018). EcoEvo-MAPS: An Ecology and Evolution Assessment for Introductory Through Advanced Undergraduates. CBE Life Sciences Education, 17(2). 
 Susanti, Y., Pratiwi, H., H., S. S., & Liana, T. (2014). M Estimation, S Estimation, and MM Estimation in Robust Regression. International Journal of Pure and Applied Mathematics, 91(3), 349–360. 
 SwMATH (2020). MULTILOG- Mathematical Software. Retrieved July 14, 2020, from http://swmath.org/software/24168 
 Tai, J., Dawson, P., Panadero, E., Boud, D., & Ajjawi, R. (2017). Developing Evaluative Judgement: Enabling Students to Make Decisions About the Quality of Work. Higher Education, 467–481. 
 TalentCorp (2019). Semak Apa Pekerjaan Masa Hadapan Untuk Anda. Petaling Jaya. Retrieved from https://www.talentcorp.com.my/clients/TalentCorp. 
 Tawil, N. M., Ismail, N. A., Asshaari, I., Osman, H., Nopiah, Z. M., & Zaharim, A. (2012). Comparing Lecture and E-learning as Learning Process in Mathematics and Statistics Courses for Engineering Students in Universiti Kebangsaan Malaysia. Procedia - Social and Behavioral Sciences, 60, 420–425. 
 Tekkumru-Kisa, M., & Stein, M. K. (2017). A Framework for Planning and Facilitating Video-based Professional Development. International Journal of STEM Education, 4, 28. 
 Testa, S., Toscano, A., & Rosato, R. (2018). Distractor Efficiency in an Item Pool for a Statistics Classroom Exam: Assessing Its Relation With Item Cognitive Level Classified According to Bloom’s Taxonomy. Frontiers in Psychology, 9, 1–12. 
 Thaneerananon, T., Triampo, W., & Nokkaew, A. (2016). Development of a Test to Evaluate Students’ Analytical Thinking Based on Fact versus Opinion Differentiation. International Journal of Instruction, 9(2), 123–138. 
 Tharwat, A. (2009). Principal Component Analysis-A Tutorial. 
 Thiele, T., Singleton, A., Pope, D., & Stanistreet, D. (2016). Predicting Students’ Academic Performance Based on School and Socio-demographic Characteristics. Studies in Higher Education, 41(8), 1424-1446. 
 Thompson, L. A. (2009). R (and S-PLUS) Manual to Accompany Agresti’s Categorical Data Analysis (2002) 2nd edition. Categorical Data Analysis. 
 Tijmstra, J., & Bolsinova, M. (2019). Bayes Factors for Evaluating Latent Monotonicity in Polytomous Item Response Theory Models. Psychometrika, 84(3), 846–869. 
 Tutz, G. (2014). Regression for Categorical Data. Cambridge: Cambridge University Press. 
 Ueckert, S. (2018). Modeling Composite Assessment Data Using Item Response Theory. CPT: Pharmacometrics and Systems Pharmacology, 7(4), 205–218. 
 Ünlü, A., & Yanagida, T. (2011). R You Ready for R?: The CRAN Psychometrics Task View. British Journal of Mathematical and Statistical Psychology, 64(1), 182– 186. 
 van der Linden, W. J. (2016). Handbook of Item Response Theory Volume One. London, New York: Taylor & Francis Group. 
 van der Linden, W. J. (2018). Handbook of Item Response Theory Volume Three: Applications. London, New York: Taylor & Francis Group. 
 van der Zanden, P. J. A. C., Denessen, E., Cillessen, A. H. N., & Meijer, P. C. (2018). Domains and Predictors of First-year Student Success: a Systematic Review. Educational Research Review, 23, 57–77. 
 Villagrá-Arnedo, C. J., Gallego-Durán, F. J., Llorens-Largo, F., Compañ-Rosique, P., Satorre-Cuerda, R., Molina-Carmona, R., … Molina-Carmona, R. (2017). Improving the Expressiveness of Black-box Models for Predicting Student Performance. Computers in Human Behavior, 72, 621–631. 
 Villarroel, V., Boud, D., Bloxham, S., Bruna, D., & Bruna, C. (2020). Using Principles of Authentic Assessment to Redesign Written Examinations and Tests. Innovations in Education and Teaching International, 57(1), 38–49. 
 Vora, D. R., & Rajamani, K. (2019). A Hybrid Classification Model for Prediction of Academic Performance of Students : a Big Data Application. Evolutionary Intelligence. 
 Walker, S. H., & Duncan, D. B. (1967). Estimation of the Probability of an Event as a Function of Several Independent Variables. Biometrika, 54, 167–179. 
 Wang, J. C., & Holan, S. H. (2012). Bayesian Multi-regime Smooth Transition Regression With Ordered Categorical Variables. Computational Statistics and Data Analysis, 56(12), 4165–4179. 
 Wang, R., Hao, P., Zhou, X., Campbell, A. T., & College, D. (2015). SmartGPA: Academic Performance Can Assess and Predict How Smartphones of College Students. In the 2015 ACM International Joint Conference on Ubiquitous Computing (UbiComp 2015), 19, 13–17. 
 Watan, S., & Sugiman. (2018). Exploring the Relationship Between Teachers’ Instructional and Students’ Geometrical Thinking Levels Based on Van Hiele Theory. Journal of Physics: Conference Series, 1097(1). 
 Weng, T. S., & Yang, D. C. (2017). Research on Mathematical Animation Using Pascal Animation as an Example. Eurasia Journal of Mathematics, Science and Technology Education, 13(6), 1687–1699. 
 Whitney, B. M., Cheng, Y., Brodersen, A. S., & Hong, M. R. (2018). The Scale of Student Engagement in Statistics: Development and Initial Validation. Journal of Psychoeducational Assessment, 37(5), 553-565. 
 Wijekoon, C. N., Amaratunge, H., Silva, Y. De, & Senanayake, S. (2017). Emotional Intelligence and Academic Performance of Medical Undergraduates : a Crosssectional Study in a Selected University in Sri Lanka. BMC Medical Education, 17(176), 1–11. 
 Williams, R. A. (2016). Ordinal Regression Models : Problems, Solutions, and Problems With the Solutions. Stata Users Group, German Stata Users' Group Meetings 2008. 
 Winsteps (2020). Rasch Analysis + Rasch Measurement Software + 1PL IRT. Retrieved July 14, 2020, from https://www.winsteps.com/index.htm 
 Wright, B. D., & Panchapakesan, N. (1969). A Procedure for Sample-Free Item Analysis. Educational and Psychological Measurement, 29, 23–48. 
 Xu, J., Moon, K. H., & van der Schaar, M. (2017). A Machine Learning Approach for Tracking and Predicting Student Performance in Degree Programs. IEEE Journal of Selected Topics in Signal Processing, 11(5), 742–753. 
 Ye, F., & Lord, D. (2014). Comparing Three Commonly Used Crash Severity Models on Sample Size Requirements: Multinomial Logit, Ordered Probit and Mixed Logit Models. Analytic Methods in Accident Research, 1, 72–85. 
 Yee, T., & Moler, C. (2020). VGAM: Vector Generalized Linear and Additive Models. Retrieved from https://CRAN.R-project.org/package=VGAM. 
 Yen, T. S., & Halili, S. H. (2015). Effective Teaching of Higher-Order Thinking (HOT) in Education. Distance Education and E-Learning, 3(2), 41–47. 
 You, H. S., Kim, K., Black, K., & Min, K. W. (2018). Assessing Science Motivation for College Students: Validation of the Science Motivation Questionnaire II Using the Rasch-andrich Rating Scale Model. Eurasia Journal of Mathematics, Science and Technology Education, 14(4), 1161–1173. 
 Young, D. E., & Meredith, D. C. (2017). Using the Resources Framework to Design, Assess, and Refine Interventions on Pressure in Fluids. Physical Review Physics Education Research, 13(1), 1–16. 
 Yusof, A. L., Naim, N. F., Latip, M. F. A., Aminuddin, N., & Ya’acob, N. (2017). Implementation of Integrated Cumulative Grade Point Average (iCGPA) Towards Academic Excellence in Malaysia. In 2017 IEEE 9th International Conference on Engineering Education (ICEED), 106–109. 
 Zainudin, S., Ahmad, K., Ali, N. M., & Zainal, N. F. A. (2012). Determining Course Outcomes Achievement Through Examination Difficulty Index Measurement. Procedia - Social and Behavioral Sciences, 59, 270–276. 
 Zhang, Q., & Stephens, M. (2016). Profiling Teacher Capacity in Statistical Thinking of National Curriculum Reform: a Comparative Study Between Australia and China. Eurasia Journal of Mathematics, Science and Technology Education, 12(4), 733–746. 
 Zollanvari, A., Kizilirmak, R. C., Kho, Y. H., & Hernandez-Torrano, D. (2017). Predicting Students’ GPA and Developing Intervention Strategies Based on Self- Regulatory Learning Behaviors. IEEE Access, 5, 23792-23802. 
 Zulkifli, F., Abidin, R. Z., & Mohamed, Z. (2019). Evaluating the Quality of Exam Questions: a Multidimensional Item Response. International Journal of Recent Technology and Engineering, 8(2 Special Issue 11), 606–612. 
 Zulkifli, F., Abidin, R. Z., Razi, N. F. M., Mohammad, N. H., Ahmad, R., & Azmi, A. Z. (2018). Evaluating Quality and Reliability of Final Exam Questions for Probability and Statistics Course Using Rasch Model. International Journal of Engineering and Technology(UAE), 7(4), 32–36. 
 | 
| This material may be protected under Copyright Act which governs the making of photocopies or reproductions of copyrighted materials. You may use the digitized material for private study, scholarship, or research. |