UPSI Digital Repository (UDRep)
|
|
|
Abstract : Universiti Pendidikan Sultan Idris |
Kajian ini bertujuan menganalisis kualiti item aneka pilihan soalan peperiksaan
Pengajian Am Semester 3 STPM 2017. Kajian juga meneliti sumbangan item terhadapujian, kebolehpercayaan anggaran skor kebolehan calon, skor kebolehan calon
berbanding kesukaran item, perbezaan pencapaian calon mengikut aliran, dan item
berat sebelah. Kajian menggunakan pendekatan kuantitatif berbentuk ex-post facto.
Seramai 1415 calon peperiksaan Pengajian Am Semester 3 STPM 2017 telah dipilih
secara pensampelan rawak berstrata sebagai subjek kajian. Instrumen kajian ialah item
aneka pilihan soalan peperiksaan Pengajian Am Semester 3 STPM 2017. Dapatan
kajian mendapati kualiti semua item tinggi kerana mempunyai kesahan konstruk
dengan fit statistics (0.5 < MNSQ < 1.5). Kebolehpercayaan ujian tinggi, iaitu 1.00
(>0.94). Analisis Parameter kesukaran item mendapati 2 item sukar (1.00 ≥ b ≤ 3.00),
2 item mudah (-3.00 ≥ b ≤ -1.00), dan 11 item sederhana (-1.00 ≥ b ≤ 1.00). Analisis
kesepadanan model mendapati semua item fit untuk tujuan pengujian (0.5 < MNSQ 5%). Analisis
kebolehpercayaan anggaran skor kebolehan calon mendapati PRI = 0.50 dan PSI = 1.00
di bawah nilai minimum. Ukuran julat kebolehan calon lebih besar berbanding julat
kesukaran item. Kajian juga mendapati pencapaian calon Aliran Sains lebih tinggi
berbanding calon Aliran Kemanusiaan, iaitu perbezaan pencapaian sebanyak 0.20
logits. Kesan DIF wujud dalam Aliran Sains bagi item 1, iaitu calon lelaki (-0.54 logits)
lebih mudah menjawab berbanding calon perempuan (-1.09 logits). Kesimpulannya,
item aneka pilihan soalan peperiksaan Pengajian Am Semester 3 STPM 2017 sesuai
untuk pengujian. Implikasi kajian mendapati penelitian item menggunakan Model
Pengukuran Rasch dapat menentukan kualiti item dengan baik. |
References |
Abdullah, A. Z. (1985). Analisis Soalan. Kuala Lumpur: Majlis Peperiksaan Malaysia. Ahmad, A., & Awang, M. I. (2008). Pengukuran dan Penilaian. Selangor Darul Ehsan: Dawama Sdn. Bhd. Ahmad Zawawi A. (1985). Analisa Soalan Majlis Peperiksaan Malaysia. Kuala Lumpur Akta MPM, M. P. (2014). Undang-Undang Malaysia Cetakan Semula. Kuala Lumpur: Pesuruhjaya Penyemak Undang-Undang Malaysia. Amin, L. (2011). Psychometric Methods to Develop and to Analyze Clinical Measures: A Comparison and Contrast of Rasch Analysis and Classical Test Theory Analysis of the PedsQL 4.0 Generic Core Scale (Parent-report) in a Childhood Canser Sample. Open Access dissertations and thesis. Paper 6053. Anastasi, A., & Urbina, S. (1997). Psychological Testing (7th Edition). New York: Macmillan. Anastasi, A. (1988). Psychological Testing. New York: Macmillan. Andrich, D. (1978). A Rating Formulation for Ordered Response Categories. Psychometrika, 43, 357-374. Andrich, D., & Mercer, A. (1997). International Perspectives on Selection Methods of Entry Into Higher Education. Canberra: National Board of Employment, Education and Training and Higher Education Council. Ariffin, S. R. (2008). Inovasi dalam Pengukuran dan Penilaian Pendidikan: Bangi: Penerbit Universiti Kebangsaan Malaysia. Arsaythamby, V., & Rosna, A. H. (2016). Teori Ujian dan Pentaksiran Pendidikan. Sintok: UUM Press. Baker, F. B. (2001). The Basic of Item Response Theory (2nd Ed.). Wisconsin: ERIC Clearinghouse on Assessment and Evaluation. Bejar, I. I. (1983). Applications of Item Response Theory. Educational Research Institute: Research Institute of British Columbia. Bond, R. D., & Fox, C. M. (2012). Applying the Rasch Model: Fundamental Measurement in The Human Sciences (2nd Edition). New York: Routledge Taylor & Francis Group. Bond, T., & Fox, C. (2001). Applying the Rasch Model. Mahwah, NJ: Lawrence Erlbaum Associates. Bond, T., & Fox, C. (2007). Applying The Rasch Model: Fundamental Measurement in The Human Sciences. Mahwah, NJ USA: Lawrence Associates. Bond T. G. & Fox C.M. 2015. Applying the Rasch Model: Fundamental Measurement in the Human Sciences. Ed. ke-3. Mahwah, NJ: L. Erlbaum. Brown, F. G. (1989). Improving ESL Placement Tests Using Two Perspectives. TESOL Quarterly, 23 (1), 65-83. Chong, H. Y. (2013). A Simple Guide to The Item Response Theory (IRT) and Rasch Modeling. Chua, Y. P. (2007). Kaedah dan Statistik Penyelidikan Buku 1: Kaedah Penyelidikan. Malaysia: McGraw-Hill (Malaysia) Sdn. Bhd. Clark & Cooper (2000), An Overview of the Representation and Discovery of Causal Relationships Using Bayesian Networks. In Glymour and Cooper, 1999, Pages 3-62. Cohen, R. J., & Swerdlik, M. E. (2005). Psychological Testing and Assessment. New York: McGraw Hill. Crocker, L., & Algina, J. (1986). Introduction to Classical and Modern Test Theory. Philadelphia: Harcourt Brace Jovanovich College Publishers. Crocker, L., & Algina, J. (2008). Introduction to Classical & Modern Test Theory. Ohio, USA: Cengage Learning. Crocker, L., & Algina, J. (2008). Introduction to Classical and Modern Test Theory. USA: Cengage Learning. Culligan, B. (2011). Item Respons Theory, Reliability and Standard Error. Tokyo: Aoyama Gakuin Women's Junior College. De Ayala, R. J. (2009). The Theory and Practice of Item Response Theory. New York: The Guilford Press. De Ayala, R. J. (2009). The Theory and Practice of Item Response Theory. Pyschometrika 75(4), 778-779. De Champlain, A. F. (2010). A Primer on Classical Test Theory and Item Response Theory for Assessments in Medical Education. Medical Education, 44, 109-117. DeMars, C. (2010). Item Respons Theory: Understanding Statistic Measurement. New York: Oxford University Press, Inc. DeWitt, J., Archer, L., Osborne, J., Dillon, J., Willis, B., & Wong, B. (2011). High Aspirations but Low Progression: The Science Aspirations - Careers Paradox Amongst Minority Ethnic Students. International Journal of Science and Mathematics Education 9(2), 243-271. Donald, M. (2003). Origins of The Modern Mind: Three Stages in The Evolution of Culture and Cognition. Cambridge MA: Harvard University Press. Ducan, G. J., Dowsett, C. J., Claessens, A., Magnuson, K., Huston, A. C., & Klebanov, P. (2007). School Readiness and Later Acvhievement. Developmental Psychology. Duong, M. (2004). Introduction to Item Response Theory and Its Applications. Proseminar in Learning, Technology and Culture. Eckes, T. (2015). Introduction to Many-Facet Rasch Measurement: Language Testing and Evaluation 2nd Revised and Updated Edition. Frankfurt am Main. Peter Lang. Embretson, S. E., & Reise, S. P. (2000). Item Respons Theory for Psychologist. New Jersey: Lawrence Erlbaum Associates, Inc. Englehard, G., Jr. (2013). Invariant measurement. New York: Routledge. Fox, T. B. (2007). Applying Rasch model: Fundamental Measurement in the Human Sciences (2nd Ed.). Mahwah, NJ: Erlbaum. Fox, C. M., & Jones, J. A. (1998). Uses of Rasch Modeling in Counseling Psychology Research. Journal of Counseling Psychology, 45, 30-45. Furr, R. M., & Bacharach, V. R. (2008). Psychometrics: An Introduction Second Edition. United States of America: Sage Publication. Gay, L. R., Mills, G. E., & Airasian, P. (2009). Educational Research Competencies for Analysis and Applications. NJ USA: Pearson Education. George, E. J. & Stefanie, A. W., (2018). Invariant Measurement With Raters and Rating Scales: Rasch Models for Rater-Mediated Assessments. Third Avenue, New York. Routledge. Gregory, R. J. (1996). Norms and Reliability. In R. J. Gregory, Psychologcal Testing: History, Principles, and Applications (pp. 62-105). Needham Heights: Allyn & Bacon. Gregory, R. J. (2007). Psychological Testing: History, Principles, and Applications (5th Ed.). Boston: Pearson International. Gunnar, G. (2009). Journal of Rehabilitation Medicine. European Board of Physical and Rehabilitation Medicine, 41, 41(13):1021-3. Hair, J., Black, W., Barbin, B., Anderson, R., & Tatham, R. (2006). Multivariate Data Analysis (6th Ed.). Uppersaddle River, N.J.: Pearson Prentice Hall. Hambleton, R. K., Swaminathan, H., & Rogers, H. J. (1991). Fundamentals of Item Respons Theory. USA: Sage Publication. Hambleton, R. K. (2000). The Next Generationof The ITC Test Translation and Adaptation Guidelines. European Journal of Psychological Assessment, 164-172. Hambleton, R. K., & Jones, R. W. (1993). Comparison of Classical Test Theory and Item Response Theory and Their Applications to Test Development. Educational Measurement: Issues and Practice, 12(3), 38-47. Hishamuddin. (2016). Analisis Item Aneka Pilihan Peperiksaan Akhir Anatomi dan Fisiologi Diploma Kejururawatan Kementerian Kesihatan Malaysia. Universiti Pendidikan Sultan Idris. Tanjung Malim Perak Holmes, W., Bix, B. & Shea, J. (1996). SF-20 Score and Item Distribution in a Human Immunodeficiency Virus-Seropositive Sample. Medical Care; 34: 562-569. Holmes, W. & Shea, J. (1997). A New HIV Disease-Specific Quality of Life Questionnaire. Hudiya, A., & Aidah, A. (2017). Pembangunan Instrumen Jurang Digital Bagie-Pembelajaran Menggunakan Analisis Rasch. Bangi: Universiti Kebangsaan Malaysia. Impara, J. C., & Plake, B. S. (1997). Standard-Setting: An Alternative Approach. Journal of Education Measurement, 34, 353-366. Jackson, T. R., Draugalis, J. R., Slack, M. K., Zachry, W. M., DÁgostino, J. (2002). Validation of Authentic Performance Assessment: A Process Suited for Rasch Modeling. Am J Pharm Educ 2002; 66: 233-243. Jaeger, R. M. (1993). Statistics: A Spectator Sport (Second Edition). Newbury Park, CA: Sage. Kaplan, R. M., & Saccuzo, D. P. (1997). Psychological Testing: Principles, Application and Issues. Pacific Grove: Brooks Cole Pub. Company. Kementerian Pendidikan Malaysia (KPM). (2020). Pendidikan Prasekolah. Diperolehi pada 17 Julai 2020 daripada www.moe.edu.my. Kementerian Pendidikan Malaysia (KPM). (2020). Pendidikan Rendah. Diperolehi pada 17 Julai 2020 daripada www.moe.edu.my. Kementerian Pendidikan Malaysia (KPM). (2020). Pendidikan Menengah. Diperolehi pada 17 Julai 2020 daripada www.moe.edu.my. Kementerian Pendidikan Malaysia (KPM). (2020). Pendidikan Lepasan Menengah. Diperolehi pada 17 Julai 2020 daripada www.moe.edu.my. Lai, J. S. & Eton, D. T. (2002). Clinically Meaningful Gaps. Rasch Measurement Transactions 15(4): 850 Lamoureux, E. L., Pesudovs, K., Thumboo, J., Saw, S. M., & Wong, T. Y. (2009). An Evaluation of the Reliability and Validity of The Visual Functioning Questionnaire (VF-11) Using Rasch Analysis in An Asian Population. Linacre, J. M. (2006). Data Variance Explained by Measures. Rasch Measurement Transactions. 20:1045 - 1047. Linacre, J. M. (2007). Standard Errors and Reliabilities: Rasch and Row Score. Rasch Measurement Transaction, 20(4), 1086. Retrieved from http://www.rasch.org/rmt/rmt204f.htm. Linacre, J. M. (2010). A User's Guide to WINSTEPS Rasch-Model Computer Programs. Chicago: MESA Press. Linacre, J. M. (2013). FACETS Version 3.71.3 [Computer Software and manual]. Chicago: Winsteps.com. Lord, F. M. (1980). Application of Item Response Theory to Practical Testing Problems. Hillside. Linden, W. J., & Ronald, K. H. (2010). Handbook of Modern Item Respons Theory. New York: Springer-Verlag New York Inc. Lord, F. M. (1983). Small Justifies the Rasch Model. New Horizons in Testing, 51-61. Lord, F. M., & Novick, M. (1968). Statistical Theories of Mental Test Scores. Reading, MA: Addison Wesley. Ludlow, L. H., & Guida, F. (1991). Constructing a Scale of Academic Anxiety. Chicago, IL: Paper Presented at the American Educational Research Association Annual Meeting. Luecht, R. M. (1998). Computer-assisted test assembly using optimization heuristics. Applied Psychological Measurement, 22, 224-236. Malaysia Majlis Peperiksaan. (2012). Pengajian Am: Sukatan Pelajaran dan Kertas Soalan Contoh. Sungai Buloh, Selangor Darul Ehsan: Tihani Cetak Sdn Bhd. McManus, I. C., Powis, D. A., Wakeford, R., Forguson, E., James, D., & Richards, P. (2005). Intellectual Aptitude Tests and A Levels for Selecting UK School Leaver Entrants for Medical School. British Medical Journal, 331, 555-559. Messick, S. (1995). Validity of Psychological Assessment: Validation of Inferences From Persons' Responses and Performances as Scientific Inquiry into Score Meaning. American Psychologist, 741-749. Messick, S. (1989). Educational Measurement (3rd Edition). New York: Macmillan. Miller T.R & Cleary T.A. (1993). Direction of Wording effects in balance scales. Educations and Psychological Measurement. 53, 51-60. Mohamed Mustafa, I. (2017). Pengumuman Keputusan Peperiksaan Sijil Tinggi Persekolahan Malaysia (STPM) Tahun 2017. Putrajaya: Majlis Peperiksaan Malaysia. MPM. (2017). Laporan Tahunan. Batu Caves Selangor. MPM. (2013). Manual Teknik dan Gaya Penulisan Soalan Majlis Peperiksaan Malaysia. Batu Caves, Kuala Lumpur: Majlis Peperiksaan Malaysia. MPM. (2013). Pengajian Am: Sukatan Pelajaran dan Kertas Soalan. Batu Caves, Kuala Lumpur: Majlis Peperiksaan Malaysia. MPM. (2017). Pengajian Am: Manual Pelaksanaan Kerja Kursus Mata Pelajaran Pengajian Am STPM. Batu Caves, Kuala Lumpur: Majlis Peperiksaan Malaysia. MPM. (2017). Kriteria Pentaksiran Kerja Kursus Mata Pelajaran Pengajian Am STPM. Batu Caves, Kuala Lumpur: Majlis Peperiksaan Malaysia. MPM. (2018). Skema Peperiksaan Sijil Tinggi Persekolahan Malaysia (STPM). Batu Caves, Kuala Lumpur: Majlis Peperiksaan Malaysia. MPM. (2019). Panduan Pelaksanaan Kerja Kursus Sijil Tinggi Persekolahan Malaysia (STPM) Edisi Ke-5. Batu Caves, Kuala Lumpur: Majlis Peperiksaan Malaysia. Murphy, K., & Davidshofer, C. (1998). Psychological Testing: Principles and Application (4th Ed.) . Englewood Cliffs NJ: Prentice Hall. Nazlinda Abdullah. (2011). Performance Indicator on Parallel Circuit Conceptual Test (PCCTUT): A Fit Statistic Approach. Procedia-Social and Behavioral Sciences 90 (2013), 431 - 440. Norasmah, O., Suria, M. S., Halizah, H., & Haryaty, A. W. (2014). Assessing Construct Validity and Reliability of Competitiveness Scale Using Rasch Model Approach. WEI International Academic Conference Proceedings. Bali Indonesia. Novick, M. R. (1966). The Axioms and Principal Result of Classical Test Theory. Journal of Mathematical Psychology 3, 1-18. Nunnally, J. C. (1978). The Study of Change in Evaluation Research: Principle Concerning Measurement, Experimental Design, and Analysis. Dalam Elmer, L. S. & Guttentag, M. (Eds). Handbook of Evaluation Research. Beverly Hills, California: Sage. Omar, M. H. (2003, May 5-8). Pentaksiran Pendidikan Alaf Baru. Kertas Kerja Seminar Persidangan Pendidikan Kebangsaan. Kuala Lumpur. Osterlind, S. J. (1992). Constructing Test Item (2nd Edition). London: Kluwer Academic Publishers. Pallant, J. F., & Tennant, A. (2007). An Introduction to the Rasch Measurement Model: An Example Using the Hospital Anxiety and Depression Scale (HADS). British Journal of Clinical Psychology, 46, 1-18. Parra-Lopez, E., & Oreja-Rodriguez, J. R. (2014). Evaluation of the competitiveness of tourist zones of an island destination: An application of a many-facets Rasch model (MFRM). Journal of Destination Marketing and Management, 3(2), 114–121. Payne, D. A. (1974). The Assessment of Learning. Lexington MA: Heath. Popham, W. (1975). Educational Evaluation. New Jersey: Prentice Hall Inc. Popham, W. (1981). Modern Educational Measurement. Englewood Cliffs NJ: Prentice Hall. Pustaka, D. B. (2005). Kamus Dewan Edisi Keempat. Kuala Lumpur. Rafidah M. A., & Mohd Effendi Ewan M. M. (2019). Kesahan dan Kebolehpercayaan Instrumen I-CGPKM Menggunakan Model Rasch. Journal of Quality Measurement and Analysis. JQMA 15(1) 2019, 1-14 Rahaya, A. S. (2003). Teori, Konsep & Amalan Dalam Pengukuran dan Penilaian. Kuala Lumpur: Percetakan Ampang Press Sdn Bhd. Raju, N. S., Price, L. R., Oshima, T., & Nering, M. L. (2006). Standardize Conditional SEM: A Case for Conditional Reliability. Applied Psychology Measurement, 30(X), 1-12. Rasch, G. (1960). Probabilistic Model For Some Intelligence and Attainment Tests (Reprint, with Foreword and Afterword by B. D. Wright, Chicago: University of Chicago Press, 1980). Copenhagen, Denmark: Denmarks Paedogogiske Institut. Reeve, B. B. (2006 ). Introduction to Item Response Theory (IRT) Modeling and Applications for Survey Research. One day workshop sponsored by the Joint Program in Survey Methodology (JPSM): Bethesda, MD. Reeve, B. B. (2002). An Introduction to Modern Measurement Theory. USA: Springer. Ronald K. Hambleton & Russell W. Jones. (1993). Comparison of Classical Test Theory and Item Respons Theory and Their Application to Test Development. Educational Measurement: Issues and Practice, 12(3), 38-47. Ruhaibah, H. (2015). Ciri-Ciri Psikometrik "Malaysian University Selection Inventory"(MUnSyI) Menggunakan Model Pengukuran Rasch . Pulau Pinang: Universiti Sains Malaysia. Schumacker, R. E. & Smith, E. V. (2007). A Rasch Perspective. Educational and Psychological Measurement, V67 N3. 394-409. Siti Eshah, M. (2018). Teori Respons Item Dalam Penyelidikan. Tanjong Malim: Penerbit Universiti Pendidikan Sultan Idris. Smith, R. M. (2000). Fit Analysis in Latent Trait Measurement Models. Journal of Applied Measurement, 1, 199-218. Streiner, D. L., & Norman, G. R. (2008). Health Measurement Scale: A Practical Guide to Their Development and Use (4th Ed.). Oxford UK: Oxford University Press. Suen, H. K. (2010). Principles of Test Theories. New Jersey: Lawrence Erlbaum Associates, Inc., Publishers. Teasdale, T. W., & Owen, D. R. (1984). Heredity and Familial Environment in Intelligence and Educational Level a Sibling Study. Nature Publishing Group 309, 620-622. Tineke, B., Luke, H., & Aaron, O. B., (2018) Going online: The effect of mode of delivery on performances and perceptions on an English L2 writing test suite. United Kingdom. Assessing Writing 36. 3-18 Thomas, E., (2015). Language Testing and Evaluation: Introduction to Many-Facet Rasch Measurement (Analyzing and Evaluating Rater-Mediated Assessments). New York: Peter Lang GmbH. Thompson, N. A. (2009). Ability Estimation With Item Response Theory. White Paper. Assessment Systems Corporation 2233 University Avenue, Suite 200: St. Paul, Minnesota 55114. Tuckman, B. W. (1978). Conducting Educational Research. New York: Harcout Brace Jovanovich Inc. William, J. B., & John R. S. (2020). Advances in Rasch Analyses in The Human Sciences. Switzerland. Springer Nature. William, J. B., John R. S., & Melissa, S. Y. (2014). Rasch Analysis in The Human Sciences. New York London. Springer Dordrecht Heidelberg. Willnat L. & Weaver D. H. (2005). Global Trends in Communication Education and Research. Journalism Education in the United States. 37-52 Wu, M. L & Adams, R, J. (2007). Applying the Rasch Model to Psycho-Social Measurement: A Practical Approach. Educational Measurement Solutions. Melbourne. Wright, B. D., & Masters, G. (1982). Rating Scale Analysis. Chicago, IL: Pluribus Press. Zeng, J., & Wyse, A. (2009). Introduction to Classical Test Theory. Michigan: W. Allegan Lansing.
|
This material may be protected under Copyright Act which governs the making of photocopies or reproductions of copyrighted materials. You may use the digitized material for private study, scholarship, or research. |