UPSI Digital Repository (UDRep)
|
|
|
Abstract : Universiti Pendidikan Sultan Idris |
The Spearman rho nonparametric correlation coefficient is widely used to measure the strength and degree of association between two variables. However, outliers in the data can skew the results, leading to inaccurate results as the Spearman correlation coefficient is sensitive toward outliers. Thus, the robust approach is used to construct a robust model which is highly resistant to data contamination. The robustness of an estimator is measured by the breakdown point which is the smallest fraction of outliers in a sample data without affecting the estimator entirely. To overcome this problem, the aim of this study is two-fold. Firstly, researchers have proposed a robust Spearman correlation coefficient model based on the MMestimator, called the MM-Spearman correlation coefficient. Secondly, to test the performance of the proposed model, it was tested by the Monte Carlo simulation and contaminated air pollution data in Kuala Terengganu, Terengganu, Malaysia. The data have been contaminated from 10% to 50% outliers. The performance of the MMSpearman correlation coefficient properties was evaluated by statistical measurements such as standard error, mean squared error, root mean squared error and bias. The MMSpearman correlation coefficient model outperformed the classical model, producing significantly smaller standard error, mean squared error, and root mean squared error values. The robustness of the model was evaluated using the breakdown point, which measures the smallest fraction of outliers that can be present in sample data without entirely affecting the estimator. The hybrid MM-Spearman correlation coefficient model demonstrated high robustness and efficiently handled data contamination up to 50%. However, the study has a limitation in that it can only overcome data contamination up to a maximum of 50%. Despite this limitation, the proposed model provides accurate and efficient results, enabling management authorities to make sound decisions without being affected by contaminated data. The MM-Spearman correlation coefficient model provides a valuable tool for researchers and decision-makers, allowing them to analyze data with a high degree of accuracy and robustness, even in the presence of outliers. 2023 by authors, all rights reserved. |
References |
Renaud, O., & Victoria-Feser, M. P. “A robust coefficient of determination for regression”. Journal of Statistical Planning and Inference, vol. 140, no. 7, pp. 1852-1862, 2010, https://doi.org/10.1016/j.jspi.2010.01.008. Daszykowski, M., Kaczmarek, K., Heyden, Y. V., Walczak, B. “Robust Statistics in Data Analysis: A Review Basic Concepts”. Journal Chemometrics and Intelligent Laboratory Systems, vol. 85, no. 2, pp. 203–219, 2007, https://doi.org/10.1016/j.chemolab.2006.06.016 Maronna, R. A., Martin, R. D., Yohai, V. J., SalibiánBarrera, M. Robust statistics: theory and methods (with R). John Wiley & Sons. 2019, pp. 17-85. Abdullah, M., Analisis Regresi (1st ed.). Kuala Lumpur: Perpustakaan Dewan Bahasa dan Pustaka. 1994, pp. 95-115. Croux, C., Dehon, C. “Influence Functions of the Spearman and Kendall Correlation Measures”. Statistical Method & Applications, vol. 19, pp. 497-515, 2010, https://doi.org/10.1007/s10260-010-0142-z Huber, P. J. “Robust Estimation of Location Parameter”. The Annals of Mathematical Statistics, vol. 35, No.1, pp.73-101, 1964. https://www.jstor.org/stable/2238020 Yohai, V. J. High Breakdown-Point and High Efficiency Robust Estimates for Regression. The Annals of Statistics, 15, pp. 642-656, 1987. https://www.jstor.org/stable/2241331. Maru, A. P., Modi, C. K., & Nataraj, P. S. V. "Comparison of Robust MM Estimator and Robust M Estimator Based Denoising Filters for Gray Level Image Denoising," 2012 International Conference on Communication Systems and Network Technologies, Rajkot, Gujarat, India, 2012, pp. 109-113, doi: 10.1109/CSNT.2012.33. Huber, P. J. “Robust Regression: Asymptotics, Conjectures and Monte Carlo”. The Annals of Statistics, vol. 1, No. 5, pp. 799-821. 1973. https://www.jstor.org/stable/2958283 Huber, P. J. Robust Statistics (1st ed.). New York: Wiley Interscience Publication. 1981, pp. 199-237 Lola, M. S., “ Fuzzy Parametric Sample Selection Model: Monte Carlo Simulation Approach”. Journal of Statistical Computation and Simulation. vol. 83, no. 6, pp. 992-1006, 2013. https://doi.org/10.1080/00949655.2011.646277 Pascual, L., Romo, J., Ruiz, E. “Bootsrap Prediction for Returns and Volatile in GARCH Model”. Computational Statistics and Data Analysis. vol. 50, no. 9, pp. 2293-2312, 2006. DOI: 10.1016/j.csda.2004.12.008. Zainuddin, N. H., Lola, M.S., and Nur Shazrahanim, K. “Modelling Moving Centerline Exponentially Weighted Moving Average (MCEWMA) with bootstrap approach: Case study on sukuk musyarakah of Rantau Abang Capital Berhad, Malaysia”. International Journal of Applied, Business and Economic Research. vol. 14, no. 2, pp. 621-638, 2016. Zainuddin, N. H., Lola, M. S. “The Performance of BBMCEWMA Model: Case Study on Sukuk Rantau Abang Capital Berhad, Malaysia”. International Journal of Applied, Business and Economic Research. vol. 14, no. 2, pp. 63-77, 2016. Abdullah, M. “On a Robust Correlation Coefficient”. Journal of the Royal Statistical Society, Series D (The Statistician). vol. 39, no. 4, pp. 455-460. 1990. https://doi.org/10.2307/2349088 Mundform, D. J., Schaffer, Jay, K. M., Jin, S. D., Thongteeraparp, A., Supawan, P. “Number of Replications Required in Monte Carlo Simulation Studies: A Synthesis of Four Studies". Journal of Modern Applied Statistical Methods. vol. 10, no. 1, pp.19-28, 2011. 10.22237/jmasm/1304222580 Razali, M.R., Lola, M. S., Abd Wahid, M. E.,, Zainuddin, N. H., Abdullah, M. T., K Abdul Hamid, A. A., Chandra Segaran, T., Lassaw, S., and Djauhari, M. “A Hybrid Logistic Regression Model with a Bootstrap Approach to Improve the Accuracy of the Performance of Jellyfish Collagen Data”. Journal of Sustainability Science and Management. vol. 16, No. 6, pp. 191-203, 2021. |
This material may be protected under Copyright Act which governs the making of photocopies or reproductions of copyrighted materials. You may use the digitized material for private study, scholarship, or research. |