UPSI Digital Repository (UDRep)
|
|
|
Abstract : Universiti Pendidikan Sultan Idris |
Principal component analysis (PCA) guided clustering approach is widely used in high dimensional data especially in identifying the spatial distribution patterns of daily torrential rainfall. Typically, a common method of identifying rainfall patterns for climatological investigation employed T mode based Pearson correlation matrix to extract the relative variance retained. However, rainfall data in Peninsular Malaysia involve skewed observations which only take positive values and are skewed towards higher values. Thus, applying PCA based Pearson correlation on rainfall data set could affect cluster partitions and generate extremely unbalanced clusters in a high dimensional space. Another issues that crop up in this study is classical clustering techniques have drawbacks where these techniques require certain assumptions which contradict with the characteristics of rainfall and these techniques divides the database of rainfall patterns into clusters with the assumption that every rainfall pattern belongs only to one specific cluster. Therefore, the main objective of this research is to include a robust spatial and temporal daily torrential rainfall pattern approach in the PCA and also to employ the biclustering method to daily torrential rainfall analysis. Firstly, a robust dimension reduction method in Principal Component Analysis (PCA) is used to rectify the issue of unbalanced clusters in rainfall patterns due to the skewed nature of rainfall data. Secondly, a series of two-way clustering methods known as biclustering that could be applied to daily torrential rainfall analysis in order to show clear classification in defining daily rainfall patterns in Peninsular Malaysia. As results, it has a substantial improvement with the robust PCA combined with bicluster analysis in terms of the average number of clusters obtained and its cluster quality. The outputs can be beneficial to those who would like to apply a more accurate regional future rainfall data in their studies. |
References |
V. Moron, et al., “Weather Types Across the Maritime Continent: from the Diurnal Cycle to Interannual Variations,” Frontiers in Environmental Science. 3(44), 2015.
N. H. Ahmad, et al., “Hierarchical Cluster Approach for Regionalization of Peninsular Malaysia based on the Precipitation Amount,” Journal of Physics: Conference Series. 423, pp. 1-10, 2013.
G. S. Siva, et al., “Cluster Analysis Approach to Study the Rainfall Pattern in Visakhapatnam District,” Weekly Science Research Journal. 1(31), 2014.
R. Romero, et al., “Daily Rainfall Patterns in the Spanish Mediterranean Area: An Objective Classification,” International Journal of Climatology.19, pp. 95-112, 1999.
D. Penarrocha, “Classification of Daily Rainfall Patterns in a Mediterranean Area with Extreme Intensity Levels: The Valencia Region,” Internation Journal of Climatology.22, pp. 677-695, 2002.
G. Sumner, et al., “The Impact of Surface Circulations on the Daily Rainfall over Mallorca,” International Journal of Climatology.15, pp. 673–696, 1995.
P. Wickramagamage, “Seasonality and spatial pattern of rainfall of Sri Lanka: Exploratory factor analysis,” International Journal of Climatology. 30, pp. 1235-1245, 2010.
N. S. Chok, “Pearson’s Versus Sperman’s and Kendall’s Correlation Coefficients for Continuous Data”, 2008. |
This material may be protected under Copyright Act which governs the making of photocopies or reproductions of copyrighted materials. You may use the digitized material for private study, scholarship, or research. |