Research Article

K-means Clustering in R Libraries {cluster} and {factoextra} for Grouping Oceanographic Data

Volume: 2 Number: 1 September 23, 2019
EN

K-means Clustering in R Libraries {cluster} and {factoextra} for Grouping Oceanographic Data

Abstract


Cluster analysis by k-means algorithm by R programming is the scope of the current paper. The study assesses the similarity of the sampling data derived from the GIS project by homogeneity of their attribute parameters aimed to analyze similar clusters of the observa- tion data by the variety of parameters: geology (similar location on the tectonic plates, sediment thickness, igneous volcanic areas), bathymetry (similar depth ranges) and geomorphology (similar slope steepness and aspect). The geological case study is Mariana Trench. Clustering as ef- fective statistical method to detect similar groups in the data set. Tech- nically, major used R libraries include {cluster}, {factoextra}, {ggplot2}. Minor R libraries include {wordcloud}, {tm}. Several clusters were tested from 2 to 7, optical number is 5. The findings include following computed and visualized results illustrated by 8 figures: 1) correlation matrix show- ing crossing correlations in the combination of factors; 2) comparison of the bi-factors in-between the factors revealed pairwise correlation; 3) pairwise comparative analysis enabled to observe an influence on the variables as bi-factors: in response to the decreasing sediment thickness, slope angles go in parallel; 4) the location of the volcanic igneous ar- eas cause a cyclic repetition of the curve for the slope angles, and those of the volcanic zones have correlation with the slope angle and aspect degree. Findings reveals that four variables affect geomorphology of the trench: slope angle, sediment thickness, aspect degree and volcanic ig- neous areas. The paper includes 7 listings of R programming codes for repeatability of the algorithms in similar research.


Keywords

References

  1. Ciaccio, A.D., Coli, M., Angulo Ibanez, J.M.: Studies in Theoretical and Applied Statistics Selected Papers of the Statistical Societies, chap. Advanced Statistical Methods for the Anaysis of Large Data Sets, p. 464. Springer (2012). https://doi.org/10.1007/978-3-642-21037-2
  2. Cielen,D., Meysman, A. D. B., M., A.: Introducing Data Science. Big Data, Machine Learning and More, Using Python Tools. Manning, Shelter Island, U.S. (2016)
  3. van Haren, H., Berndt, C., Klaucke, I.: Ocean mixing in deep-sea trenches: New insights from the Challenger Deep, Mariana Trench. Deep-Sea Research Part I: Oceanographic Research Papers (2017). https://doi.org/10.1016/j.dsr.2017.09.003
  4. Hartwell, A.M., Voight, J.R., Wheat, C.G.: Clusters of deep-sea egg-brooding octopods associated with warm fluid discharge: An ill-fated fragment of a larger, discrete population? Deep-Sea Research Part I: Oceanographic Research Papers 135, 1–8 (2018). https://doi.org/10.1016/j.dsr.2018.03.011
  5. Hessler, R.R., Ingram, C.L., Yayanos, A.A., Burnett, B.: Scavenging amphipods from the floor of the Philippine Trench. Deep-Sea Research Part I: Oceanographic Research Papers 25, 1029–1047 (1978)
  6. Ichino, M.C., Clark, M.R., Drazen, J.C., Jamieson, A., Jones, D.O.B., Martin, A.P., Rowden, A.A., Shank, T.M., Yancey, P.H., Ruhl, H.A.: The distribution of benthic biomass in hadal trenches: A modelling approach to investi- gate the effect of vertical and lateral organic matter transport to the seafloor. Deep-Sea Research Part I: Oceanographic Research Papers 100, 21–33 (2015). https://doi.org/10.1016/j.dsr.2015.01.010
  7. Itoh, M., Kawamura, K., Kitahashi, T., kiKojima, S., Katagiri, H., Shimanaga, M.: Bathymetric patterns of meiofaunal abundance and biomass associated with the Kuril and Ryukyu trenches, western North Pacific Ocean. Deep-Sea Research Part I: Oceanographic Research Papers 58, 86–97 (2011). https://doi.org/10.1016/j.dsr.2010.12.004
  8. Jamieson, A.J., Fujii, T.: Trench Connection. Biology Letters 7, 641–643 (2011). https://doi.org/10.1098/rsbl.2011.0231

Details

Primary Language

English

Subjects

Software Engineering (Other)

Journal Section

Research Article

Publication Date

September 23, 2019

Submission Date

April 20, 2019

Acceptance Date

August 3, 2019

Published in Issue

Year 2019 Volume: 2 Number: 1

APA
Lemenkova, P. (2019). K-means Clustering in R Libraries {cluster} and {factoextra} for Grouping Oceanographic Data. International Journal of Informatics and Applied Mathematics, 2(1), 1-26. https://izlik.org/JA46AF99JS
AMA
1.Lemenkova P. K-means Clustering in R Libraries {cluster} and {factoextra} for Grouping Oceanographic Data. IJIAM. 2019;2(1):1-26. https://izlik.org/JA46AF99JS
Chicago
Lemenkova, Polina. 2019. “K-Means Clustering in R Libraries {cluster} and {factoextra} for Grouping Oceanographic Data”. International Journal of Informatics and Applied Mathematics 2 (1): 1-26. https://izlik.org/JA46AF99JS.
EndNote
Lemenkova P (September 1, 2019) K-means Clustering in R Libraries {cluster} and {factoextra} for Grouping Oceanographic Data. International Journal of Informatics and Applied Mathematics 2 1 1–26.
IEEE
[1]P. Lemenkova, “K-means Clustering in R Libraries {cluster} and {factoextra} for Grouping Oceanographic Data”, IJIAM, vol. 2, no. 1, pp. 1–26, Sept. 2019, [Online]. Available: https://izlik.org/JA46AF99JS
ISNAD
Lemenkova, Polina. “K-Means Clustering in R Libraries {cluster} and {factoextra} for Grouping Oceanographic Data”. International Journal of Informatics and Applied Mathematics 2/1 (September 1, 2019): 1-26. https://izlik.org/JA46AF99JS.
JAMA
1.Lemenkova P. K-means Clustering in R Libraries {cluster} and {factoextra} for Grouping Oceanographic Data. IJIAM. 2019;2:1–26.
MLA
Lemenkova, Polina. “K-Means Clustering in R Libraries {cluster} and {factoextra} for Grouping Oceanographic Data”. International Journal of Informatics and Applied Mathematics, vol. 2, no. 1, Sept. 2019, pp. 1-26, https://izlik.org/JA46AF99JS.
Vancouver
1.Polina Lemenkova. K-means Clustering in R Libraries {cluster} and {factoextra} for Grouping Oceanographic Data. IJIAM [Internet]. 2019 Sep. 1;2(1):1-26. Available from: https://izlik.org/JA46AF99JS

International Journal of Informatics and Applied Mathematics