Research Article
BibTex RIS Cite

Applying Permanova for Multivariate Analysis of Variance in Health Studies

Year 2025, Volume: 8 Issue: 2, 428 - 434, 15.03.2025
https://doi.org/10.34248/bsengineering.1611775

Abstract

Health data often do not meet the normality assumption, which limits the applicability of traditional analysis of variance methods. The aim of this study is to propose a methodological framework for analyzing such data by examining PERMANOVA (Permutational Multivariate Analysis of Variance), a method that does not require the normality assumption and is particularly suitable for complex datasets, within the context of maternal health data. In the context of maternal healthcare in Bangladesh, the effects of two independent variables—risk and age factors—on multivariate response variables such as Systolic Blood Pressure (mmHg), Diastolic Blood Pressure (mmHg), Blood Sugar (mmol/L), Body Temperature (Fahrenheit), and Heart Rate (beats per minute) were examined using the PERMANOVA method. The first independent variable represents the risk factor, comprising three different risk levels (low, mid, high), while the second independent variable represents the age factor, divided into four age groups (young, adolescent, middle-aged, menopausal). The dependent variables did not follow a normal distribution, as confirmed by the Anderson-Darling test and Mardia’s multivariate normality test. As a result of the PERMANOVA analysis, it was determined that at least two mean differences between the groups of the risk factor and the age factor were statistically significant in terms of the response variables (P<0.01). Furthermore, pairwise comparisons of the factor groups revealed that the mean differences between the low, mid, and high levels of the risk factor, as well as the mean differences among the young, adolescent, and middle-aged groups of the age factor, were statistically significant (P<0.01). However, the mean difference between the middle-aged and menopausal groups for the age factor was found to be statistically insignificant (P>0.01). The PERMANOVA method is recommended for researchers to accurately determine whether the mean differences in factor levels are statistically significant or to identify threshold values of the groups by using multiple response variables simultaneously and performing pairwise comparisons of factor groups.

Ethical Statement

The datasets used in this study are publicly accessible and can be found at the following link: https://www.kaggle.com/datasets/csafrit2/maternal-health-risk-data

Thanks

We sincerely thank Ahmed and Kashem for making the open-access dataset used in this study available.

References

  • Ahmed M, Kashem MA. 2020. IoT-based risk level prediction model for maternal health care in the context of Bangladesh. 2nd International Conference on Sustainable Technologies for Industry 4.0 (STI), December 19-20, Dhaka, Bangladesh, pp: 1-6. https://doi.org/10.1109/STI50764.2020.9350320
  • Anderson MJ. 2001. A new method for non-parametric multivariate analysis of variance. Austral Ecol, 26(1): 32-46. https://doi.org/10.1111/j.1442-9993.2001.01070.pp.x
  • Anderson MJ. 2014. Permutational multivariate analysis of variance (PERMANOVA). Wiley StatsRef: Statistics Reference Online. https://doi.org/10.1002/9781118445112.stat07841
  • Hervé M. 2022. RVAideMemoire: Testing and plotting procedures for biostatistics [Computer software]. CRAN. https://CRAN.R-project.org/package=RVAideMemoire
  • Koç Ş, Çanga Boğa D, Önem AB, Yavuz E, Şahin M. 2019. Monte Carlo simulation study robustness of MANOVA test statistics in Bernoulli and uniform distribution. BSJ Eng Sci, 2(2): 42-51. https://doi.org/10.34248/bsengineering.563330
  • Korkmaz S, Göksülük D, Zararsiz G. 2014. MVN: An R package for assessing multivariate normality. The R J, 6(2): 151-162. https://doi.org/10.32614/RJ-2014-031
  • Nascimento FJA, Dahl M, Deyanova D, Lyimo LD, Bik HM, Schuelke T, Bonaglia S. 2019. Above-below surface interactions mediate effects of seagrass disturbance on meiobenthic diversity, nematode and polychaete trophic structure. Commun Biol, 2: 362. https://doi.org/10.1038/s42003-019-0607-z
  • Oksanen J, Blanchet FG, Friendly M, Kindt R, Legendre P, McGlinn D, Minchin PR, O’Hara RB, Simpson GL, Solymos P, Stevens MHH, Szoecs E, Wagner H. 2022. Vegan: Community ecology package [Computer software]. CRAN. https://CRAN.R-project.org/package=vegan
  • Pasin O, Ankarali H, Cangür S, Sungur MA. 2016. Nonparametric multivariate analysis of variance and its application in the field of health. J Info Technol, 9(1): 13-20.
  • R Core Team. 2023. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL: https://www.R-project.org/
  • Sowers JR, Epstein M, Frohlich ED. 2001. Diabetes, hypertension, and cardiovascular disease: An update. Hypertension, 37(4): 1053-1059. https://doi.org/10.1161/01.HYP.37.4.1053
  • Şahin M, Koç Ş. 2018. A Monte Carlo simulation study robustness of MANOVA test statistics in Bernoulli distribution. Süleyman Demirel Univ J Inst Nat App Sci, 22(3): 1125-1131.
  • Tabachnick BG, Fidell LS, Ullman JB. 2013. Using multivariate statistics. Pearson, London, UK, pp: 983 (6th ed.).
  • Tinsley HEA, Brown SD. 2000. Multivariate statistics and mathematical modeling. In Handbook of applied multivariate statistics and mathematical modeling. Academic Press, Cambridge, US, pp: 3–36. https://doi.org/10.1016/B978-012691360-6/50002-8
  • Togunwa TO, Babatunde AO, Abdullah KU. 2023. Deep hybrid model for maternal health risk classification in pregnancy: Synergy of ANN and random forest. Front AI, 6: 1213436. https://doi.org/10.3389/frai.2023.1213436
  • Underwood AJ. 1981. Techniques of analysis of variance in experimental marine biology and ecology. Oceanog Marine Biol: An Annual Rev, 19(Suppl 1): 513-605.
  • Wikipedia. 2023. Akaike information criterion. URL: https://en.wikipedia.org/wiki/Akaike_information_criterion https://en.wikipedia.org/wiki/Bayesian_information_criterion (accessed date: September 30, 2023).

Applying Permanova for Multivariate Analysis of Variance in Health Studies

Year 2025, Volume: 8 Issue: 2, 428 - 434, 15.03.2025
https://doi.org/10.34248/bsengineering.1611775

Abstract

Health data often do not meet the normality assumption, which limits the applicability of traditional analysis of variance methods. The aim of this study is to propose a methodological framework for analyzing such data by examining PERMANOVA (Permutational Multivariate Analysis of Variance), a method that does not require the normality assumption and is particularly suitable for complex datasets, within the context of maternal health data. In the context of maternal healthcare in Bangladesh, the effects of two independent variables—risk and age factors—on multivariate response variables such as Systolic Blood Pressure (mmHg), Diastolic Blood Pressure (mmHg), Blood Sugar (mmol/L), Body Temperature (Fahrenheit), and Heart Rate (beats per minute) were examined using the PERMANOVA method. The first independent variable represents the risk factor, comprising three different risk levels (low, mid, high), while the second independent variable represents the age factor, divided into four age groups (young, adolescent, middle-aged, menopausal). The dependent variables did not follow a normal distribution, as confirmed by the Anderson-Darling test and Mardia’s multivariate normality test. As a result of the PERMANOVA analysis, it was determined that at least two mean differences between the groups of the risk factor and the age factor were statistically significant in terms of the response variables (P<0.01). Furthermore, pairwise comparisons of the factor groups revealed that the mean differences between the low, mid, and high levels of the risk factor, as well as the mean differences among the young, adolescent, and middle-aged groups of the age factor, were statistically significant (P<0.01). However, the mean difference between the middle-aged and menopausal groups for the age factor was found to be statistically insignificant (P>0.01). The PERMANOVA method is recommended for researchers to accurately determine whether the mean differences in factor levels are statistically significant or to identify threshold values of the groups by using multiple response variables simultaneously and performing pairwise comparisons of factor groups.

Ethical Statement

The datasets used in this study are publicly accessible and can be found at the following link: https://www.kaggle.com/datasets/csafrit2/maternal-health-risk-data

Thanks

We sincerely thank Ahmed and Kashem for making the open-access dataset used in this study available.

References

  • Ahmed M, Kashem MA. 2020. IoT-based risk level prediction model for maternal health care in the context of Bangladesh. 2nd International Conference on Sustainable Technologies for Industry 4.0 (STI), December 19-20, Dhaka, Bangladesh, pp: 1-6. https://doi.org/10.1109/STI50764.2020.9350320
  • Anderson MJ. 2001. A new method for non-parametric multivariate analysis of variance. Austral Ecol, 26(1): 32-46. https://doi.org/10.1111/j.1442-9993.2001.01070.pp.x
  • Anderson MJ. 2014. Permutational multivariate analysis of variance (PERMANOVA). Wiley StatsRef: Statistics Reference Online. https://doi.org/10.1002/9781118445112.stat07841
  • Hervé M. 2022. RVAideMemoire: Testing and plotting procedures for biostatistics [Computer software]. CRAN. https://CRAN.R-project.org/package=RVAideMemoire
  • Koç Ş, Çanga Boğa D, Önem AB, Yavuz E, Şahin M. 2019. Monte Carlo simulation study robustness of MANOVA test statistics in Bernoulli and uniform distribution. BSJ Eng Sci, 2(2): 42-51. https://doi.org/10.34248/bsengineering.563330
  • Korkmaz S, Göksülük D, Zararsiz G. 2014. MVN: An R package for assessing multivariate normality. The R J, 6(2): 151-162. https://doi.org/10.32614/RJ-2014-031
  • Nascimento FJA, Dahl M, Deyanova D, Lyimo LD, Bik HM, Schuelke T, Bonaglia S. 2019. Above-below surface interactions mediate effects of seagrass disturbance on meiobenthic diversity, nematode and polychaete trophic structure. Commun Biol, 2: 362. https://doi.org/10.1038/s42003-019-0607-z
  • Oksanen J, Blanchet FG, Friendly M, Kindt R, Legendre P, McGlinn D, Minchin PR, O’Hara RB, Simpson GL, Solymos P, Stevens MHH, Szoecs E, Wagner H. 2022. Vegan: Community ecology package [Computer software]. CRAN. https://CRAN.R-project.org/package=vegan
  • Pasin O, Ankarali H, Cangür S, Sungur MA. 2016. Nonparametric multivariate analysis of variance and its application in the field of health. J Info Technol, 9(1): 13-20.
  • R Core Team. 2023. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL: https://www.R-project.org/
  • Sowers JR, Epstein M, Frohlich ED. 2001. Diabetes, hypertension, and cardiovascular disease: An update. Hypertension, 37(4): 1053-1059. https://doi.org/10.1161/01.HYP.37.4.1053
  • Şahin M, Koç Ş. 2018. A Monte Carlo simulation study robustness of MANOVA test statistics in Bernoulli distribution. Süleyman Demirel Univ J Inst Nat App Sci, 22(3): 1125-1131.
  • Tabachnick BG, Fidell LS, Ullman JB. 2013. Using multivariate statistics. Pearson, London, UK, pp: 983 (6th ed.).
  • Tinsley HEA, Brown SD. 2000. Multivariate statistics and mathematical modeling. In Handbook of applied multivariate statistics and mathematical modeling. Academic Press, Cambridge, US, pp: 3–36. https://doi.org/10.1016/B978-012691360-6/50002-8
  • Togunwa TO, Babatunde AO, Abdullah KU. 2023. Deep hybrid model for maternal health risk classification in pregnancy: Synergy of ANN and random forest. Front AI, 6: 1213436. https://doi.org/10.3389/frai.2023.1213436
  • Underwood AJ. 1981. Techniques of analysis of variance in experimental marine biology and ecology. Oceanog Marine Biol: An Annual Rev, 19(Suppl 1): 513-605.
  • Wikipedia. 2023. Akaike information criterion. URL: https://en.wikipedia.org/wiki/Akaike_information_criterion https://en.wikipedia.org/wiki/Bayesian_information_criterion (accessed date: September 30, 2023).
There are 17 citations in total.

Details

Primary Language English
Subjects Biostatistics, Applied Statistics
Journal Section Research Articles
Authors

Fahrettin Kaya 0000-0003-1666-4859

Publication Date March 15, 2025
Submission Date January 2, 2025
Acceptance Date February 2, 2025
Published in Issue Year 2025 Volume: 8 Issue: 2

Cite

APA Kaya, F. (2025). Applying Permanova for Multivariate Analysis of Variance in Health Studies. Black Sea Journal of Engineering and Science, 8(2), 428-434. https://doi.org/10.34248/bsengineering.1611775
AMA Kaya F. Applying Permanova for Multivariate Analysis of Variance in Health Studies. BSJ Eng. Sci. March 2025;8(2):428-434. doi:10.34248/bsengineering.1611775
Chicago Kaya, Fahrettin. “Applying Permanova for Multivariate Analysis of Variance in Health Studies”. Black Sea Journal of Engineering and Science 8, no. 2 (March 2025): 428-34. https://doi.org/10.34248/bsengineering.1611775.
EndNote Kaya F (March 1, 2025) Applying Permanova for Multivariate Analysis of Variance in Health Studies. Black Sea Journal of Engineering and Science 8 2 428–434.
IEEE F. Kaya, “Applying Permanova for Multivariate Analysis of Variance in Health Studies”, BSJ Eng. Sci., vol. 8, no. 2, pp. 428–434, 2025, doi: 10.34248/bsengineering.1611775.
ISNAD Kaya, Fahrettin. “Applying Permanova for Multivariate Analysis of Variance in Health Studies”. Black Sea Journal of Engineering and Science 8/2 (March 2025), 428-434. https://doi.org/10.34248/bsengineering.1611775.
JAMA Kaya F. Applying Permanova for Multivariate Analysis of Variance in Health Studies. BSJ Eng. Sci. 2025;8:428–434.
MLA Kaya, Fahrettin. “Applying Permanova for Multivariate Analysis of Variance in Health Studies”. Black Sea Journal of Engineering and Science, vol. 8, no. 2, 2025, pp. 428-34, doi:10.34248/bsengineering.1611775.
Vancouver Kaya F. Applying Permanova for Multivariate Analysis of Variance in Health Studies. BSJ Eng. Sci. 2025;8(2):428-34.

                                                24890