MANAGING MULTICOLLINEARITY AND LINEAR DEPENDENCE IN PCA OF OLIVE OIL USING FATTY ACID AND TRIGLYCERIDE PROFILES
Öz
In this study, principal component analysis (PCA) was applied to fatty acid and triglyceride (TAG) profiles of extra virgin olive oils (n=40) obtained from Kahramanmaraş (Türkiye) during the 2023/2024 harvest season to diagnose and address matrix instabilities caused by multicollinearity and linear dependence. A total of 34 variables (23 experimental and 11 derived) were used to reduce data dimensionality and determine sample variability. The initial PCA attempt with standardized variables showed poor factorability (Kaiser-Meyer-Olkin measure, KMO=0.13; measure of sampling adequacy, MSA<0.40), and Bartlett’s test of sphericity could not be calculated because the correlation matrix was not positive definite. Multicollinearity and linear dependence were assessed using Pearson correlations and regression-based diagnostics (variance inflation factor, V IF; t olerance i ndex, T I; condition index, CI; and variance decomposition proportions, VDP). Most derived variables showing high correlations and redundant information were removed from the dataset, reducing the number of variables to 23, and in the repeated PCA, Bartlett’s test of sphericity became significant (P<0.001), but the KMO value of 0.49 indicated that the model still had insufficient factorability. An optimized 17-variable model was obtained through a stepwise screening based on MSA (<0.40) and multicollinearity criteria (VIF>10; TI<0.10). The final m odel p roduced 5 principal c omponents ( PCs) t hat e xplained 7 4% of t he t otal variance and reached an acceptable level of sampling adequacy (KMO=0.70). After Promax rotation, variables were mostly loaded uniquely and strongly on the relevant PCs in the pattern matrix, while secondary loadings were limited in the structure matrix. In the score analysis, most samples showed separation on the PC1-PC2 plane. Additionally, only 10 samples (25%) exceeded the standardized z-score threshold (|z|>2). Overall, the results indicated that for reliable and interpretable PCA modelling of the olive oil data, it is necessary to clearly manage factorability and multicollinearity issues and to carefully examine the correlation matrix structures and the score distributions.
Anahtar Kelimeler
Etik Beyan
Kaynakça
- Abdi, H., Williams, L.J. (2010). Principal component analysis. WIREs Computational Statistics, 2, 433-459. https://doi.org/10.1002/wics.101
- Agozzino, P., Avellone, G., Bongiorno, D., Ceraulo, L., Indelicato, S., Indelicato, S., Vèkey, K. (2010). Determination of the cultivar and aging of Sicilian olive oils using HPLC‐MS and linear discriminant analysis. Journal of Mass Spectrometry, 45, 989-995. https://doi.org/10.1002/jms.1791
- Amaral, J.S., Mafra, I., Oliveira, M.B.P. (2010). Characterization of three Portuguese varietal olive oils based on fatty acids, triacylglycerols, phytosterols and vitamin E profiles: application of chemometrics. In: Olives and Olive Oil in Health and Disease Prevention, Preedy, V.R., Watson, R.R. (Eds.). Elsevier, Oxford, UK, pp. 581-589.
- AOCS, 2009. Preparation of Methyl Esters of Fatty Acids (Ce 2-66). Official Methods and Recommended Practices of the American Oil Chemists' Society, The Association, IL, USA.
- Bosque-Sendra, J.M., Cuadros-Rodríguez, L., Ruiz-Samblás, C., de la Mata, A.P. (2012). Combining chromatography and chemometrics for the characterization and authentication of fats and oils from triacylglycerol compositional data—A review. Analytica Chimica Acta, 724, 1-11. https://doi.org/10.1016/j. aca.2012.02.041
- Brown, J.D. (2009). Principal components analysis and exploratory factor analysis - Definitions, differences, and choices. Shiken: JALT Testing & Evaluation SIG Newsletter, 13, 26-30.
- Dıraman, H. (2010). Characterization by chemometry of the most important domestic and foreign olive cultivars from the National Olive Collection Orchard of Turkey. Grasasy Aceites, 61, 341-351.
- Ergin, M., Can, A.S., Koşkan, Ö. (2023). Factor rotation methods in factor analysis: An application on agricultural data. Ziraat Fakültesi Dergisi, 18, 134-142. https://doi.org/10.54975/isubuzfd.1370165
Ayrıntılar
Birincil Dil
İngilizce
Konular
Gıda Mühendisliği
Bölüm
Araştırma Makalesi
Yazarlar
Ayşe Gizem Gül
Bu kişi benim
0009-0006-3439-5751
Türkiye
Yayımlanma Tarihi
31 Mart 2026
Gönderilme Tarihi
17 Şubat 2026
Kabul Tarihi
17 Mart 2026
Yayımlandığı Sayı
Yıl 2026 Cilt: 51 Sayı: 2
GIDA Dergisi Creative Commons Atıf-Gayri Ticari 4.0 (CC BY-NC 4.0) Uluslararası Lisansı ile lisanslanmıştır.