Research Article
BibTex RIS Cite

Regression Tool in MS Excel® Spreadsheets for Biological Data: R-BioXL

Year 2024, Volume: 22 Issue: 3, 224 - 235, 18.12.2024
https://doi.org/10.24323/akademik-gida.1603881

Abstract

A user-friendly MS Excel® spreadsheet as a freeware (R-BioXL) was developed to fit mathematical models to experimental data. (R-BioXL is available to everyone at https://drive.google.com/drive/folders/1GyjT3Z_CJQZu6ASb4LQBlS-ajLa_nF6X?usp=sharing) Initially, users are expected to enter their X-Y data and define their parameters of the model. Then, a model equation should also be entered again by users. Users can visualize data (scatter plot) and model fit (line plot) with the defined initial estimates of parameters on the same graph by default. Squared differences between experimental data and model estimates are calculated automatically. Users can change the initial estimates of the parameters to make the model closer to the data instantly, and Solver Add-In of Excel® should be used to minimize the sum of squared error by changing the parameter values. After the parameters are obtained, standard errors (by using “SolverAid” macro), 95 and 99% confidence intervals of the parameters, p values to determine the statistical significance of the parameters, and goodness-of-fit indices are calculated as the last step. All results can be saved on a different Excel® working page. Whole procedure takes a couple of minutes (~3 to 10 min) depending on the Excel® experience of the user. The utility, accuracy and reliability of the spreadsheet was shown by applying two-parameter (non-linear) Michealis-Menten equation for enzyme kinetics, three-parameter (linear) van Deemter equation for chromatography, and four-parameter (non-linear) modified Gompertz equation for microbial growth. In conclusion, R-BioXL can be safely and freely used to describe the experimental data with Excel® knowledge, without any skills in programming and without additional cost for other software package.

References

  • [1] Hu, W., Xie, J., Chau, H.W., Si, B.C. (2015). Evaluation of parameter uncertainties in nonlinear regression using Microsoft Excel Spreadsheet. Environmental Systems Research, 4, 4.
  • [2] Serment-Moreno, V. (2021). Microbial Modeling Needs for the Nonthermal Processing of Foods. Food Engineering Reviews, 13, 465–489.
  • [3] Leylak, C., Yurdakul, M., Buzrul, S. (2020). Use of Excel in food science 1: Linear regression. Food and Health, 6, 186–198.
  • [4] Brown, A.M. (2001). A step-by-step guide to non-linear regression analysis of experimental data using a Microsoft Excel spreadsheet. Computer Methods and Programs Biomedicine, 65, 191–200.
  • [5] Kemmer, G., Keller, S. (2010). Nonlinear least-squares data fitting in Excel spreadsheets. Nature Protocols, 5, 267–281.
  • [6] Yurdakul, M., Leylak, C., Buzrul, S. (2020). Use of Excel in food science 2: Non-linear regression. Food and Health, 6, 199–212.
  • [7] van Boekel, M.A.J.S. (2022). Kinetics of heat-induced changes in dairy products: Developments in data analysis and modelling techniques. International Dairy Journal, 126, 105187.
  • [8] van Boekel, M.A.J.S. (1996). Statistical aspects of kinetic modeling for food science problems. Journal of Food Science, 61, 477–485.
  • [9] de Levie, R. (2004). Advanced Excel for scientific data analysis. New York, USA, Oxford University Press.
  • [10] Chase, A.M., von Meier, H.C., Menna, V.J. (1962). The non-competitive inhibition and irreversible inactivation of yeast. Journal of Cellular and Comparative Physiology, 59, 1–13.
  • [11] van Boekel, M.A.J.S. (2008). Kinetic Modeling of Reactions in Foods. Boca Raton, CRC Press.
  • [12] Moody, H.W. (1982). The evaluation of the parameters in the van Deemter equation. Journal of Chemical Education, 59, 290–291.
  • [13] Lambert, R.J.W., Mytilinaios, I., Maitland, L., Brown, A.M. (2012). Monte Carlo simulation of parameter confidence intervals for non-linear regression analysis of biological data using Microsoft Excel. Computer Methods and Programs Biomedicine, 107, 155–163.
  • [14] Zwietering, M.H., Jongenburger, I., Rombouts, F.M., Van’t Riet, K. (1990). Modeling of the bacterial growth curve. Applied Environmental Microbiology, 56, 1875–1881.
  • [15] Alcantara, I.M., Naranjo, J., Lang, Y. (2022). Model selection using PRESS statistic. Computational Statistics, 38, 285–298.
  • [16] Öksüz, H.B., Buzrul, S. (2020). Monte Carlo analysis for microbial growth curves. Journal of Microbiology, Biotechnology and Food Sciences, 10, 418–423.
  • [17] de Levie R (2012). Collinearity in least-squares analysis. Journal of Chemical Education, 89, 68–78.
  • [18] de Levie R (2012). Nonisothermal analysis of solution kinetics by spreadsheet simulation. Journal of Chemical Education, 89, 79–86.
  • [19] Bergtold, J,S,, Pokharel, K.P., Featherstone, A.M., Mo, L. (2018). On the examination of the reliability of statistical software for estimating regression models with discrete dependent variables. Computational Statistics, 33, 757–786.
  • [20] McCullough, B.D., Wilson, B. (1999). On the accuracy of statistical procedures in Microsoft Excel 97. Computational Statistics and Data Analysis, 31, 27–37.
  • [21] McCullough, B.D., Wilson, B. (2000). On the accuracy of statistical procedures in Microsoft Excel 2000 and Excel XP. Computational Statistics and Data Analysis, 40, 713–721.
  • [22] McCullough, B.D., Wilson, B. (2005). On the accuracy of statistical procedures in Microsoft Excel 2003. Computational Statistics and Data Analysis, 49, 1244–1252.
  • [23] McCullough, B.D., Heiser, D.A. (2008). On the accuracy of statistical procedures in Microsoft Excel 2007. Computational Statistics and Data Analysis, 52, 4570–4578.
  • [24] Mélard, G. (2014). On the accuracy of statistical procedures in Microsoft Excel 2010. Computational Statistics, 29, 1095–1128.
  • [25] Geeraerd, A.H., Valdramidis, V.P., van Impe, J.F. (2005). GInaFiT, a freeware tool to assess non-log-linear microbial survivor curves. International Journal of Food Microbiology, 102, 95–105.
  • [26] Baranyi, J., Roberts, T.A. (1994). A dynamic approach to predicting bacterial growth in food. International Journal of Food Microbiology, 23, 277–294.
  • [27] Johnson, M.L. (1992). Why, when, and how biochemists should use least squares. Analytical Biochemistry, 206, 215–225.
  • [28] Buzrul, S. (2021). Monte Carlo simulation in Microsoft Excel: Confidence intervals of model parameters for non-linear regression used in food sciences. Akademik Gıda, 19, 291–299.
  • [29] Press, W.H.,. Teukolsky, S.A., Vetterling, W.T., Flannery, B.P. (1989). Numerical Recipes: The Art of Scientific Computing. Cambridge University Press, New York.
  • [30] Straume, M., Johnson, M.L. (1992). Monte Carlo Method for determining complete confidence probability distributions of estimated model parameters. Methods in Enzymology, 210, 117–129.
  • [31] Öksüz, H.B., Buzrul, S. (2021). An Excel-based, user-friendly freeware tool to describe microbial growth curves: ÖK-BUZ GRoFiT. Journal of Tekirdag Agricultural Faculty, 18, 521–532
  • [32] Buzrul, S. (2024). Fen Bilimleri ve Mühendislik Uygulamalarında Deneysel Verilerin Matematik Modellerle Tanımlanması. Excel Uygulamalı Anlatım. Ankara, Türkiye, Akademisyen Kitabevi.

Biyolojik Veriler İçin MS Excel® Hesap Çizelgesi Aracı: R-BioXL

Year 2024, Volume: 22 Issue: 3, 224 - 235, 18.12.2024
https://doi.org/10.24323/akademik-gida.1603881

Abstract

Deneysel verilere matematik modelleri uydurmak için ücretsiz yazılım olarak kullanıcı dostu bir MS Excel® hesap çizelgesi aracı (R-BioXL) geliştirilmiştir. (R-BioXL https://drive.google.com/drive/folders/1GyjT3Z_CJQZu6ASb4LQBlS-ajLa_nF6X?usp=sharing bağlantısından herkese açıktır) Başlangıçta, kullanıcıların X-Y verilerini bu araca girmeleri ve model parametrelerini tanımlamaları beklenmektedir. Daha sonra model denkleminin de yine kullanıcılar tarafından girilmesi gerekmektedir. Kullanıcılar verileri (dağılım grafiği) ve model uyumunu (çizgi grafiği) girilen ilk parametre değerleri ile aynı grafik üzerinde varsayılan olarak gözlemleyebilmektedirler. Deneysel veriler ve model tahminleri arasındaki farkların karesi otomatik olarak hesaplanmaktadır. Kullanıcılar modeli verilere anında daha yakın hale getirmek için parametrelerin ilk değerlerini değiştirebilir. Excel®’in Çözücü eklentisi parametre değerlerini değiştirerek hataların karesinin toplamını en aza indirmek için kullanılmalıdır. Parametreler elde edildikten sonra son adım olarak standart hatalar (“SolverAid” makrosu kullanılarak), parametrelerin %95 ve %99 güven aralıkları, parametrelerin istatistiksel anlamlılığını belirlemek için p değerleri ve uyum iyiliği indeksleri hesaplanır. Tüm sonuçlar farklı bir Excel® çalışma sayfasına kaydedilebilir. Tüm bu prosedür, kullanıcının Excel® deneyimine bağlı olarak birkaç dakika (~3 ila 10 dakika) sürebilir. Aracın kullanımı, doğruluğu ve güvenilirliği enzim kinetiği için iki parametreli (doğrusal olmayan) Michealis-Menten denklemi, kromatografi için üç parametreli (doğrusal) van Deemter denklemi ve mikrobiyal büyüme için dört parametreli (doğrusal olmayan) modifiye Gompertz denklemi uygulanarak gösterilmiştir. Sonuç olarak, R-BioXL, Excel® bilgisi ile deneysel verileri tanımlamak için, herhangi bir programlama becerisi gerektirmeden ve diğer yazılım paketleri için ek maliyet olmadan güvenle ve serbestçe kullanılabilir.

References

  • [1] Hu, W., Xie, J., Chau, H.W., Si, B.C. (2015). Evaluation of parameter uncertainties in nonlinear regression using Microsoft Excel Spreadsheet. Environmental Systems Research, 4, 4.
  • [2] Serment-Moreno, V. (2021). Microbial Modeling Needs for the Nonthermal Processing of Foods. Food Engineering Reviews, 13, 465–489.
  • [3] Leylak, C., Yurdakul, M., Buzrul, S. (2020). Use of Excel in food science 1: Linear regression. Food and Health, 6, 186–198.
  • [4] Brown, A.M. (2001). A step-by-step guide to non-linear regression analysis of experimental data using a Microsoft Excel spreadsheet. Computer Methods and Programs Biomedicine, 65, 191–200.
  • [5] Kemmer, G., Keller, S. (2010). Nonlinear least-squares data fitting in Excel spreadsheets. Nature Protocols, 5, 267–281.
  • [6] Yurdakul, M., Leylak, C., Buzrul, S. (2020). Use of Excel in food science 2: Non-linear regression. Food and Health, 6, 199–212.
  • [7] van Boekel, M.A.J.S. (2022). Kinetics of heat-induced changes in dairy products: Developments in data analysis and modelling techniques. International Dairy Journal, 126, 105187.
  • [8] van Boekel, M.A.J.S. (1996). Statistical aspects of kinetic modeling for food science problems. Journal of Food Science, 61, 477–485.
  • [9] de Levie, R. (2004). Advanced Excel for scientific data analysis. New York, USA, Oxford University Press.
  • [10] Chase, A.M., von Meier, H.C., Menna, V.J. (1962). The non-competitive inhibition and irreversible inactivation of yeast. Journal of Cellular and Comparative Physiology, 59, 1–13.
  • [11] van Boekel, M.A.J.S. (2008). Kinetic Modeling of Reactions in Foods. Boca Raton, CRC Press.
  • [12] Moody, H.W. (1982). The evaluation of the parameters in the van Deemter equation. Journal of Chemical Education, 59, 290–291.
  • [13] Lambert, R.J.W., Mytilinaios, I., Maitland, L., Brown, A.M. (2012). Monte Carlo simulation of parameter confidence intervals for non-linear regression analysis of biological data using Microsoft Excel. Computer Methods and Programs Biomedicine, 107, 155–163.
  • [14] Zwietering, M.H., Jongenburger, I., Rombouts, F.M., Van’t Riet, K. (1990). Modeling of the bacterial growth curve. Applied Environmental Microbiology, 56, 1875–1881.
  • [15] Alcantara, I.M., Naranjo, J., Lang, Y. (2022). Model selection using PRESS statistic. Computational Statistics, 38, 285–298.
  • [16] Öksüz, H.B., Buzrul, S. (2020). Monte Carlo analysis for microbial growth curves. Journal of Microbiology, Biotechnology and Food Sciences, 10, 418–423.
  • [17] de Levie R (2012). Collinearity in least-squares analysis. Journal of Chemical Education, 89, 68–78.
  • [18] de Levie R (2012). Nonisothermal analysis of solution kinetics by spreadsheet simulation. Journal of Chemical Education, 89, 79–86.
  • [19] Bergtold, J,S,, Pokharel, K.P., Featherstone, A.M., Mo, L. (2018). On the examination of the reliability of statistical software for estimating regression models with discrete dependent variables. Computational Statistics, 33, 757–786.
  • [20] McCullough, B.D., Wilson, B. (1999). On the accuracy of statistical procedures in Microsoft Excel 97. Computational Statistics and Data Analysis, 31, 27–37.
  • [21] McCullough, B.D., Wilson, B. (2000). On the accuracy of statistical procedures in Microsoft Excel 2000 and Excel XP. Computational Statistics and Data Analysis, 40, 713–721.
  • [22] McCullough, B.D., Wilson, B. (2005). On the accuracy of statistical procedures in Microsoft Excel 2003. Computational Statistics and Data Analysis, 49, 1244–1252.
  • [23] McCullough, B.D., Heiser, D.A. (2008). On the accuracy of statistical procedures in Microsoft Excel 2007. Computational Statistics and Data Analysis, 52, 4570–4578.
  • [24] Mélard, G. (2014). On the accuracy of statistical procedures in Microsoft Excel 2010. Computational Statistics, 29, 1095–1128.
  • [25] Geeraerd, A.H., Valdramidis, V.P., van Impe, J.F. (2005). GInaFiT, a freeware tool to assess non-log-linear microbial survivor curves. International Journal of Food Microbiology, 102, 95–105.
  • [26] Baranyi, J., Roberts, T.A. (1994). A dynamic approach to predicting bacterial growth in food. International Journal of Food Microbiology, 23, 277–294.
  • [27] Johnson, M.L. (1992). Why, when, and how biochemists should use least squares. Analytical Biochemistry, 206, 215–225.
  • [28] Buzrul, S. (2021). Monte Carlo simulation in Microsoft Excel: Confidence intervals of model parameters for non-linear regression used in food sciences. Akademik Gıda, 19, 291–299.
  • [29] Press, W.H.,. Teukolsky, S.A., Vetterling, W.T., Flannery, B.P. (1989). Numerical Recipes: The Art of Scientific Computing. Cambridge University Press, New York.
  • [30] Straume, M., Johnson, M.L. (1992). Monte Carlo Method for determining complete confidence probability distributions of estimated model parameters. Methods in Enzymology, 210, 117–129.
  • [31] Öksüz, H.B., Buzrul, S. (2021). An Excel-based, user-friendly freeware tool to describe microbial growth curves: ÖK-BUZ GRoFiT. Journal of Tekirdag Agricultural Faculty, 18, 521–532
  • [32] Buzrul, S. (2024). Fen Bilimleri ve Mühendislik Uygulamalarında Deneysel Verilerin Matematik Modellerle Tanımlanması. Excel Uygulamalı Anlatım. Ankara, Türkiye, Akademisyen Kitabevi.
There are 32 citations in total.

Details

Primary Language English
Subjects Food Engineering
Journal Section Research Papers
Authors

Hasan Basri Öksüz This is me 0000-0001-5740-8793

Sencer Buzrul 0000-0003-2272-3827

Publication Date December 18, 2024
Submission Date May 31, 2024
Acceptance Date December 6, 2024
Published in Issue Year 2024 Volume: 22 Issue: 3

Cite

APA Öksüz, H. B., & Buzrul, S. (2024). Regression Tool in MS Excel® Spreadsheets for Biological Data: R-BioXL. Akademik Gıda, 22(3), 224-235. https://doi.org/10.24323/akademik-gida.1603881
AMA Öksüz HB, Buzrul S. Regression Tool in MS Excel® Spreadsheets for Biological Data: R-BioXL. Akademik Gıda. December 2024;22(3):224-235. doi:10.24323/akademik-gida.1603881
Chicago Öksüz, Hasan Basri, and Sencer Buzrul. “Regression Tool in MS Excel® Spreadsheets for Biological Data: R-BioXL”. Akademik Gıda 22, no. 3 (December 2024): 224-35. https://doi.org/10.24323/akademik-gida.1603881.
EndNote Öksüz HB, Buzrul S (December 1, 2024) Regression Tool in MS Excel® Spreadsheets for Biological Data: R-BioXL. Akademik Gıda 22 3 224–235.
IEEE H. B. Öksüz and S. Buzrul, “Regression Tool in MS Excel® Spreadsheets for Biological Data: R-BioXL”, Akademik Gıda, vol. 22, no. 3, pp. 224–235, 2024, doi: 10.24323/akademik-gida.1603881.
ISNAD Öksüz, Hasan Basri - Buzrul, Sencer. “Regression Tool in MS Excel® Spreadsheets for Biological Data: R-BioXL”. Akademik Gıda 22/3 (December2024), 224-235. https://doi.org/10.24323/akademik-gida.1603881.
JAMA Öksüz HB, Buzrul S. Regression Tool in MS Excel® Spreadsheets for Biological Data: R-BioXL. Akademik Gıda. 2024;22:224–235.
MLA Öksüz, Hasan Basri and Sencer Buzrul. “Regression Tool in MS Excel® Spreadsheets for Biological Data: R-BioXL”. Akademik Gıda, vol. 22, no. 3, 2024, pp. 224-35, doi:10.24323/akademik-gida.1603881.
Vancouver Öksüz HB, Buzrul S. Regression Tool in MS Excel® Spreadsheets for Biological Data: R-BioXL. Akademik Gıda. 2024;22(3):224-35.