Research Article
BibTex RIS Cite
Year 2019, Volume: 48 Issue: 1, 242 - 254, 01.02.2019

Abstract

References

  • Aitchison, J. The statistical analysis of compositional data, Monographs on Statistics and Applied Probability, Chapman & Hall, London, 1986.
  • Aitchison, J., Barcelo-Vidal, C., Martin-Fernandez, J.A. and Pawlowsky-Glahn, V. Logratio analysis and compositional distance, Mathematical Geosciences 32 (3), 271-275, 2000.
  • Billheimer, D., Guttorp, P. and Fagan, W.F. Statistical interpretation of species composition, Journal of the American Statistical Association 96 (456), 1205-1214, 2001.
  • Chacon, J.E., Mateu-Figueras, G. and Martin-Fernandez, J.A. Gaussian kernels for density estimation with compositional data, Computers & Geosciences 37 (5), 702-711, 2011.
  • Egozcue, J.J. and Pawlowsky-Glahn, V. Groups of parts and their balances in compositional data analysis, Mathematical Geosciences 37 (7), 795-828, 2005.
  • Egozcue, J.J., Pawlowsky-Glahn, V., Mateu-Figueras, G. and Barcelo-Vidal, C. Isometric logratio transformations for compositional data analysis, Mathematical Geosciences 35 (3), 279-300, 2003.
  • Filzmoser, P. Stat{DA}: statistical analysis for environmental data, R package version 1.6.3, 2011.
  • Martin-Fernandez, J.A., Barcelo-Vidal, C. and Pawlowsky-Glahn, V. Dealing with zeros and missing values in compositional data sets using nonparametric imputation, Mathematical Geosciences 35 (3), 253-278, 2003.
  • Mart\'{\i}n-Fernandez, J.A., Palarea-Albaladejo, J. and Olea, R.A. Dealing with zeros, in Compositional Data Analysis: Theory and Applications, V. Pawlowsky-Glahn and A. Buccianti, eds., John Wiley \& Sons Ltd., Chichester, 47-62, 2011.
  • Martin-Fernandez, J.A., Hron, K., Templ, M., Filzmoser, P. and Palarea-Albaladejo, J. Model-based replacement of rounded zeros in compositional data: classical and robust approaches, Computational Statistics & Data Analysis 56 (9), 2688-2704, 2012.
  • Mateu-Figueras, G., Pawlowsky-Glahn, V. and Egozcue, J.J. The normal distribution in some constrained sample spaces, Sort-statistics and Operations Research Transactions 37 (1), 29-56, 2008.
  • Palarea-Albaladejo J. and Martin-Fernandez, J.A. A modified EM alr-algorithm for replacing rounded zeros in compositional data sets, Computers & Geosciences 34 (8), 902-917, 2008.
  • Palarea-Albaladejo, J. and Martin-Fernandez, J.A. Values below detection limit in compositional chemical data, Analytica Chimica Acta 764, 32-43, 2013.
  • Palarea-Albaladejo, J. and Martin-Fernandez, J.A. zCompositions-R package for multivariate imputation of left-censored data under a compositional approach, Chemometrics and Intelligent Laboratory Systems 143, 85-96, 2015.
  • Palarea-Albaladejo, J., Martin-Fernandez, J.A. and Gomez-Garcia, J. A parametric approach for dealing with compositional rounded zeros, Mathematical Geosciences 39 (7), 625-645, 2007.
  • Pawlowsky-Glahn, V. and Buccianti, A. Compositional Data Analysis: Theory and Applications}, John Wiley & Sons Ltd., Chichester, 2011.
  • Pawlowsky-Glahn, V. and Egozcue, J.J. Geometric approach to statistical analysis on the simplex, Stochastic Environmental Research and Risk Assessment 15 (5), 384-398, 2001.
  • Pawlowsky-Glahn, V., Egozcue, J.J. and Tolosana-Delgado, R. Modeling and analysis of compositional data, Statistics in Practice, John Wiley & Sons, Ltd., Chichester, 2015.
  • Rizzo, M.L. and Szekely, G.J. Energy: E-statistics (energy statistics), R package version 1.1-0, 2008.
  • Silverman, B.W. Density estimation for statistics and data analysis, Chapman & Hall, London, 1986.
  • van den Boogaart, K.G. and Tolosana-Delgado, R. Analyzing compositional data with R, Springer, Heidelberg, 2013.

A kernel density approach for replacing rounded zeros in compositional data sets

Year 2019, Volume: 48 Issue: 1, 242 - 254, 01.02.2019

Abstract

The logratio methodology widely used in compositional data analysis is not applicable when some components have rounded zeros. There are many univariate and multivariate methods that have been used to deal with rounded zeros. However, both of them have restrictions: the univariate methods replaced the rounded zeros only using the information of the corresponding component; the multivariate methods need to assume the distribution of transformed data. When the form of the distribution function is unknown, a multivariate nonparametric replacement approach is proposed in this paper. The proposed method uses conditional expected value based on isometric logratio coordinates to replace rounded zeros, in which the conditional density is estimated through multivariate Gauss kernel function. The permutation invariance and invariance under change of orthonormal basis are also presented. Simulation studies show that the proposed method has better performance than previous methods as the percentage of rounded zeros increases. The proposed method is also applied on the moss data from the Kola project.

References

  • Aitchison, J. The statistical analysis of compositional data, Monographs on Statistics and Applied Probability, Chapman & Hall, London, 1986.
  • Aitchison, J., Barcelo-Vidal, C., Martin-Fernandez, J.A. and Pawlowsky-Glahn, V. Logratio analysis and compositional distance, Mathematical Geosciences 32 (3), 271-275, 2000.
  • Billheimer, D., Guttorp, P. and Fagan, W.F. Statistical interpretation of species composition, Journal of the American Statistical Association 96 (456), 1205-1214, 2001.
  • Chacon, J.E., Mateu-Figueras, G. and Martin-Fernandez, J.A. Gaussian kernels for density estimation with compositional data, Computers & Geosciences 37 (5), 702-711, 2011.
  • Egozcue, J.J. and Pawlowsky-Glahn, V. Groups of parts and their balances in compositional data analysis, Mathematical Geosciences 37 (7), 795-828, 2005.
  • Egozcue, J.J., Pawlowsky-Glahn, V., Mateu-Figueras, G. and Barcelo-Vidal, C. Isometric logratio transformations for compositional data analysis, Mathematical Geosciences 35 (3), 279-300, 2003.
  • Filzmoser, P. Stat{DA}: statistical analysis for environmental data, R package version 1.6.3, 2011.
  • Martin-Fernandez, J.A., Barcelo-Vidal, C. and Pawlowsky-Glahn, V. Dealing with zeros and missing values in compositional data sets using nonparametric imputation, Mathematical Geosciences 35 (3), 253-278, 2003.
  • Mart\'{\i}n-Fernandez, J.A., Palarea-Albaladejo, J. and Olea, R.A. Dealing with zeros, in Compositional Data Analysis: Theory and Applications, V. Pawlowsky-Glahn and A. Buccianti, eds., John Wiley \& Sons Ltd., Chichester, 47-62, 2011.
  • Martin-Fernandez, J.A., Hron, K., Templ, M., Filzmoser, P. and Palarea-Albaladejo, J. Model-based replacement of rounded zeros in compositional data: classical and robust approaches, Computational Statistics & Data Analysis 56 (9), 2688-2704, 2012.
  • Mateu-Figueras, G., Pawlowsky-Glahn, V. and Egozcue, J.J. The normal distribution in some constrained sample spaces, Sort-statistics and Operations Research Transactions 37 (1), 29-56, 2008.
  • Palarea-Albaladejo J. and Martin-Fernandez, J.A. A modified EM alr-algorithm for replacing rounded zeros in compositional data sets, Computers & Geosciences 34 (8), 902-917, 2008.
  • Palarea-Albaladejo, J. and Martin-Fernandez, J.A. Values below detection limit in compositional chemical data, Analytica Chimica Acta 764, 32-43, 2013.
  • Palarea-Albaladejo, J. and Martin-Fernandez, J.A. zCompositions-R package for multivariate imputation of left-censored data under a compositional approach, Chemometrics and Intelligent Laboratory Systems 143, 85-96, 2015.
  • Palarea-Albaladejo, J., Martin-Fernandez, J.A. and Gomez-Garcia, J. A parametric approach for dealing with compositional rounded zeros, Mathematical Geosciences 39 (7), 625-645, 2007.
  • Pawlowsky-Glahn, V. and Buccianti, A. Compositional Data Analysis: Theory and Applications}, John Wiley & Sons Ltd., Chichester, 2011.
  • Pawlowsky-Glahn, V. and Egozcue, J.J. Geometric approach to statistical analysis on the simplex, Stochastic Environmental Research and Risk Assessment 15 (5), 384-398, 2001.
  • Pawlowsky-Glahn, V., Egozcue, J.J. and Tolosana-Delgado, R. Modeling and analysis of compositional data, Statistics in Practice, John Wiley & Sons, Ltd., Chichester, 2015.
  • Rizzo, M.L. and Szekely, G.J. Energy: E-statistics (energy statistics), R package version 1.1-0, 2008.
  • Silverman, B.W. Density estimation for statistics and data analysis, Chapman & Hall, London, 1986.
  • van den Boogaart, K.G. and Tolosana-Delgado, R. Analyzing compositional data with R, Springer, Heidelberg, 2013.
There are 21 citations in total.

Details

Primary Language English
Subjects Statistics
Journal Section Statistics
Authors

Jiajia Chen

Xiaoqin Zhang This is me

Shengjia Li This is me

Publication Date February 1, 2019
Published in Issue Year 2019 Volume: 48 Issue: 1

Cite

APA Chen, J., Zhang, X., & Li, S. (2019). A kernel density approach for replacing rounded zeros in compositional data sets. Hacettepe Journal of Mathematics and Statistics, 48(1), 242-254.
AMA Chen J, Zhang X, Li S. A kernel density approach for replacing rounded zeros in compositional data sets. Hacettepe Journal of Mathematics and Statistics. February 2019;48(1):242-254.
Chicago Chen, Jiajia, Xiaoqin Zhang, and Shengjia Li. “A Kernel Density Approach for Replacing Rounded Zeros in Compositional Data Sets”. Hacettepe Journal of Mathematics and Statistics 48, no. 1 (February 2019): 242-54.
EndNote Chen J, Zhang X, Li S (February 1, 2019) A kernel density approach for replacing rounded zeros in compositional data sets. Hacettepe Journal of Mathematics and Statistics 48 1 242–254.
IEEE J. Chen, X. Zhang, and S. Li, “A kernel density approach for replacing rounded zeros in compositional data sets”, Hacettepe Journal of Mathematics and Statistics, vol. 48, no. 1, pp. 242–254, 2019.
ISNAD Chen, Jiajia et al. “A Kernel Density Approach for Replacing Rounded Zeros in Compositional Data Sets”. Hacettepe Journal of Mathematics and Statistics 48/1 (February 2019), 242-254.
JAMA Chen J, Zhang X, Li S. A kernel density approach for replacing rounded zeros in compositional data sets. Hacettepe Journal of Mathematics and Statistics. 2019;48:242–254.
MLA Chen, Jiajia et al. “A Kernel Density Approach for Replacing Rounded Zeros in Compositional Data Sets”. Hacettepe Journal of Mathematics and Statistics, vol. 48, no. 1, 2019, pp. 242-54.
Vancouver Chen J, Zhang X, Li S. A kernel density approach for replacing rounded zeros in compositional data sets. Hacettepe Journal of Mathematics and Statistics. 2019;48(1):242-54.