Year 2019,
Volume: 48 Issue: 1, 242 - 254, 01.02.2019
Jiajia Chen
,
Xiaoqin Zhang
Shengjia Li
References
- Aitchison, J. The statistical analysis of compositional data, Monographs
on Statistics and Applied Probability, Chapman & Hall, London, 1986.
- Aitchison, J., Barcelo-Vidal, C., Martin-Fernandez, J.A. and
Pawlowsky-Glahn, V. Logratio analysis and compositional distance, Mathematical Geosciences 32 (3), 271-275, 2000.
- Billheimer, D., Guttorp, P. and Fagan, W.F. Statistical interpretation of
species composition, Journal of the American Statistical Association 96 (456), 1205-1214, 2001.
- Chacon, J.E., Mateu-Figueras, G. and Martin-Fernandez, J.A.
Gaussian kernels for density estimation with compositional data,
Computers & Geosciences 37 (5), 702-711, 2011.
- Egozcue, J.J. and Pawlowsky-Glahn, V. Groups of parts and their balances
in compositional data analysis, Mathematical Geosciences 37 (7), 795-828, 2005.
- Egozcue, J.J., Pawlowsky-Glahn, V., Mateu-Figueras, G. and Barcelo-Vidal, C.
Isometric logratio transformations for compositional data analysis,
Mathematical Geosciences 35 (3), 279-300, 2003.
- Filzmoser, P. Stat{DA}: statistical analysis for environmental data, R
package version 1.6.3, 2011.
- Martin-Fernandez, J.A., Barcelo-Vidal, C. and Pawlowsky-Glahn, V.
Dealing with zeros and missing values in compositional data sets using
nonparametric imputation, Mathematical Geosciences 35 (3), 253-278, 2003.
- Mart\'{\i}n-Fernandez, J.A., Palarea-Albaladejo, J. and Olea, R.A.
Dealing with zeros, in Compositional Data Analysis: Theory
and Applications, V. Pawlowsky-Glahn and A. Buccianti, eds., John Wiley \&
Sons Ltd., Chichester, 47-62, 2011.
- Martin-Fernandez, J.A., Hron, K., Templ, M., Filzmoser, P. and
Palarea-Albaladejo, J. Model-based replacement of rounded zeros in
compositional data: classical and robust approaches, Computational Statistics & Data Analysis 56 (9), 2688-2704, 2012.
- Mateu-Figueras, G., Pawlowsky-Glahn, V. and Egozcue, J.J. The normal
distribution in some constrained sample spaces, Sort-statistics and Operations Research Transactions 37 (1), 29-56, 2008.
- Palarea-Albaladejo J. and Martin-Fernandez, J.A. A modified EM
alr-algorithm for replacing rounded zeros in compositional data sets,
Computers & Geosciences 34 (8), 902-917, 2008.
- Palarea-Albaladejo, J. and Martin-Fernandez, J.A. Values below
detection limit in compositional chemical data, Analytica Chimica Acta 764, 32-43, 2013.
- Palarea-Albaladejo, J. and Martin-Fernandez, J.A.
zCompositions-R package for multivariate imputation of
left-censored data under a compositional approach, Chemometrics and Intelligent Laboratory Systems 143,
85-96, 2015.
- Palarea-Albaladejo, J., Martin-Fernandez, J.A. and
Gomez-Garcia, J. A parametric approach for dealing with
compositional rounded zeros, Mathematical Geosciences 39 (7), 625-645, 2007.
- Pawlowsky-Glahn, V. and Buccianti, A. Compositional Data Analysis:
Theory and Applications}, John Wiley & Sons Ltd., Chichester, 2011.
- Pawlowsky-Glahn, V. and Egozcue, J.J. Geometric approach to statistical
analysis on the simplex, Stochastic Environmental Research and Risk Assessment 15 (5), 384-398, 2001.
- Pawlowsky-Glahn, V., Egozcue, J.J. and Tolosana-Delgado, R. Modeling and
analysis of compositional data, Statistics in Practice, John Wiley & Sons,
Ltd., Chichester, 2015.
- Rizzo, M.L. and Szekely, G.J. Energy: E-statistics (energy
statistics), R package version 1.1-0, 2008.
- Silverman, B.W. Density estimation for statistics and data analysis,
Chapman & Hall, London, 1986.
- van den Boogaart, K.G. and Tolosana-Delgado, R. Analyzing compositional
data with R, Springer, Heidelberg, 2013.
A kernel density approach for replacing rounded zeros in compositional data sets
Year 2019,
Volume: 48 Issue: 1, 242 - 254, 01.02.2019
Jiajia Chen
,
Xiaoqin Zhang
Shengjia Li
Abstract
The logratio methodology widely used in compositional data analysis is not applicable when some components have rounded zeros. There are many univariate and multivariate methods that have been used to deal with rounded zeros. However, both of them have restrictions: the univariate methods replaced the rounded zeros only using the information of the corresponding component; the multivariate methods need to assume the distribution of transformed data. When the form of the distribution function is unknown, a multivariate nonparametric replacement approach is proposed in this paper. The proposed method uses conditional expected value based on isometric logratio coordinates to replace rounded zeros, in which the conditional density is estimated through multivariate Gauss kernel function. The permutation invariance and invariance under change of orthonormal basis are also presented. Simulation studies show that the proposed method has better performance than previous methods as the percentage of rounded zeros increases. The proposed method is also applied on the moss data from the Kola project.
References
- Aitchison, J. The statistical analysis of compositional data, Monographs
on Statistics and Applied Probability, Chapman & Hall, London, 1986.
- Aitchison, J., Barcelo-Vidal, C., Martin-Fernandez, J.A. and
Pawlowsky-Glahn, V. Logratio analysis and compositional distance, Mathematical Geosciences 32 (3), 271-275, 2000.
- Billheimer, D., Guttorp, P. and Fagan, W.F. Statistical interpretation of
species composition, Journal of the American Statistical Association 96 (456), 1205-1214, 2001.
- Chacon, J.E., Mateu-Figueras, G. and Martin-Fernandez, J.A.
Gaussian kernels for density estimation with compositional data,
Computers & Geosciences 37 (5), 702-711, 2011.
- Egozcue, J.J. and Pawlowsky-Glahn, V. Groups of parts and their balances
in compositional data analysis, Mathematical Geosciences 37 (7), 795-828, 2005.
- Egozcue, J.J., Pawlowsky-Glahn, V., Mateu-Figueras, G. and Barcelo-Vidal, C.
Isometric logratio transformations for compositional data analysis,
Mathematical Geosciences 35 (3), 279-300, 2003.
- Filzmoser, P. Stat{DA}: statistical analysis for environmental data, R
package version 1.6.3, 2011.
- Martin-Fernandez, J.A., Barcelo-Vidal, C. and Pawlowsky-Glahn, V.
Dealing with zeros and missing values in compositional data sets using
nonparametric imputation, Mathematical Geosciences 35 (3), 253-278, 2003.
- Mart\'{\i}n-Fernandez, J.A., Palarea-Albaladejo, J. and Olea, R.A.
Dealing with zeros, in Compositional Data Analysis: Theory
and Applications, V. Pawlowsky-Glahn and A. Buccianti, eds., John Wiley \&
Sons Ltd., Chichester, 47-62, 2011.
- Martin-Fernandez, J.A., Hron, K., Templ, M., Filzmoser, P. and
Palarea-Albaladejo, J. Model-based replacement of rounded zeros in
compositional data: classical and robust approaches, Computational Statistics & Data Analysis 56 (9), 2688-2704, 2012.
- Mateu-Figueras, G., Pawlowsky-Glahn, V. and Egozcue, J.J. The normal
distribution in some constrained sample spaces, Sort-statistics and Operations Research Transactions 37 (1), 29-56, 2008.
- Palarea-Albaladejo J. and Martin-Fernandez, J.A. A modified EM
alr-algorithm for replacing rounded zeros in compositional data sets,
Computers & Geosciences 34 (8), 902-917, 2008.
- Palarea-Albaladejo, J. and Martin-Fernandez, J.A. Values below
detection limit in compositional chemical data, Analytica Chimica Acta 764, 32-43, 2013.
- Palarea-Albaladejo, J. and Martin-Fernandez, J.A.
zCompositions-R package for multivariate imputation of
left-censored data under a compositional approach, Chemometrics and Intelligent Laboratory Systems 143,
85-96, 2015.
- Palarea-Albaladejo, J., Martin-Fernandez, J.A. and
Gomez-Garcia, J. A parametric approach for dealing with
compositional rounded zeros, Mathematical Geosciences 39 (7), 625-645, 2007.
- Pawlowsky-Glahn, V. and Buccianti, A. Compositional Data Analysis:
Theory and Applications}, John Wiley & Sons Ltd., Chichester, 2011.
- Pawlowsky-Glahn, V. and Egozcue, J.J. Geometric approach to statistical
analysis on the simplex, Stochastic Environmental Research and Risk Assessment 15 (5), 384-398, 2001.
- Pawlowsky-Glahn, V., Egozcue, J.J. and Tolosana-Delgado, R. Modeling and
analysis of compositional data, Statistics in Practice, John Wiley & Sons,
Ltd., Chichester, 2015.
- Rizzo, M.L. and Szekely, G.J. Energy: E-statistics (energy
statistics), R package version 1.1-0, 2008.
- Silverman, B.W. Density estimation for statistics and data analysis,
Chapman & Hall, London, 1986.
- van den Boogaart, K.G. and Tolosana-Delgado, R. Analyzing compositional
data with R, Springer, Heidelberg, 2013.