Research Article
BibTex RIS Cite
Year 2018, Volume: 47 Issue: 5, 1335 - 1347, 16.10.2018

Abstract

References

  • Arazia, A., Ben-Jacob, E.B. and Yechiali, U. Bridging genetic networks and queueing theory, Physica A 332, 585-616, 2004.
  • Barriga, G.D.C. and Louzada, F. The zero-inflated Conway-Maxwell-Poisson distribution: Bayesian inference, regression modeling and influence diagnostic, Statistical Methodology 21, 23-34, 2014.
  • Chatla, S.B. and Shmueli, G. An efficient estimation of Conway-Maxwell-Poisson regession and additive model with an application to bike sharing, arXiv:1610.08244v2, 2016.
  • Conway, R.W. and Maxwell, W.L. A queuing model with state dependent service rates, Journal of Industrial Engineering 12, 132-136, 1962.
  • Elgart, V., Jia, T., and Kulkarni, R.V. Applications of Little's law to stochastic models of gene expression, Physical Review E 82, 021901(1-6), 2010.
  • Fraser D. and Kærn M. A chance at survival: gene expression noise and phenotypic diver- sification strategies, Molecular Microbiology 71, 1333-1340, 2009.
  • Gaunt, R.E., Iyengar, S., Olde Daalhuis, A.B., and Simsek, B. An asymptotic expansion for the normalizing constant of Conway-Maxwell-Poisson distribution, arXiv:1612:06618, 2016.
  • Golding, I., Paulsson, J., and Cox, E.C. Real-time kinetics of gene activity in individual bacteria, Cell 123, 1025-1036, 2005.
  • Guikema, S.D. and Goffelt, J.P. A flexible count data regression model for risk analysis, Risk Analysis 28, 213-223, 2008.
  • Imoto, K. A generalized Conway-Maxwell-Poisson distribution which includes the negative binomial distribution, Applied Mathematics and Computation 247, 824-834, 2014.
  • Lord, D., Guikema, S.D., and Geedipally, S.R. Application of the Conway-Maxwell-Poisson generalized linear model for analyzing motor vehicle crashes, Accident Analysis & Prevention 40, 1123-1134, 2008.
  • Minka, T., Shmueli, G., Kadane, J.B., Borle, S., and Boadwright, P. Computing with the COM-Poisson distribution, Technical Report 776, Statistics Department, Carnegie Mellon University, 2003.
  • Mitarai N, Semsey A, Sneppen K. (2015) Dynamic competition between transcription initiation and repression: Role of non-equilibrium steps in cell to cell heterogeneity. arXiv: 1502.03011v3.
  • Peccoud, J. and Ycart, B. Markovian modeling of gene-product synthesis, Theoretical Population Biology 48, 222-234, 1995.
  • Pogany T.K. Integral form of the COM-Poisson renormalization constant, Statistics & Probability Letters 119, 144-145, 2016.
  • Raj, A., Peskin, C.S., Tranchina, D., Vargas, D.Y., and Tyagi, S. Stochastic mRNA synthesis in mammalian cells, PLOS Biology 4, 1707-1719, 2006.
  • Sellers, K.F., Borle, S., and Shmueli, G. The COM-Poisson model for count data: a survey of methods and applications, Applied Stochastic Models in Business and Industry 28, 104- 116, 2012.
  • Sellers, K.F. and Raim, A.M. A flexible zero-inflated model to address data dispersion, Computational Statistics and Data Analysis 99, 68-80, 2016.
  • Sellers, K.F. and Shmueli, G. A flexible regression model for count data, Annals of Applied Statistics 4, 943-961, 2010.
  • Shahrezaei, V. and Swain, P.S. Analytical distributions for stochastic gene expression, Proceedings of the National Academy of Sciences 45, 17256-17261, 2008.
  • Shmueli, G., Minka, T., Kadane, J.B., Borle, S., and Boatwright, P. A useful distribution for fitting discrete data: revival of the Conway-Maxwell-Poisson distribution, Journal of the Royal Statistical Society C 54, 127-142, 2005.
  • Simsek B. Applications of Point Process Models to Imaging and Biology, PhD Dissertation, Statistics Department, University of Pittsburgh, 2016.
  • Simsek, B. and Iyengar, S. Approximating the the Conway-Maxwell-Poisson normalizing constant, Filomat 30, 953-960, 2016.
  • So, L., Ghosh, A., Zong, C., Sepulveda, L.A., Segev R., and Golding I. General properties of transcriptional time series in Escherichia coli, Nature Genetics 43, 554-560, 2011.
  • Trcek, T., Chao, J.A., Larson, D.R., Park, H.Y., Zenklusen, D., Shenoy, S.M., and Singer,R.H. Single-mRNA counting using fluorescent in situ hybridization in budding yeast, NatureProtocols 7, 408-419, 2012.
  • Vuong, Q.H. Likelihood ratio tests for model selection and non-nested hypotheses, Econometrica. 57, 307-333, 1989.
  • Zenklusen, D., Larson, D.R., and Singer, R.H. Single-RNA counting reveals alternative modes of gene expression in yeast, Natural Structural & Molecular Biology 15, 12631271, 2008.
  • Zhang, H., Pounds, S.B., and Tang, L. Statistical methods for overdispersion in mRNA-Seq count data, Open Bioinformatics Journal 7, 34-40, 2013.

Overdispersed count models for mRNA transcription

Year 2018, Volume: 47 Issue: 5, 1335 - 1347, 16.10.2018

Abstract

Direct detection of gene activity is often not possible because new proteins from an individual activation event are masked by proteins remaining from previous events. Thus, researchers determine gene activation or inactivation by observing messenger RNA (mRNA) production instead. Typically, mRNA transcription occurs in short rapid bursts when the gene is in its on-state, and no transcriptions during its offstate. This burstiness of mRNA production is not well modeled by a Poisson process. We propose the Conway-Maxwell-Poisson (COM- Poisson) distribution as a potential alternative to the more common negative binomial (NB) distribution. We use the generalized linear model version of these models to incorporate covariate information. We also consider zero inflation to model excess zero counts. We use data from E. coli bacteria and mammalian cells to illustrate our proposed methods. We find that when there is a biophysically derived distribution, this distribution performs well. We also show that in the absence of such biophysical knowledge, the COM-Poisson is competitive with the NB. Both the COM-Poisson and NB arise in queueing theory, suggesting that further application of that framework to study mRNA dynamics would be useful.

References

  • Arazia, A., Ben-Jacob, E.B. and Yechiali, U. Bridging genetic networks and queueing theory, Physica A 332, 585-616, 2004.
  • Barriga, G.D.C. and Louzada, F. The zero-inflated Conway-Maxwell-Poisson distribution: Bayesian inference, regression modeling and influence diagnostic, Statistical Methodology 21, 23-34, 2014.
  • Chatla, S.B. and Shmueli, G. An efficient estimation of Conway-Maxwell-Poisson regession and additive model with an application to bike sharing, arXiv:1610.08244v2, 2016.
  • Conway, R.W. and Maxwell, W.L. A queuing model with state dependent service rates, Journal of Industrial Engineering 12, 132-136, 1962.
  • Elgart, V., Jia, T., and Kulkarni, R.V. Applications of Little's law to stochastic models of gene expression, Physical Review E 82, 021901(1-6), 2010.
  • Fraser D. and Kærn M. A chance at survival: gene expression noise and phenotypic diver- sification strategies, Molecular Microbiology 71, 1333-1340, 2009.
  • Gaunt, R.E., Iyengar, S., Olde Daalhuis, A.B., and Simsek, B. An asymptotic expansion for the normalizing constant of Conway-Maxwell-Poisson distribution, arXiv:1612:06618, 2016.
  • Golding, I., Paulsson, J., and Cox, E.C. Real-time kinetics of gene activity in individual bacteria, Cell 123, 1025-1036, 2005.
  • Guikema, S.D. and Goffelt, J.P. A flexible count data regression model for risk analysis, Risk Analysis 28, 213-223, 2008.
  • Imoto, K. A generalized Conway-Maxwell-Poisson distribution which includes the negative binomial distribution, Applied Mathematics and Computation 247, 824-834, 2014.
  • Lord, D., Guikema, S.D., and Geedipally, S.R. Application of the Conway-Maxwell-Poisson generalized linear model for analyzing motor vehicle crashes, Accident Analysis & Prevention 40, 1123-1134, 2008.
  • Minka, T., Shmueli, G., Kadane, J.B., Borle, S., and Boadwright, P. Computing with the COM-Poisson distribution, Technical Report 776, Statistics Department, Carnegie Mellon University, 2003.
  • Mitarai N, Semsey A, Sneppen K. (2015) Dynamic competition between transcription initiation and repression: Role of non-equilibrium steps in cell to cell heterogeneity. arXiv: 1502.03011v3.
  • Peccoud, J. and Ycart, B. Markovian modeling of gene-product synthesis, Theoretical Population Biology 48, 222-234, 1995.
  • Pogany T.K. Integral form of the COM-Poisson renormalization constant, Statistics & Probability Letters 119, 144-145, 2016.
  • Raj, A., Peskin, C.S., Tranchina, D., Vargas, D.Y., and Tyagi, S. Stochastic mRNA synthesis in mammalian cells, PLOS Biology 4, 1707-1719, 2006.
  • Sellers, K.F., Borle, S., and Shmueli, G. The COM-Poisson model for count data: a survey of methods and applications, Applied Stochastic Models in Business and Industry 28, 104- 116, 2012.
  • Sellers, K.F. and Raim, A.M. A flexible zero-inflated model to address data dispersion, Computational Statistics and Data Analysis 99, 68-80, 2016.
  • Sellers, K.F. and Shmueli, G. A flexible regression model for count data, Annals of Applied Statistics 4, 943-961, 2010.
  • Shahrezaei, V. and Swain, P.S. Analytical distributions for stochastic gene expression, Proceedings of the National Academy of Sciences 45, 17256-17261, 2008.
  • Shmueli, G., Minka, T., Kadane, J.B., Borle, S., and Boatwright, P. A useful distribution for fitting discrete data: revival of the Conway-Maxwell-Poisson distribution, Journal of the Royal Statistical Society C 54, 127-142, 2005.
  • Simsek B. Applications of Point Process Models to Imaging and Biology, PhD Dissertation, Statistics Department, University of Pittsburgh, 2016.
  • Simsek, B. and Iyengar, S. Approximating the the Conway-Maxwell-Poisson normalizing constant, Filomat 30, 953-960, 2016.
  • So, L., Ghosh, A., Zong, C., Sepulveda, L.A., Segev R., and Golding I. General properties of transcriptional time series in Escherichia coli, Nature Genetics 43, 554-560, 2011.
  • Trcek, T., Chao, J.A., Larson, D.R., Park, H.Y., Zenklusen, D., Shenoy, S.M., and Singer,R.H. Single-mRNA counting using fluorescent in situ hybridization in budding yeast, NatureProtocols 7, 408-419, 2012.
  • Vuong, Q.H. Likelihood ratio tests for model selection and non-nested hypotheses, Econometrica. 57, 307-333, 1989.
  • Zenklusen, D., Larson, D.R., and Singer, R.H. Single-RNA counting reveals alternative modes of gene expression in yeast, Natural Structural & Molecular Biology 15, 12631271, 2008.
  • Zhang, H., Pounds, S.B., and Tang, L. Statistical methods for overdispersion in mRNA-Seq count data, Open Bioinformatics Journal 7, 34-40, 2013.
There are 28 citations in total.

Details

Primary Language English
Subjects Mathematical Sciences
Journal Section Statistics
Authors

Burcin Simsek

Satish Iyengar This is me

Publication Date October 16, 2018
Published in Issue Year 2018 Volume: 47 Issue: 5

Cite

APA Simsek, B., & Iyengar, S. (2018). Overdispersed count models for mRNA transcription. Hacettepe Journal of Mathematics and Statistics, 47(5), 1335-1347.
AMA Simsek B, Iyengar S. Overdispersed count models for mRNA transcription. Hacettepe Journal of Mathematics and Statistics. October 2018;47(5):1335-1347.
Chicago Simsek, Burcin, and Satish Iyengar. “Overdispersed Count Models for MRNA Transcription”. Hacettepe Journal of Mathematics and Statistics 47, no. 5 (October 2018): 1335-47.
EndNote Simsek B, Iyengar S (October 1, 2018) Overdispersed count models for mRNA transcription. Hacettepe Journal of Mathematics and Statistics 47 5 1335–1347.
IEEE B. Simsek and S. Iyengar, “Overdispersed count models for mRNA transcription”, Hacettepe Journal of Mathematics and Statistics, vol. 47, no. 5, pp. 1335–1347, 2018.
ISNAD Simsek, Burcin - Iyengar, Satish. “Overdispersed Count Models for MRNA Transcription”. Hacettepe Journal of Mathematics and Statistics 47/5 (October 2018), 1335-1347.
JAMA Simsek B, Iyengar S. Overdispersed count models for mRNA transcription. Hacettepe Journal of Mathematics and Statistics. 2018;47:1335–1347.
MLA Simsek, Burcin and Satish Iyengar. “Overdispersed Count Models for MRNA Transcription”. Hacettepe Journal of Mathematics and Statistics, vol. 47, no. 5, 2018, pp. 1335-47.
Vancouver Simsek B, Iyengar S. Overdispersed count models for mRNA transcription. Hacettepe Journal of Mathematics and Statistics. 2018;47(5):1335-47.