<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.4 20241031//EN"
        "https://jats.nlm.nih.gov/publishing/1.4/JATS-journalpublishing1-4.dtd">
<article  article-type="research-article"        dtd-version="1.4">
            <front>

                <journal-meta>
                                                                <journal-id>ajps</journal-id>
            <journal-title-group>
                                                                                    <journal-title>Anatolian Journal of Pharmaceutical Sciences</journal-title>
            </journal-title-group>
                                        <issn pub-type="epub">2980-0463</issn>
                                                                                            <publisher>
                    <publisher-name>İnönü Üniversitesi</publisher-name>
                </publisher>
                    </journal-meta>
                <article-meta>
                                        <article-id/>
                                                                                                                                                                                            <title-group>
                                                                                                                        <article-title>Concurrent Generation of Binary, Ordinal, and Count Data with Specified  Marginal and Associational Quantities in Pharmaceutical Sciences</article-title>
                                                                                                                                        </title-group>
            
                                                    <contrib-group content-type="authors">
                                                                        <contrib contrib-type="author">
                                                                    <contrib-id contrib-id-type="orcid">
                                        https://orcid.org/0000-0003-2482-703X</contrib-id>
                                                                <name>
                                    <surname>Demirtaş</surname>
                                    <given-names>Hakan</given-names>
                                </name>
                                                            </contrib>
                                                    <contrib contrib-type="author">
                                                                <name>
                                    <surname>Coşar</surname>
                                    <given-names>Kübra</given-names>
                                </name>
                                                                    <aff>ANKARA HACI BAYRAM VELI UNIVERSITY</aff>
                                                            </contrib>
                                                    <contrib contrib-type="author">
                                                                <name>
                                    <surname>Altuntaş</surname>
                                    <given-names>Mutlu</given-names>
                                </name>
                                                                    <aff>SINOP UNIVERSITY</aff>
                                                            </contrib>
                                                                                </contrib-group>
                        
                                        <pub-date pub-type="pub" iso-8601-date="20221231">
                    <day>12</day>
                    <month>31</month>
                    <year>2022</year>
                </pub-date>
                                        <volume>1</volume>
                                        <issue>1</issue>
                                        <fpage>7</fpage>
                                        <lpage>32</lpage>
                        
                        <history>
                                    <date date-type="received" iso-8601-date="20221225">
                        <day>12</day>
                        <month>25</month>
                        <year>2022</year>
                    </date>
                                            </history>
                                        <permissions>
                    <copyright-statement>Copyright © 2022, Anatolian Journal of Pharmaceutical Sciences</copyright-statement>
                    <copyright-year>2022</copyright-year>
                    <copyright-holder>Anatolian Journal of Pharmaceutical Sciences</copyright-holder>
                </permissions>
            
                                                                                                <abstract><p>This manuscript is concerned with establishing a unified framework for concurrently generating data sets that include three major kinds of variables (i.e., binary, ordinal, and count) when the marginal distributions and a feasible association structure are specified for simulation purposes. The simulation paradigm has been commonly utilized in pharmaceutical practice. A central aspect of every simulation study is the quantification of the model components and parameters that jointly define a scientific process. When this quantification goes beyond the deterministic tools, researchers often resort to random number generation (RNG) in finding simulation-based solutions to address the stochastic nature of the problem. Although many RNG algorithms have appeared in the literature, a major limitation is that most of them were not devised to simultaneously accommodate all variable types mentioned above. Thus, these algorithms provide only an incomplete solution, as real data sets include variables of different kinds. This work represents an important augmentation of the existing methods as it is a systematic attempt and comprehensive investigation for mixed data generation. We provide an algorithm that is designed for generating data of mixed marginals; illustrate its operational, logistical, and computational details; and present ideas on how it can be extended to span more sophisticated distributional settings in terms of a broader range of marginal features and associational quantities.</p></abstract>
                                                                                    
            
                                                            <kwd-group>
                                                    <kwd>Biserial correlation</kwd>
                                                    <kwd>  phi coefficient</kwd>
                                                    <kwd>  simulation</kwd>
                                                    <kwd>  tetrachoric correlation</kwd>
                                                    <kwd>  random number generation</kwd>
                                                    <kwd>  mixed data</kwd>
                                            </kwd-group>
                                                        
                                                                                                                                                    </article-meta>
    </front>
    <back>
                            <ref-list>
                                    <ref id="ref1">
                        <label>1</label>
                        <mixed-citation publication-type="journal">[1] Demirtas H. A method for multivariate ordinal data generation given marginal distributions and correlations. Journal of Statistical Computation and Simulation, 2006; 76: 1017- 1025.</mixed-citation>
                    </ref>
                                    <ref id="ref2">
                        <label>2</label>
                        <mixed-citation publication-type="journal">[2] Demirtas H, Doganay B. Simultaneous generation of binary and normal data with specified marginal and association structures. Journal of Biopharmaceutical Statistics, 2012; 22: 223-236.</mixed-citation>
                    </ref>
                                    <ref id="ref3">
                        <label>3</label>
                        <mixed-citation publication-type="journal">[3] Demirtas H, Yavuz Y. Concurrent generation of ordinal and normal data. Journal of Biopharmaceutical Statistics, 2015; 25: 635-650.</mixed-citation>
                    </ref>
                                    <ref id="ref4">
                        <label>4</label>
                        <mixed-citation publication-type="journal">[4] Amatya A, Demirtas H. Simultaneous generation of multivariate mixed data with Poisson and normal marginals. Journal of Statistical Computation and Simulation, 2015a; 85: 3129-3139.</mixed-citation>
                    </ref>
                                    <ref id="ref5">
                        <label>5</label>
                        <mixed-citation publication-type="journal">[5] Emrich JL, Piedmonte MR. A method for generating high-dimensional multivariate binary variates. The American Statistician, 1991; 45: 302—304.</mixed-citation>
                    </ref>
                                    <ref id="ref6">
                        <label>6</label>
                        <mixed-citation publication-type="journal">[6] Demirtas H, Hedeker D. A practical way for computing approximate lower and upper correlation bounds. The American Statistician, 2011; 65: 104-109.</mixed-citation>
                    </ref>
                                    <ref id="ref7">
                        <label>7</label>
                        <mixed-citation publication-type="journal">[7] Demirtas H, Hedeker D. Computing the point-biserial correlation under any underlying continuous distribution. Communications in Statistics-Simulation and Computation, 2016; Anat. J. Pharm. Sci 2022; 45: 2744-2751.</mixed-citation>
                    </ref>
                                    <ref id="ref8">
                        <label>8</label>
                        <mixed-citation publication-type="journal">[8] Demirtas H, Ahmadian R, Atis S, Can FE, Ercan I. A nonnormal look at polychoric correlations: Modeling the change in correlations before and after discretization. Computational Statistics, 2016; 31: 1385-1401.</mixed-citation>
                    </ref>
                                    <ref id="ref9">
                        <label>9</label>
                        <mixed-citation publication-type="journal">[9] Ferrari PA, Barbiero A. Simulating ordinal data. Multivariate Behavioral Research, 2012; 47: 566-589.</mixed-citation>
                    </ref>
                                    <ref id="ref10">
                        <label>10</label>
                        <mixed-citation publication-type="journal">[10] Yahav I, Shmueli G. On generating multivariate Poisson data in management science applications. Applied Stochastic Models in Business and Industry, 2012; 28: 91—102.</mixed-citation>
                    </ref>
                                    <ref id="ref11">
                        <label>11</label>
                        <mixed-citation publication-type="journal">[11] Fleishman AI. A method for simulating non-normal distributions. Psychometrika, 1978; 43: 521-532.</mixed-citation>
                    </ref>
                                    <ref id="ref12">
                        <label>12</label>
                        <mixed-citation publication-type="journal">[12] Vale CD, Maurelli VA. Simulating multivariate nonnormal distributions. Psychometrika, 1983; 48: 465-471.</mixed-citation>
                    </ref>
                                    <ref id="ref13">
                        <label>13</label>
                        <mixed-citation publication-type="journal">[13] Headrick T.C. Power Method Polynomials and Other Transformations. Boca Raton. 2010; FL: Chapman and Hall/CRC.</mixed-citation>
                    </ref>
                                    <ref id="ref14">
                        <label>14</label>
                        <mixed-citation publication-type="journal">[14] Demirtas H, Hedeker D, Mermelstein J.M. Simulation of massive public health data by power polynomials. Statistics in Medicine, 2012; 31: 3337-3346.</mixed-citation>
                    </ref>
                                    <ref id="ref15">
                        <label>15</label>
                        <mixed-citation publication-type="journal">[15] Demirtas H. Concurrent generation of binary and nonnormal continuous data through fifth order power polynomials. Communications in Statistics-Simulation and Computation, 2017a; 46: 344-357.</mixed-citation>
                    </ref>
                                    <ref id="ref16">
                        <label>16</label>
                        <mixed-citation publication-type="journal">[16] Demirtas H. On accurate and precise generation of generalized Poisson vari- ates. Communications in Statistics-Simulation and Computation, 2017b; 46: 489-499.</mixed-citation>
                    </ref>
                                    <ref id="ref17">
                        <label>17</label>
                        <mixed-citation publication-type="journal">[17] Nelsen RB. An Introduction to Copulas. 2006; Berlin, Germany: Springer.</mixed-citation>
                    </ref>
                                    <ref id="ref18">
                        <label>18</label>
                        <mixed-citation publication-type="journal">[18] Higham N.J. Computing the nearest correlation matrix - a problem from finance. IMA Journal of Numerical Analysis, 2002; 22: 329-343.</mixed-citation>
                    </ref>
                                    <ref id="ref19">
                        <label>19</label>
                        <mixed-citation publication-type="journal">[19] Demirtas H. Simulation-driven inferences for multiply imputed longitudinal datasets. Statistica Neerlandica, 2004a; 58: 466-482.</mixed-citation>
                    </ref>
                                    <ref id="ref20">
                        <label>20</label>
                        <mixed-citation publication-type="journal">[20] Demirtas H. Assessment of relative improvement due to weights within generalized estimating equations framework for incomplete clinical trials data. Journal of Biopharmaceutical Statistics, 2004b; 14: 1085-1098.</mixed-citation>
                    </ref>
                                    <ref id="ref21">
                        <label>21</label>
                        <mixed-citation publication-type="journal">[21] Demirtas H. Multiple imputation under Bayesianly smoothed pattern-mixture models for non-ignorable drop-out. Statistics in Medicine, 2005a; 24: 2345-2363.</mixed-citation>
                    </ref>
                                    <ref id="ref22">
                        <label>22</label>
                        <mixed-citation publication-type="journal">[22] Demirtas H. Bayesian analysis of hierarchical pattern-mixture models for clinical trials data with attrition and comparisons to commonly used ad-hoc and model- based approaches. Journal of Biopharmaceutical Statistics, 2005b; 15: 383-402.</mixed-citation>
                    </ref>
                                    <ref id="ref23">
                        <label>23</label>
                        <mixed-citation publication-type="journal">[23] Demirtas H. Practical advice on how to impute continuous data when the ultimate interest centers on dichotomized outcomes through pre-specified thresholds. Communications in Statistics-Simulation and Computation, 2007a; 36: 871-889.</mixed-citation>
                    </ref>
                                    <ref id="ref24">
                        <label>24</label>
                        <mixed-citation publication-type="journal">[24] Demirtas H. The design of simulation studies in medical statistics. Statistics in Medicine, 2007b; 26: 3818-3821.</mixed-citation>
                    </ref>
                                    <ref id="ref25">
                        <label>25</label>
                        <mixed-citation publication-type="journal">[25] Demirtas H. On imputing continuous data when the eventual interest pertains to ordinalized outcomes via threshold concept. Computational Statistics and Data Analysis, 2008; 52: 2261-2271.</mixed-citation>
                    </ref>
                                    <ref id="ref26">
                        <label>26</label>
                        <mixed-citation publication-type="journal">[26] Demirtas H. Multiple imputation under the generalized lambda distribution. Journal of Biopharmaceutical Statistics, 2009a; 19: 77-89.</mixed-citation>
                    </ref>
                                    <ref id="ref27">
                        <label>27</label>
                        <mixed-citation publication-type="journal">[27] Demirtas H. A distance-based rounding strategy for post-imputation ordinal data. Journal of Applied Statistics, 2010a; 37: 489-500.</mixed-citation>
                    </ref>
                                    <ref id="ref28">
                        <label>28</label>
                        <mixed-citation publication-type="journal">[28] Demirtas H, Hedeker D. Gaussianizationbased quasi-imputation and expansion strategies for incomplete correlated binary responses. Statistics in Medicine, 2007; 26: 782-799.</mixed-citation>
                    </ref>
                                    <ref id="ref29">
                        <label>29</label>
                        <mixed-citation publication-type="journal">[29] Demirtas H, Hedeker D. An imputation strategy for incomplete longitudinal ordinal data. Statistics in Medicine, 2008a; 27: 4086-4093.</mixed-citation>
                    </ref>
                                    <ref id="ref30">
                        <label>30</label>
                        <mixed-citation publication-type="journal">[30] Demirtas H, Hedeker D. Multiple imputation under power polynomials. Communications in Statistics-Simulation and Computation, 2008b; 37: 1682-1695.</mixed-citation>
                    </ref>
                                    <ref id="ref31">
                        <label>31</label>
                        <mixed-citation publication-type="journal">[31] Demirtas H, Hedeker D. Imputing continuous data under some non-Gaussian distributions. Statistica Neerlandica, 2008c; 62: 193-205.</mixed-citation>
                    </ref>
                                    <ref id="ref32">
                        <label>32</label>
                        <mixed-citation publication-type="journal">[32] Demirtas H, Schafer JL. On the performance of random-coefficient patternmixture models for non-ignorable drop-out. Statistics in Medicine, 2003; 22: 2253-2575.</mixed-citation>
                    </ref>
                                    <ref id="ref33">
                        <label>33</label>
                        <mixed-citation publication-type="journal">[33] Demirtas H, Arguelles LM, Chung H, Hedeker D. On the performance of bias reduction techniques for variance estimation in approximate Bayesian bootstrap imputation. Computational Statistics and Data Analysis, 2007; 51: 4064-4068.</mixed-citation>
                    </ref>
                                    <ref id="ref34">
                        <label>34</label>
                        <mixed-citation publication-type="journal">[34] Demirtas H, Freels SA, Yucel RM. Plausibility of multivariate normality assumption when multiply imputing nonGaussian continuous outcomes: A simulation assessment. Journal of Statistical Computation and Simulation, 2008; 78: 69-84.</mixed-citation>
                    </ref>
                                    <ref id="ref35">
                        <label>35</label>
                        <mixed-citation publication-type="journal">[35] Inan G, Demirtas H, Gao R. PoisBinOrd: Data Generation with Poisson, Binary, and Ordinal Components, 2021; URL:http://CRAN.Rproject.org/package=PoisBinOrd</mixed-citation>
                    </ref>
                                    <ref id="ref36">
                        <label>36</label>
                        <mixed-citation publication-type="journal">[36] R Development Core Team. R: A Language and Environment for Statistical Computing. Foundation for Statistical Computing. 2023: Vienna, Austria.</mixed-citation>
                    </ref>
                                    <ref id="ref37">
                        <label>37</label>
                        <mixed-citation publication-type="journal">[37] Amatya A, Demirtas H. MultiOrd: An R package for generating correlated ordinal data. Communications in Statistics-Simulation and Computation, 2015b; 44: 1683-1691.</mixed-citation>
                    </ref>
                                    <ref id="ref38">
                        <label>38</label>
                        <mixed-citation publication-type="journal">[38] Amatya A, Demirtas H. OrdNor: An R package for concurrent generation of ordinal and normal data. Journal of Statistical Software, 2015c; 68: 1-14.</mixed-citation>
                    </ref>
                                    <ref id="ref39">
                        <label>39</label>
                        <mixed-citation publication-type="journal">[39] Amatya A, Demirtas H. Concurrent generation of multivariate mixed data with variables of dissimilar types. Journal of Statistical Computation and Simulation, 2016; 86: 3595-3607.</mixed-citation>
                    </ref>
                                    <ref id="ref40">
                        <label>40</label>
                        <mixed-citation publication-type="journal">[40] Amatya A, Demirtas H. PoisNor: An R package for generation of multivariate data with Poisson and normal marginals. Communications in Statistics-Simulation and Computation, 2017; 46: 2241-2253.</mixed-citation>
                    </ref>
                                    <ref id="ref41">
                        <label>41</label>
                        <mixed-citation publication-type="journal">[41] Demirtas H. Rounding strategies for multiply imputed binary data. Biometrical Journal, 2009b; 51: 677-688.</mixed-citation>
                    </ref>
                                    <ref id="ref42">
                        <label>42</label>
                        <mixed-citation publication-type="journal">[42] Demirtas H. Multiple imputation for longitudinal data under a Bayesian multilevel model. Communications in Statistics-Theory and Methods, 2009c; 38: 2812-2828.</mixed-citation>
                    </ref>
                                    <ref id="ref43">
                        <label>43</label>
                        <mixed-citation publication-type="journal">[43] Demirtas H. An application of multiple imputation under the two generalized parametric families. Journal of Data Science, 2010b; 8: 443-455.</mixed-citation>
                    </ref>
                                    <ref id="ref44">
                        <label>44</label>
                        <mixed-citation publication-type="journal">[44] Demirtas H. Joint generation of binary and nonnormal continuous data. Journal of Biometrics and Biostatistics, 2014; 5: 1-9.</mixed-citation>
                    </ref>
                                    <ref id="ref45">
                        <label>45</label>
                        <mixed-citation publication-type="journal">[45] Demirtas H. A multiple imputation framework for massive multivariate data of different variable types: A Monte-Carlo technique. In ICSA Book Series in Statistics, John Dean Chen and Ding-Geng (Din) Chen (Eds): Monte-Carlo Simulation- Based Statistical Modeling. Singapore: Springer, 2017c: 143-162.</mixed-citation>
                    </ref>
                                    <ref id="ref46">
                        <label>46</label>
                        <mixed-citation publication-type="journal">[46] Demirtas H. Inducing any feasible level of correlation to bivariate data with any marginals. The American Statistician, 2019; 73: 273-277.</mixed-citation>
                    </ref>
                                    <ref id="ref47">
                        <label>47</label>
                        <mixed-citation publication-type="journal">[47] Demirtas H, Gao R. Mixed data generation packages and related computational tools in R. Communications in StatisticsSimulation and Computation, 2022; 51: 4520-4563.</mixed-citation>
                    </ref>
                                    <ref id="ref48">
                        <label>48</label>
                        <mixed-citation publication-type="journal">[48] Demirtas H, Amatya A, Doganay B. BinNor: An R package for concur- rent generation of binary and normal data. Communications in Statistics-Simulation and Computation, 2014; 43: 569-579.</mixed-citation>
                    </ref>
                                    <ref id="ref49">
                        <label>49</label>
                        <mixed-citation publication-type="journal">[49] Demirtas H, Allozi R, Hu Y, Inan G, Ozbek L. Joint generation of binary, ordinal, count, and normal data with specified marginal and association structures in Monte-Carlo 
 simulations.In ICSA Book Series in Statistics, John Dean Chen and Ding-Geng (Din) Chen (Eds): Monte-Carlo Simulation-Based Statistical Modeling. Singapore: Springer, 2017: 3-15.</mixed-citation>
                    </ref>
                                    <ref id="ref50">
                        <label>50</label>
                        <mixed-citation publication-type="journal">[50] Gao R, Demirtas H. CorrToolBox: an R package for modeling correlational transformations in discretization contexts. Communications in Statistics-Simulation and Computation, 2023.</mixed-citation>
                    </ref>
                                    <ref id="ref51">
                        <label>51</label>
                        <mixed-citation publication-type="journal">[51] Li H, Demirtas H, Chen R. RNGforGPD: An R package for generation of univariate and multivariate generalized Poisson data. The R Journal, 2020; 12: 120-133.</mixed-citation>
                    </ref>
                                    <ref id="ref52">
                        <label>52</label>
                        <mixed-citation publication-type="journal">[52] Demirtas H, Amatya A, Pugach O, Cursio J, Shi F, Morton D, Doganay B. Accuracy versus convenience: a simulation-based comparison of two continuous imputation models for incomplete ordinal longitudinal clinical trials data. Statistics and Its Interface, 2009; 2: 449-456.</mixed-citation>
                    </ref>
                                    <ref id="ref53">
                        <label>53</label>
                        <mixed-citation publication-type="journal">[53] Demirtas H, Vardar-Acar, C. Anatomy of correlational magnitude transforma- tions in latency and discretization contexts in MonteCarlo studies. In ICSA Book Series in Statistics, John Dean Chen and Ding-Geng (Din) Chen (Eds): Monte-Carlo SimulationBased Statistical Modeling. Singapore: Springer, 2017; 59-84.</mixed-citation>
                    </ref>
                                    <ref id="ref54">
                        <label>54</label>
                        <mixed-citation publication-type="journal">[54] Demirtas H. A note on the relationship between the phi coefficient and the tetrachoric correlation under nonnormal underlying distributions. The American Statistician, 2016; 70: 143-148.</mixed-citation>
                    </ref>
                            </ref-list>
                    </back>
    </article>
