<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.4 20241031//EN"
        "https://jats.nlm.nih.gov/publishing/1.4/JATS-journalpublishing1-4.dtd">
<article  article-type="research-article"        dtd-version="1.4">
            <front>

                <journal-meta>
                                                                <journal-id>buje</journal-id>
            <journal-title-group>
                                                                                    <journal-title>Bogazici University Journal of Education</journal-title>
            </journal-title-group>
                            <issn pub-type="ppub">2822-5600</issn>
                                        <issn pub-type="epub">2822-5597</issn>
                                                                                            <publisher>
                    <publisher-name>Boğaziçi Üniversitesi</publisher-name>
                </publisher>
                    </journal-meta>
                <article-meta>
                                        <article-id/>
                                                                                                                                                                                            <title-group>
                                                                                                                        <trans-title-group xml:lang="tr">
                                    <trans-title>Striking the Balance between Validity and Reliability of a Listening Test in Turkish as a Second Language</trans-title>
                                </trans-title-group>
                                                                                                                                        </title-group>
            
                                                    <contrib-group content-type="authors">
                                                                        <contrib contrib-type="author">
                                                                <name>
                                    <surname>Tozlu</surname>
                                    <given-names>Emel</given-names>
                                </name>
                                                            </contrib>
                                                    <contrib contrib-type="author">
                                                                <name>
                                    <surname>Ünaldı</surname>
                                    <given-names>Aylin</given-names>
                                </name>
                                                            </contrib>
                                                                                </contrib-group>
                        
                                        <pub-date pub-type="pub" iso-8601-date="20181217">
                    <day>12</day>
                    <month>17</month>
                    <year>2018</year>
                </pub-date>
                                        <volume>34</volume>
                                        <issue>1</issue>
                                        <fpage>49</fpage>
                                        <lpage>73</lpage>
                        
                        <history>
                                    <date date-type="received" iso-8601-date="20171207">
                        <day>12</day>
                        <month>07</month>
                        <year>2017</year>
                    </date>
                                                    <date date-type="accepted" iso-8601-date="20180716">
                        <day>07</day>
                        <month>16</month>
                        <year>2018</year>
                    </date>
                            </history>
                                        <permissions>
                    <copyright-statement>Copyright © 1976, Boğaziçi Üniversitesi Eğitim Dergisi</copyright-statement>
                    <copyright-year>1976</copyright-year>
                    <copyright-holder>Boğaziçi Üniversitesi Eğitim Dergisi</copyright-holder>
                </permissions>
            
                                                                                                <trans-abstract xml:lang="tr">
                            <p>Evidence on the efficacy of an assessment tool is necessary in order to justify the decisions we make based on the scores from it. Validity evidence can be collected from several sources such as the stages before and after test administration. In the present research study, validity evidence of several types on a Turkish as a Second Language (TSL) Academic Listening Test is presented in order to establish the efficacy of it. This paper presents cognitive, contextual and scoring validity (reliability) evidence from the first and second versions of the test and investigates whether the modifications made after the first administration have had a positive effect on the quality of the test. The study concludes that although the changes made in the first version of the test strengthened the validity claims in terms of cognitive and contextual requirements, the reliability scores of the test worsened in the second version. This reminded us that although it is necessary to build the foundations of a test firmly by operationalizing the necessary contextual features and cognitive processes, this will not thoroughly guarantee the technical quality of the items. Scoring validity should be established carefully as well. This study exemplifies a thorough attempt in establishing the validity of a TSL test from multiple perspectives and aims to be an exemplary study for further test development in TSL.</p></trans-abstract>
                                                                                    
            
                                                                                
                                                <kwd-group xml:lang="tr">
                                                    <kwd>Cognitive validity</kwd>
                                                    <kwd>  contextual validity</kwd>
                                                    <kwd>  scoring validity</kwd>
                                                    <kwd>  assessment of listening in Turkish as a second language</kwd>
                                            </kwd-group>
                                                                                                                                        </article-meta>
    </front>
    <back>
                            <ref-list>
                                    <ref id="ref1">
                        <label>1</label>
                        <mixed-citation publication-type="journal">ALTE, (2011). Manual for language test development and examining. Downloaded from https://www.alte.org/resources/Documents/ManualLanguageTest-Alte2011_EN.pdf</mixed-citation>
                    </ref>
                                    <ref id="ref2">
                        <label>2</label>
                        <mixed-citation publication-type="journal">Bachman, L. F. (1990). Fundamental considerations in language testing. Oxford: Oxford University Press.</mixed-citation>
                    </ref>
                                    <ref id="ref3">
                        <label>3</label>
                        <mixed-citation publication-type="journal">Bachman, L. F., &amp; Palmer, A. S. (1996). Language testing in practice: Designing and developing useful language tests. Oxford: Oxford University Press.</mixed-citation>
                    </ref>
                                    <ref id="ref4">
                        <label>4</label>
                        <mixed-citation publication-type="journal">Borsboom, D., Mellenbergh, G. J., &amp; Heerden, J. V. (2004). The concept of validity. Psychological Review, 111(4), 1061-1071.</mixed-citation>
                    </ref>
                                    <ref id="ref5">
                        <label>5</label>
                        <mixed-citation publication-type="journal">Council of Europe, (2001). Common European framework of reference for languages: Learning, teaching and assessment. Cambridge: Cambridge University Press.</mixed-citation>
                    </ref>
                                    <ref id="ref6">
                        <label>6</label>
                        <mixed-citation publication-type="journal">EALTA guidelines for good practice in language testing and assessment. Downloaded from http://www.ealta.eu.org/documents/archive/guidelines/English.pdf</mixed-citation>
                    </ref>
                                    <ref id="ref7">
                        <label>7</label>
                        <mixed-citation publication-type="journal">Elliott, M., &amp; Wilson, J. (2013). Context validity. In A. Geranpayeh &amp; L. Taylor (Eds.), Examining listening: Research and practice in assessing second language listening, Studies in language testing, 35 (pp.152-241). Cambridge: Cambridge University Press.</mixed-citation>
                    </ref>
                                    <ref id="ref8">
                        <label>8</label>
                        <mixed-citation publication-type="journal">Field, J. (2013). Cognitive Validity. In A. Geranpayeh &amp; L. Taylor (Eds.), Examining listening: Research and practice in assessing second language listening, Studies in language testing, 35 (pp.77-151). Cambridge: Cambridge University Press.</mixed-citation>
                    </ref>
                                    <ref id="ref9">
                        <label>9</label>
                        <mixed-citation publication-type="journal">Geranpayeh, A. (2013). Scoring validity. In A. Geranpayeh &amp; L. Taylor (Eds.), Examining listening: Research and practice in assessing second language listening. Studies in language testing, 35 (pp.242-272). Cambridge: Cambridge University Press.</mixed-citation>
                    </ref>
                                    <ref id="ref10">
                        <label>10</label>
                        <mixed-citation publication-type="journal">Kane, M. T. (2013) Validating the Interpretations and Uses of Test Scores. Journal of Educational Measurement 50(1), 1–73.</mixed-citation>
                    </ref>
                                    <ref id="ref11">
                        <label>11</label>
                        <mixed-citation publication-type="journal">Khalifa, H., &amp; Weir, C. J. (2009). Examining reading: research and practice in assessing second language reading, Studies in language testing, 29. Cambridge: UCLES/Cambridge University Press.</mixed-citation>
                    </ref>
                                    <ref id="ref12">
                        <label>12</label>
                        <mixed-citation publication-type="journal">Knoch, U., &amp; Elder, C. (2013). A framework for validating post-entry language assessments (PELAs). Papers in Language Testing and Assessment, 2(2), 48- 66.</mixed-citation>
                    </ref>
                                    <ref id="ref13">
                        <label>13</label>
                        <mixed-citation publication-type="journal">McNamara, T. (2000). Language testing. Oxford: Oxford University Press.</mixed-citation>
                    </ref>
                                    <ref id="ref14">
                        <label>14</label>
                        <mixed-citation publication-type="journal">Messick, S. A. (1989). Validity. In R. L. Linn (Ed) Educational measurement 13-103. New York. American Council on Education. Mac Millian Publishing Company.</mixed-citation>
                    </ref>
                                    <ref id="ref15">
                        <label>15</label>
                        <mixed-citation publication-type="journal">Richards, J. C. (1983). Listening comprehension: Approach, design, procedure. TESOL quarterly, 17(2), 219-240.</mixed-citation>
                    </ref>
                                    <ref id="ref16">
                        <label>16</label>
                        <mixed-citation publication-type="journal">Tozlu, E. (2017). The development of a listening test for learners of Turkish as a foreign language (Unpublished master thesis). Boğaziçi University, İstanbul, Turkey.</mixed-citation>
                    </ref>
                                    <ref id="ref17">
                        <label>17</label>
                        <mixed-citation publication-type="journal">Trim, J. L. M. (2009). Breakthrough. Retrieved from https://www.coe.int/t/dg4/linguistic/Source/FinalBreakthrough%20specification_6Nov01.rtf</mixed-citation>
                    </ref>
                                    <ref id="ref18">
                        <label>18</label>
                        <mixed-citation publication-type="journal">Van Ek, J., &amp; Trim, J. L. M. (1991a). Threshold 1990. Cambridge: Cambridge University Press.</mixed-citation>
                    </ref>
                                    <ref id="ref19">
                        <label>19</label>
                        <mixed-citation publication-type="journal">Van Ek, J., &amp; Trim, J. L. M. (1991b). Waystage 1990. Cambridge: Cambridge University Press.</mixed-citation>
                    </ref>
                                    <ref id="ref20">
                        <label>20</label>
                        <mixed-citation publication-type="journal">Van Ek, J., &amp; Trim, J. L. M. (2001). Vantage. Cambridge: Cambridge University Press.</mixed-citation>
                    </ref>
                                    <ref id="ref21">
                        <label>21</label>
                        <mixed-citation publication-type="journal">Walt, J. L., &amp; Steyn, F. (2008). The validation of language tests. Stellenbosch Papers in Linguistics, 38, 191-204.</mixed-citation>
                    </ref>
                                    <ref id="ref22">
                        <label>22</label>
                        <mixed-citation publication-type="journal">Weir, C. J. (1993). Understanding and developing language tests. New York &amp; Toronto: Prentice-Hall.</mixed-citation>
                    </ref>
                                    <ref id="ref23">
                        <label>23</label>
                        <mixed-citation publication-type="journal">Weir, C. J. (2005). Language testing and validation. Hampshire: Palgrave McMillan.</mixed-citation>
                    </ref>
                                    <ref id="ref24">
                        <label>24</label>
                        <mixed-citation publication-type="journal">Young, J.W., So, Y., &amp; Ockey, G.J. (2013). Guidelines for best test development practices to ensure validity and fairness for international English language proficiency assessments. Educational Testing Service. https://www.ets.org/s/about/pdf/best_practices_ensure_validity_fairness_english_language_assessments.pdf</mixed-citation>
                    </ref>
                            </ref-list>
                    </back>
    </article>
