Research Article

Word Frequency: New York Times Throughout the Times

Volume: 8 Number: 2 December 22, 2024
EN

Word Frequency: New York Times Throughout the Times

Abstract

This project investigates the evolution of the English language over the past century through a machine learning model trained on leading articles from The New York Times spanning from 1920 to 2020. The primary aim is to predict the year in which a given sentence could have been written based on linguistic patterns, including word usage and sentence structure. By analyzing these patterns, the model provides insights into the changing styles and trends in written English over time. The model's predictions are grounded in extensive data analysis and machine learning techniques, ensuring a high degree of accuracy. This study not only highlights the dynamic nature of language but also demonstrates the application of computational methods in linguistic research. The findings of this research are significant for historical linguistics and literature studies, as they provide a quantifiable method to track linguistic changes. Additionally, this work can aid in the development of tools for temporal text classification, benefiting fields such as digital humanities and archival studies. Understanding how language evolves is crucial for preserving cultural heritage and improving communication strategies in various media.

Keywords

References

  1. [1] Wagner, Richard K., et al. "Modeling the development of written language." Reading and writing 24 (2011): 203-220.
  2. [2] Leech, Geoffrey, and Nicholas Smith. "Change and constancy in linguistic change: How grammatical usage in written English evolved in the period 1931-1991." Corpus Linguistics. Brill, 2009.
  3. [3] Zhang, Guoshuai, et al. "Learning to predict US policy change using New York Times corpus with pre-trained language model." Multimedia Tools and Applications 79 (2020): 34227-34240.
  4. [4] Jatowt, Adam, and Kevin Duh. "A framework for analyzing semantic change of words across time." IEEE/ACM joint conference on digital libraries. IEEE, 2014.
  5. [5] Shapiro, Adam Hale, Moritz Sudhof, and Daniel J. Wilson. "Measuring news sentiment." Journal of econometrics 228.2 (2022): 221-243.
  6. [6] Trust, Paul, Ahmed Zahran, and Rosane Minghim. "Understanding the influence of news on society decision making: application to economic policy uncertainty." Neural Computing and Applications 35.20 (2023): 14929-14945.
  7. [7] https://www.kaggle.com/datasets/tumanovalexander/nyt-articlesdata?resource=download Accessed 20 July 2024.
  8. [8] Yun-tao, Zhang, Gong Ling, and Wang Yong-cheng. "An improved TF-IDF approach for text classification." Journal of Zhejiang University-Science A 6.1 (2005): 49-55.

Details

Primary Language

English

Subjects

Data Mining and Knowledge Discovery, Artificial Intelligence (Other)

Journal Section

Research Article

Early Pub Date

December 22, 2024

Publication Date

December 22, 2024

Submission Date

November 8, 2024

Acceptance Date

December 21, 2024

Published in Issue

Year 2024 Volume: 8 Number: 2

APA
Aşıroğlu, M., & Olca, E. (2024). Word Frequency: New York Times Throughout the Times. International Journal of Multidisciplinary Studies and Innovative Technologies, 8(2), 163-170. https://izlik.org/JA39XN76TM
AMA
1.Aşıroğlu M, Olca E. Word Frequency: New York Times Throughout the Times. IJMSIT. 2024;8(2):163-170. https://izlik.org/JA39XN76TM
Chicago
Aşıroğlu, Mehmet, and Emre Olca. 2024. “Word Frequency: New York Times Throughout the Times”. International Journal of Multidisciplinary Studies and Innovative Technologies 8 (2): 163-70. https://izlik.org/JA39XN76TM.
EndNote
Aşıroğlu M, Olca E (December 1, 2024) Word Frequency: New York Times Throughout the Times. International Journal of Multidisciplinary Studies and Innovative Technologies 8 2 163–170.
IEEE
[1]M. Aşıroğlu and E. Olca, “Word Frequency: New York Times Throughout the Times”, IJMSIT, vol. 8, no. 2, pp. 163–170, Dec. 2024, [Online]. Available: https://izlik.org/JA39XN76TM
ISNAD
Aşıroğlu, Mehmet - Olca, Emre. “Word Frequency: New York Times Throughout the Times”. International Journal of Multidisciplinary Studies and Innovative Technologies 8/2 (December 1, 2024): 163-170. https://izlik.org/JA39XN76TM.
JAMA
1.Aşıroğlu M, Olca E. Word Frequency: New York Times Throughout the Times. IJMSIT. 2024;8:163–170.
MLA
Aşıroğlu, Mehmet, and Emre Olca. “Word Frequency: New York Times Throughout the Times”. International Journal of Multidisciplinary Studies and Innovative Technologies, vol. 8, no. 2, Dec. 2024, pp. 163-70, https://izlik.org/JA39XN76TM.
Vancouver
1.Mehmet Aşıroğlu, Emre Olca. Word Frequency: New York Times Throughout the Times. IJMSIT [Internet]. 2024 Dec. 1;8(2):163-70. Available from: https://izlik.org/JA39XN76TM