Research Article

Designing a platform for Tweet Collection, Analytics and Storage (TweetCASP)

Volume: IDAP-2023 : International Artificial Intelligence and Data Processing Symposium Number: IDAP-2023 October 18, 2023
TR EN

Designing a platform for Tweet Collection, Analytics and Storage (TweetCASP)

Abstract

In this era of big data, the streaming, storage, and analysis of large amounts of data present a variety of challenges. Several challenges must be addressed by designers of data-intensive systems in order to retrieve useful information. Collecting, storing, and analyzing data requires a collection and analytics platform comprised of an appropriate choice of data processing and analytics technologies in order to acquire meaningful insight. In this paper, we report on TweetCASP (Tweet Collection, Analytics and Storage Platfrom), which gathers tweets based on user-entered keywords using Twitter's Streaming API, providing an environment for real-time analytics on streaming data and permanently storing data in an Apache Cassandra NoSQL datastore to fulfill future batch-oriented data processing requirements. Moreover, The TweetCASP presents an example of a data-intensive system used by software developers, designers, and researchers for data collecting and analytics.

Keywords

References

  1. Amghar, S., Cherdal, S., & Mouline, S. (2020). Storing , preprocessing and analyzing tweets : finding the suitable noSQL system. https://doi.org/10.1080/1206212X.2020.1846946
  2. Anderson, K. M., Aydin, A. A., Barrenechea, M., Cardenas, A., Hakeem, M., & Jambi, S. (2015). Design Challenges/Solutions for Environments Supporting the Analysis of Social Media Data in Crisis Informatics Research. 2015 48th Hawaii International Conference on System Sciences, 2015-March, 163–172. https://doi.org/10.1109/HICSS.2015.29
  3. Anderson, K. M., & Schram, A. (2011). Design and implementation of a data analytics infrastructure in support of crisis informatics research: NIER track. 2011 33rd International Conference on Software Engineering (ICSE), 844–847. https://doi.org/10.1145/1985793.1985920
  4. ApacheCassandra. (2022). ApacheCassandra.pdf. https://cassandra.apache.org/_/index.html Aswathy, A., Prabha, R., Gopal, L. S., Pullarkatt, D., & Ramesh, M. V. (2022). An efficient twitter data collection and analytics framework for effective disaster management. 2022 IEEE Delhi Section Conference, DELCON 2022. https://doi.org/10.1109/DELCON54057.2022.9753627
  5. Aydin, A. A. (2016). INCREMENTAL DATA COLLECTION & ANALYTICS THE DESIGN OF NEXT-GENERATION CRISIS INFORMATICS SOFTWARE [Ph.D., University of Colorado Boulder].
  6. https://www.proquest.com/pagepdf/1834583278/Record/9F7C2D640FDE4BCCPQ/3?accountid=16268 Aydin, A. A., & Anderson, K. M. (2017). Batch to Real-Time : Incremental Data Collection & Analytics Platform.
  7. Proceedings of the 50th Hawaii International Conference on System Sciences, 5911–5920. http://hdl.handle.net/10125/41876
  8. Aydin, A. A., & Anderson, K. M. (2020). Data modelling for large-scale social media analytics: design challenges and lessons learned. International Journal of Data Mining, Modelling and Management, 12(4), 386. https://doi.org/10.1504/IJDMMM.2020.111409

Details

Primary Language

English

Subjects

Computer Software

Journal Section

Research Article

Publication Date

October 18, 2023

Submission Date

August 16, 2023

Acceptance Date

August 27, 2023

Published in Issue

Year 2023 Volume: IDAP-2023 : International Artificial Intelligence and Data Processing Symposium Number: IDAP-2023

APA
Doguc, T. B., & Aydın, A. A. (2023). Designing a platform for Tweet Collection, Analytics and Storage (TweetCASP). Computer Science, IDAP-2023 : International Artificial Intelligence and Data Processing Symposium(IDAP-2023), 165-171. https://doi.org/10.53070/bbd.1344271
AMA
1.Doguc TB, Aydın AA. Designing a platform for Tweet Collection, Analytics and Storage (TweetCASP). JCS. 2023;IDAP-2023 : International Artificial Intelligence and Data Processing Symposium(IDAP-2023):165-171. doi:10.53070/bbd.1344271
Chicago
Doguc, Tugba Beril, and Ahmet Arif Aydın. 2023. “Designing a Platform for Tweet Collection, Analytics and Storage (TweetCASP)”. Computer Science IDAP-2023 : International Artificial Intelligence and Data Processing Symposium (IDAP-2023): 165-71. https://doi.org/10.53070/bbd.1344271.
EndNote
Doguc TB, Aydın AA (October 1, 2023) Designing a platform for Tweet Collection, Analytics and Storage (TweetCASP). Computer Science IDAP-2023 : International Artificial Intelligence and Data Processing Symposium IDAP-2023 165–171.
IEEE
[1]T. B. Doguc and A. A. Aydın, “Designing a platform for Tweet Collection, Analytics and Storage (TweetCASP)”, JCS, vol. IDAP-2023 : International Artificial Intelligence and Data Processing Symposium, no. IDAP-2023, pp. 165–171, Oct. 2023, doi: 10.53070/bbd.1344271.
ISNAD
Doguc, Tugba Beril - Aydın, Ahmet Arif. “Designing a Platform for Tweet Collection, Analytics and Storage (TweetCASP)”. Computer Science IDAP-2023 : INTERNATIONAL ARTIFICIAL INTELLIGENCE AND DATA PROCESSING SYMPOSIUM/IDAP-2023 (October 1, 2023): 165-171. https://doi.org/10.53070/bbd.1344271.
JAMA
1.Doguc TB, Aydın AA. Designing a platform for Tweet Collection, Analytics and Storage (TweetCASP). JCS. 2023;IDAP-2023 : International Artificial Intelligence and Data Processing Symposium:165–171.
MLA
Doguc, Tugba Beril, and Ahmet Arif Aydın. “Designing a Platform for Tweet Collection, Analytics and Storage (TweetCASP)”. Computer Science, vol. IDAP-2023 : International Artificial Intelligence and Data Processing Symposium, no. IDAP-2023, Oct. 2023, pp. 165-71, doi:10.53070/bbd.1344271.
Vancouver
1.Tugba Beril Doguc, Ahmet Arif Aydın. Designing a platform for Tweet Collection, Analytics and Storage (TweetCASP). JCS. 2023 Oct. 1;IDAP-2023 : International Artificial Intelligence and Data Processing Symposium(IDAP-2023):165-71. doi:10.53070/bbd.1344271

Cited By

The Creative Commons Attribution 4.0 International License 88x31.png is applied to all research papers published by JCS and

A Digital Object Identifier (DOI) Logo_TM.png is assigned for each published paper