Research Article

Single and Binary Performance Comparison of Data Compression Algorithms for Text Files

Volume: 12 Number: 3 September 28, 2023
EN

Single and Binary Performance Comparison of Data Compression Algorithms for Text Files

Abstract

Data compression is a technique used to reduce the size of a file. To reduce the size of a file, unnecessary information is removed or parts that repeat the same information are stored once. Thus a lossless compression is achieved. The extracted file has all the features of the compressed original file and can be used in the same way. Data compression can be done using different techniques. Some of these techniques are Huffman coding, Lempel-Ziv-Welch coding and Burrows-Wheeler Transform. Techniques such as Huffman coding, Lempel-Ziv-Welch coding and Burrows-Wheeler Transform are some of them. Which technique to use depends on the type and size of the data to be compressed. Huffman, Lempel-Ziv-Welch, Burrows-Wheeler Transform and Deflate algorithms are the most widely used techniques for text compression. Each algorithm uses different approaches and can produce different results in terms of compression ratios and performance. In this study, different data compression techniques were measured on specific data sets by using them individually and in pairs on top of each other. The most successful result was obtained with the Deflate algorithm when used alone and the achieved compression ratio was 29.08. When considered in the form of stacked pairs, the compression ratio of the Burrows-Wheeler Transform and Deflate gave the best result as 57.36. In addition, when compression is performed in pairs, which algorithm is applied first and which algorithm is applied afterwards can make a significant difference in the compression ratio. In this study, the performance measurements obtained by applying the algorithms in different orders are compared and suggestions are presented to obtain optimum performance

Keywords

References

  1. [1] M. Ignatoski, J. Lerga, L. Stanković, and M. Daković, ‘Comparison of entropy and dictionary based text compression in English, German, French, Italian, Czech, Hungarian, Finnish, and Croatian’, Mathematics, vol. 8, no. 7, p. 1059, Jul. 2020, doi: 10.3390/MATH8071059.
  2. [2] I. B. Ginzburg, S. N. Padalko, and M. N. Terentiev, ‘Short Message Compression Scheme for Wireless Sensor Networks’, Moscow Work. Electron. Netw. Technol. MWENT 2020 - Proc., Mar. 2020, doi: 10.1109/MWENT47943.2020.9067371.
  3. [3] M. R. Hasan, ‘Data Compression using Huffman based LZW Encoding Technique’, Int. J. Sci. Eng. Res., vol. Volume 2, no. 11, pp. 1–7, 2011, Accessed: Mar. 20, 2023. [Online]. Available: http://www.ijser.org
  4. [4] V. Ratnam Anappindi, ‘Issue 8 www.jetir.org (ISSN-2349-5162)’, JETIREZ06012 J. Emerg. Technol. Innov. Res., vol. 8, 2021, doi: 10.1109/EDSSC.2017.8126506.J.
  5. [5] A. Habib, M. J. Islam, and M. S. Rahman, ‘A dictionary-based text compression technique using quaternary code’, Iran J. Comput. Sci., vol. 3, no. 3, pp. 127–136, Sep. 2020, doi: 10.1007/s42044-019-00047-w.
  6. [6] S. S and R. L, ‘Text Compression Algorithms - a Comparative Study’, ICTACT J. Commun. Technol., vol. 02, no. 04, pp. 444–451, 2011, doi: 10.21917/ijct.2011.0062.
  7. [7] M. A. Rahman and M. Hamada, ‘Burrows–wheeler transform based lossless text compression using keys and Huffman coding’, Symmetry (Basel)., vol. 12, no. 10, pp. 1–14, Oct. 2020, doi: 10.3390/sym12101654.
  8. [8] L. Barua, P. K. Dhar, L. Alam, and I. Echizen, ‘Bangla text compression based on modified lempel-Ziv-welch algorithm’, ECCE 2017 - Int. Conf. Electr. Comput. Commun. Eng., pp. 855–859, Apr. 2017, doi: 10.1109/ECACE.2017.7913022.

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Early Pub Date

September 23, 2023

Publication Date

September 28, 2023

Submission Date

May 24, 2023

Acceptance Date

September 4, 2023

Published in Issue

Year 2023 Volume: 12 Number: 3

APA
Keskin, S., Sevli, O., & Okatan, E. (2023). Single and Binary Performance Comparison of Data Compression Algorithms for Text Files. Bitlis Eren Üniversitesi Fen Bilimleri Dergisi, 12(3), 783-796. https://doi.org/10.17798/bitlisfen.1301546
AMA
1.Keskin S, Sevli O, Okatan E. Single and Binary Performance Comparison of Data Compression Algorithms for Text Files. Bitlis Eren Üniversitesi Fen Bilimleri Dergisi. 2023;12(3):783-796. doi:10.17798/bitlisfen.1301546
Chicago
Keskin, Serkan, Onur Sevli, and Ersan Okatan. 2023. “Single and Binary Performance Comparison of Data Compression Algorithms for Text Files”. Bitlis Eren Üniversitesi Fen Bilimleri Dergisi 12 (3): 783-96. https://doi.org/10.17798/bitlisfen.1301546.
EndNote
Keskin S, Sevli O, Okatan E (September 1, 2023) Single and Binary Performance Comparison of Data Compression Algorithms for Text Files. Bitlis Eren Üniversitesi Fen Bilimleri Dergisi 12 3 783–796.
IEEE
[1]S. Keskin, O. Sevli, and E. Okatan, “Single and Binary Performance Comparison of Data Compression Algorithms for Text Files”, Bitlis Eren Üniversitesi Fen Bilimleri Dergisi, vol. 12, no. 3, pp. 783–796, Sept. 2023, doi: 10.17798/bitlisfen.1301546.
ISNAD
Keskin, Serkan - Sevli, Onur - Okatan, Ersan. “Single and Binary Performance Comparison of Data Compression Algorithms for Text Files”. Bitlis Eren Üniversitesi Fen Bilimleri Dergisi 12/3 (September 1, 2023): 783-796. https://doi.org/10.17798/bitlisfen.1301546.
JAMA
1.Keskin S, Sevli O, Okatan E. Single and Binary Performance Comparison of Data Compression Algorithms for Text Files. Bitlis Eren Üniversitesi Fen Bilimleri Dergisi. 2023;12:783–796.
MLA
Keskin, Serkan, et al. “Single and Binary Performance Comparison of Data Compression Algorithms for Text Files”. Bitlis Eren Üniversitesi Fen Bilimleri Dergisi, vol. 12, no. 3, Sept. 2023, pp. 783-96, doi:10.17798/bitlisfen.1301546.
Vancouver
1.Serkan Keskin, Onur Sevli, Ersan Okatan. Single and Binary Performance Comparison of Data Compression Algorithms for Text Files. Bitlis Eren Üniversitesi Fen Bilimleri Dergisi. 2023 Sep. 1;12(3):783-96. doi:10.17798/bitlisfen.1301546

Cited By

Bitlis Eren University
Journal of Science Editor
Bitlis Eren University Graduate Institute
Bes Minare Mah. Ahmet Eren Bulvari, Merkez Kampus, 13000 BITLIS