• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

Dynamic Prime Chunking Algorithm for Data Deduplication in Cloud Storage

Vol. 15, No. 4, April 30, 2021
10.3837/tiis.2021.04.009, Download Paper (Free):

Abstract

The data deduplication technique identifies the duplicates and minimizes the redundant storage data in the backup server. The chunk level deduplication plays a significant role in detecting the appropriate chunk boundaries, which solves the challenges such as minimum throughput and maximum chunk size variance in the data stream. To provide the solution, we propose a new chunking algorithm called Dynamic Prime Chunking (DPC). The main goal of DPC is to dynamically change the window size within the prime value based on the minimum and maximum chunk size. According to the result, DPC provides high throughput and avoid significant chunk variance in the deduplication system. The implementation and experimental evaluation have been performed on the multimedia and operating system datasets. DPC has been compared with existing algorithms such as Rabin, TTTD, MAXP, and AE. Chunk Count, Chunking time, throughput, processing time, Bytes Saved per Second (BSPS) and Deduplication Elimination Ratio (DER) are the performance metrics analyzed in our work. Based on the analysis of the results, it is found that throughput and BSPS have improved. Firstly, DPC quantitatively improves throughput performance by more than 21% than AE. Secondly, BSPS increases a maximum of 11% than the existing AE algorithm. Due to the above reason, our algorithm minimizes the total processing time and achieves higher deduplication efficiency compared with the existing Content Defined Chunking (CDC) algorithms.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
M. Ellappan and A. S, "Dynamic Prime Chunking Algorithm for Data Deduplication in Cloud Storage," KSII Transactions on Internet and Information Systems, vol. 15, no. 4, pp. 1342-1359, 2021. DOI: 10.3837/tiis.2021.04.009.

[ACM Style]
Manogar Ellappan and Abirami S. 2021. Dynamic Prime Chunking Algorithm for Data Deduplication in Cloud Storage. KSII Transactions on Internet and Information Systems, 15, 4, (2021), 1342-1359. DOI: 10.3837/tiis.2021.04.009.

[BibTeX Style]
@article{tiis:24527, title="Dynamic Prime Chunking Algorithm for Data Deduplication in Cloud Storage", author="Manogar Ellappan and Abirami S and ", journal="KSII Transactions on Internet and Information Systems", DOI={10.3837/tiis.2021.04.009}, volume={15}, number={4}, year="2021", month={April}, pages={1342-1359}}