• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

Optimization Driven MapReduce Framework for Indexing and Retrieval of Big Data

Vol. 14, No. 5, May 31, 2020
10.3837/tiis.2020.05.002, Download Paper (Free):

Abstract

With the technical advances, the amount of big data is increasing day-by-day such that the traditional software tools face a burden in handling them. Additionally, the presence of the imbalance data in big data is a massive concern to the research industry. In order to assure the effective management of big data and to deal with the imbalanced data, this paper proposes a new indexing algorithm for retrieving big data in the MapReduce framework. In mappers, the data clustering is done based on the Sparse Fuzzy-c-means (Sparse FCM) algorithm. The reducer combines the clusters generated by the mapper and again performs data clustering with the Sparse FCM algorithm. The two-level query matching is performed for determining the requested data. The first level query matching is performed for determining the cluster, and the second level query matching is done for accessing the requested data. The ranking of data is performed using the proposed Monarch chaotic whale optimization algorithm (M-CWOA), which is designed by combining Monarch butterfly optimization (MBO) [22] and chaotic whale optimization algorithm (CWOA) [21]. Here, the Parametric Enabled-Similarity Measure (PESM) is adapted for matching the similarities between two datasets. The proposed M-CWOA outperformed other methods with maximal precision of 0.9237, recall of 0.9371, F1-score of 0.9223, respectively.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
H. B. Abdalla, A. M. Ahmed and M. A. A. Sibahee, "Optimization Driven MapReduce Framework for Indexing and Retrieval of Big Data," KSII Transactions on Internet and Information Systems, vol. 14, no. 5, pp. 1886-1908, 2020. DOI: 10.3837/tiis.2020.05.002.

[ACM Style]
Hemn Barzan Abdalla, Awder Mohammed Ahmed, and M. A. Al Sibahee. 2020. Optimization Driven MapReduce Framework for Indexing and Retrieval of Big Data. KSII Transactions on Internet and Information Systems, 14, 5, (2020), 1886-1908. DOI: 10.3837/tiis.2020.05.002.

[BibTeX Style]
@article{tiis:23548, title="Optimization Driven MapReduce Framework for Indexing and Retrieval of Big Data", author="Hemn Barzan Abdalla and Awder Mohammed Ahmed and M. A. Al Sibahee and ", journal="KSII Transactions on Internet and Information Systems", DOI={10.3837/tiis.2020.05.002}, volume={14}, number={5}, year="2020", month={May}, pages={1886-1908}}