• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

A Classification Algorithm Based on Data Clustering and Data Reduction for Intrusion Detection System over Big Data

Vol. 13, No. 7, July 30, 2019
10.3837/tiis.2019.07.021, Download Paper (Free):

Abstract

With the rapid development of network, Intrusion Detection System(IDS) plays a more and more important role in network applications. Many data mining algorithms are used to build IDS. However, due to the advent of big data era, massive data are generated. When dealing with large-scale data sets, most data mining algorithms suffer from a high computational burden which makes IDS much less efficient. To build an efficient IDS over big data, we propose a classification algorithm based on data clustering and data reduction. In the training stage, the training data are divided into clusters with similar size by Mini Batch K-Means algorithm, meanwhile, the center of each cluster is used as its index. Then, we select representative instances for each cluster to perform the task of data reduction and use the clusters that consist of representative instances to build a K-Nearest Neighbor(KNN) detection model. In the detection stage, we sort clusters according to the distances between the test sample and cluster indexes, and obtain k nearest clusters where we find k nearest neighbors. Experimental results show that searching neighbors by cluster indexes reduces the computational complexity significantly, and classification with reduced data of representative instances not only improves the efficiency, but also maintains high accuracy.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
Q. Wang, X. Ouyang and J. Zhan, "A Classification Algorithm Based on Data Clustering and Data Reduction for Intrusion Detection System over Big Data," KSII Transactions on Internet and Information Systems, vol. 13, no. 7, pp. 3714-3732, 2019. DOI: 10.3837/tiis.2019.07.021.

[ACM Style]
Qiuhua Wang, Xiaoqin Ouyang, and Jiacheng Zhan. 2019. A Classification Algorithm Based on Data Clustering and Data Reduction for Intrusion Detection System over Big Data. KSII Transactions on Internet and Information Systems, 13, 7, (2019), 3714-3732. DOI: 10.3837/tiis.2019.07.021.