• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

Density-based Outlier Detection in Multi-dimensional Datasets

Vol. 16, No. 12, December 31, 2022
10.3837/tiis.2022.12.002, Download Paper (Free):

Abstract

Density-based outlier detection is one of the hot issues in data mining. A point is determined as outlier on basis of the density of points near them. The existing density-based detection algorithms have high time complexity, in order to reduce the time complexity, a new outlier detection algorithm DODMD (Density-based Outlier Detection in Multidimensional Datasets) is proposed. Firstly, on the basis of ZH-tree, the concept of micro-cluster is introduced. Each leaf node is regarded as a micro-cluster, and the micro-cluster is calculated to achieve the purpose of batch filtering. In order to obtain n sets of approximate outliers quickly, a greedy method is used to calculate the boundary of LOF and mark the minimum value as πΏπΏπΏπΏπΏπΏπ‘šπ‘šπ‘šπ‘šπ‘šπ‘š. Secondly, the outliers can filtered out by πΏπΏπΏπΏπΏπΏπ‘šπ‘šπ‘šπ‘šπ‘šπ‘š, the real outliers are calculated, and then the result set is updated to make the boundary closer. Finally, the accuracy and efficiency of DODMD algorithm are verified on real dataset and synthetic dataset respectively.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
X. Wang, Z. Cao, R. Zhan, M. Bai, Q. Ma, G. Li, "Density-based Outlier Detection in Multi-dimensional Datasets," KSII Transactions on Internet and Information Systems, vol. 16, no. 12, pp. 3815-3835, 2022. DOI: 10.3837/tiis.2022.12.002.

[ACM Style]
Xite Wang, Zhixin Cao, Rongjuan Zhan, Mei Bai, Qian Ma, and Guanyu Li. 2022. Density-based Outlier Detection in Multi-dimensional Datasets. KSII Transactions on Internet and Information Systems, 16, 12, (2022), 3815-3835. DOI: 10.3837/tiis.2022.12.002.

[BibTeX Style]
@article{tiis:38207, title="Density-based Outlier Detection in Multi-dimensional Datasets", author="Xite Wang and Zhixin Cao and Rongjuan Zhan and Mei Bai and Qian Ma and Guanyu Li and ", journal="KSII Transactions on Internet and Information Systems", DOI={10.3837/tiis.2022.12.002}, volume={16}, number={12}, year="2022", month={December}, pages={3815-3835}}