• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

SVM-Based Incremental Learning Algorithm for Large-Scale Data Stream in Cloud Computing

Vol. 8, No.10, October 31, 2014
10.3837/tiis.2014.10.005, Full Text:

Abstract

We have witnessed the rapid development of information technology in recent years. One of the key phenomena is the fast, near-exponential increase of data. Consequently, most of the traditional data classification methods fail to meet the dynamic and real-time demands of todays data processing and analyzing needs--especially for continuous data streams. This paper proposes an improved incremental learning algorithm for a large-scale data stream, which is based on SVM (Support Vector Machine) and is named DS-IILS. The DS-IILS takes the load condition of the entire system and the node performance into consideration to improve efficiency. The threshold of the distance to the optimal separating hyperplane is given in the DS-IILS algorithm. The samples of the history sample set and the incremental sample set that are within the scope of the threshold are all reserved. These reserved samples are treated as the training sample set. To design a more accurate classifier, the effects of the data volumes of the history sample set and the incremental sample set are handled by weighted processing. Finally, the algorithm is implemented in a cloud computing system and is applied to study user behaviors. The results of the experiment are provided and compared with other incremental learning algorithms. The results show that the DS-IILS can improve training efficiency and guarantee relatively high classification accuracy at the same time, which is consistent with the theoretical analysis.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
Ning Wang, Yang Yang, Liyuan Feng, Zhenqiang Mi, Kun Meng and Qing Ji, "SVM-Based Incremental Learning Algorithm for Large-Scale Data Stream in Cloud Computing," KSII Transactions on Internet and Information Systems, vol. 8, no. 10, pp. 3378-3393, 2014. DOI: 10.3837/tiis.2014.10.005

[ACM Style]
Wang, N., Yang, Y., Feng, L., Mi, Z., Meng, K., and Ji, Q. 2014. SVM-Based Incremental Learning Algorithm for Large-Scale Data Stream in Cloud Computing. KSII Transactions on Internet and Information Systems, 8, 10, (2014), 3378-3393. DOI: 10.3837/tiis.2014.10.005