• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

Centralized Machine Learning Versus Federated Averaging: A Comparison using MNIST Dataset


Abstract

A flood of information has occurred with the rise of the internet and digital devices in the fourth industrial revolution era. Every millisecond, massive amounts of structured and unstructured data are generated; smartphones, wearable devices, sensors, and self-driving cars are just a few examples of devices that currently generate massive amounts of data in our daily. Machine learning has been considered an approach to support and recognize patterns in data in many areas to provide a convenient way to other sectors, including the healthcare sector, government sector, banks, military sector, and more. However, the conventional machine learning model requires the data owner to upload their information to train the model in one central location to perform the model training. This classical model has caused data owners to worry about the risks of transferring private information because traditional machine learning is required to push their data to the cloud to process the model training. Furthermore, the training of machine learning and deep learning models requires massive computing resources. Thus, many researchers have jumped to a new model known as "Federated Learning". Federated learning is emerging to train Artificial Intelligence models over distributed clients, and it provides secure privacy information to the data owner. Hence, this paper implements Federated Averaging with a Deep Neural Network to classify the handwriting image and protect the sensitive data. Moreover, we compare the centralized machine learning model with federated averaging. The result shows the centralized machine learning model outperforms federated learning in terms of accuracy, but this classical model produces another risk, like privacy concern, due to the data being stored in the data center. The MNIST dataset was used in this experiment.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
S. Peng, Y. Yang, M. Mao, D. Park, "Centralized Machine Learning Versus Federated Averaging: A Comparison using MNIST Dataset," KSII Transactions on Internet and Information Systems, vol. 16, no. 2, pp. 742-756, 2022. DOI: 10.3837/tiis.2022.02.020.

[ACM Style]
Sony Peng, Yixuan Yang, Makara Mao, and Doo-Soon Park. 2022. Centralized Machine Learning Versus Federated Averaging: A Comparison using MNIST Dataset. KSII Transactions on Internet and Information Systems, 16, 2, (2022), 742-756. DOI: 10.3837/tiis.2022.02.020.

[BibTeX Style]
@article{tiis:25317, title="Centralized Machine Learning Versus Federated Averaging: A Comparison using MNIST Dataset", author="Sony Peng and Yixuan Yang and Makara Mao and Doo-Soon Park and ", journal="KSII Transactions on Internet and Information Systems", DOI={10.3837/tiis.2022.02.020}, volume={16}, number={2}, year="2022", month={February}, pages={742-756}}