• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

An Efficient Machine Learning-based Text Summarization in the Malayalam Language


Abstract

Automatic text summarization is a procedure that packs enormous content into a more limited book that incorporates significant data. Malayalam is one of the toughest languages utilized in certain areas of India, most normally in Kerala and in Lakshadweep. Natural language processing in the Malayalam language is relatively low due to the complexity of the language as well as the scarcity of available resources. In this paper, a way is proposed to deal with the text summarization process in Malayalam documents by training a model based on the Support Vector Machine classification algorithm. Different features of the text are taken into account for training the machine so that the system can output the most important data from the input text. The classifier can classify the most important, important, average, and least significant sentences into separate classes and based on this, the machine will be able to create a summary of the input document. The user can select a compression ratio so that the system will output that much fraction of the summary. The model performance is measured by using different genres of Malayalam documents as well as documents from the same domain. The model is evaluated by considering content evaluation measures precision, recall, F score, and relative utility. Obtained precision and recall value shows that the model is trustable and found to be more relevant compared to the other summarizers.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
R. P. Haroon, A. G. M and B. N. U, "An Efficient Machine Learning-based Text Summarization in the Malayalam Language," KSII Transactions on Internet and Information Systems, vol. 16, no. 6, pp. 1778-1799, 2022. DOI: 10.3837/tiis.2022.06.001.

[ACM Style]
Rosna P Haroon, Abdul Gafur M, and Barakkath Nisha U. 2022. An Efficient Machine Learning-based Text Summarization in the Malayalam Language. KSII Transactions on Internet and Information Systems, 16, 6, (2022), 1778-1799. DOI: 10.3837/tiis.2022.06.001.

[BibTeX Style]
@article{tiis:25754, title="An Efficient Machine Learning-based Text Summarization in the Malayalam Language", author="Rosna P Haroon and Abdul Gafur M and Barakkath Nisha U and ", journal="KSII Transactions on Internet and Information Systems", DOI={10.3837/tiis.2022.06.001}, volume={16}, number={6}, year="2022", month={June}, pages={1778-1799}}