Researh of Multi-label Data Classification Solutions
Technological Sciences
Emilija Valujavičiūtė
Vilnius Tech, Lithuania
Published 2024-04-09
https://doi.org/10.15388/JMD.2022.2.5
PDF
HTML

Keywords

multi-label classification
lithuanian language
multi-label text data
text classification
category detection method
category membership method
category combination detection method

How to Cite

Valujavičiūtė, E. (2024) “Researh of Multi-label Data Classification Solutions”, Jaunųjų mokslininkų darbai, 52(2), pp. 50–59. doi:10.15388/JMD.2022.2.5.

Abstract

The article analyzes the impact of the chosen method of model application on the classification of multi-label texts written in the Lithuanian language. The article presents a study of mult-label data classification methods in Lithuanian, which includes an analysis of the accuracy of the application of data classification methods for the automatic classification of multiclass texts written in Lithuanian. The classification methods, evaluation criteria, their applicability and the principles of data preparation for classification are reviewed. After preparing the text data for classification tasks, 44 combinations of classifiers were formed for the study and classification was performed using 3 different methods of multi-label data classification: category detection, category membership and category combination detection. The results obtained are compared in terms of time and classification accuracy, identifying the best performing classifiers and identifying the differences and advantages of the classification methods used.

PDF
HTML
Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Downloads

Download data is not yet available.