View publication
Title | Multi-label Learning on Low Label Density Sets with Few Examples |
Authors | Matías Vergara, Benjamin Bustos, Iván Sipirán, Tobias Schreck, Stefan Lengauer |
Publication date | March 2025 |
Abstract |
Multi-label learning has experienced an immense growth in the last years due to the multiple real-life applications to which it is applicable, such as the classification of protein functions, or musical genres, among others. This has led to the proposal of categories for multi-label classification (MLC) problems that seek to establish guidelines for the different configurations, given either by the quality or quantity of the labels, the number of examples for training, etc. Such is the case for the class of problems known as "Challenging MLC", those in which the universe of labels incorporates obstacles either in terms of quality (erroneously assigned labels, unseen labels, etc.) or quantity (thousands or millions of labels). Different methods have been developed to address these cases, and yet few efforts have been directed towards the case where, despite having a large label universe, the number of examples is small (of the same order as the labels), thus posing a more complex scenario. In this paper, we examine one important real-world problem case -- the labeling of Geometric surface patterns, appearing on pottery objects from the Classical era. As we will show, existing methods from the state of the art can provide baseline performance, but cannot yet comprehensively address this and similar application problems. We present and encompassing experimental comparison of state of the art methods, detailing advantages and problems. We contribute a processing pipeline that allows us to achieve effective classifications. Our work addresses the importance case when the universe of labels admits a feasible simplification through natural language processing (NLP) techniques and augmentation of visual training data. Based on an in-depth analysis of results, we propose practical guidelines on how to face similar problems, regarding both the selection of techniques and the analysis of results. We also identify pressing issues for current research to make multi-labeling more widely applicable and functional. |
Pages | article 125942 |
Volume | 265 |
Journal name | Expert Systems with Applications |
Publisher | Elsevier Science (Amsterdam, The Netherlands) |
Reference URL |
![]() |