Logo 知识与财富的链接
深度长尾学习研究综述

深度长尾学习研究综述

ISSN:0254-4156
2025年第51卷第5期
韩佳艺1,刘建伟1,陈德华2,徐璟东1,代琪1,夏鹏飞2 HAN Jia-Yi1,LIU Jian-Wei1,CHEN De-Hua2,XU Jing-Dong1,DAI Qi1,XIA Peng-Fei2
1.中国石油大学(北京)人工智能学院自动化系 北京 102249;2.东华大学计算机科学与技术学院 上海 201620 1. Department of Automation, College of Artificial Intelligence, China University of Petroleum, Beijing 102249;2. College of Computer Science and Technology, Donghua University, Shanghai 201620

深度学习是一门依赖于数据的科学, 传统深度学习方法假定在平衡数据集上训练模型, 然而, 现实世界中大规模数据集通常表现出长尾分布现象, 样本数量众多的少量头部类主导模型训练, 而大量尾部类样本数量过少, 难以得到充分学习. 近年来, 长尾学习掀起学术界的研究热潮. 本文综合梳理和分析近年来发表在高水平会议或期刊上的文献, 对长尾学习进行全面综述. 具体而言, 根据深度学习模型设计流程, 将图像识别领域的长尾学习算法分为丰富样本数量与语义信息的优化样本空间方法, 关注特征提取器、分类器、logits和损失函数这四个基本组成部分的优化模型方法, 以及通过引入辅助任务来帮助模型训练并在多个空间共同优化长尾学习模型的辅助任务学习3大类, 根据提出的分类方法综合对比分析每类长尾学习方法的优缺点. 然后, 进一步将基于样本数量的狭义长尾学习概念推广至多尺度广义长尾学习. 此外, 对文本数据、语音数据等其他数据形式下的长尾学习算法进行简要评述. 最后, 讨论目前长尾学习面临的可解释性较差、数据质量较低等挑战, 并展望如多模态长尾学习、半监督长尾学习等未来具有潜力的发展方向.


Deep learning is a science that depends on data. Traditional deep learning methods unrealistically assume that the training models are on balanced datasets. In real-world large-scale datasets, a long-tailed distribution often occurs, with a few head classes having many samples dominating model training, while many tail classes have too few samples to be adequately learned. In recent years, the long-tailed learning has set off a research upsurge in academic circles. In this paper, we comb and analyze the literature published in high-level conferences or journals to provide a comprehensive survey of long-tailed learning.
Specifically, we categorize long-tailed learning algorithms in the field of image recognition into three main types according to the design process of deep learning models: Optimizing the sample space by enriching the quantity and semantic information of samples, optimizing the model by focusing on the four fundamental components of feature extractor, classifier, logits and loss function, and auxiliary task learning, which involves introducing auxiliary tasks to aid model training and jointly optimizing long-tailed learning models across multiple spaces. Additionally, a comprehensive comparative analysis of the strengths and weaknesses of each category is conducted based on the proposed classification method. We further extend the concept of narrow long-tail learning based on the number of samples to multi-scale generalized long-tailed learning. In addition, we briefly review long-tailed learning algorithms in other data forms, such as text data and voice data. Finally, we discuss the current challenges faced by long-tailed learning, such as poor interpretability and low data quality, and look forward to the future development directions, such as multimodal long-tailed learning and semi-supervised long-tailed learning.

认领
收 藏
点 赞
认领进度
0 %

发表评论

ISSN:0254-4156
2025年第51卷第5期

用户信息设置