[2102.01243] PSLA: Improving Audio Event Classification with Pretraining, Sampling, Labeling, and Aggregation

Audio event classification is an active research area and has a wide range of applications. Since the release of AudioSet, great progress has been made in advancing the classification accuracy, which mostly comes from the development of novel model architectures and attention modules. However, we find that appropriate training techniques are equally important for building audio event classification models with AudioSet, but have not received the attention they deserve. To fill the gap, in this work, we present PSLA, a collection of training techniques that can noticeably boost the model accuracy including ImageNet pretraining, balanced sampling, data augmentation, label enhancement, model aggregation and their design choices. By training an EfficientNet with these techniques, we obtain a model that achieves a new state-of-the-art mean average precision (mAP) of 0.474 on AudioSet, outperforming the previous best system of 0.439.

1 mentions: @shinmura0
Keywords: pretrain
Date: 2021/02/19 05:21

Referring Tweets

@shinmura0 PSLA: Improving Audio Event Classification... t.co/qkhI3fgPf6 ・音のImageNetともいえるAudioSetでSOTAを達成した論文 ・EfficientNetB2(ImageNet学習済)を使用 ・結論としてEvaluation dataのラベル間違いを指摘 (続く)

Related Entries

Read more [2001.10072] ABCTracker: an easy-to-use, cloud-based application for tracking multiple objectscontac...
0 users, 1 mentions 2020/01/31 02:20
Read more [1904.07223] Joint Discriminative and Generative Learning for Person Re-identificationopen searchope...
0 users, 1 mentions 2020/06/23 03:51
Read more [1912.04761] Sound Event Detection of Weakly Labelled Data with CNN-Transformer and Automatic Thresh...
0 users, 1 mentions 2020/09/16 14:22
Read more [1911.10676] Attribute Restoration Framework for Anomaly Detectionopen searchopen navigation menucon...
0 users, 1 mentions 2020/11/20 05:21
Read more Rainforest Connection Species Audio Detection | Kaggle
0 users, 1 mentions 2021/02/18 03:51
Read more Rainforest Connection Species Audio Detection | Kaggle
0 users, 1 mentions 2021/02/19 00:52