This paper presents kNN-CTC, a novel approach that overcomes these challenges by leveraging Connectionist Temporal Classification (CTC) pseudo labels to establish frame-level audio-text key-value pairs, circumventing the need for precise ground truth alignments.
Dec 13, 2023
In this paper, we propose MADI, a novel UDA approach for ASR via inter-domain MAtching and intra-domain DIscrimination, which improves the model transferability by fine-grained inter-domain matching and discriminability by intra-domain contrastive discrimination simultaneously.
Feb 16, 2023