I am currently a master’s student at the College of Computer Science, Nankai University, under the guidance of Professor Yong Qin since 2022. I am poised to embark on a Ph.D. journey, transitioning seamlessly from my master’s program in September 2024. Prior to this, I earned my B.E. degree from Dalian University of Technology (DUT) in 2022. My research interests include automatic speech recognition and domain adaptation.
This paper presents kNN-CTC, a novel approach that overcomes these challenges by leveraging Connectionist Temporal Classification (CTC) pseudo labels to establish frame-level audio-text key-value pairs, circumventing the need for precise ground truth alignments.
Dec 13, 2023
In this paper, we propose MADI, a novel UDA approach for ASR via inter-domain MAtching and intra-domain DIscrimination, which improves the model transferability by fine-grained inter-domain matching and discriminability by intra-domain contrastive discrimination simultaneously.
Feb 16, 2023