About Me
Hello! I’m Roger, an 3rd-year masters student at the NTU Speech Processing and Machine Learning Lab supervised by Prof. Lin-shan Lee and Prof. Hung-yi Lee. I work on topics related to human-like machine learning, such as unsupervised speech segmentation/recognition and audio-visual learning.
I’m currently interning with the Speech team at Samsung AI Center in Cambridge.
Publications
REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR
Liang-Hsuan Tseng, En-Pei Hu, Cheng-Han Chiang, Yuan Tseng, Hung-yi Lee, Lin-shan Lee, Shao-Hua Sun
NeurIPS 2024
[paper] [code]CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems
Haibin Wu, Yuan Tseng, Hung-yi Lee
Interspeech 2024
[dataset] [paper] [code]AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models
Yuan Tseng, Layne Berry, Yi-Ting Chen, I-Hsiang Chiu, Hsuan-Hao Lin, Max Liu, Puyuan Peng, Yi-Jen Shih, Hung-Yu Wang, Haibin Wu, Po-Yao Huang, Chun-Mao Lai, Shang-Wen Li, David Harwath, Yu Tsao, Shinji Watanabe, Abdelrahman Mohamed, Chi-Luen Feng, Hung-yi Lee
ICASSP 2024
[paper] [code] [submission platform]Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences
Yuan Tseng, Cheng-I Lai, Hung-yi Lee
ICASSP 2023
[paper] [code]On the Utility of Self-supervised Models for Prosody-related Tasks
Guan-Ting Lin*, Chi-Luen Feng*, Wei-Ping Huang^, Yuan Tseng^, Tzu-Han Lin^, Chen-An Li^, Hung-yi Lee, Nigel G. Ward.
SLT 2022 (Best Paper Award)
[paper] [code]