About Me

Hello! I’m Roger, an 2nd-year masters student currently working on several topics in unsupervised speech processing at NTU Speech Processing and Machine Learning Lab supervised by Prof. Lin-shan Lee and Prof. Hung-yi Lee. I used to work on unsupervised music source separation, and I also have experience in various applied ML projects.

Publications

  • REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR
    Liang-Hsuan Tseng, En-Pei Hu, Cheng-Han Chiang, Yuan Tseng, Hung-yi Lee, Lin-shan Lee, Shao-Hua Sun
    in submission
    [paper] [code] (TBA)

  • AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models
    Yuan Tseng, Layne Berry, Yi-Ting Chen, I-Hsiang Chiu, Hsuan-Hao Lin, Max Liu, Puyuan Peng, Yi-Jen Shih, Hung-Yu Wang, Haibin Wu, Po-Yao Huang, Chun-Mao Lai, Shang-Wen Li, David Harwath, Yu Tsao, Shinji Watanabe, Abdelrahman Mohamed, Chi-Luen Feng, Hung-yi Lee
    ICASSP 2024
    [paper] [code] [submission platform]

  • Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences
    Yuan Tseng, Cheng-I Lai, Hung-yi Lee
    ICASSP 2023
    [paper] [code]

  • On the Utility of Self-supervised Models for Prosody-related Tasks
    Guan-Ting Lin*, Chi-Luen Feng*, Wei-Ping Huang^, Yuan Tseng^, Tzu-Han Lin^, Chen-An Li^, Hung-yi Lee, Nigel G. Ward.
    SLT 2022 (Best Paper Award)
    [paper] [code]

Talks

  • AV-SUPERB: Audio-visual Representations and How to Evaluate Them, ASRU 2023 SPARKS workshop [slides]