📝 Publications

TSD 2025
sym

RECA-PD: A Robust Explainable Cross-Attention Method for Speech-based Parkinson’s Disease Classification
Terry Yi Zhong, Cristian Tejedor-Garcia, Martha Larson, Bastiaan R. Bloem. [Code]

Contribution:

  • A novel, robust, and explainable method for speech-based PD classification that delivers more clinically relevant explanations
  • RECA-PD offer a better trade-off between explainability and performance

INTERSPEECH 2025 Evaluating the Usefulness of Non-Diagnostic Speech Data for Developing Parkinson’s Disease Classifiers

Terry Yi Zhong, Esther Janse, Cristian Tejedor-Garcia, Louis ten Bosch, Martha Larson.

ICASSP 2024 SynthTab: Leveraging Synthesized Data for Guitar Tablature Transcription

Yongyi Zang*, Yi Zhong*, Frank Cwitkowitz, Zhiyao Duan. [Demo Page]

INTERSPEECH 2024 GTR-Voice: Articulatory Phonetics Informed Controllable Expressive Speech Synthesis

Zehua Kcriss Li, Meiying Melissa Chen, Yi Zhong, Pinxin Liu, Zhiyao Duan. [Demo Page]

INTERSPEECH 2023
sym

EE-TTS: Emphatic Expressive TTS with Linguistic Information
Yi Zhong, Chen Zhang, Xule Liu, Chenxi Sun, Weishan Deng, Haifeng Hu, Zhongqian Sun. [Demo Page]

Contribution:

  • EE-TTS can identify appropriate emphasis positions from text and synthesize expressive speech with emphasis and linguistic information.
  • This work outperforms baseline with expressiveness-MOS improvements from 3.76 to 4.25 and naturalness-MOS from 3.67 to 4.34.
  • EE-TTS helps build AI playmate services for the world’s most-played mobile MOBA game Honor of Kings (DAU 100+ million).