📝 Publications

RECA-PD: A Robust Explainable Cross-Attention Method for Speech-based Parkinson’s Disease Classification
Terry Yi Zhong, Cristian Tejedor-Garcia, Martha Larson, Bastiaan R. Bloem. [Code]
Contribution:
- A novel, robust, and explainable method for speech-based PD classification that delivers more clinically relevant explanations
- RECA-PD offer a better trade-off between explainability and performance
INTERSPEECH 2025
Evaluating the Usefulness of Non-Diagnostic Speech Data for Developing Parkinson’s Disease Classifiers
Terry Yi Zhong, Esther Janse, Cristian Tejedor-Garcia, Louis ten Bosch, Martha Larson.
ICASSP 2024
SynthTab: Leveraging Synthesized Data for Guitar Tablature Transcription
Yongyi Zang*, Yi Zhong*, Frank Cwitkowitz, Zhiyao Duan. [Demo Page]
INTERSPEECH 2024
GTR-Voice: Articulatory Phonetics Informed Controllable Expressive Speech Synthesis
Zehua Kcriss Li, Meiying Melissa Chen, Yi Zhong, Pinxin Liu, Zhiyao Duan. [Demo Page]

EE-TTS: Emphatic Expressive TTS with Linguistic Information
Yi Zhong, Chen Zhang, Xule Liu, Chenxi Sun, Weishan Deng, Haifeng Hu, Zhongqian Sun. [Demo Page]
Contribution:
- EE-TTS can identify appropriate emphasis positions from text and synthesize expressive speech with emphasis and linguistic information.
- This work outperforms baseline with expressiveness-MOS improvements from 3.76 to 4.25 and naturalness-MOS from 3.67 to 4.34.
- EE-TTS helps build AI playmate services for the world’s most-played mobile MOBA game Honor of Kings (DAU 100+ million).