Haozhe Zhang

I am currently a second-year student enrolled in the School of Computer Science at Carnegie Mellon University (CMU), pursuing a Master of Intelligent Information Systems (MIIS) degree with an anticipated graduation date of December 2023. My academic journey has been enriched by the privilege of being a part of Prof. Alex Hauptmann's esteemed team, where I am actively engaged in the development of a speech recognition pipeline for the KAIROS project during my directed study at CMU.

I obtained my dual Bachelor of Science in Data Science degree from Duke Univeristy and Duke Kunshan Univeristy. Throughout my undergraduate tenure, I am fortumate to work with Prof. Ming Li on innovative projects centered around voice conversion (VC).

Broadly, my research interests encompass the expansive realms of machine learning and artificial intelligence. And My research endeavors have centered on speech processing, with a particular emphasis on advancing the domains of voice conversion (VC) and text-to-speech synthesis (TTS).

Publications

Following is a list of my publications on Google Scholar

[1] Yaogen Yang, Haozhe Zhang(Joint first author), Zexin Cai, Yao Shi, Ming Li
Electrolaryngeal speech enhancement based on a two stage framework with bottleneck feature refinement and voice conversion.
Biomedical Signal Processing and Control 80 (2023): 104279.

[2] Haozhe Zhang, Zexin Cai, Xiaoyi Qin, Ming Li
SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System for Both Human Beings and Machines.
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2022.

[3] Yaogen Yang, Haozhe Zhang, Xiaoyi Qin, Shanshan Liang, Huahua Cui, Mingyang Xu, Ming Li
Building Bilingual and Code-Switched Voice Conversion with Limited Training Data Using Embedding Consistency Loss
arXiv preprint arXiv:2104.10832 (2021).