I am currently start my third year of PhD in School of Data Science, The Chinese University of Hong Kong (Shenzhen). My chief supervisor is Prof. Satoshi Nakamura and co-supervisor is Prof. Haizhou Li. I completed my bachelor’s degree at the School Electronic and Information Engineering, Beihang University, And then I completed my master’s degree in Integrated Circuit Engineering from Tsinghua University, supervised by Prof. Shouyi Yin, in 2020. From 2020 to 2022, I worked as a speech algorithm engineer at Lenovo and Meituan.
My research interest includes speech-to-speech translation, speech-to-text translation, and speech generation. .
🔥 News
- 2024.08: 🎉🎉 We release training codes with the hope of contributing to the language model-based speech translation community.
- 2024.05: 🎉🎉 Our speech-to-text translation paper is accepted by ACL 2024.
- 2023.12: 🎉🎉 Our accent conversion paper is accepted by ICASSP 2024.
📖 Educations
- 2022.09 - 2024.10 (now), Ph.D candidate at School of Data Science, The Chinese University of Hong Kong (Shenzhen), Shenzhen.
- 2017.09 - 2020.06, M.E. in Integrated Circuit Engineering, Tsinghua University, Beijing.
- 2013.09 - 2017.06, B.E. in the School Electronic and Information Engineering, Beihang University, Beijing.
📝 Publications
-
Xi Chen, Songyang Zhang, Qibing Bai, Kai Chen, Satoshi Nakamura “LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models” (Accepted by ACL 2024)
-
Xi Chen, Jiakun Pei, Liumeng Xue, Mingyang Zhang “Transfer the linguistic representations from TTS to accent conversion with non-parallel data” (Accepted by ICASSP 2024)
-
Xueyao Zhang, Liumeng Xue, Yicheng Gu, Yuancheng Wang, Jiaqi Li, Haorui He, Chaoren Wang, Songting Liu, Xi Chen, et al. “Amphion: An Open-Source Audio, Music and Speech Generation Toolkit” (Accepted by SLT 2024)
-
Huiyu Shi, Xi Chen, Tianlong Kong, Shouyi Yin, Peng Ouyang “GLMSnet: Single Channel Speech Separation Framework in Noisy and Reverberant Environments” (Accepted by ASRU 2021)
-
Xi Chen, Songyang Zhang, Dandan Song, Peng Ouyang, Shouyi Yin “Transformer with bidirectional decoder for speech recognition” (Accepted by InterSpeech 2020)
-
Xi Chen, Shouyi Yin, Dandan Song, Peng Ouyang, Leibo Liu, Shaojun Wei “Small-footprint keyword spotting with graph convolutional network” (Accepted by ASRU 2019)
-
Zeqing Zhao*, Xi Chen*, Hui Liu, Xuyang Wang, Lin Yang, Junjie Wang Sptts: Parallel speech synthesis without extra aligner model (Accepted by APSIPA ASC 2021)
-
Ruiqi Guo, Yonggang Liu, Shixuan Zheng, Ssu-Yen Wu, Peng Ouyang, Win-San Khwa, Xi Chen, et al. A 5.1 pJ/neuron 127.3 us/inference RNN-based speech recognition processor using 16 computing-in-memory SRAM macros in 65nm CMOS (Accepted by VLSI 2019)
🎖 Honors and Awards
- Duan Yong Ping Travel Award, 2024
- National Encouragement Scholarship in China, 2016
🧑🏫 Teaching
- Leading TA, AIR6001 Advanced Artificial Intelligence, Fall 2024
- TA, CSC3160 Fundamentals of Speech and Language Processing, Spring 2023
- TA, CSC3100 Data Structures, Fall 2022, Fall 2023
- TA, DDA2003 Visual Analytics_L01, Spring 2024