Electronic Engineering Department, The Chinese University of Hong Kong

Assistant Professor
_{B.Eng. (SCUT), Master (SCUT), Ph.D. (Surrey)}

Rm 413, Ho Sin Hang Engineering Building

Tel: +852 3943 0869

This email address is being protected from spambots. You need JavaScript enabled to view it.

Research Interests:

Audio / Music signal processing and generation, computer audition, spatial audio, multi-modality signal processing.

http://www.ee.cuhk.edu.hk/qqkong/

Resume of Career

Qiuqiang Kong is currently an assistant professor at the EE department of the Chinese University of Hong Kong. He received his Ph.D. degree from the University of Surrey, Guildford, UK, in 2019. Following his Ph.D., he joined ByteDance as a research scientist. His research topic includes the classification, detection, separation, and generation of general sounds and music. He was the top 2% scientist in 2021 in “Updated science-wide author databases of standardized citation indicators. He was known for developing pretrained audio neural networks (PANNs) for audio tagging and was awarded the IEEE SPS Young Author Best Paper in 2022. He won the detection and classification of acoustic scenes and events (DCASE) challenge in 2017. He was known for transcribing the largest piano MIDI dataset GiantMIDI-Piano in the world. He has co-authored over 50 papers in journals and conferences, including IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), ICASSP, INTERSPEECH, IJCAI, DCASE, EUSIPCO, LVA-ICA. He has been cited 3156 times, with an H-index of 28 till Sep. 2023. He assisted with organizing the LVA-ICA 2018 in Guildford, UK and the DCASE 2018 Workshop in Woking, UK. He is serving as a co-editor for the Frontiers in Signal Processing journal.

Current Research Interests

Audio / Music signal processing and generation, computer audition, spatial audio, multi-modality signal processing.

Highlights of Recent Achievements

Journal publication "PANNs: Large-scale pretrained audio neural networks for audio pattern recognition" receives the IEEE SPS Young Author Best Paper in 2022.
Conference publication "Performance MIDI-to-score conversion by neural beat tracking." receives the Best Paper Award at ISMIR conference in 2022.

External Service in Recent 3 Years

Serves as the co-editor for the Frontiers in Signal Processing journal in 2022.

Publication for the Past 3 Years

Journal Publications

Qiuqiang Kong, Li, Bochen and Chen, Jitong and Wang, Yuxuan, “Giantmidi-piano: A large-scale midi dataset for classical piano music,” Accepted by Transactions of the International Society for Music Information Retrieval (TISMIR), 2022.
Jie jiang, Qiuqiang Kong, Mark D. Plumbley, Nigel Gilbert, “Deep Learning Based Energy Disaggregation and On/Off Detection of Household Appliances,” ACM Transactions on Knowledge Discovery from Data (TKDD), 2021.
Jingqiao Zhao, Qiuqiang Kong, Xiaoning Song, Zhenhua Feng, and Xiaojun Wu. “Feature Alignment for Robust Acoustic Scene Classification across Devices,” IEEE Signal Processing Letters (SPL), 2021.
Qiuqiang Kong, Bochen Li, Xuchen Song, Yuan Wan, and Yuxuan Wang. “High-resolution Piano Transcription with Pedals by Regressing Onset and Offset Times,” IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2021.
Qiuqiang Kong, YinCao, Turab Iqbal, Yuxuan Wang, WenwuWang, and Mark D. Plumbley, “PANNs:Large- scale pretrained audio neural networks for audio pattern recognition,” IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP, IEEE SPS Best Paper Award), 2020.
Qiuqiang Kong, Yang Xu, Wenwu Wang, and Mark D. Plumbley, “Sound event detection of weakly labelled data with CNN-transformer and automatic threshold optimization,” IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2020.
Boqing Zhu, Kele Xu, Qiuqiang Kong, Huaimin Wang, and Yuxing Peng, “Audio Tagging by Cross Filtering Noisy Labels,” IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2020.
Zhao Ren, Qiuqiang Kong, Jing Han, Mark D. Plumbley, Bjö̈rn W. Schuller, “CAA-Net: Conditional Atrous CNNs with Attention for Explainable Device-robust Acoustic Scene Classification,” IEEE Transactions on Multimedia (TMM), 2020.

Conference Publications

Yuanzhe Chen, Ming Tu, Tang Li, Xin Li, Qiuqiang Kong, Jiaxin Li, Zhichao Wang, Qiao Tian, Yuping Wang, Yuxuan Wang. “Streaming Voice Conversion via Intermediate Bottleneck Features and Non-Streaming Teacher Guidance.” In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023.
Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Mark D. Plumbley, Wenwu Wang. “Simple pooling front-ends for efficient audio classification”. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023.
Haohe Liu, Qiuqiang Kong, Xubo Liu, Xinhao Mei, Wenwu Wang, Mark D. Plumbley. “Ontology-aware Learning and Evaluation for Audio Tagging.”, In INTERSPEECH, 2023.
Xubo Liu, Qiushi Huang, Xinhao Mei, Haohe Liu, Qiuqiang Kong, Jianyuan Sun, Shengchen Li, Tom Ko, Yu Zhang, Lilian H. Tang, Mark D. Plumbley, Volkan Kılıç, Wenwu Wang. “Visually-aware audio captioning with adaptive audio-visual attention.” In INTERSPEECH, 2023.
Hu, Jinbo, Yin Cao, Ming Wu, Qiuqiang Kong, Feiran Yang, Mark D. Plumbley, and Jun Yang. “”Sound Event Localization and Detection for Real Spatial Sound Scenes: Event-Independent Network and Data Augmentation Chains.” In Workshop on the Detection and Classification of Acoustic Scenes and Events (DCASE), 2022.
Liu, Haohe, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Wenwu Wang, and Mark D. Plumbley. “Segment-level Metric Learning for Few-shot Bioacoustic Event Detection.” In Workshop on the Detection and Classification of Acoustic Scenes and Events (DCASE), 2022.
Lele Liu, Qiuqiang Kong, Veronica Morfi, Emmanouil Benetos. “Performance MIDI-to-score conversion by neural beat tracking.” In International Society for Music Information Retrieval (ISMIR, Best Paper Award), 2022.
Jinbo Hu, Yin Cao, Ming Wu, Qiuqiang Kong, Feiran Yang, Mark D. Plumbley, and Jun Yang. “A TrackWise Ensemble Event Independent Network for Polyphonic Sound Event Localization and Detection.” In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022.
Haohe Liu, Xubo Liu, Qiuqiang Kong, Qiao Tian, Yan Zhao, and DeLiang Wang. “VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration.” INTERSPEECH, 2022.
Haohe Liu, Woosung Choi, Xubo Liu, Qiuqiang Kong, Qiao Tian, and DeLiang Wang. “Neural Vocoder is All You Need for Speech Super-resolution.” INTERSPEECH, 2022.
Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Jinzheng Zhao, Qiushi Huang, Mark D. Plumbley, and Wenwu Wang. “Separate What You Describe: Language-Queried Audio Source Separation.” INTERSPEECH, 2022.
Lam Pham, Chris Baume, Qiuqiang Kong. “An Audio-Based Deep Learning Framework For BBC Television Programme Classification.” European Signal Processing Conference (EUSIPCO), 2021.
Qiuqiang, Kong, Yin Cao, Haohe Liu, Keunwoo Choi, and Yuxuan Wang. “Decoupling magnitude and phase estimation with deep ResUNet for music source separation.” In International Society for Music Information Retrieval (ISMIR), 2021.
Liwei Lin, Qiuqiang Kong, Junyan Jiang, and Gus Xia. “A Unified Model for Zero-shot Music Source Separation.” In International Society for Music Information Retrieval (ISMIR), 2021.
Qiuqiang Kong, Haohe Liu, Xingjian Du, Li Chen, Rui Xia, and Yuxuan Wang. “Speech enhancement with weakly labelled data from AudioSet.” In INTERSPEECH, 2021.
Lam Pham, Chris Baume, Qiuqiang Kong,Tassadaq Hussain, Wenwu Wang, and Mark Plumbley.“An Audio-Based Deep Learning Framework For BBC Television Programme Classification.” European Signal Processing Conference (EUSIPCO), 2021.
Xingjian Du, Bilei Zhu, Qiuqiang Kong, and Zejun Ma, “Singing Melody Extraction from Polyphonic Music based on Spectral Correlation Modeling.” In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021.
Yin Cao, Turab Iqbal, Qiuqiang Kong, Fengyan An, Wenwu Wang, and Mark D. Plumbley. “An improved event-independent network for polyphonic sound event localization and detection.” In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020.
Turab Iqbal, Yin Cao, Qiuqiang Kong, Mark D.Plumbley, and Wenwu Wang, “LearningWithOut-of-Distribution Data for Audio Classification,” In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020.
Qiuqiang Kong,Yuxuan Wang,Xuchen Song,Yin Cao, Wenwu Wang, and Mark D. Plumbley, “Source separation with weakly labelled data: An approach to computational auditory scene analysis,” In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020.

Homepage

Prof KONG, Qiuqiang 孔秋强