ScholarMate
客服热线:400-1616-289

Attention based gender and nationality information exploration for speaker identification

Tang, Yong; Liu, Chuang; Leng, Yan*; Zhao, Weiwei; Sun, Jiande; Sun, Chengli; Wang, Rongyan; Yuan, Qi; Li, Dengwang; Xu, Huaqiang
Science Citation Index Expanded
南昌航空大学; 1; y

摘要

Gender and nationality information has not been exploited in large-scale speaker recognition despite being provided in the popular VoxCeleb1 dataset. This paper explores methods that combine high -level features extracted from the gender and nationality information with low-level acoustic features for speaker identification. To our knowledge, this is the first time that the gender and nationality information provided in VoxCeleb1 is utilized in speaker identification. Specifically, we propose Gender-Guided Spectrogram-Attention network and Nationality-Guided Spectrogram-Attention network that embed gender and nationality information into the spectrogram features, respectively. The resulting gender and nationality embeddings are then used with the spectrogram features together for classification. Experimental results show that the proposed methods can successfully capture the gender and nationality information of the speakers, and can effectively improve speaker identification accuracy.

关键词

Speaker identification Gender Nationality High-level features Attention network