Abstract
Many speaker identification systems are created by model-based approaches, where a statistical model is used to characterize speaker's voice and no inter-speaker information is used in parameter estimation. It is well known that inter-speaker information is very helpful in discrimination of different speakers. In this paper, we propose a novel method for the use of inter-speaker information to improve performance of a model-based speaker identification system. A neural network is employed to capture inter-speaker information from output space of those statistical models. In order to sufficiently utilize inter-speaker information, a rival penalized encoding rule is proposed to design supervised learning pairs for training the neural network. Comparative results in the KING speech corpus show that our method leads to a considerable improvement for a model-based speaker identification system.