Speaker Identification using Multi Methods of Features Extraction
Main Article Content
Abstract
The primary challenge in identifying speakers is extracting recognition features from speech signals to optimize classification algorithms' performance. Several methods are proposed in this article for extracting differential features from an audio signal in order to classify the speaker. The following methods were used to obtain the features of the audio signal in this approach: Power Spectral Density (PSD), Short Term Energy (STE), Fast Fourier Transform (FFT), Hue Seven Moment Invariants method (HSMI), Mel Frequency Cepstrum Coefficients (MFCC), cross-correlation estimates of MFCC (XCORR), and Linear predictive coding (LPC). The classification methods in this paper are the artificial neural network (ANN), the Euclidean distance, and the autocorrelation, where the results obtained from the experiments showed that the accuracy rate is more than 96%.
Downloads
Metrics
Article Details
Licensing
TURCOMAT publishes articles under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This licensing allows for any use of the work, provided the original author(s) and source are credited, thereby facilitating the free exchange and use of research for the advancement of knowledge.
Detailed Licensing Terms
Attribution (BY): Users must give appropriate credit, provide a link to the license, and indicate if changes were made. Users may do so in any reasonable manner, but not in any way that suggests the licensor endorses them or their use.
No Additional Restrictions: Users may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.