Enhancing Speaker Recognition Robustness with Scalable Deep Learning Models and MFCC Features

Yasir Hussein Shakir; Eshaq Aziz Awadh  AL Mandhari; Ali  Alkhazraji; Reem Ali  Mutlag

doi:10.63496/ejcs.Vol1.Iss5.185

PDF

Published: 01-11-2025

DOI: https://doi.org/10.63496/ejcs.Vol1.Iss5.185

Keywords:

Speaker Recognition, Deep Learning, MFCC, FFNN, EPNN, FCBP

Yasir Hussein Shakir

College of Graduate Studies (COGS), Universiti Tenaga Nasional (UNITEN) Kajang, Malaysia.

Eshaq Aziz Awadh AL Mandhari

Graduate School of Technology at Asia Pacific University of Technology and Innovation (APU) in Malaysia

Ali Alkhazraji

Computer Science Department, Faculty of Sciences, Lebanese University, Hadat Campus, Beirut, Lebanon

Reem Ali Mutlag

College of Graduate Studies (COGS), University Tenaga Nasional (UNITEIN), Kajang, Malaysia

Abstract

Speaker recognition is the process of distinguishing various speakers within recordings of sounds or stream. Several variables contribute to the task's complexity, including variances in structure, overlapping sound events, as well as the presence of multiple noise sources after recorded. Despite the plethora of algorithms that have been developed to extract this data for identification purposes, capturing speaker-specific attributes from the often intricate sound mix is still a difficulty for machines. Earlier methods have used discriminative models to decode voice data, but with increasing computation capability, generative models are taking some ground. While they are functional for various speech types missing transition or clarity, the scalability of these models is questionable. To address this issue in this paper, the different databases used to train deep learning models like the Feed Forward Neural Network (FFNN), Forward Cascade Back Propagation (FCBP), and Elman Propagation Neural Network (EPNN) are trained in such a way that addresses scalability problems of the models.

Issue

Vol. 1 No. 5 (2025): Issue 5 (2025)

Section

Articles

This work is licensed under a Creative Commons Attribution 4.0 International License.

How to Cite

Enhancing Speaker Recognition Robustness with Scalable Deep Learning Models and MFCC Features. (2025). East Journal of Computer Science, 1(5), 1-16. https://doi.org/10.63496/ejcs.Vol1.Iss5.185

Enhancing Speaker Recognition Robustness with Scalable Deep Learning Models and MFCC Features

Abstract

Issue

Section

How to Cite

Similar Articles

Journal Info

Guidelines

Follow Us

Contact Us

Similar Articles

Enhancing Alzheimer’s disease Classification from MRI Scans Using Deep Learning techniques

Optimizing Software Quality: Integrating Test Case Prioritization, Defect Prediction, and Resource Allocation Strategies

Predict Covid-19 Pandemic Phases using Several Machine Learning Algorithms

A Comprehensive Review of Learning-Based Anomaly Detection Techniques in IoT Security Systems

Face Detecting and Recognizing using 3D local Binary Pattern

Early Detection of Chronic Kidney Disease (CKD)Using Machine Learning Algorithms

Fingerprint-Based Cryptographic Identity: A Custom Recognition Pipeline with Key Pair Generation

Article Sidebar

Main Article Content

Abstract

Article Details

Issue

Section

How to Cite

Similar Articles