I try to design with tensorflow a model to recognize speaker with audio wav files. I use VoxCeleb dataset.
I read lots of paper on it but I don't achieve to design a proper model. I just get 30% accuracy on my dataset.
I just use MFCC algorithm to extract patterns of my sound signal. I...