Skip to content

Latest commit

 

History

History
33 lines (21 loc) · 772 Bytes

README.md

File metadata and controls

33 lines (21 loc) · 772 Bytes

Digital Speech Recognition


Digital Signal Processing Project.

In this project, we implemented speech recognition for digits. We choose MFCC as our feature extractor and adopted naive bayes, random forest, KNN, decision tree and SVM classifiers.

Experiment Result

Confusion Matrix of Offline Test

Confusion Matrix of Online Test in Real World

Details

  • Voice Sample Rate: 44.1kHz

Environment:

  • Python 3
  • Numpy
  • Wave
  • PyAudio
  • Matplotlib
  • Keras 2.1
  • Tensorboard

Tool Box:

  • Run