Hi d94mandal, do have a look at the Stanford seminar in Speech Recognition: https://www.youtube.com/watch?v=RBgfLvAOrss
DeepSpeech code implementation might be able to help: https://github.com/mozilla/DeepSpeech
Hello, feel free to check these helpful links:
https://www.researchgate.net/publication/306010331_Speech_Recognition_System_-_A_Review
M.R