[Notes]Listen, attend and spell: A neural network for large vocabulary conversational speech recognition

Click this link to the original paper. Main Point This paper proposed a HMM-free speech recognition system, which consists of an end-to-end model outputting conditional probability of character sequences given acoustic signal and a corresponding decoding mechanism to output the recognition result. The end-to-end model can be divided into two parts, a encoder and a […]