A novel method that apply deep learning into lip reading
Scenario | Epoch | CER | WER | BLEU |
---|---|---|---|---|
Unseen speakers [C] | N/A | N/A | N/A | N/A |
Unseen speakers | N/A | N/A | 0.226 | N/A |
Overlapped speakers [C] | N/A | N/A | N/A | N/A |
Overlapped speakers | N/A | N/A | N/A | N/A |
- PyTorch 0.5 (With nn.CTCLoss)
Modify options.toml and
python main.py options.toml
This model uses GRID corpus (http://spandh.dcs.shef.ac.uk/gridcorpus/) Please extract mouth region using dlib or other tools.