The Topic links will take you to the slides for that lecture.
Slides for a lecture will hopefully be posted by 10pm the night
before the lecture. The PDF links in the Readings
column will take you to PDF versions of all required readings (i.e.,
if no PDF version is available for a paper, the paper is
not required reading).
Key for sources of readings:
[Holmes]:Speech Synthesis and Recognition, J. Holmes, W. Holmes.
[R+S]:Theory and Applications of Digital Signal Processing,
Rabiner, Schafer.
[R+J]:Fundamentals of Speech Recognition, Rabiner, Juang.
[J+M]:Speech and Language Processing, Jurafsky, Martin, 2nd ed.
[Jelinek]:Statistical Methods for Speech Recognition, Jelinek.
[HAH]:Spoken Language Processing, Huang, Acero, Hon.
Required:MFCC: [HAH] Sec. 6.5.2; LPC: [R+S] Sec. 9.2-9.2.2; PLP: [Gold+Morgan] Sec. 22.1-22.2; DTW: [Holmes] Sec. 8.6-8.7. Optional:DTW: [R+J] p. 200-226, [Sakoe+Chiba] paper. (PDF)
Required:HMM's: [Rabiner] A tutorial on HMM's, [Poritz] HMM's: A Guided Tour, [Holmes], p. 133-158, [HAH] p. 385-396, p. 441-443, [Duda+Hart+Stork] p. 128-138. (PDF)
Optional:class n-grams: [Brown] paper; grammatical LM's: [Chelba] paper; topic LM's: [Seymore] paper; maximum entropy and triggers: [Rosenfeld] paper; everything and a bag of chips: [Goodman] paper; Model M: [Chen] paper; neural network LM's: [Bengio et al.] paper. (PDF)
Required: [HAH] p. 107-109, p. 444-451 (MAP and MLLR); [HAH] p. 522-525 (CMR), p. 528-529 (retraining); [Leggetter+Woodland] paper, [Gales] paper (MLLR); [Gauvain+Lee] paper (MAP). (PDF)
Required: [Duda+Hart+Stork] p. 114-124 (LDA); [Povey+Woodland] paper (MMI); [Povey+Woodland] paper (MPE); [Mangu+Brill+Stolcke] paper (consensus decoding); [Fiscus] paper (ROVER system combination). (PDF)