Department of Electrical Engineering - Columbia University

[SEAS logo]

ELEN E6820 - Spring 2008

SPEECH AND AUDIO PROCESSING AND RECOGNITION

Home page

Course outline

Matlab scripts

Problem sets

Projects

Columbia Courseworks

Course outline

Links take you to the slide pack for that lecture, as soon as it is available. Currently, the links take you to the slide packs from last year. I will be revising these to a greater or lesser extend through the semester, and the actual slides to be used in the lecture will be posted by the night before class.

Lecture Date Topic Paper presentation
1 2008-01-24 Course introduction: DSP review, Timescale modification
2 2008-01-31 Acoustics fundamentals: Sound, waves, waveguides, resonance, energy transfer. The Phase Vocoder, Flanagan & Golden, 1966
3 2008-02-07 Machine learning, classification, and generative models Physical Modeling Using Digital Waveguides, Smith, 1993
4 2008-02-14 Auditory perception fundamentals: the ear, auditory physiology, psychophysics, auditory scene analysis A tutorial on hidden Markov models and selected applications in speech recognition, Rabiner, Proc. IEEE 1989 (Spencer)
5 2008-02-21 Speech models and speech synthesis: LPC, cepstrum, harmonic+noise Chimaeric sounds reveal dichotomies in auditory perception, Smith, Delgutte, and Oxenham. Nature, 2002 (Haiyu)
6 2008-02-28 Nonspeech: Nonspeech and music signals, sinewave modeling The IBM Expressive Speech Synthesis System, Hamza, Bakis, Eide, Picheny, and Pitrelli. ICSLP 2004
7 2008-03-06 Compression: Speech coding & high-quality audio compression Estimating and interpreting the instantaneous frequency of a signal: I. Fundamentals, Boashash, Proc. IEEE 1992
  2008-03-13 Midterm Project proposals
  2008-03-20 Spring break - no lecture
8 2008-03-27 Spatial sound & rendering A Tutorial on MPEG/Audio compression, Pan, 1995 (Felix)
9 2008-04-03 Speech Recognition: Features, Hidden Markov Models Learning a Precedence Effect-Like Weighting Function for the Generalized Cross-Correlation Framework Wilson & Darrell TASLP, 2006 (Yiyin)
10 2008-04-10 Sound mixtures & separation: CASA, ICA, and model-based separation Weighted finite-state transducers in speech recognition, Mohri, Pereira, Riley. Computer Speech and Language, 2002 (Fadi)
11 2008-04-17 Music analysis & recognition: Transcription, summarization, and similarity Factorial models and refiltering for speech separation and denoising, Roweis, 2003 (David)
12 2008-04-24 Analysis of Everyday Sounds : Content-based retrieval of large-scale archives etc. Non-negative Matrix Factorization for Polyphonic Music Transcription, Smaragdis & Brown, 2003 (Karl)
  2008-05-01 Project presentations


Valid HTML 4.01! Dan Ellis <dpwe@ee.columbia.edu>
Last updated: Thu Apr 24 14:42:29 EDT 2008