|
Home page
Course outline
Matlab scripts
Problem sets
Projects
Columbia Courseworks
|
Course outline
Links take you to the slide pack for that lecture, as soon as it is
available. Currently, the links take you to the slide packs from last
year. I will be revising these to a greater or lesser extend through
the semester, and the actual slides to be used in the lecture will be
posted by the night before class.
| Lecture |
Date |
Topic |
Paper presentation |
| 1 |
2008-01-24 |
Course introduction: DSP review, Timescale modification
|
| 2 |
2008-01-31 |
Acoustics fundamentals:
Sound, waves, waveguides, resonance, energy transfer.
|
The Phase Vocoder, Flanagan & Golden, 1966 |
| 3 |
2008-02-07 |
Machine learning, classification, and generative models
|
Physical Modeling Using Digital Waveguides, Smith, 1993 |
| 4 |
2008-02-14 |
Auditory perception fundamentals:
the ear, auditory physiology, psychophysics, auditory scene analysis
|
A tutorial on hidden Markov models and selected applications in speech recognition, Rabiner, Proc. IEEE 1989 (Spencer) |
| 5 |
2008-02-21 |
Speech models and speech synthesis:
LPC, cepstrum, harmonic+noise
|
Chimaeric sounds reveal dichotomies in auditory perception, Smith, Delgutte, and Oxenham. Nature, 2002 (Haiyu) |
| 6 |
2008-02-28 |
Nonspeech:
Nonspeech and music signals, sinewave modeling
|
The IBM Expressive Speech Synthesis
System, Hamza, Bakis, Eide, Picheny, and Pitrelli. ICSLP 2004 |
| 7 |
2008-03-06 |
Compression:
Speech coding & high-quality audio compression
|
Estimating and interpreting the instantaneous frequency of a signal: I. Fundamentals, Boashash, Proc. IEEE 1992
|
| |
2008-03-13 |
Midterm Project proposals |
| |
2008-03-20 |
Spring break - no lecture |
| 8 |
2008-03-27 |
Spatial sound & rendering
|
A Tutorial on MPEG/Audio compression, Pan, 1995 (Felix) |
| 9 |
2008-04-03 |
Speech Recognition:
Features, Hidden Markov Models
|
Learning a Precedence Effect-Like Weighting Function for the Generalized Cross-Correlation Framework Wilson & Darrell TASLP, 2006 (Yiyin) |
| 10 |
2008-04-10 |
Sound mixtures & separation: CASA, ICA, and model-based separation
|
Weighted finite-state transducers in speech recognition, Mohri, Pereira, Riley. Computer Speech and Language, 2002 (Fadi) |
| 11 |
2008-04-17 |
Music analysis & recognition: Transcription, summarization, and similarity
|
Factorial models and refiltering for speech separation and denoising, Roweis, 2003 (David)
|
| 12 |
2008-04-24 |
Analysis of Everyday Sounds
:
Content-based retrieval of large-scale archives etc.
|
Non-negative Matrix Factorization for Polyphonic Music Transcription, Smaragdis & Brown, 2003 (Karl)
|
| |
2008-05-01 |
Project presentations |
Dan Ellis
<dpwe@ee.columbia.edu>
Last updated: Thu Apr 24 14:42:29 EDT 2008
|