Ron Weiss

I am a postdoc working on music information retrieval with Juan Bello at MARL at NYU. Previously, I was a graduate research assistant working with Dan Ellis in the Laboratory for the Recognition and Organization of Speech and Audio (LabROSA). I defended my dissertation in May 2009 (watch me write it at about 50,000 * real-time here).

My research interests lie at the intersection of audio signal processing and machine learning. My dissertation research was devoted to model based source separation, but I also found time to do a bit of music signal analysis to create some wacky remixes on the side. I'm currently focusing more on music information retrieval. You can find more (outdated) information on my projects page.

You might also be interested in some of my freely available code, including assorted Python audio processing modules, and useful Matlab tools for functional programming, easier plotting, training GMMs/HMMs, and interfacing with HTK.

Teaching

I have taught/been a teaching assistant for:

Publications

[1] M. I. Mandel, R. J. Weiss, and D. P. W. Ellis. Model-Based Expectation-Maximization Source Separation and Localization. IEEE Transactions on Audio, Speech, and Language Processing, 18(2):382-394, February 2010. [ bib | DOI | .pdf ]
[2] R. J. Weiss and D. P. W. Ellis. Speech Separation Using Speaker-Adapted Eigenvoice Speech Models. Computer Speech and Language, 24(1):16-29, 2010. Speech Separation and Recognition Challenge. [ bib | DOI | .pdf ]
[3] R. J. Weiss and D. P. W. Ellis. A Variational EM Algorithm for Learning Eigenvoice Parameters in Mixed Signals. In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 113-116, Taipei, Taiwan, April 2009. [ bib | DOI | poster | .pdf ]
[4] R. J. Weiss. Underdetermined Source Separation Using Speaker Subspace Models. PhD thesis, Department of Electrical Engineering, Columbia University, 2009. [ bib | slides | .pdf ]
[5] R. J. Weiss, M. I. Mandel, and D. P. W. Ellis. Source Separation Based on Binaural Cues and Source Model Constraints. In Proc. Interspeech, pages 419-422, Brisbane, Australia, September 2008. [ bib | http | poster | .pdf ]
[6] R. J. Weiss and T. Kristjansson. DySANA: Dynamic Speech and Noise Adaptation for Voice Activity Detection. In Proc. Interspeech, pages 127-130, Brisbane, Australia, September 2008. [ bib | http | poster | .pdf ]
[7] R. J. Weiss and D. P. W. Ellis. Monaural Speech Separation Using Source-Adapted Models. In Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pages 114-117, New Paltz, NY, October 2007. [ bib | DOI | web | slides | .pdf ]
[8] R. J. Weiss and D. P. W. Ellis. Estimating Single-Channel Source Separation Masks: Relevance Vector Machine Classifiers vs. Pitch-Based Masking. In Proc. ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition (SAPA), pages 31-36, Pittsburgh, PA, September 2006. [ bib | http | slides | .pdf ]
[9] D. P. W. Ellis and R. J. Weiss. Model-Based Monaural Source Separation Using a Vector-Quantized Phase-Vocoder Representation. In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages V-957-960, Toulouse, France, May 2006. [ bib | DOI | .pdf ]

Contact information

Send email to ronw at ee.columbia.edu


Last updated on January 7, 2010.