[1]
|
S. Hershey, S. Chaudhuri, D. P. W. Ellis, J. F. Gemmeke, A. Jansen, R. C.
Moore, M. Plakal, D. Platt, R. A. Saurous, B. Seybold, M. Slaney, R. J.
Weiss, and K. Wilson.
CNN Architectures for Large-Scale Audio Classification.
In Proc. IEEE International Conference on Acoustics, Speech,
and Signal Processing (ICASSP), New Orleans, USA, March 2017.
[ bib |
web |
.pdf ]
|
[2]
|
T. N. Sainath, R. J. Weiss, K. W. Wilson, B. Li, A. Narayanan, E. Variani,
M. Bacchiani, I. Shafran, A. Senior, K. W. Chin, A. Misra, and C. Kim.
Multichannel Signal Processing with Deep Neural Networks for
Automatic Speech Recognition.
IEEE Transactions on Audio, Speech, and Language Processing,
February 2017.
[ bib |
DOI |
.pdf ]
|
[3]
|
T. N. Sainath, R. J. Weiss, K. W. Wilson, B. Li, A. Narayanan, E. Variani,
M. Bacchiani, I. Shafran, A. Senior, K. W. Chin, A. Misra, and C. Kim.
Raw Multichannel Processing Using Deep Neural Networks.
In New Era for Robust Speech Recognition: Exploiting Deep
Learning. Springer, 2017.
to appear.
[ bib |
.pdf ]
|
[4]
|
B. Li, T. N. Sainath, R. J. Weiss, K. W. Wilson, and M. Bacchiani.
Neural Network Adaptive Beamforming for Robust Multichannel
Speech Recognition.
In Proc. Interspeech, San Francisco, USA, September 2016.
[ bib |
.pdf ]
|
[5]
|
T. N. Sainath, A. Narayanan, R. J. Weiss, E. Variani, K. W. Wilson,
M. Bacchiani, and I. Shafran.
Reducing the Computational Complexity of Multimicrophone
Acoustic Models with Integrated Feature Extraction.
In Proc. Interspeech, San Francisco, USA, September 2016.
[ bib |
.pdf ]
|
[6]
|
T. N. Sainath, R. J. Weiss, K. W. Wilson, A. Narayanan, and M. Bacchiani.
Factored Spatial and Spectral Multichannel Raw Waveform
CLDNNs.
In Proc. IEEE International Conference on Acoustics, Speech,
and Signal Processing (ICASSP), Shanghai, China, March 2016.
[ bib |
.pdf ]
|
[7]
|
T. N. Sainath, R. J. Weiss, K. W. Wilson, A. Narayanan, M. Bacchiani, and
A. Senior.
Speaker Location and Microphone Spacing Invariant Acoustic
Modeling from Raw Multichannel Waveforms.
In Proc. IEEE Automatic Speech Recognition and Understanding
Workshop (ASRU), Scottsdale, USA, December 2015.
[ bib |
.pdf ]
|
[8]
|
T. N. Sainath, R. J. Weiss, A. Senior, K. W. Wilson, and O. Vinyals.
Learning the Speech Front-End with Raw Waveform CLDNNs.
In Proc. Interspeech, Dresden, Germany, September 2015.
[ bib |
.pdf ]
|
[9]
|
Y. Hoshen, R. J. Weiss, and K. W. Wilson.
Speech Acoustic Modeling from Raw Multichannel Waveforms.
In Proc. IEEE International Conference on Acoustics, Speech,
and Signal Processing (ICASSP), Brisbane, Australia, April 2015.
[ bib |
.pdf ]
|
[10]
|
J. Weston, R. Weiss, and H. Yee.
Affinity Weighted Embedding.
In Proc. International Conference on Machine Learning (ICML),
pages 1215-1223, Beijing, China, June 2014.
[ bib |
http |
.pdf ]
|
[11]
|
J. Weston, R. J. Weiss, and H. Yee.
Nonlinear Latent Factorization by Embedding Multiple User
Interests.
In Proc. ACM Conference on Recommender Systems (RecSys),
pages 65-68, Hong Kong, October 2013.
[ bib |
DOI |
.pdf ]
|
[12]
|
J. Weston, H. Yee, and R. J. Weiss.
Learning to Rank Recommendations with the k-order Statistic
Loss.
In Proc. ACM Conference on Recommender Systems (RecSys),
pages 245-248, Hong Kong, October 2013.
[ bib |
DOI |
.pdf ]
|
[13]
|
J. Weston, R. Weiss, and H. Yee.
Affinity Weighted Embedding.
In International Conference on Learning Representations
(ICLR), Scottsdale, USA, May 2013.
[ bib |
http |
.pdf ]
|
[14]
|
J. Weston, C. Wang, R. Weiss, and A. Berenzweig.
Latent Collaborative Retrieval.
In Proc. International Conference on Machine Learning (ICML),
Edinburgh, Scotland, June 2012.
[ bib |
http |
.pdf ]
|
[15]
|
R. J. Weiss and J. P. Bello.
Unsupervised Discovery of Temporal Structure in Music.
IEEE Journal of Selected Topics in Signal Processing,
5(6):1240-1251, October 2011.
[ bib |
DOI |
.pdf ]
|
[16]
|
F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel,
M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos,
D. Cournapeau, M. Brucher, M. Perrot, and É. Duchesnay.
scikit-learn: Machine Learning in Python.
Journal of Machine Learning Research, 12:2825-2830, October
2011.
[ bib |
http |
.pdf ]
|
[17]
|
R. J. Weiss, M. I. Mandel, and D. P. W. Ellis.
Combining Localization Cues and Source Model Constraints for
Binaural Source Separation.
Speech Communication, 53(5):606-621, May 2011.
Special issue on Perceptual and Statistical Audition.
[ bib |
DOI |
.pdf ]
|
[18]
|
T. Bertin-Mahieux, G. Grindlay, R. J. Weiss, and D. P. W. Ellis.
Evaluating Music Sequence Models Through Missing Data.
In Proc. IEEE International Conference on Acoustics, Speech,
and Signal Processing (ICASSP), pages 177-180, Prague, Czech Republic,
May 2011.
[ bib |
DOI |
.pdf ]
|
[19]
|
R. J. Weiss and J. P. Bello.
Identifying Repeated Patterns in Music Using Sparse Convolutive
Non-Negative Matrix Factorization.
In Proc. International Society for Music Information Retrieval
Conference (ISMIR), pages 123-128, Utrecht, Netherlands, August 2010.
Best Paper Award.
[ bib |
web |
slides |
.pdf ]
|
[20]
|
T. Bertin-Mahieux, R. J. Weiss, and D. P. W. Ellis.
Clustering Beat-Chroma Patterns in a Large Music Database.
In Proc. International Society for Music Information Retrieval
Conference (ISMIR), pages 111-116, Utrecht, Netherlands, August 2010.
[ bib |
web |
.pdf ]
|
[21]
|
T. Cho, R. J. Weiss, and J. P. Bello.
Exploring Common Variations in State of the Art Chord
Recognition Systems.
In Proc. Sound and Music Computing Conference (SMC), pages
1-8, Barcelona, Spain, July 2010.
[ bib |
.pdf ]
|
[22]
|
M. I. Mandel, R. J. Weiss, and D. P. W. Ellis.
Model-Based Expectation-Maximization Source Separation and
Localization.
IEEE Transactions on Audio, Speech, and Language Processing,
18(2):382-394, February 2010.
[ bib |
DOI |
web |
.pdf ]
|
[23]
|
R. J. Weiss and D. P. W. Ellis.
Speech Separation Using Speaker-Adapted Eigenvoice Speech
Models.
Computer Speech and Language, 24(1):16-29, January 2010.
Speech Separation and Recognition Challenge.
[ bib |
DOI |
.pdf ]
|
[24]
|
R. J. Weiss and D. P. W. Ellis.
A Variational EM Algorithm for Learning Eigenvoice Parameters
in Mixed Signals.
In Proc. IEEE International Conference on Acoustics, Speech,
and Signal Processing (ICASSP), pages 113-116, Taipei, Taiwan, April
2009.
[ bib |
DOI |
poster |
.pdf ]
|
[25]
|
R. J. Weiss.
Underdetermined Source Separation Using Speaker Subspace
Models.
PhD thesis, Department of Electrical Engineering, Columbia
University, 2009.
[ bib |
slides |
.pdf ]
|
[26]
|
R. J. Weiss, M. I. Mandel, and D. P. W. Ellis.
Source Separation Based on Binaural Cues and Source Model
Constraints.
In Proc. Interspeech, pages 419-422, Brisbane, Australia,
September 2008.
[ bib |
http |
poster |
.pdf ]
|
[27]
|
R. J. Weiss and T. Kristjansson.
DySANA: Dynamic Speech and Noise Adaptation for Voice
Activity Detection.
In Proc. Interspeech, pages 127-130, Brisbane, Australia,
September 2008.
[ bib |
http |
poster |
.pdf ]
|
[28]
|
R. J. Weiss and D. P. W. Ellis.
Monaural Speech Separation Using Source-Adapted Models.
In Proc. IEEE Workshop on Applications of Signal Processing to
Audio and Acoustics (WASPAA), pages 114-117, New Paltz, USA, October
2007.
[ bib |
DOI |
web |
slides |
.pdf ]
|
[29]
|
R. J. Weiss and D. P. W. Ellis.
Estimating Single-Channel Source Separation Masks: Relevance
Vector Machine Classifiers vs. Pitch-Based Masking.
In Proc. ISCA Tutorial and Research Workshop on Statistical
Perceptual Audition (SAPA), pages 31-36, Pittsburgh, USA, September 2006.
[ bib |
http |
slides |
.pdf ]
|
[30]
|
D. P. W. Ellis and R. J. Weiss.
Model-Based Monaural Source Separation Using a Vector-Quantized
Phase-Vocoder Representation.
In Proc. IEEE International Conference on Acoustics, Speech,
and Signal Processing (ICASSP), pages V-957-960, Toulouse, France, May
2006.
[ bib |
DOI |
.pdf ]
|