Jump to : Download | Abstract | See also | Contact | BibTex reference | EndNote reference |

xie05layered

Lexing Xie, Lyndon Kennedy, Shih-Fu Chang, Ajay Divakaran, Huifang Sun, Ching-Yung Lin. Layered Dynamic Mixture Model for Pattern Discovery in Asynchronous Multi-modal Streams. In Interational Conference on Acoustic, Speech and Signal Processing (ICASSP), Philadelphia, PA, March 2005.

Download [help]

Download paper: Adobe portable document (pdf)

Copyright notice:This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Abstract

We propose a layered dynamic mixture model for asynchronous multi-modal fusion for unsupervised pattern discovery in video. The lower layer of the model uses generative temporal structures such as a hierarchical hidden Markov model to convert the audio-visual streams into mid-level labels, it also models the correlations in text with probabilistic latent semantic analysis. The upper layer fuses the statistical evidence across diverse modalities with a flexible meta-mixture model that assumes loose temporal correspondence. Evaluation on a large news database shows that multi-modal clusters have better correspondence to news topics than audio-visual clusters alone; novel analysis techniques suggest that meaningful clusters occur when the prediction of salient features by the model concurs with those shown in the story clusters

See also

[ xie04discover ]

Contact

Lexing Xie
Lyndon Kennedy
Shih-Fu Chang
Ching-Yung Lin

BibTex Reference

@InProceedings{xie05layered,
   Author = {Xie, Lexing and Kennedy, Lyndon and Chang, Shih-Fu and Divakaran, Ajay and Sun, Huifang and Lin, Ching-Yung},
   Title = {Layered Dynamic Mixture Model for Pattern Discovery in Asynchronous Multi-modal Streams},
   BookTitle = {Interational Conference on Acoustic, Speech and Signal Processing (ICASSP)},
   Address = {Philadelphia, PA},
   Month = {March},
   Year = {2005}
}

EndNote Reference [help]

Get EndNote Reference (.ref)

 
bar

For problems or questions regarding this web site contact The Web Master.

This document was translated automatically from BibTEX by bib2html (Copyright 2003 © Eric Marchand, INRIA, Vista Project).