Jump to : Download | Abstract | Contact | BibTex reference | EndNote reference |


Wei Jiang, Courtenay Cotton, Shih-Fu Chang, Dan Ellis, Alexander C. Loui. Audio-Visual Atoms for Generic Video Concept Classification. ACM Transactions on Multimedia Computing, Communications and Applications, 6(3):1-19, 2010.

Download [help]

Download paper: Adobe portable document (pdf)

Copyright notice:This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.


We investigate the challenging issue of joint audio-visual analysis of generic videos targeting at concept detection. We extract a novel local representation, Audio-Visual Atom (AVA), which is defined as a region track associated with regional visual features and audio onset features. We develop a hierarchical algorithm to extract visual atoms from generic videos, and locate energy onsets from the corresponding soundtrack by time-frequency analysis. Audio atoms are extracted around energy onsets. Visual and audio atoms form AVAs, based on which discriminative audio-visual codebooks are constructed for concept detection. Experiments over Kodak's consumer benchmark videos confirm the effectiveness of our approach


Wei Jiang
Shih-Fu Chang

BibTex Reference

   Author = {Jiang, Wei and Cotton, Courtenay and Chang, Shih-Fu and Ellis, Dan and C. Loui, Alexander},
   Title = {Audio-Visual Atoms for Generic Video Concept Classification},
   Journal = {ACM Transactions on Multimedia Computing, Communications and Applications},
   Volume = {6},
   Number = {3},
   Pages = {1--19},
   Publisher = {ACM},
   Address = {New York, NY, USA},
   Year = {2010}

EndNote Reference [help]

Get EndNote Reference (.ref)


For problems or questions regarding this web site contact The Web Master.

This document was translated automatically from BibTEX by bib2html (Copyright 2003 © Eric Marchand, INRIA, Vista Project).