Jump to : Download | Abstract | Contact | BibTex reference | EndNote reference |


Hualu Wang, Ajay Divakaran, Anthony Vetro, Shih-Fu Chang, Huifang Sun. Survey of Compressed-Domain Features Used in Audio-Visual Indexing and Analysis. Journal of Visual Communication and Image Representation, 14(2):150-183, June 2003.

Download [help]

Download paper: Adobe portable document (pdf)

Copyright notice:This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.


In this paper, we attempt to provide a comprehensive and high-level review of audio-visual features that can be extracted from the standard compressed domains, such as MPEG-1 and MPEG-2. The paper is motivated by the myriad of active research works in extraction and application of compressed-domain features in various fields, such as indexing, filtering, and manipulation. Compressed domain approaches avoid expensive computation and memory requirements involved in decoding and/or re-encoding. Selected features are categorized into four groups - spatial visual (e.g., color, texture, edge, shape), motion (e.g., motion field, trajectory), audio (e.g., energy, spectral features, pitch), and coding (e.g., bit rate, frame/block type). For each feature, we briefly discuss the extraction methods, computational complexity, potential effectiveness in applications, and possible limitations caused by compress-domain approaches. Finally, we also discuss the possibilities of extracting some important MPEG-7 visual and audio descriptors directly from the compressed domain.


Shih-Fu Chang

BibTex Reference

   Author = {Wang, Hualu and Divakaran, Ajay and Vetro, Anthony and Chang, Shih-Fu and Sun, Huifang},
   Title = {Survey of Compressed-Domain Features Used in Audio-Visual Indexing and Analysis},
   Journal = {Journal of Visual Communication and Image Representation},
   Volume = {14},
   Number = {2},
   Pages = {150--183},
   Month = {June},
   Year = {2003}

EndNote Reference [help]

Get EndNote Reference (.ref)


For problems or questions regarding this web site contact The Web Master.

This document was translated automatically from BibTEX by bib2html (Copyright 2003 © Eric Marchand, INRIA, Vista Project).