Hari Sundaram, Shih-Fu Chang. Constrained Utility Maximization for generating Visual Skims. In IEEE Workshop on Content-based Access of Image and Video Libraries (CBAIVL'2001), Kauai, HI, December 2001.
Download paper: Adobe portable document (pdf)
Copyright notice:This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.
In this paper, we present a novel algorithm to generate visual skims, that do not contain audio, from computable scenes. Visual skims are useful for browsing digital libraries, and for on-demand summaries in set-top boxes. A computable scene is a chunk of data that exhibits consistencies with respect to chromaticity, lighting and sound. First, we define visual complexity of a shot to be its Kolmogorov complexity. Then, we conduct experiments that help us map the complexity of a shot into the minimum time required for its comprehension. Second, we analyze the grammar of the film language, since it makes the shot sequence meaningful. We achieve a target skim time by minimizing a sequence utility function. It is subject to shot duration constraints, and penalty functions based on sequence rhythm, and information loss. This helps us determine individual shot durations as well as the shots to drop. Our user studies show good results on skims with compression rates up to 80%
@InProceedings{dvmmPub208,
Author = {Sundaram, Hari and Chang, Shih-Fu},
Title = {Constrained Utility Maximization for generating Visual Skims},
BookTitle = {IEEE Workshop on Content-based Access of Image and Video Libraries (CBAIVL'2001)},
Address = {Kauai, HI},
Month = {December},
Year = {2001}
}
Get EndNote Reference (.ref)
For problems or questions regarding this web site contact The
Web Master.
This document was translated automatically from BibTEX by bib2html (Copyright 2003 © Eric Marchand, INRIA, Vista Project).