Jump to : Download | Abstract | Contact | BibTex reference | EndNote reference |

jiang:icassp2009

Wei Jiang, Lexing Xie, Shih-Fu Chang. Visual saliency with side information. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), April 2009.

Download [help]

Download paper: Adobe portable document (pdf)

Copyright notice:This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Abstract

We propose novel algorithms for organizing large image and video datasets using both the visual content and the associated sideinformation, such as time, location, authorship, and so on. Earlier research have used side-information as pre-filter before visual analysis is performed, and we design a machine learning algorithm to model the join statistics of the content and the side information. Our algorithm, Diverse-Density Contextual Clustering (D2C2), starts by finding unique patterns for each sub-collection sharing the same side-info, e.g., scenes from winter. It then finds the common patterns that are shared among all subsets, e.g., persistent scenes across all seasons. These unique and common prototypes are found with Multiple Instance Learning and subsequent clustering steps. We evaluate D2C2 on two web photo collections from Flickr and one news video collection from TRECVID. Results show that not only the visual patterns found by D2C2 are intuitively salient across different seasons, locations and events, classifiers constructed from the unique and common patterns also outperform state-of-the-art bag-of-features classifiers

Contact

Wei Jiang
Lexing Xie
Shih-Fu Chang

BibTex Reference

@InProceedings{jiang:icassp2009,
   Author = {Jiang, Wei and Xie, Lexing and Chang, Shih-Fu},
   Title = {Visual saliency with side information},
   BookTitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
   Month = {April},
   Year = {2009}
}

EndNote Reference [help]

Get EndNote Reference (.ref)

 
bar

For problems or questions regarding this web site contact The Web Master.

This document was translated automatically from BibTEX by bib2html (Copyright 2003 © Eric Marchand, INRIA, Vista Project).