Jump to : Download | Abstract | Contact | BibTex reference | EndNote reference |


Lexing Xie, Shih-Fu Chang, Ajay Divakaran, Huifang Sun. Feature Selection for Unsupervised Discovery of Statistical Temporal Structures in Video. In IEEE International Conference on Image Processing (ICIP), Barcelona, Spain, September 2003.

Download [help]

Download paper: Adobe portable document (pdf)

Copyright notice:This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.


We present algorithms for automatic feature selection for unsupervised structure discovery from video sequences. Feature selection in this scenario is hard because of the absence of class labels to evaluate against, and the temporal correlation among samples that prevents the direct estimation of posterior probabilities of the cluster given the sequence. The overall problem of structure discovery is formulated as simultaneously finding the statistical descriptions of structure and locating segments that matches the descriptions. Under Markov assumptions among events, structures in the video are modelled with hierarchical hidden Markov models, with efficient algorithms to jointly learn the model parameters and the optimal model complexity. Feature selection iterates between a wrapper step that partitions the large feature pool into consistent subsets, and a filter step that eliminate redundancy within these subsets, respectively. The feature subsets are then ranked according to the normalized Bayesian Information criteria, and the learning results from these ranked subsets can be evaluated and interpreted by a human observer. Results on soccer and baseball videos show that the automatically selected feature set coincides with those selected with domain knowledge and intuition, while achieving a correspondence comparable to that of supervised learning against manually labelled ground truth


Lexing Xie
Shih-Fu Chang

BibTex Reference

   Author = {Xie, Lexing and Chang, Shih-Fu and Divakaran, Ajay and Sun, Huifang},
   Title = {Feature Selection for Unsupervised Discovery of Statistical Temporal Structures in Video},
   BookTitle = {IEEE International Conference on Image Processing (ICIP)},
   Address = {Barcelona, Spain},
   Month = {September},
   Year = {2003}

EndNote Reference [help]

Get EndNote Reference (.ref)


For problems or questions regarding this web site contact The Web Master.

This document was translated automatically from BibTEX by bib2html (Copyright 2003 © Eric Marchand, INRIA, Vista Project).