Jump to : Download | Abstract | Contact | BibTex reference | EndNote reference |


Yu-Gang Jiang, Zhenguo Li, Shih-Fu Chang. Modeling Scene and Object Contexts for Human Action Retrieval with Few Examples. IEEE Transactions on Circuits and Systems for Video Technology, 21:674-681, May 2011.

Download [help]

Download paper: Adobe portable document (pdf)

Copyright notice:This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.


The use of context knowledge is critical for understanding human actions, which typically occur under particular scene settings with certain object interactions. For instance, "driving car" usually happens outdoors, and "kissing" involves two people moving towards each other. In this paper, we investigate the problem of context modeling for human action retrieval. We first identify ten simple object-level action atoms relevant to many human actions, e.g., "people getting closer". With the action atoms and several background scene classes, we show that action retrieval can be improved through modeling action-scene-object dependency. An algorithm inspired by the popular semi-supervised learning paradigm is introduced for this purpose. One important contribution of this work is to show that modeling the dependencies among actions, objects, and scenes can be efficiently achieved with very few examples. Such a solution has tremendous potential in practice as it is often expensive to acquire large sets of training data. Experiments were performed on the challenging Hollywood2 dataset containing 89 movies. The results validate the effectiveness of our approach, achieving a mean average precision of 26% with just 10 examples per action


Yu-Gang Jiang
Zhenguo Li
Shih-Fu Chang

BibTex Reference

   Author = {Jiang, Yu-Gang and Li, Zhenguo and Chang, Shih-Fu},
   Title = {Modeling Scene and Object Contexts for Human Action Retrieval with Few Examples},
   Journal = {IEEE Transactions on Circuits and Systems for Video Technology},
   Volume = {21},
   Pages = {674--681},
   Month = {May},
   Year = {2011}

EndNote Reference [help]

Get EndNote Reference (.ref)


For problems or questions regarding this web site contact The Web Master.

This document was translated automatically from BibTEX by bib2html (Copyright 2003 © Eric Marchand, INRIA, Vista Project).