%0 Conference Proceedings
%F hsu05visual
%A Hsu, Winston
%A Chang, Shih-Fu
%T Visual Cue Cluster Construction via Information Bottleneck 			Principle and Kernel Density Estimation
%B International Conference on Content-Based Image and Video Retrieval (CIVR)
%C Singapore
%X 			Recent research in video analysis has shown a promising direction, 			in which mid-level features (e.g., people, anchor, indoor) are 				abstracted from low-level features (e.g., color, texture, motion, 						etc.) and used for discriminative classification of semantic 				labels. However, in most systems, such mid-level features are 				selected manually. In this paper, we propose an 				information-theoretic framework, visual cue cluster construction 				(VC3), to automatically discover adequate mid-level features. The 				problem is posed as mutual information maximization, through which 				optimal cue clusters are discovered to preserve the highest 				information about the semantic labels. We extend the Information 				Bottleneck framework to high-dimensional continuous features and 				further propose a projection method to map each video into 				probabilistic memberships over all the cue clusters. The biggest 				advantage of the proposed approach is to remove the dependence on 				the manual process in choosing the mid-level features and the huge 				labor cost involved in annotating the training corpus for training 				the detector of each mid-level feature. The proposed VC3 framework 				is general and effective, leading to exciting potential in solving 				other problems of semantic video analysis. When tested in news 				video story segmentation, the proposed approach achieves promising 				performance gain over representations derived from conventional 				clustering techniques and even the mid-level features selected 				manually. 		
%U http://www.ee.columbia.edu/dvmm/publications/05/hsu05visual.pdf
%D 2005