PUBLICATIONS

 

Graphical models and machine learning approaches for computer vision and multimedia retrieval:

 

Dong-Qing Zhang and Shih-Fu Chang, "A Generative-Discriminative Hybrid Method for Multi-View Object Detection", in Proceeding of IEEE International Conference of Computer Vision and Pattern Recognition (CVPR) 2006 (PDF). Proofs and more details (PDF).

 

Dong-Qing Zhang and Shih-Fu Chang, "Statistical Part-Based Models: Theory and Applications in Image Similarity, Object Detection and Region Labeling", PhD Thesis Graduate School of Arts and Sciences, Columbia University, 2005 (PDF).

 

Dong-Qing Zhang and Shih-Fu Chang, "Learning Random Attributed Relational Graph for Part-based Object Detection", submitted to conference (ADVENT Technical Report #212-2005-6 PDF).

 

Dong-Qing Zhang and Shih-Fu Chang, "Detecting Image Near-Duplicate by Stochastic Attributed Relational Graph Matching with Learning", in Proceeding of ACM conference of Multimedia (ACM MM) 2004 . (PS),(PDF). (Oral Presentation). Extended technical report referred by the paper (PS/PDF).

 

Dong-Qing Zhang and Shih-Fu Chang, "Learning to Detect Scene Text Using a Higher-order MRF with Belief Propagation", IEEE Workshop on Learning in Computer Vision and Pattern Recognition, in conjunction with CVPR (LCVPR) 2004, Washington DC, June 2004. (PDF)

 

Dong-Qing Zhang, Ching-Yung Lin, Shih-Fu Chang and John R. Smith, "Semantic Video Clustering Across Sources Using Bipartite Spectral Clustering", in Proceeding of IEEE Conference of Multimedia and Expo (ICME) 2004, Taipei Taiwan, June 2004. (PDF)

 

Visual text detection and recognition for visual indexing:

 

DongQing Zhang and Shih-Fu Chang, "A Bayesian Framework for Fusing Multiple Word Knowledge Models in Videotext Recognition", in Proceeding of IEEE International Conference of Computer Vision and Pattern Recognition (CVPR) 2003. (PDF) (Poster)

 

DongQing Zhang, Belle L. Tseng, Ching-Yung Lin and Shih-Fu Chang, "Accurate Overlay Text Extraction for Digital Video Analysis",in Proceeding of IEEE International Conference on Information Technology: Research and Education (ITRE) 2003. (PDF)

Belle L. Tseng, Ching-Yung Lin, DongQing Zhang, "Improved Text Overlay Detection in Videos Using a Fusion-Based Classifier", in Proceeding of IEEE Conference of Multimedia and Expo (ICME) 2003.

B. Adams, A. Amir, C. Dorai, S. Ghosal, G. Iyengar, A. Jaimes, C.L. Lang, C.-Y. Lin, A. Natsev, M. Naphade, C. Neti, H.J. Nock, H.H. Permuter, R. Singh, J.R. Smith, S. Srinivasan, B.L. Tseng, T.V. Ashwin, D. Zhang, "IBM Research TREC-2002 Video Retrieval System", TREC Video Retrieval Track Workshop, , Washington D.C., 2003.

DongQing Zhang, and Shih-Fu Chang, "Event Detection in Baseball Video Using Superimposed Caption Recognition", in Proceeding of ACM conference of Multimedia (ACM MM), Juan Les Pins, France, December 1-6, 2002. (PDF) (Poster)

DongQing Zhang, and Shih-Fu Chang, "General and Domain-specific Techniques for Detecting and Recognizing Superimposed Text in Video", in Proceeding of International Conference on Image Processing (ICIP), Rochester, New York, USA, 2002. (PDF)


Others:

Dongqing Zhang and Dan. Ellis, "Detecting sound events in basketball video archive", Speech & Audio Processing class project report (2001) (PDF).

Dongqing Zhang, Yan Guo; Wu, Jiankang (Singapore): "Facial Expression Recognition using VQ-HMM", Conference Proceeding of 4th World Multiconference on Systemics, Cybernetics and Informatics, July 23-26, 2000, Orlando, Florida (SCI 2000/ISAS 2000).

Dongqing Zhang, Yan Guo, Jing Zhong, Jiankang Wu, "Real-time Mouth Detection Using Corner Vector Detection and Parabola Models", Proceeding of 5th Asian Conference on Computer Vision, January 8-11, 2000 Taipei, Taiwan (ACCV 2000).