References for
EE E6850 Visual Information Systems, Fall 2002

Electronical copies of the following reference papers will be available through the linked files when possible. Otherwise, a physical copy of the papers will be placed in the class paper folder available at the Engineering Library reference desk.

Reference Books:

Michael S. Lew (Editor), Principles of Visual Information Retrieval, ISBN 1-85233-381-2, Springer-Verlag, 2001.

Guojun Lu, Multimedia Database Management Systems, ISBN 0-89006-342-7, 1999, Artech House, Inc.

Mark T. Maybury (Editor), Intelligent Multimedia Information Retrieval, ISBN 0-262-63179-2, AAAI Press/The MIT Press, 1997.

Ralf Steinmetz and Klara Nahrstedt, Multimedia: Computing, Communications and Applications, ISBN 0-13-324435-0, Prentice Hall, New Jersey, 1995.

Lecture 1:

Overview

S.-F. Chang, Q. Huang, T. Huang, A. Puri, and B. Shahraray, "Multimedia Search and retrieval," invited book chapter in Advances in Multimedia: Systems, Standards, and Networks, New York: Marcel Dekker, 1999. (pdf file)

S.-F. Chang and H. Sundaram, Structural and Semantic Analysis of Video, ICME 2000, New York, New York, July 28-Aug 2, 2000. (pdf file)

Example Systems (demos)

F. Mintzer, et al, “Populating the Hermitage Museum Web Site,” Communications of the ACM, August 2001, Vol. 44, No. 8, pp. 53-60.  (http://www.hermitagemuseum.org)
(pdf file)

S.-F. Chang, W. Chen, H.J. Meng, H. Sundaram, and D. Zhong, "VideoQ-An Automatic Content-Based Video Search System Using Visual Cues," ACM Multimedia Conference, Nov. 1997, Seattle, WA. also Columbia  University/CTR Technical Report, CTR-TR #478-97-12. (pdf file, demo: http://www.ctr.columbia.edu/videoq)

J. R. Smith and S.-F. Chang, "Visually Searching the Web for Content," IEEE Multimedia Magazine, Summer, Vol. 4 No. 3, pp.12-20, 1997. (also Columbia U. CU/CTR Technical Report #459-96-25). (pdf file)

J. R. Smith and S.-F. Chang, An Image and Video Search Engine for the World-Wide Web, Proceedings, IS&T/SPIE Symposium on Electronic Imaging: Science and Technology (EI'97) - Storage and Retrieval for Image and Video Databases V, San Jose, CA, February 1997.
(pdf file)
(demo http://www.ctr.columbia.edu/webseek)

Lecture 2:

Compression Standards and others

D. Le Gall, "MPEG: A Video Compression Standard for Multimedia Applications," Communications of ACM, April 1991, Vol 34, No. 4, pp. 46-58.  (a nice introduction to MPEG-1)
(pdf file)

Atul Puri and Tsuhan Chen (eds.), Multimedia Systems, Standards, and Networks, ISBN 0-8247-9303-X, Marcel Dekker Inc. 2000. (this book includes a collection of chapters about ITU-T H.32x, H.263, MPEG-4 and MPEG-7.)
(a chapter about H.263+ is included in the library reserve folder)

Barry Haskell, Atul Puri, and Arun Netravali, Digital Video: An Introduction to MPEG-2, ISBN 0-412-08411-2, 1997, Chapman and Hall.

Lecture #3:

Multimedia Metadata Standard

Digital Still Camera Image File Format Standard (Exchangeable image file format for Digital Still Cameras: Exif) - Version 2.1
http://www.exif.org

DIG35 Image Metadata Standard
http://www.i3a.org/i_dig35.html

S.-F. Chang, T. Sikora and A. Puri, "Overview of the MPEG-7 Standard," IEEE Transactions on Circuits and Systems for Video Technology, special issue on MPEG-7, June 2001. (pdf file)

Introduction to MPEG-7 (v2), Document: ISO/IEC JTC1/SC29/WG11 N3751. Oct. 2000.
(pdf file)

MPEG-21 Overview, July 2001, Document: ISO/IEC JTC1/SC29/WG11 N4318. (pdf file)
 

Lecture #4:

Syntactic Video Analysis: Shot Segmentation

U. Gargi, R. Kasturi, and S. H. Strayer, “Performance Characterization of Video-Shot-Change Detection Methods”, IEEE Transactions on Circuits and System for Video Technology,Vol 10, No. 1 Feb 2000.
(pdf file)

J.S. Boreczky and L.A. Rowe, " A Comparison of Video Shot Boundary Detection Techniques," Journal of Electronic Imaging, 5(2), April, 1996, pp. 122-128. Also
appeared in Storage & Retrieval for Image and Video Databases IV,, I.K. Sethi, and R.C.
Jain, Editors, Proc. SPIE 2670, pp. 170-179 (1996). (pdf file)

Hong.Jiang Zhang, Atreyi Kankanhalli, Stephen W. Smoliar, "Automatic Partitioning of Full-Motion Video," ACM Multimedia Systems Journal, 1993, pp.10-28. (pdf file)

D. Zhong and S.-F. Chang, Video Shot Detection Combining Multiple Visual Features, Columbia University ADVENT Technical Report #092, Dec. 27th 2000. (pdf file)

Irena Koprinska , Sergio Carrato, "Temporal Video Segmentation: A Survey,"
Institute for Information Technologies, Department of Electrical Engineering, Acad. G. Bonchev Str., Bl. 29A, 1113 Sofia, Bulgaria, contact e-mail: irena@iinf.bas.bg. (ps file available at NEC siteseer)
(here is a converted pdf file)
 

Lecture #5:

Syntactic Video Analysis: Keyframe Seclection and Summarization

D. Zhong, H. Zhang and S.-F. Chang, “Clustering Methods for Video Browsing and Annotation”, IS&T/SPIE Symposium on Storage and Retrieval for Image and Video Database, San Jose, February 1996.
(pdf file)

H.J. Zhang, C.Y. Low, S.W. Smoliar and J.H. Wu,"Video Parsing, Retrieval and Browsing: An integrated and content-based solution. In Proc. of the ACM Multimedia Conference, pages 15--24, 1995.
(link) (pdf file)

Special Issue on Visual Information Management, Communications of the ACM,  December 1997, Vol. 40, No.12. (link) (pdf file)

M. Christel, A. Hauptmann, A. Warmack and S. Crosby, "Adjustable Filmstrips and Skims as Abstractions for a Digital Video Library," IEEE Advances in Digital Libraries Conference, Baltimore, MD, May 1999.
(pdf file)

Yeo, B.-L., and Yeung, M.M. Retrieving and Visualizing Video, Communications of ACM, 40, 12 (Dec. 1997), pp. 43-52. (link) (local link)
 

Lecture #6 #7 #8  Content-Based Image Retrieval

J. R. Smith and S.-F. Chang, "VisualSEEk: a Fully Automated Content-Based Image Query System, Proceedings", ACM Multimedia '96 Conference, Boston, MA, November 1996. (pdf file)

M. Flickher, H. Sawhney, W. Niblack, J. Ashley, Q. Huang, B. Dom, M. Gorkani, J. Hafner, D. Lee, D. Petkovicand D. Steele, and P. Yanker. Query by image and video content: The qbic system. In IEEE Computer, volume 38, pages 23-31, 1995.

Christos Faloutsos, Ron Barber, Myron Flickner, Wayne Niblack, Dragutin Petkovic, and William Equitz. Efficient and effective querying by image content. J. of Intelligent Information Systems, 3(3/4):231-- 262, July 1994. (QBIC System) (pdf file)

S.-F. Chang and J. R. Smith, "Extracting Multi-Dimensional Signal Features for Content-Based Visual Query", Proceedings, SPIE Symposium on Visual Communications and Image Processing (VCIP'95), May 1995. (pdf file)

Sikora, T., "The MPEG-7 visual standard for content description-an overview," IEEE Transactions on Circuits and Systems for Video Technology, Volume: 11 Issue: 6 , Page(s): 696 -702, June 2001. (pdf file) (local link)

Manjunath, B.S.; Ohm, J.-R.; Vasudevan, V.V.; Yamada, A., "Color and texture descriptors," IEEE Transactions on Circuits and Systems for Video Technology, Volume: 11 Issue: 6 , Page(s): 703 -715, June 2001. (pdf file) (local link)

Bober, M., "MPEG-7 visual shape descriptors," IEEE Transactions on Circuits and Systems for Video Technology, Volume: 11 Issue: 6 , Page(s): 716 -719, June 2001. (pdf file) (local link)

Jeannin, S.; Divakaran, A., "MPEG-7 visual motion descriptors,"  IEEE Transactions on Circuits and Systems for Video Technology, Volume: 11 Issue: 6 , Page(s): 720 -724, June 2001. (pdf file) (local link)

M. Swain and D. Ballard, "Color Indexing," International Journal of Computer Vision, &:1, pp. 11--32, 1991.

C. Faloutsos and K.-I Lin, "FastMap: a Fast Algorithm for Indexing, Data-Mining and Visualization of Traditional and Multimedia Datasets," Proc. of ACM-SIGMOD, pp. 163-174, San Jose, CA, May, 1995.

J. Hafner, H. S. Sawhney, W. Equitz, M. Flickner and W. Niblack ,"Efficient Color Histogram Indexing for Quadratic Form Distance Functions", IEEE Trans. PAMI, July, 1995.

A. Guttman, "R-Trees: A Dynamic Index Structure for Spatial Seraching", ACM SIGMOD, June 1984, pp. 47-57.

M. Christel, A. Hauptmann, A. Warmack and S. Crosby, "Adjustable Filmstrips and Skims as Abstractions for a Digital Video Library," IEEE Advances in Digital Libraries Conference, Baltimore, MD, May 1999.

W. Chen and Shih-Fu Chang, VISMAP: An Interactive Image/Video Retrieval System Using Visualization and Concept Maps, IEEE International Conference on Image Processing, Greece, Oct. 2001.

Michael G. Christel, "Visual digests for news video libraries," ACM Multimedia 1999, Nov. Orlando, FL.

Rui, Y., Huang, T. S., Ortega, M., , Mehrotra, S., "Relevance feedback: A power tool in interactive content-based image retrieval,"  IEEE Trans. on Circuits and Systems for Video Technology 8(5) (Sep. 1998):644—655.

Buckley C. and Salton G., “Optimization of Relevance Feedback Weights,” ACM SIGIR 1995.

Yossi Rubner, Carlo Tomasi, and Leonidas J. Guibas, “A Metric for Distributions with Applications to Image Databases,” ICCV, Bombay, India, 1998. (Earth Mover's Distance Metric)

Smith, J.R.: Image retrieval evaluation. In: IEEE Workshop on Content-based Access of Image and Video Libraries, Santa Barbara, California (1998)
 

Lecture #10 and #11 Multimedia Security and Watermarking

H. Yu, D. Kundur, and C.-Y. Lin, “Spies, Thieves, and Lies: The Battle for
Multimedia in the Digital Era,” IEEE Multimedia, Vol.8, No. 3, July 2001.

I. J. Cox, J. Kilian, F. T. Leighton and T. Shamoon, “Secure Spread Spectrum
Watermarking for Multimedia,” IEEE Trans. on Image Processing, Vol. 6, No.
12, Dec. 1997.

G. W. Braudaway, K. A. Magerlein, and F. Mintzer, “Protecting Publicly
Available Images with a Visible Image Watermark,” SPIE Optical Security
and Counterfeit Deterrence Techniques, Vol. 2659, Jan. 1996.

C.-Y. Lin and S.-F. Chang, “Semi-Fragile Watermarking for Authenticating
JPEG Visual Content,” SPIE Security and Watermarking in Multimedia
Contents II, Vol. 3971, Jan. 2000.

F. A. P. Petitcolas, R. J. Anderson, and M. G. Kuhn, “Information Hiding: A
Survey,” Proceedings of the IEEE, Vol. 87, No. 7, July 1999.

J. A. Bloom, I. J. Cox, T. Kalker, J.-P. Linnartz, M. L. Miller and B. Traw,
“Copy Protection for DVD Video,” Proceedings of the IEEE, Vol. 87, No. 7,
July 1999.

Resources Links
Multimedia Authentication Resources:
http://www.ctr.columbia.edu/~cylin/auth

Watermarking mailing list:
http://www.watermarkingworld.org

Information Hiding and Digital Watermarking:
http://www.cl.cam.ac.uk/~fapp2/steganography
 

Lecture #12 Visualization and Abstraction, Content Adaptation for Ubiquitous Media Access

W. Chen and Shih-Fu Chang, VISMAP: An Interactive Image/Video Retrieval System Using Visualization and Concept Maps, IEEE International Conference on Image Processing, Greece, Oct. 2001.

A. Fox, E. Brewer, S. Gribble, and E. Amir, “Adapting to Network and Client Variability via On-Demand Dynamic Distillation,” ASPLOS, 1996.

J. R. Smith, R. Mohan, and C.-S. Li, “Scalable Multimedia Delivery for Pervasive Computing,” ACM Multimedia Conference, Orlando, FL, Oct. 1999.

T.-L. Pham, G. Schneider and S. Goose,  A situated computing framework for mobile and ubiquitous multimedia access using small screen and composite devices,“ ACM Multimedia Conference, Oct. 2000, Los Angeles, CA.

S.-F. Chang, D. Zhong, and R. Kumar, Real-Time Content-Based Adaptive Streaming of Sports Video, Columbia University ADVENT Technical Report #121, July 2001. Also IEEE Workshop on Content-Based Access to Video/Image Library, Hawaii, Dec. 2001.

H. Sundaram and S.-F Chang, Condensing Computable Scenes using Visual Complexity and Film Syntax Analysis, IEEE Conference on Multimedia and Exhibition, Tokyo, Japan, Aug. 22-25, 2001.

Lecture #13  Digital Rights Management

W3C Workshop on Digital Rights Management for the Web, Jan. 2001, INRIA - Sophia-Antipolis, France, http://www.w3.org/2000/12/drm-ws/

Ulrich Kohl, Jeffrey Lotspiech, and Marc A. Kaplan, "Safeguarding Digital Library Contents and Users- Protecting Documents Rather Than Channels,“ D-Lib Magazine, September 1997. http://www.dlib.org/dlib/

Renato Iannella, "Digital Rights Management (DRM) Architectures," D-Lib Magazine June 2001 Volume 7 Number 6. http://www.dlib.org/dlib/

Thomas C. Rindfleisch, “Privacy, information technology, and health care,” Communications of the ACM, Volume 40 ,  Issue 8  (August 1997).

(the following are some old references)

Image Query: Systems

J. R. Bach, C. Fuller, A. Gupta, A. Hampapur, B. Horowitz, R. Humphrey, R.C. Jain and C. Shu, "Virage image search engine: an open framework for image management", Symposium on Electronic Imaging: Science and Technology -- Storage & Retrieval for Image and Video Databases IV, IS&T/SPIE, Feb. 1996.

M. Flickner, et al, "Query by Image and Video Content: The QBIC System," IEEE Computer Magazine, Sep. 1995, Vol.28, No.9, pp. 23-32.

J. R. Smith and S.-F. Chang, "VisualSEEk: A Fully Automated Content-Based Image Query System," ACM Multimedia Conference, Boston, MA, Nov. 1996.
(demo http://www.ctr.columbia.edu/VisualSEEk)
(ftp://ftp.ctr.columbia.edu/CTR-Research/advent/public/papers/96/smith96f.ps)

A. Pentland, R.W. Picard, and S. Sclaroff, "Photobook Content-Based Manipulation of Image Databases," MIT Media Lab Perceptual Computing, TR No. 255, also Intern. J. of Computer Vision, 18(3), pp. 233-254 (1996).

D. A. Forsyth, J. Malik, M. M. Fleck, H. Greenspan, T. Leung, S. Belongie, C. Carson, C. Bregler, "Finding Pictures of Objects in Large Collections of Images," U. C. Berkeley, Dept. of EECS, CS Division, Technical Report, CSD-96-905, June 1996.

UCSB DL

CMU Informedia

IBM Satellite Image DL
http://maya.ctr.columbia.edu:8080/

Other References:

R. Mehrotra and J. E. Gary, "Similar-Shape Retrieval in Shape Data Management," IEEE Computer Magazine, Sep. 1995, Vol.28, No.9, pp. 57-62.

J.R. Bach, S. Paul, and R. Jain, "A Visual Information Management System for the Interactive Retrieval of Faces," IEEE Trans. on Knowledge and Data Engineering, Vol. 5, No. 4, Aug. 1993, pp. 619-628.

R.W. Picard and T.P. Minka, "Vision Texture for Annotation," Journal of Multimedia Systems, Vol. 3, pp. 3-14, 1995. (also MIT Media Lab. Perceptual Computing Section Technical Report, No. 302.)

H. Tamura, S. Mori, and T. Yamawaki,"Textual features corresponding to visual perception," I.E.E.E. Transactions on Systems, Man, and Cybernetics, vol. SMC-8, No. 6 1978.
 

Video Indexing

H.S. Sawhney, S. Ayer, and M. Gorkani, "Model-Based 2D and 3D Dominant Motion Estimation for Mosaicking and Video Representation," Proc. Fifth Int'l Conf. Computer Vision, 1995.

S. W. Smoliar and H. Zhang, "Content-Based Video Indexing and Retrieval", IEEE Multimedia Magazine, Summer, 1994.

B. L. Yeo and B. Liu", Rapid Scene Analysis on Compressed Videos", IEEE Transactions on Circuits and Systems for Video Technology, Dec. 1995.
 

Fast Indexing

A. Guttman, "R-Trees: A Dynamic Index Structure for Spatial Seraching", ACM SIGMOD, June 1984, pp. 47-57.

C. Faloutsos and K.-I Lin, "FastMap: a Fast Algorithm for Indexing, Data-Mining and Visualization of Traditional and Multimedia Datasets," Proc. of ACM-SIGMOD, pp. 163-174, San Jose, CA, May, 1995.

J. Hafner, H. S. Sawhney, W. Equitz, M. Flickner and W. Niblack ,"Efficient Color Histogram Indexing for Quadratic Form Distance Functions", IEEE Trans. PAMI, July, 1995.
 

Spatial Indexing and Queries

S.K. Chang, Q.Y. Shi, and C.W. Yan, "Iconic Indexing by 2D Strings," IEEE Trans. on PAMI, 9(3):413-428, 1987.

V. Gudivada and V. Raghavan, "Design and Evaluation of Algorithms for Image Retrieval by Spatial Similarity," ACM Trans. on Information Systems, April 1995.

E. G.M. Petrakis, C. Faloutsos, "Similarity Searching in Large Image Databases," Tech. Report CS-TR-3388, Dept. of Computer Science, Univ. of Maryland, December 1994.
 

Manipulation and Editing

J. Meng and S.-F. Chang, "CVEPS: A Compressed Video Editing and Parsing System," ACM Multimedia Conference, Boston, MA, Nov. 1996.

Thomas D.C. Little and Arif Ghafoor, "Spatio-Temporal Composition of Distributed Multimedia Objects for Value-Added Networks," IEEE Computer Magazine, pp. 42-50, Oct. 1991.

B. Smith, Rivle?
 

Security and Authentication

Gary L. Friedman, "The Trustworthy Digital Camera: Restoring Credibility to the Photographic Image," IEEE Trans. on Consumer Electronics, Vol. 39, No. 4, Nov. 1993.

Ingemar J. Cox, Joe Kilian, Tom Leighton, Talal Shamoon, "Secure Spread Spectrum Watermarking For Multimedia," NEC Research Institute Technical Report 95-10.

Jian Zhao, Eckhard Koch, "Embedding Robust Labels into Images for Copyright Protection," Proc. Intern. Congress on Intellectual Property Rights for Specialized Information, Knowledge, and New Technologies, Vienna, Austria, Aug. 95

G. W. Braudaway, K. A. Magerlein, and F. Mintzer , "Protecting Publicly-Available Images with a Visible Image Watermark," IBM Research Division, Technical Report, RC 20336 (89918) 1/15/96.
 

Video Storage

D.J. Gemmell, H.M. Vin, D.D. Kandlur, P.V. Rangan, "Multimedia Storage Servers: A Tutorial," IEEE Computer, Vol. 28, No. 5, pp. 40-51, May 1995.

H. Vin and P. Rangan, "Designing a Multi-User HDTV Storage Server," IEEE Journal on Selected Areas in Communications, Vol. 11, No. 1, Jan 1993.

S. Ghandeharizadeh and C. Shahabi, "On Multimedia Repositories Personal Computers, and Hierarchical Storage Systems," ACM 2nd Multimedia Conference, San Francisco, CA, Oct, 1994.

S. Paek, P. Bocheck, and S.-F. Chang, "Scalable MPEG2 Video Servers with Heterogeneous QoS on Parallel Disk Arrays," IEEE Workshop on Network and Operating System Support for Digital Audio and Video, `95, Durham, New Hampshire, April 1995.

E. Chang and A. Zakhor, "Admissions Control and Data Placement for VBR Video Servers," Proc. IEEE Intern. Conf. on Image Processing , Austin, TX, Sep. 1994.

Asit Dan, Dan Dias, Rajat Mukherjee, Christos Polyzois, Dinkar Sitaram, Renu Tewari, "Buffering and Caching in Large-Scale Video Servers," IEEE CompCom '95.

Dan, A., D. Sitaram, and P. Shahabuddin, "Scheduling Policies for an On-Demand Video Server with Batching," 2nd Annual ACM Multimedia Conference and Exposition, San Francisco, CA, October, 1994.

S. Berson, L. Golubchik, R.R. Muntz, "Fault Tolerant Design of Multimedia Servers," ACM SIGMOD 1995.
 

WWW Applications

J. R. Smith and S.-F. Chang, "An Image and Video Search Engine for the World-Wide Web," IS&T/SPIE Symposium on Electronic Imaging: Science and Technology - Storage & Retrieval for Image and Video Databases V, San Jose, CA, February 1997.
http://www.ctr.columbia.edu/webseek

C. Frankel, M. Swain, and V. Athitsos, "WebSeer: An Image Search Engine for the World Wide Web", University of Chicago Department of Computer Science Technical Report TR-96-14, July 31, 1996.
http://webseer.cs.uchicago.edu/

Interpix. Image Surfer. http://www.interpix.com/