Jump to : Download | Abstract | Contact | BibTex reference | EndNote reference |


Seungyup Paek, John R. Smith. Detecting Image Purpose in World-Wide Web Documents. In IS&T/SPIE Symposium on Electronic Imaging: Science and Technology (EI'98), San Jose, CA, January 1998.

Download [help]

Download paper: Adobe portable document (pdf)

Copyright notice:This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.


The number of World-Wide Web (WWW) documents available to users of the Internet is growing at an incredible rate. Therefore, it is becoming increasingly important to develop systems that aid users in searching, filtering, and retrieving information from the Internet. Currently, only a few prototype systems catalog and index images in Web documents. To greatly improve the cataloging and indexing of images on the Web, we have developed a prototype rule-based system that detects the content images in Web documents. Content images are images that are associated with the main content of Web documents, as opposed to a multitude of other images that exist in Web documents for di erent purposes, such as decorative, advertisement and logo images. We present a system that uses decision tree learning for automated rule induction for the content image detection system. The system uses visual features, text-related features and the document context of images in concert for fast and effective content image detection in Web documents. We have evaluated the system by collecting more than 1200 images from 4 different Web sites and we have achieved an overall classification accuracy of 84%


Seungyup Paek
John_R. Smith

BibTex Reference

   Author = {Paek, Seungyup and Smith, John    R.},
   Title = {Detecting Image Purpose in World-Wide Web       Documents},
   BookTitle = {IS&T/SPIE Symposium on Electronic          Imaging: Science and Technology (EI'98)},
   Address = {San Jose,             CA},
   Month = {January},
   Year = {1998}

EndNote Reference [help]

Get EndNote Reference (.ref)


For problems or questions regarding this web site contact The Web Master.

This document was translated automatically from BibTEX by bib2html (Copyright 2003 © Eric Marchand, INRIA, Vista Project).