Columbia Photographic Images and Photorealistic Computer Graphics Dataset Tian-Tsong Ng, Shih-Fu Chang, Jessie Hsu, Martin Pepeljugoski¤ fttng,sfchang,yfhsug@ee.columbia.edu, [email protected] Department of Electrical Engineering Columbia University ADVENT Technical Report #205-2004-5 Feb 2005 Abstract Passive-blind image authentication is a new area of research.  A suitable dataset for experimentation and comparison of new techniques is important for the progress of the new research area.  In response to the need for a new dataset, the Columbia Photographic Images and Photorealistic Computer Graphics Dataset is made open for the passive-blind image authentication research community. The dataset is composed of four component image sets, i.e., the Photorealistic Com- puter Graphics  Set, the Personal Photographic Image  Set, the Google Image  Set, and the Recaptured Computer Graphics  Set. This dataset, available from http://www.ee.columbia.edu/trustfoto, will be for those who work on the photographic images versus photorealistic com- puter graphics classification problem, which is a subproblem of the passive-blind image authentication research.  In this report, we de- scribe the design and the implementation of the dataset. The report will also serve as a user guide for the dataset. 1   Introduction Digital watermarking [1] has been an active area of research since a decade ago. Various fragile [2, 3, 4, 5] or semi-fragile watermarking algorithms [6, 7, 8, 9] has been proposed for the image content authentication and the detection of image tampering.  In addition, authentication signature [10, ¤This work was done when Martin spent his summer in our research group 1