Here we introduce a web image dataset created by Lab for Media Search in National University of Singapore. The dataset includes: (1) 269,648 images and the associated tags from Flickr, with a total number of 5,018 unique tags; (2) six types of low-level features extracted from these images, including 64-D color histogram, 144-D color correlogram, 73-D edge direction histogram, 128-D wavelet texture, 225-D block-wise color moments and 500-D bag of words based on SIFT descriptions; and (3) ground-truth for 81 concepts that can be used for evaluation. Based on this dataset, we identify several research issues on web image annotation and retrieval. We also provide the baseline results for web image annotation by learning from the tags using the traditional k-NN algorithm. The benchmark results show that it is possible to learn models from these data to help general image retrieval.
The dataset, including low-level features, groundtruth, tags, concept list, image list and original urls, can be obtained through following links. The exif and geo-info for the images from a fraction of the dataset are available here. For more detailed descriptions of the dataset, see specification below.
We also provide (1) a light version of NUS-WIDE dataset, named NUS-WIDE-LITE, (2) one object image dataset, named NUS-WIDE-OBJECT and (3) one scene image dataset, named NUS-WIDE-SCENE. The specifications of these three subsets are detailed in the paper.
For any question regarding the dataset, please contact Mr. Xindi Shang (shangxin AT comp.nus.edu.sg). The raw images can be obtained via sending a request email to us. Specifically, the researchers interested in the dataset should download and fill up the Agreement and Disclaimer Form and send it back to us. We will then email you the instructions to download the images at our discretion.
Tat-Seng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, and Yan-Tao Zheng. "NUS-WIDE: A Real-World Web Image Database from National University of Singapore", ACM International Conference on Image and Video Retrieval. Greece. Jul. 8-10, 2009. [pdf] [Bibitex entry]
Each row in every low-level feature file corresponds to the image in the corresponding row of Imagelist.
We separate the dataset into two parts, the first part contains 161,789 images for training and the second part contains 107,859 images for evaluation. So each low-level feature file is separated into two files which include 161,789 and 107,859 rows of features respectively.
AllLabels : this fold includes 81 files corresponding to 81 concepts respectively. For a certain concept, every row in the file corresponds to the groundtruth of the image in the corresponding row of Imagelist.txt. TrainTestLabels : every groundtruth file is separated into two parts according to the separation of the dataset. Thus this fold includes 162 files corresponding to the training and testing groundtruth of the 81 concepts.
AllTags81 : this file includes the labels extracted from the tags associated to the images for the 81 concepts. Each column represents a concept and each row includes the 81 labels extracted from the associated tags for the corresponding images. AllTags81(i, j) = 1 means the associated tags of the i-th image include the j-th concept in Concepts81.txt. Otherwise, AllTags81(i, j) = 0 means the associated tags of the i-th image does not include the j-th concept in Concepts81.txt.
Train_Tags81, Test_Tags81 : the file AllTags81 is separated into two parts according to the separation of the dataset. AllTags1k : this file includes the labels extracted from the tags associated to the images for the top 1,000 concepts in the Fianl_Tag_List.
Train_Tags1k, Test_Tags1k : the file AllTags1k is separated into two parts according to the separation of the dataset. AllTags : each row includes the raw tags crawled from www.flickr.com for the image in the corresponding row of Imagelist.txt. Fianl_Tag_List : the tag list includes 5,018 tags extracted from the associated tags of the dataset. The tags are sorted according to their frequencies.
Concepts81 : this file includes the 81 concepts in alphabetical order.
Imagelist : the list of raw images extracted from www.flickr.com
TrainImagelist : the list of images for training.
TestImagelist : the list of images for testing.