Skip to content

Latest commit

 

History

History

data_preparation

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 

Dataset Preparation

1. synImageNet291

Overview

Synthesized paired images in 291 classes.

Description

This dataset is synthesized for the training of our content encoder, containing paired data. The details of the dataset generation:

We use the official BigGAN-deep-128 model on TF Hub to generate correlated images associated by random latent codes for each of the 291 domains including dogs, wild animals, birds and vehicles. Their class indexes in the original ImageNet 1000 classes are 7∼20, 22, 23, 80∼102, 104, 105, 106, 127, 128, 129, 131∼145, 151∼299, 330∼378, 380, 382∼388, 407, 436, 468, 511, 555, 586, 609, 627, 654, 656, 675, 717,734, 751, 757, 779, 803, 817, 829, 847, 856, 864, 866, 867, 874. We apply truncation trick to the latent codes, and obtain 3K images with a truncation threshold of 0.5 and 3K images with a truncation threshold of 1.0. After filtering low quality ones, we finally obtain 655 images per domain that are linked across all domains, 600 of which are used for training.

Download

synImageNet291 can be downloaded from Google Drive or Baidu Cloud (access code: cvpr)

Content

Unzip the downloaded synImageNet291.zip, which is in the following folder structure:

synImageNet291
|--Train
   |--7_bird
      <600 training images in the class 7 of ImageNet>
   |--8_bird
      <600 training images in the class 8 of ImageNet>
   ...
   |--874_vehicle
      <600 training images in the class 8874 of ImageNet>   
|--Test
   |--7_bird
      <55 testing images in the class 7 of ImageNet>   
   ...
   
synImageNet291_mask
|--Train
   |--7_bird
      <600 instance segmentation maps of the correponding images in synImageNet291>
   ...
|--Test
   |--7_bird
      <55 instance segmentation maps of the correponding images in synImageNet291>   
   ...

Images with the same filename are generated from the same latent code. For example, synImageNet291/Train/7_bird/0.jpg, synImageNet291/Train/158_dog/0.jpg and synImageNet291/Train/271_otheranimal/0.jpg are paired images, containing three kinds of animals sharing very similar pose.

2. ImageNet291

Description

This dataset is synthesized for the training of our content encoder. It also serves as the training and testing data for the tasks of Dog2Bird and Bird2Car in our paper. The details of the dataset generation:

For each domain X, we first calculate the mean style feature S_X of the images in X from synImageNet291. The style feature is defined as the channel-wise mean of the conv5_2 feature of pre-trained VGG. Then, we apply HTC to detect and crop the object regions in the domain X of ImageNet. Small objects are filtered. The remaining images are ranked based on the similarity between their style features and S_X . We finally select the top 800 images to eliminate outliers, with 600 images for training and 50 images for testing.

Download

For instance segmentation maps of the images in ImageNet291, they can be downloaded from Google Drive or Baidu Cloud (access code: cvpr). The link also provides the instance segmentation maps of the face images in CelebHA in the ImageNet291_mask/train/1001_face/ and ImageNet291_mask/test/1001_face/

For the images of ImageNet291, we provide the script to build ImageNet291 from the original ImageNet training set.

python generate_ImageNet291.py
  • Download CelebA-HQ and put the 29000 training images and 1000 testing images to ImageNet291/train/1001_face/ and ImageNet291/test/1001_face/, respectively.

Content

The folder structure of ImageNet291 is the same as that of synImageNet291.