Victorian400 Dataset for Colorizing Victorian Illustrations | by Hoyeol Kim
Dataset: Victorian400 (Kaggle) (521mb)
Here is example code for loading the Victorian400 dataset:
_URL = URL = '~/Victorian400.tar.gz'
path_to_zip = tf.keras.utils.get_file('Victorian400.tar.gz', origin=_URL, extract=True)
path1 = os.path.join(os.path.dirname(path_to_zip), 'Victorian400/')
This dataset was originally created to colorize Charles Dickens’ illustrations. Due to a lack of data, however, the scope of this dataset has been expanded to all nineteenth century illustrations. Colorful images from the nineteenth century were selected to be used in the dataset, which were converted to black and white images for training data.
This dataset contains three different folders: original, resized, and gray. Images in the ‘original’ folder were curated for significant features, such as faces and bodies. In the 'resized' folder, the original images were resized to fit the 256*256 dimensions needed for the process of deep learning. Based on the resized images, I created the 'gray' folder, which contains black and white images converted from the colorful resized images.
The Victorian400 dataset was tested with pix2pix developed by Isola et al.
Please use the following reference to cite the dataset:
@article{Victorian400,
title = {Victorian400: Colorizing Victorian Illustrations},
author = {Kim, Hoyeol},
journal = {International Journal of Humanities and Arts Computing},
publisher = {Edinburgh University Press},
volume = {15},
number = {1-2},
year = {2021},
pages = {186-202},
doi = {10.3366/ijhac.2021.0269},
}
or
Kim, Hoyeol, "Victorian400: Colorizing Victorian Illustrations." International Journal of Humanities and Arts Computing, vol. 15, no. 1-2, Oct. 2021, pp. 186-202. doi: 10.3366/ijhac.2021.0269.
In recognition of my work on the Victorian400 dataset, I received a $563 Project Development Grant from the Center of Digital Humanities Research, as well as Graduate Merit funding for $400 by the Department of English, Texas A&M University.