Skip to content

Dataset Zoo

JiankangDeng edited this page Nov 1, 2018 · 30 revisions

Face Recognition Training Datasets

CASIA-Webface (10K ids/0.5M images) [1]

baidu

dropbox

CelebA (10K ids/0.2M images) [2]

UMDFace (8K ids/0.37M images) [3]

VGG2 (9K ids/3.31M images) [4]

baidu

dropbox

MS1M-IBUG (85K ids/3.8M images) [5,6]

baidu

dropbox

MS1M-ArcFace (85K ids/5.8M images) [5,7] (Recommend)

baidu

dropbox

Asian-Celeb (94K ids/2.8M images)[8] (Recommend)

baidu

dropbox

DeepGlint (181K ids/6.75M images) [8] (Recommend)

baidu

dropbox

IMDB-Face (59K ids/1.7M images) [9]

Celeb500k (500K ids/50M images) [10]

MegaFace (672K ids/4.7M images) [11]

Face Recognition Validation Datasets

CFP-FP (500 ids/7K images/7K pairs)[12]

AgeDB-30 (570 ids/12,240 images/6K pairs)[13,6]

LFW (5749 ids/13233 images/6K pairs)[14]

CALFW (5749 ids/13233 images/6K pairs)[15]

CPLFW (5749 ids/13233 images/6K pairs)[16]

Face Recognition Image Test Datasets

MegaFace

IJB (IJB-B, IJB-C)

TrillionPairs

Face Recognition Video Test Datasets

YTF

IQIYI

Reference

[1] Dong Yi, Zhen Lei, Shengcai Liao, Stan Z. Li. Learning Face Representation from Scratch. arXiv:1411.7923, 2014.

[2] Ziwei Liu, Ping Luo, Xiaogang Wang, Xiaoou Tang. Deep Learning Face Attributes in the Wild, ICCV, 2015.

[3] Bansal Ankan, Nanduri Anirudh, Castillo Carlos D, Ranjan Rajeev, Chellappa, Rama. UMDFaces: An Annotated Face Dataset for Training Deep Networks, arXiv:1611.01484v2, 2016.

[4] Qiong Cao, Li Shen, Weidi Xie, Omkar M. Parkhi, Andrew Zisserman. VGGFace2: A dataset for recognising faces across pose and age. FG, 2018.

[5] Yandong Guo, Lei Zhang, Yuxiao Hu, Xiaodong He, Jianfeng Gao. Ms-celeb-1m: A dataset and benchmark for large-scale face recognition. ECCV, 2016.

[6] Jiankang Deng, Yuxiang Zhou, Stefanos Zafeiriou. Marginal loss for deep face recognition, CVPRW, 2017.

[7] Jiankang Deng, Jia Guo, Stefanos Zafeiriou. Arcface: Additive angular margin loss for deep face recognition, arXiv:1801.07698, 2018.

[8] http://trillionpairs.deepglint.com/

[9] Wang Fei, Chen Liren, Li Cheng, Huang Shiyao, Chen Yanjie, Qian Chen, Loy, Chen Change. The Devil of Face Recognition is in the Noise, ECCV, 2018.

[10] Cao Jiajiong, Li Yingming, Zhang Zhongfei, Celeb-500K: A Large Training Dataset for Face Recognition, ICIP, 2018.

[11] Nech Aaron, Kemelmacher-Shlizerman Ira, Level Playing Field For Million Scale Face Recognition, CVPR, 2017.

[12] Sengupta Soumyadip, Chen Jun-Cheng, Castillo Carlos, Patel Vishal M, Chellappa Rama, Jacobs David W, Frontal to profile face verification in the wild, WACV, 2016.

[13] Moschoglou, Stylianos and Papaioannou, Athanasios and Sagonas, Christos and Deng, Jiankang and Kotsia, Irene and Zafeiriou, Stefanos, Agedb: the first manually collected, in-the-wild age database, CVPRW, 2017.

[14] Gary B. Huang, Manu Ramesh, Tamara Berg, and Erik Learned-Miller. Labeled Faces in the Wild: A Database for Studying Face Recognition in Unconstrained Environments, 2007.

[15] Zheng Tianyue, Deng Weihong, Hu Jiani, Cross-age lfw: A database for studying cross-age face recognition in unconstrained environments, arXiv:1708.08197, 2017.

[16] Zheng, Tianyue, and Weihong Deng. Cross-Pose LFW: A Database for Studying Cross-Pose Face Recognition in Unconstrained Environments, 2018.

Clone this wiki locally