Welcome to Cross-Age LFW (CALFW) database, a renovation of Labeled Faces in the Wild (LFW), the de facto standard testbed for unconstraint face verification.
Labeled Faces in the Wild (LFW) database has been widely utilized as the benchmark of unconstrained face verification and due to big data driven machine learning methods, the performance on the database approaches nearly 100%. However, we argue that this accuracy may be too optimistic. Besides different poses, illuminations, occlusions and expressions, cross-age face is another challenge in face recognition yet LFW does not pay much attention on it. Thereby we construct a Cross-Age LFW (CALFW) which deliberately searches and selects 3,000 positive face pairs with age gaps to add aging process intra-class variance. Negative pairs with same gender and race are also selected to reduce the influence of attribute difference between positive/negative pairs. We evaluate several metric learning and deep learning methods on the new database. Compared to the accuracy on LFW, the accuracy drops about 10\%-17\% on CALFW. There are three motivations behind the construction of CALFW benchmark as follows:
1.Establishing a relatively more difficult database to evaluate the performance of real world face verification so the effectiveness of several face verification methods can be fully justified.
2.CALFW emphasizes age gap of positive pairs to further enlarge intra-class variance and still considers other intra-class variations. Also, negative pairs are deliberately selected to avoid different gender or race. CALFW considers both the large intra-class variance and the tiny inter-class variance simultaneously.
3.Maintaining the data size, the face verification protocol which provides a 'same/different' benchmark and the same identities in LFW, so one can easily apply CALFW to evaluate the performance of face verification.
Compared to the positive pairs in LFW, the age gaps of positive pairs in CALFW is larger. This shows we successfully add aging process to intra-class variations. Also, in LFW, age gaps of most positive pairs are less than 10 years while that of most negative pairs are larger than 10 years, in CALFW, there is no clear boundary to distinguish the two kinds of pairs, so age gap can not be a big influence on face verification in CALFW.
CALFW is collected by crowdsourcing efforts to seek the pictures of people in LFW with age gap as large as possible on the Internet. Compared to LFW, the positive pairs in CALFW contain obvious age difference.
Compared to LFW, the negative pairs in CALFW have same gender and race, which reduces the influence of attribute difference between positive pairs and negative pairs in face verification.
We dedicate to maintain the protocols, dataset size, and the identities in each fold of LFW database in order to encourage fair and meaningful comparisons. You can find more information about standard LFW protocol in Labeled Faces in the Wild (LFW).
We expect CALFW could promote algorithms to make reliable verification judgement, and close the large gap between the reported performance on benchmarks and performance on real world tasks.
We evaluate several metric learning and deep learning methods on the new CALFW database and their accuracy drops about 10% to 17% compared to the corresponding LFW performance.
Verification accuracy(%) under image-unrestricted setting using labeled outside data.
Verification accuracy(%) under image-restricted setting with no outside data using LBP feature.
You can download all the resources in CALFW. There are three folds in the website link. Feature fold contains the LBP feature we use in baseline results. Images fold contains all the images (similarity_align.rar contains images aligned using similarity transform and images&landmarks.rar contains images which are not alighed and landmarks) txts fold contains all the files including identity list, image name list and 6,000 pairs list. Please cite as Reference.