Labeled Faces in the Wild Home

Welcome to Labeled Faces in the Wild, a database of face photographs
designed for studying the problem of unconstrained face recognition.
The dataset contains more than 13,000 images of faces collected from
the web. Each face has been labeled with the name of the person
pictured. 1680 of the people pictured have two or more distinct photos
in the data set. The only constraint on these faces is that they were
detected by the Viola-Jones face detector. More details can be found
in the technical report below.
Related: Faces in Real-Life Images workshop at the European Conference on Computer Vision, run by Erik Learned-Miller, Andras Ferencz, and Frederic Jurie.
last updated: 2010/04/11 14:17 ESTRelated: Faces in Real-Life Images workshop at the European Conference on Computer Vision, run by Erik Learned-Miller, Andras Ferencz, and Frederic Jurie.
change log
- Mailing list:
- If you wish to receive announcements regarding any changes made to the LFW database, please send email to majordomo@cs.umass.edu with the message body: "subscribe lfw" on a single line.
- Alphabetically by first name:
[A)[Alf)[Ang)[B)[Bin)[C)[Che)[Col)[D)[Daw)[Don)[E)[Eri)[F)[G)[Goe)[H) [I)[J)[Jav)[Jes)[Joh)[Jos)[K)[Kim)[L)[Lil)[M)[Mark)[Mel)[Mik)[N)[O)[P) [Per)[Q)[R)[Ric)[Rog)[S)[Sha)[Ste)[T)[Tim)[U)[V)[W)[X)[Y)[Z) - Alphabetically by first name, only people with more than one image:
[A][B][C][D][E][F][G][H][I][J][K][L][M][N][O][P][Q][R][S][T][U][V][W][X][Y][Z] - Alphabetically by last name:
[A][B][C][D][E][F][G][H][I][J][K][L][M][N][O][P][Q][R][S][T][U][V][W][X][Y][Z] - By number of images per person:
[1 A-E][1 F-J][1 K-O][1 P-T][1 U-Z][2][3][4][5][6-10][11+] - Single page of all names (no thumbnails)
Download the database:
- All images as gzipped tar file (173MB, md5sum ac79dc88658530a91423ebbba2b07bf3)
- All images aligned with funneling (233MB, md5sum 1b42dfed7d15c9b2dd63d5e5840c86ad)
- All images aligned with commercial face alignment software (LFW-a - Taigman, Wolf, Hassner)
- Subset of images - people with name starting with A (14MB)
as zip file - Subset of images - George_W_Bush (individual person with most images) (6.9MB)
as zip file - All names (with number of images for given name) as text file
- README - information on file formats and directory structure
- Training, Validation, and Testing:
-
- View 1:
- For development purposes, we recommend using the below
training/testing split, which was generated randomly and independently
of the splits for 10-fold cross validation, to avoid unfairly
overfitting to the sets above during development. For instance, these
sets may be viewed as a model selection set and a validation set. See
the tech report below for more details.
Explore the sets: [training][test]
Download the sets: pairsDevTrain.txt, pairsDevTest.txt, peopleDevTrain.txt, peopleDevTest.txt - View 2:
- As a benchmark for comparison, we suggest reporting performance as
10-fold cross validation using splits we have randomly generated.
Explore the sets: [1][2][3][4][5][6][7][8][9][10]
Download the sets: pairs.txt, people.txt
For details on how the sets were created, please refer to the tech report below.
- Results:
- Accuracy and ROC curves for various methods available on results page.
- 13233 images
- 5749 people
- 1680 people with two or more images
- Errata:
-
The following is a list of known errors in LFW. Due to the small number of such errors,
the database will be left as is (without corrections) to avoid confusion.
It is important that users of the database provide their algorithms with the database as is, i.e. without correcting the errors below, since previous results published for the database did not have the advantage of correcting for these errors.
Note: unless stated otherwise below, any error in a matched pair will mean that the label ("matched") is wrong. Any error in a mismatched pair, even with the person having the wrong identity, will generally be correct (the label of "mismatched" will still be correct).
- Recep_Tayyip_Erdogan_0004 is incorrect (it is an image of Abdullah Gul).
This image appears only in one matched pair in the training set of View 1. - Janica_Kostelic_0001 is incorrect (it is an image of Anja Paerson).
This image appears in one matched pair in the test set of View 1, and in one matched pair and one mismatched pair (with Don_Carcieri_0001) in fold 1 of View 2. - Bart_Hendricks_0001 is incorrect (it is a duplicate image of Ricky_Ray_0001). This image appears in two mismatched pairs in the training set of View 1, and one mismatched pair in fold 2 of View 2. (None of the mismatched pairs are with Ricky_Ray.)
- Carlos_Beltran_0001 is incorrect (it is a duplicate image of Raul_Ibanez_0001). This image appears in one mismatched pair in the test set of View 1, and one mismatched pair in fold 5 of View 2. (None of the mismatched pairs are with Raul_Ibanez.)
- Emmy_Rossum_0001 is incorrect (it is a duplicate image of Eva_Amurri_0001). This image appears in one mismatched pair in the test set of View 1 (the mismatched pair is not with Eva_Amurri).
- Recep_Tayyip_Erdogan_0004 is incorrect (it is an image of Abdullah Gul).
- Reference:
- Please cite as:
Gary B. Huang, Manu Ramesh, Tamara Berg, and Erik Learned-Miller.
Labeled Faces in the Wild: A Database for Studying Face Recognition in Unconstrained Environments.
University of Massachusetts, Amherst, Technical Report 07-49, October, 2007.
[pdf]
BibTeX entry:@TechReport{LFWTech, author = {Gary B. Huang and Manu Ramesh and Tamara Berg and Erik Learned-Miller}, title = {Labeled Faces in the Wild: A Database for Studying Face Recognition in Unconstrained Environments}, institution = {University of Massachusetts, Amherst}, year = 2007, number = {07-49}, month = {October}}
- Contact:
- Questions and comments can be sent to:
Gary Huang - gbhuang@cs.umass.edu
- Support:
- The building of the LFW database was supported by NSF CAREER Award number 0546666.
- Change History:
-
- 2010/04/17
- Added Cao et al.* to results page.
- 2010/02/08
- Added Ruiz-del-Solar et al.* and unsupervised (no training data) results to results page.
- 2009/10/26
- Added Kumar et al.* to results page.
- 2009/09/24
- Added link to LFW-a, LFW images aligned with commercial face alignment software, from Taigman, Wolf, and Hassner, under downloads.
- 2009/09/02
- Added Wolf et al.* to results page.
- 2009/08/03
- Added Taigman et al.* to results page.
- 2009/07/02
- Added Guillaumin et al.* to results page.
- 2009/06/24
- Added Carlos_Beltran_0001 and Emmy_Rossum_0001 labeling errors to Errata.
- 2009/04/02
- Added Pinto et al.* to results page.
- 2009/02/04
- Added Bart_Hendricks_0001 labeling error to Errata.
- 2008/07/01
- Updated LFW technical report with proper reference for VidTIMIT:
C. Sanderson.
Biometric Person Recognition: Face, Speech and Fusion.
VDM-Verlag, 2008.
ISBN 978-3-639-02769-3
- 2008/06/12
- Added Errata section and listed two known labeling errors.
- 2008/02/04
- Added funneled images and super-pixels images
to person pages.
Made all funneled images available as single downloadable file.
- 2008/01/25
- Added results page with numbers for method of Nowak and Jurie, CVPR 2007.
- 2007/11/21
- Added revised version of technical report.
- 2007/11/19
- Added technical report to page.
- 2007/11/15
- Added mailing list and change history to page.
