LFW Home

Welcome to Labeled Faces in the Wild, a database of face photographs
designed for studying the problem of unconstrained face recognition.
The database contains more than 13,000 images of faces collected from
the web. Each face has been labeled with the name of the person
pictured. 1680 of the people pictured have two or more distinct photos
in the database. The only constraint on these faces is that they were
detected by the Viola-Jones face detector. More details can be found in
the technical report below.
last updated: 2007/11/21 1:30 PM ESTchange log
- Mailing list:
- If you wish to receive announcements regarding any changes made to the LFW database, please send email to majordomo@cs.umass.edu with the message body: "subscribe lfw" on a single line.
- Alphabetically by first name:
[A)[Alf)[Ang)[B)[Bin)[C)[Che)[Col)[D)[Daw)[Don)[E)[Eri)[F)[G)[Goe)[H) [I)[J)[Jav)[Jes)[Joh)[Jos)[K)[Kim)[L)[Lil)[M)[Mark)[Mel)[Mik)[N)[O)[P) [Per)[Q)[R)[Ric)[Rog)[S)[Sha)[Ste)[T)[Tim)[U)[V)[W)[X)[Y)[Z) - Alphabetically by first name, only people with more than one image:
[A][B][C][D][E][F][G][H][I][J][K][L][M][N][O][P][Q][R][S][T][U][V][W][X][Y][Z] - Alphabetically by last name:
[A][B][C][D][E][F][G][H][I][J][K][L][M][N][O][P][Q][R][S][T][U][V][W][X][Y][Z] - By number of images per person:
[1 A-E][1 F-J][1 K-O][1 P-T][1 U-Z][2][3][4][5][6-10][11+] - Single page of all names (no thumbnails)
Download the database:
- All images as gzipped tar file (173MB, md5sum ac79dc88658530a91423ebbba2b07bf3)
as zip file - All images aligned with funneling (233MB, md5sum 1b42dfed7d15c9b2dd63d5e5840c86ad)
as zip file - Subset of images - people with name starting with A (14MB)
as zip file - Subset of images - George_W_Bush (individual person with most images) (6.9MB)
as zip file - All names (with number of images for given name) as text file
- README - information on file formats and directory structure
- Training, Validation, and Testing:
-
- View 1:
- For development purposes, we recommend using the below
training/testing split, which was generated randomly and independently
of the splits for 10-fold cross validation, to avoid unfairly
overfitting to the sets above during development. For instance, these
sets may be viewed as a model selection set and a validation set. See
the tech report below for more details.
Explore the sets: [training][test]
Download the sets: pairsDevTrain.txt, pairsDevTest.txt, peopleDevTrain.txt, peopleDevTest.txt - View 2:
- As a benchmark for comparison, we suggest reporting performance as
10-fold cross validation using splits we have randomly generated.
Explore the sets: [1][2][3][4][5][6][7][8][9][10]
Download the sets: pairs.txt, people.txt
For details on how the sets were created, please refer to the tech report below.
- Results:
- Accuracy and ROC curves for various methods available on results page.
- 13233 images
- 5749 people
- 1680 people with two or more images
- Errata:
-
The following is a list of known errors in LFW. Due to the small number of such errors,
the database will be left as is (without corrections) to avoid confusion.
It is important that users of the database provide their algorithms with the database as is, i.e. without correcting the errors below, since previous results published for the database did not have the advantage of correcting for these errors.
Note: unless stated otherwise below, any error in a matched pair will mean that the label ("matched") is wrong. Any error in a mismatched pair, even with the person having the wrong identity, will generally be correct (the label of "mismatched" will still be correct).
- Recep_Tayyip_Erdogan_0004 is incorrect (it is an image of Abdullah Gul).
This image appears only in one matched pair in the training set of View 1. - Janica_Kostelic_0001 is incorrect (it is an image of Anja Paerson).
This image appears in one matched pair in the test set of View 1, and in one matched pair and one mismatched pair (with Don_Carcieri_0001) in fold 1 of View 2.
- Recep_Tayyip_Erdogan_0004 is incorrect (it is an image of Abdullah Gul).
- Reference:
- Please cite as:
Gary B. Huang, Manu Ramesh, Tamara Berg, and Erik Learned-Miller.
Labeled Faces in the Wild: A Database for Studying Face Recognition in Unconstrained Environments.
University of Massachusetts, Amherst, Technical Report 07-49, October, 2007.
[pdf]
- Contact:
- Questions and comments can be sent to:
Gary Huang - gbhuang@cs.umass.edu
- Support:
- The building of the LFW database was supported by NSF CAREER Award number 0546666.
- Change History:
- 2008/07/01
- Updated LFW technical report with proper reference for VidTIMIT:
C. Sanderson.
Biometric Person Recognition: Face, Speech and Fusion.
VDM-Verlag, 2008.
ISBN 978-3-639-02769-3
- 2008/06/12
- Added Errata section and listed two known labeling errors.
- 2008/02/04
- Added funneled images and super-pixels images
to person pages.
Made all funneled images available as single downloadable file.
- 2008/01/25
- Added results page with numbers for method of Nowak and Jurie, CVPR 2007.
- 2007/11/21
- Added revised version of technical report.
- 2007/11/19
- Added technical report to page.
- 2007/11/15
- Added mailing list and change history to page.
