Facebook Research at Facebook

DeepFace: Closing the Gap to Human-Level Performance in Face Verification

Publication
Conference on Computer Vision and Pattern Recognition (CVPR)24 June 2014

Abstract

In modern face recognition, the conventional pipeline consists of four stages: detect => align => represent => classify. We revisit both the alignment step and the representation step by employing explicit 3D face modeling in order to apply a piecewise affine transformation, and derive a face representation from a nine-layer deep neural network. This deep network involves more than 120 million parameters using several locally connected layers without weight sharing, rather than the standard convolutional layers. Thus we trained it on the largest facial dataset to-date, an identity labeled dataset of four million facial images belonging to more than 4,000 identities.

The learned representations coupling the accurate model-based alignment with the large facial database generalize remarkably well to faces in unconstrained environments, even with a simple classifier. Our method reaches an accuracy of 97.35% on the Labeled Faces in the Wild (LFW) dataset, reducing the error of the current state of the art by more than 27%, closely approaching human-level performance.

Resources

Download Paper

Related Publications

Deep multi-scale video prediction beyond mean square error
Learning to predict future images from a video sequence involves the construction of an internal representation that models the image evolution accurately,...
by Michael Mathieu, Camille Couprie, Yann LeCunICLR 2016May
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks
In recent years, supervised learning with convolutional networks (CNNs) has seen huge adoption in computer vision applications. Comparatively, unsupervised...
by Alec Radford, Luke Metz, Soumith Chintala2016 International Conference on Learning RepresentationsJanuary
Simple bag-of-words baseline for visual question answering
We describe a very simple bag-of-words baseline for visual question answering. This baseline concatenates the word features from the question and CNN features...
by Bolei Zhou, Yuandong Tian, Sainbayar Sukhbaatar, Arthur Szlam, Rob FergusArXiv PrePrintDecember 2015

Related Blog Posts

Facebook AI Research Launches Partnership Program
by Serkan Piantino, Florent Perronnin25 February

Join Us

Do you want to help more than a billion people all over the world connect and share?

View Open Positions

Code

Learn about our open source tools and technologies, our challenging scaling experiences, and more.

Go to Facebook Code