TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Face Verification	BTS3.1	MCN (Arcface)	TAR @ FAR=0.01	0.3941	# 5
Face Verification	BTS3.1	NAN (Arcface)	TAR @ FAR=0.01	0.3901	# 6
Face Verification	BTS3.1	NAN (Adaface)	TAR @ FAR=0.01	0.5444	# 2
Face Identification	DroneSURF	NAN (Adaface)	Rank1	80.21	# 2
Face Verification	IJB-A	NAN	TAR @ FAR=0.01	94.10%	# 7

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/neural-aggregation-network-for-video-face/face-verification-on-bts3-1)](https://paperswithcode.com/sota/face-verification-on-bts3-1?p=neural-aggregation-network-for-video-face)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/neural-aggregation-network-for-video-face/face-identification-on-dronesurf)](https://paperswithcode.com/sota/face-identification-on-dronesurf?p=neural-aggregation-network-for-video-face)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/neural-aggregation-network-for-video-face/face-verification-on-ijb-a)](https://paperswithcode.com/sota/face-verification-on-ijb-a?p=neural-aggregation-network-for-video-face)`

Neural Aggregation Network for Video Face Recognition

CVPR 2017 · Jiaolong Yang, Peiran Ren, Dong-Qing Zhang, Dong Chen, Fang Wen, Hongdong Li, Gang Hua ·

This paper presents a Neural Aggregation Network (NAN) for video face recognition. The network takes a face video or face image set of a person with a variable number of face images as its input, and produces a compact, fixed-dimension feature representation for recognition. The whole network is composed of two modules. The feature embedding module is a deep Convolutional Neural Network (CNN) which maps each face image to a feature vector. The aggregation module consists of two attention blocks which adaptively aggregate the feature vectors to form a single feature inside the convex hull spanned by them. Due to the attention mechanism, the aggregation is invariant to the image order. Our NAN is trained with a standard classification or verification loss without any extra supervision signal, and we found that it automatically learns to advocate high-quality face images while repelling low-quality ones such as blurred, occluded and improperly exposed faces. The experiments on IJB-A, YouTube Face, Celebrity-1000 video face recognition benchmarks show that it consistently outperforms naive aggregation methods and achieves the state-of-the-art accuracy.

PDF Abstract CVPR 2017 PDF CVPR 2017 Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Face Identification

Face Recognition

Face Verification

Datasets

IJB-A DroneSURF BTS3.1

Results from the Paper

Add Remove

Ranked #2 on Face Identification on DroneSURF

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Face Verification	BTS3.1	MCN (Arcface)	TAR @ FAR=0.01	0.3941	# 5	Compare
Face Verification	BTS3.1	NAN (Arcface)	TAR @ FAR=0.01	0.3901	# 6	Compare
Face Verification	BTS3.1	NAN (Adaface)	TAR @ FAR=0.01	0.5444	# 2	Compare
Face Identification	DroneSURF	NAN (Adaface)	Rank1	80.21	# 2	Compare

Results from Other Papers

Task	Dataset	Model	Metric Name	Metric Value	Rank	Source Paper	Compare
Face Verification	IJB-A	NAN	TAR @ FAR=0.01	94.10%	# 7		See all

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Neural Aggregation Network for Video Face Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove