人脸识别中的损失函数

作者：幽默的人生就是悲催基_129 | 来源：互联网 | 2024-10-08 18:21

本文主要是针对人脸识别中的各种loss进行总结。背景对于分类问题，我们常用的lossfunction是softmax，表示为：,当然有softmax肯定也有hardmax:，so

本文主要是针对人脸识别中的各种loss进行总结。

背景

对于分类问题，我们常用的loss function是softmax，表示为： $f(x_i) = \frac{e^{x_i}}{\sum_j e^{x_j}}$ ,当然有softmax肯定也有hardmax: $f(x_i) = \frac{x_i}{\sum_j x_j}$ ，softmax和hardmax相比，优势是更容易收敛，更容易达到one-hot。softmax鼓励特征分开，但是并不鼓励分的很开，对于人脸识别来说我们需要类内的距离也足够小，同时保证类间的距离足够大。现有的人脸loss大都基于L2距离和cos距离。

Contrastive Loss

核心思想是随机从训练样本中选择两个样本，如果两者属于同一类，那么使他们的距离尽可能小，否则的话就是使他们的距离尽可能远。Loss function为：

$L=\frac{1}{2N}\sum_{n=1}^Ny\cdot||a_n−b_n||^2+(1−y)\cdot \max(margin−||a_n−b_n||^2,0)$

y表示的是否是同一类别。它的缺点很明显，就是需要为每对非同类样本指定margin，而且这个margin是固定的，这就导致embedding空间是固定的，不能发生畸变(distortion)。triplet loss的margin是不固定的。这样的话，对于contrastive loss来说，选择hard example通常会更快地收敛。

https://github.com/delijati/pytorch-siamese/blob/master/contrastive.pygithub.com/delijati/pytorch-siamese/blob/master/contrastive.py

class ContrastiveLoss(torch.nn.Module): """ Contrastive loss function. Based on: """ def __init__(self, margin=1.0): super(ContrastiveLoss, self).__init__() self.margin = margin def check_type_forward(self, in_types): assert len(in_types) == 3 x0_type, x1_type, y_type = in_types assert x0_type.size() == x1_type.shape assert x1_type.size()[0] == y_type.shape[0] assert x1_type.size()[0] > 0 assert x0_type.dim() == 2 assert x1_type.dim() == 2 assert y_type.dim() == 1 def forward(self, x0, x1, y): self.check_type_forward((x0, x1, y)) # euclidian distance diff = x0 - x1 dist_sq = torch.sum(torch.pow(diff, 2), 1) dist = torch.sqrt(dist_sq) mdist = self.margin - dist dist = torch.clamp(mdist, min=0.0) loss = y * dist_sq + (1 - y) * torch.pow(dist, 2) loss = torch.sum(loss) / 2.0 / x0.size()[0] return loss

Triplet Loss

They use Euclidean embedding space to find the similarity or difference between faces. Loss minimizes the distances between similar faces and maximizes one between different faces.

$Loss=\sum_{j}^{N}(||f(x_j^a)-f(x_j^p)||_2^2-||f(x_j^a)-f(x_j^n)||_2^2+\gamma)$

其中f()是embedding function，a是anchor sample，p是positive sample, n是negative sample， $\gamma$ 是positive samples和negative samples之间的margin。

从而得到这样的constraint:

$||f(x_j^a)-f(x_j^p)||_2^2+\gamma\lt||f(x_j^a)-f(x_j^n)||_2^2$ ,我们只关心违背了constraint的pair，因为这样对训练有用，我们需要选择与anchor最近的negative和最远的positive，同时如何选择pair又是一件非常tricky的事，直接去找最大和最小肯定是不现实的，代价太大！文章提出了两种方法：

离线，每n步使用最近的网络再一个subset中选择所需要的样本；

2. 在线，mini-batch中选择

作者选择了第二种。

https://github.com/adambielski/siamese-triplet/blob/master/losses.pygithub.com/adambielski/siamese-triplet/blob/master/losses.py

class TripletLoss(nn.Module): """ Triplet loss Takes embeddings of an anchor sample, a positive sample and a negative sample """ def __init__(self, margin): super(TripletLoss, self).__init__() self.margin = margin def forward(self, anchor, positive, negative, size_average=True): distance_positive = (anchor - positive).pow(2).sum(1) # .pow(.5) distance_negative = (anchor - negative).pow(2).sum(1) # .pow(.5) losses = F.relu(distance_positive - distance_negative + self.margin) return losses.mean() if size_average else losses.sum()

Center Loss

最小化类内的variations，同时保证类间的特征分开：

$L_c = \frac{1}{2} \sum_{i=1}^m||x_i-c_{y_i}||^2$ ,其中c类中心，随网络一起更新。

下面就是更新的一些推导：

$\frac{\partial L_c}{\partial x_i} = x_i - c_{y_i}$

$\Delta c_j = \frac{\sum_{i=1}^m \delta(y_i=j)\cdot (c_j-x_i)}{1+\sum_{i=1}^m\delta(y_i=j)}$

https://github.com/KaiyangZhou/pytorch-center-lossgithub.com/KaiyangZhou/pytorch-center-loss

L-Softmax Loss

样本和参数的分离性可以分解成amplitude和angular

$W_cx = ||W_c||_2||x||_2\cos(\theta_c)$

所以对于softmax的cross entropy loss可以写成：

$L_i=-\log (\frac{e^{||W_{y_i}||x_i||\cos(\theta_{y_i})}}{\sum_je^{||W_j||x_i||\cos(\theta_{j})}})$

对于初始的二分类softmax来说，我们需要保证：

$W_1^Tx > W_2^Tx$ ,即 $||W_1||_2||x||_2\cos(\theta_1)\gt||W_2||_2||x||_2\cos(\theta_2)$

考虑到 $\cos$ 函数在 $\left[ 0,\pi \right]$ 是单调递减的，为了提高分类的难度，将其改写成：

$||W_1||_2||x||_2\cos(\theta_1)\ge||W_1||_2||x||_2\cos(m\theta_1) \gt ||W_2||_2||x||_2\cos(\theta_2)$

m越大，对于相同的 $W_2$ 和x来说， $\theta_2$ 的选择空间越小，分类也越严格，使得学到的类间特征会更加接近W，减小类内的距离，与此同时中间的间隔也会更大，这样可以增加类间的距离。本质上是通过限制decision margin来提高分离性！

最终得到：

A-Softmax Loss

和L-Softmax类似，不过A-Softmax将参数W的归一化，使得W的l2 norm为1，这样的话分类只和特征向量和W的角度有关了！通过限制角度的选择空间来加大训练难度，提高分离性。

$L_{ang}=\frac{1}{N}\sum_{i} -\log (\frac{e^{||x_i||\cos(m\theta_{y_i,i})}}{e^{||x_i||\cos(m\theta_{y_i,i})}+\sum_{j\ne y_i}e^{||x_i||\cos(\theta_{j,i})}})$

把每一类都加大难度！

然后再优化一下：

$L_{ang}=\frac{1}{N}\sum_{i} -\log (\frac{e^{||x_i||\phi(m\theta_{y_i,i})}}{e^{||x_i||\phi(m\theta_{y_i,i})}+\sum_{j\ne y_i}e^{||x_i||\cos(\theta_{j,i})}})$

其中 $ψ(θ_{y_i,i})=(−1)^k \cos(mθ_{y_i,i})−2k$

参考文献

Schroff F, Kalenichenko D, Philbin J. Facenet: A unified embedding for face recognition and clustering[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2015: 815-823.MLA

Wen Y, Zhang K, Li Z, et al. A discriminative feature learning approach for deep face recognition[C]//European Conference on Computer Vision. Springer, Cham, 2016: 499-515.MLA

Hadsell R, Chopra S, LeCun Y. Dimensionality reduction by learning an invariant mapping[C]//null. IEEE, 2006: 1735-1742.

Liu W, Wen Y, Yu Z, et al. Sphereface: Deep hypersphere embedding for face recognition[C]//The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2017, 1: 1.

Liu W, Wen Y, Yu Z, et al. Large-Margin Softmax Loss for Convolutional Neural Networks[C]//ICML. 2016: 507-516.

本文转载：https://zhuanlan.zhihu.com/p/42793251

本文来自博客园，作者：海_纳百川，转载请注明原文链接：https://www.cnblogs.com/chentiao/p/16367781.html，如有侵权联系删除

推荐阅读

web
Transforming the Future of Virtual Worlds

Explore how Matterverse is redefining the metaverse experience, creating immersive and meaningful virtual environments that foster genuine connections and economic opportunities. ... [详细]

蜡笔小新 2024-12-28 09:44:49
case
编写有趣的VBScript恶作剧脚本

本文将介绍如何编写一些有趣的VBScript脚本，这些脚本可以在朋友之间进行无害的恶作剧。通过简单的代码示例，帮助您了解VBScript的基本语法和功能。 ... [详细]

蜡笔小新 2024-12-28 09:46:23
case
Handling Null Object Encoding in OAuth 1.0a API Implementation

Explore a common issue encountered when implementing an OAuth 1.0a API, specifically the inability to encode null objects and how to resolve it. ... [详细]

蜡笔小新 2024-12-28 08:54:34
case
网络链路质量监控：Smokeping部署与配置

本文详细介绍了如何在Linux系统上安装和配置Smokeping，以实现对网络链路质量的实时监控。通过详细的步骤和必要的依赖包安装，确保用户能够顺利完成部署并优化其网络性能监控。 ... [详细]

蜡笔小新 2024-12-27 19:31:05
ip
Dockerfile 编写与 Docker 网络配置详解

本文详细介绍了 Dockerfile 的编写方法及其在网络配置中的应用，涵盖基础指令、镜像构建与发布流程，并深入探讨了 Docker 的默认网络、容器互联及自定义网络的实现。 ... [详细]

蜡笔小新 2024-12-27 17:31:41
io
QUIC协议：快速UDP互联网连接

QUIC（Quick UDP Internet Connections）是谷歌开发的一种旨在提高网络性能和安全性的传输层协议。它基于UDP，并结合了TLS级别的安全性，提供了更高效、更可靠的互联网通信方式。 ... [详细]

蜡笔小新 2024-12-28 12:33:18
io
Python 异步编程：深入理解 asyncio 库（上）

本文介绍了 Python 3.4 版本引入的标准库 asyncio，该库为异步 IO 提供了强大的支持。我们将探讨为什么需要 asyncio，以及它如何简化并发编程的复杂性，并详细介绍其核心概念和使用方法。 ... [详细]

蜡笔小新 2024-12-28 11:52:00
数组
PyCharm下载与安装指南

本文详细介绍如何从官方渠道下载并安装PyCharm集成开发环境（IDE），涵盖Windows、macOS和Linux系统，同时提供详细的安装步骤及配置建议。 ... [详细]

蜡笔小新 2024-12-28 09:42:41
io
资源推荐 | TensorFlow官方中文教程助力英语非母语者学习

来源：机器之心。本文详细介绍了TensorFlow官方提供的中文版教程和指南，帮助开发者更好地理解和应用这一强大的开源机器学习平台。 ... [详细]

蜡笔小新 2024-12-28 09:00:51
web
技术分享：从动态网站提取站点密钥的解决方案

本文探讨了如何从动态网站中提取站点密钥，特别是针对验证码（reCAPTCHA）的处理方法。通过结合Selenium和requests库，提供了详细的代码示例和优化建议。 ... [详细]

蜡笔小新 2024-12-28 04:11:47
hash
数据库内核开发入门 | 搭建研发环境的初步指南

本课程将带你从零开始，逐步掌握数据库内核开发的基础知识和实践技能，重点介绍如何搭建OceanBase的开发环境。 ... [详细]

蜡笔小新 2024-12-27 16:38:48
export
Yii2 GridView 实现列表页数据直接编辑的完整指南

本文详细介绍了如何使用 Yii2 的 GridView 组件在列表页面实现数据的直接编辑功能。通过具体的代码示例和步骤，帮助开发者快速掌握这一实用技巧。 ... [详细]

蜡笔小新 2024-12-27 16:27:52
tree
深入解析ExpandableComposite.addExpansionListener()方法及其应用

本文详细介绍了Java中org.eclipse.ui.forms.widgets.ExpandableComposite类的addExpansionListener()方法，并提供了多个实际代码示例，帮助开发者更好地理解和使用该方法。这些示例来源于多个知名开源项目，具有很高的参考价值。 ... [详细]

蜡笔小新 2024-12-27 16:11:49
io
LeetCode 991：故障计算器的最优解法

探讨一个显示数字的故障计算器，它支持两种操作：将当前数字乘以2或减去1。本文将详细介绍如何用最少的操作次数将初始值X转换为目标值Y。 ... [详细]

蜡笔小新 2024-12-27 14:34:44
hash
Java面试题解析

本文详细介绍了Java编程语言中的核心概念和常见面试问题，包括集合类、数据结构、线程处理、Java虚拟机（JVM）、HTTP协议以及Git操作等方面的内容。通过深入分析每个主题，帮助读者更好地理解Java的关键特性和最佳实践。 ... [详细]

蜡笔小新 2024-12-27 13:55:14

幽默的人生就是悲催基_129

这个家伙很懒，什么也没留下！

Tags | 热门标签

RankList | 热门文章