当前位置: 开发笔记 > 编程语言 > 正文

BaptisteChopin2021HumanMotionPredictionUsingManifoleAwareWasserteinGANIEEE

作者：mobiledu2502871567 | 来源：互联网 | 2024-10-21 00:15

#HumanMotionPredictionUsingManifold-AwareWassersteinGAN#paper1.paper-info1.1MetadataAuthor

# Human Motion Prediction Using Manifold-Aware Wasserstein GAN #paper

1. paper-info

1.1 Metadata

Author:: [[Baptiste Chopin]], [[Naima Otberdout]], [[Mohamed Daoudi]], [[Angela Bartolo]]

作者机构::

Keywords:: #HMP , #GAN

Journal:: #IEEE

Date:: [[2021-07-16]]

状态:: #Done

链接::http://arxiv.org/abs/2105.08715

修改时间:: 2022.11.22

1.2. Abstract

Human motion prediction aims to forecast future human poses given a prior pose sequence. The discontinuity of the predicted motion and the performance deterioration in long-term horizons are still the main challenges encountered in current literature. In this work, we tackle these issues by using a compact manifold-valued representation of human motion. Specifically, we model the temporal evolution of the 3D human poses as trajectory, what allows us to map human motions to single points on a sphere manifold. To learn these non-Euclidean representations, we build a manifold-aware Wasserstein generative adversarial model that captures the temporal and spatial dependencies of human motion through different losses. Extensive experiments show that our approach outperforms the state-of-the-art on CMU MoCap and Human 3.6M datasets. Our qualitative results show the smoothness of the predicted motions.

2. Introduction

领域：human motion prediction
- 问题：时间维度，如何使得预测的动作序列更加顺滑？空间维度，预测的动作更加真实？

之前的方法：RNN-based、CNN-based、GNN-based

作者的方法：
1. 将人体动作当做轨迹，并且将这些轨迹映射到流行的单个紧凑点。能够使动作序列更加smoothness。
2. 由于传统的GNN无法处理流式数据，于是作者提出了a manifold-aware Wasserstein Generative Adversaial Network(WGAN)
3. 提出了多种损失函数来优化该网络，因为GAN很难train。

3. Methods

人体结构表示: 3D坐标

\(P_t=[x_1(t),y_1(t),z_1(t),...,x_k(t),y_k(t),z_k(t)]\): 某一时刻的姿势表示。

\(k\): 关节点数。

\(t\): 时刻

作者沿用[2]，将人体骨架序列看做每个关节点的轨迹，从而将人体动作预测问题看做：给定一定序列的连续帧，去预测下一帧的点对应的曲线。将曲线描述为\(\alpha(t)\) ，问题变为：给定\(\alpha(t)_{t=1,,,\tau}\),预测\(\alpha(t)_{t=\tau+1,,,T}\)

为了能够更好的学习和建模\(\alpha(t)\)，作者采用了square-root velocity function[1]，该方法能够将曲线映射到一个超球面，得到一个超平面内的点(不同行为标签对应的轨迹产生的点之间的距离是不同的)。该方法的定义如下：

\[q(t)=\frac{\dot{\alpha}(t)}{\sqrt{\|\dot{\alpha}(t)\|}}

\]

\(||.||\)操作表示为欧几里得2范数。

该方法在已知\(\alpha(0)\)的情况下，可以通过以下function还原\(\alpha(t)\)：

\[\alpha(t)=\int_{0}^{t}\|q(s)\| q(s) d s+\alpha(0)

\]

为了更好的处理，将这个超球面的半径设置为1.

\(\mathcal{C}=\{q: \mathcal{I}->\mathcal{R}^n| ||q||=1\} \subset \mathcal{L}^2(\mathcal{I}, \mathcal{R}^n)\) ：表示一个动作在超球面的映射。

由于该超空间不同于欧式空间，传统的GNN方法不能够处理该类数据，于是作者提出了新的GNN--PredictiveMA-WGAN

3.1 Network Architecture

Fig.1. 网络结构

Source:

PredictiveMA-WGAN由一个predictor\(\mathcal{G}\) 和discriminator\(\mathcal{D}\) 组成，对抗式学习。

During the training of these networks, we iteratively map the SRVF data back and forth to the tangent space using the exponential and the logarithm maps, defined in a particular point on the hypersphere.

在训练阶段，使用在超球面上的特定点中定义的指数和对数映射，以迭代的方式将SRVF数据来回映射到切线空间，然后输入到网络中。

\(\mathcal{G}\) ：

36864输出通道的全连接层。

5个上采样块(512， 256，128，64，1)，上采样块由nearest-neighbor upsampling跟着一个卷积块(3x3, stride 1)和ReLU function 构成。

\(\mathcal{D}\)：

3个下采样块(64,32,16输出通道)

Conv layer(3x3 stride 1)

batch normalization

ReLU

3.2 Loss Function

Adverasrial loss

\[\begin{aligned}

\mathcal{L}_{a}=& \mathbb{E}_{q_{T} \sim \mathbb{P}_{q_{T}}}\left[\mathcal{D}\left(\log _{\mu}\left(q_{T}\right)\right)\right] \\

&-\mathbb{E}_{\mathcal{G}\left(\log _{\mu}\left(q_{\tau}\right)\right) \sim \mathbb{P}_{\boldsymbol{q}_{T}}}\left[\mathcal{D}\left(\log _{\mu}\left(\exp _{\mu}\left(\mathcal{G}\left(\log _{\mu}\left(q_{\tau}\right)\right)\right)\right)\right]\right.\\

&+\lambda \mathbb{E}_{\widetilde{q} \sim \mathbb{P}_{\widetilde{q}}}\left[\left(\left\|\nabla_{\widetilde{q}} \mathcal{D}(\widetilde{q})\right\|-1\right)^{2}\right],

\end{aligned}\]

Reconstruction loss

\[\mathcal{L}_{r}=\left\|\log _{\mu}\left(\exp _{\mu}\left(\mathcal{G}\left(\log _{\mu}\left(q_{\tau}\right)\right)\right)\right)-\log _{\mu}\left(q_{T}\right)\right\|_{1}

\]

Skeleton integrity loss

\[\mathcal{L}_{s}=\frac{1}{m} \frac{1}{\tau} \sum_{i=1}^{m} \sum_{t=1}^{\tau} \Delta\left(P_{i, t}, \hat{P}_{i, t}\right)

\]

\[\Delta\left(G_{i}, G_{j}\right)=\operatorname{tr}\left(G_{i}\right)+\operatorname{tr}\left(G_{j}\right)-2 \sum_{i=1}^{3} \sigma_{i}

\]

这里解释没有看懂，具体看论文。

Bone length loss

\[\mathcal{L}_{b}=\frac{1}{m} \frac{1}{\tau} \frac{1}{B} \sum_{i}^{m} \sum_{t=1}^{\tau} \sum_{j}^{B}\left\|b_{i, j, t}-\hat{b}_{i, j, t}\right\|

\]

Bone Length loss

\[\mathcal{L}_{b}=\frac{1}{m} \frac{1}{\tau} \frac{1}{B} \sum_{i}^{m} \sum_{t=1}^{\tau} \sum_{j}^{B}\left\|b_{i, j, t}-\hat{b}_{i, j, t}\right\|

\]

Global loss

\[\mathcal{L}=\beta_{1} \mathcal{L}_{a}+\beta_{2} \mathcal{L}_{r}+\beta_{3} \mathcal{L}_{s}+\beta_{4} \mathcal{L}_{b}

\]

4. Experiments

dataset
- Human3.6m
- CMU Motion Capture

metric
- MPJPE

\[\operatorname{Err}=\sqrt{\frac{1}{\Delta t} \frac{1}{k} \sum_{t=\tau+1}^{\tau+\Delta t} \sum_{j=1}^{k}\left\|p_{t, j}-\hat{p}_{t, j}\right\|^{2}}

\]

总结

与[2]的思路大致一致，将人体动作序列看过关节点的轨迹，在[2]中，利用DCT编码处理得到轨迹特征，然后利用GCN处理空间信息，在本文中，作者利用Square Root Velocity Function[1]将轨迹mapping 到一个超球体内，一条轨迹对应一点，然后将这些点投影到超球体的切线空间中，得到GAN网络模型的输入，最后训练GAN得到输出。

reference

[1] A. Srivastava, E. Klassen, S. H. Joshi and I. H. Jermyn, "Shape Analysis of Elastic Curves in Euclidean Spaces," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 33, no. 7, pp. 1415-1428, July 2011, doi: 10.1109/TPAMI.2010.184.

[2]Mao W, Liu M, Salzmann M, et al. Learning trajectory dependencies for human motion prediction[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019: 9489-9497.

io
metadata

推荐阅读

instance
深入解析：OpenShift Origin环境下的Kubernetes Spark Operator

本文探讨了如何在OpenShift Origin平台上利用Kubernetes Spark Operator来管理和部署Apache Spark集群与应用。作为Radanalytics.io项目的一部分，这一开源工具为大数据处理提供了强大的支持。 ... [详细]

蜡笔小新 2024-12-19 14:07:35
email
Docker 自定义网络配置详解

本文详细介绍如何在 Docker 中自定义网络设置，包括网关和子网地址的配置。通过具体示例展示如何创建和管理自定义网络，以及容器间的通信方式。 ... [详细]

蜡笔小新 2024-12-16 20:26:24
uri
深入理解Java MySQL数据库连接池实现

尽管利用Apache Commons DBCP等工具可以轻松构建数据库连接池，但本文详细解析了数据库连接池的工作机制，提供了详尽的注释，帮助开发者深入理解其内部运作。这不仅有助于提高数据库操作的效率，还能增强应用程序的稳定性和性能。 ... [详细]

蜡笔小新 2024-12-15 22:02:58
get
Windows环境下部署Kubernetes Dashboard指南

本指南详细介绍了如何在Windows系统中部署Kubernetes Dashboard，包括下载最新配置文件、修改服务类型以支持NodePort访问、下载所需镜像并启动Dashboard服务等步骤。 ... [详细]

蜡笔小新 2024-12-13 18:18:00
filter
FFPlay 字幕与LRC歌词播放指南

本文详细介绍了不同媒体容器支持的字幕格式，以及如何使用FFPlay和FFMPEG进行字幕和LRC歌词的播放与转换。涵盖的内容包括字幕显示方法、字体配置、字幕流选择等。 ... [详细]

蜡笔小新 2024-12-13 13:13:40
get
使用GDAL创建GeoTIFF栅格文件

本文介绍了如何利用GDAL库在C#中创建一个带有地理参考信息的GeoTIFF文件，包括定义地理变换参数和坐标系统等关键步骤。 ... [详细]

蜡笔小新 2024-12-12 13:02:30
get
构建Snowflake中的近实时数据摄取管道

探索如何在Snowflake中构建高效的近实时数据摄取管道，利用其内外表特性及Snowpipe服务，实现数据的快速、稳定加载。 ... [详细]

蜡笔小新 2024-12-09 17:40:26
filter
将SQL ResultSet作为Scala流处理 - Handling SQL ResultSet as a Scala Stream

当从数据库查询返回一个（仅向前，只读）ResultSet时，该ResultSet表现得像一个数据库行列表。本文探讨了如何将ResultSet转换成Scala流，以支持高效的数据处理操作。 ... [详细]

蜡笔小新 2024-12-07 08:25:35
get
深入解析 TiDB Binlog：Pump Storage 实现详解（上）

本文作者赵一霖，将继续探讨 TiDB Binlog 系统中 Pump Storage 的实现细节，包括其核心功能如持久化存储、数据排序及配对等。 ... [详细]

蜡笔小新 2024-12-06 20:33:31
get
深入解析com.datastax.driver.core.Metadata中的triggerOnUserTypeChanged方法

本文详细介绍了Java库com.datastax.driver.core.Metadata中的triggerOnUserTypeChanged方法的使用场景和代码实现，通过多个实际案例帮助开发者更好地理解和应用此方法。 ... [详细]

蜡笔小新 2024-12-06 20:12:21
instance
解决 Oracle Data Guard Broker 不可用问题

本文探讨了在尝试禁用Data Guard配置时遇到的ORA-16525错误，该错误表明Data Guard Broker尚未可用，并提供了详细的日志分析和解决方案。 ... [详细]

蜡笔小新 2024-12-04 16:47:57
hash
Kafka组件详解及工作原理

本文介绍了Apache Kafka的核心组件及其工作方式，包括生产者(Producer)、消费者(Consumer)、主题(Topic)、代理(Broker)、分区(Partition)、消费者组(Consumer Group)和偏移量(Offset)，并探讨了这些组件之间的交互机制。 ... [详细]

蜡笔小新 2024-12-03 16:04:46
hash
深入探讨配置文件的管理与优化

尽管配置文件的重要性不言而喻，但其管理和安全性问题却常被忽视。本文将详细讨论配置文件的不同管理策略及其优缺点。 ... [详细]

蜡笔小新 2024-12-01 19:35:41
rsa
WPF 应用程序在 Windows XP 上的视频撕裂问题

本文讨论了一款使用 C# 开发的 WPF 应用程序，在 Windows XP 系统上运行时出现的视频撕裂现象及其可能的解决方案。 ... [详细]

蜡笔小新 2024-11-30 21:39:53
filter
使用与示例：Java中的JCheckBoxMenuItem.setMnemonic()方法

本文介绍了如何在Java中使用`JCheckBoxMenuItem.setMnemonic()`方法，并提供了多个实际应用的代码示例。 ... [详细]

蜡笔小新 2024-11-29 10:10:37

mobiledu2502871567

这个家伙很懒，什么也没留下！

Tags | 热门标签

RankList | 热门文章