当前位置: 开发笔记 > 编程语言 > 正文

柚子_PyTorch：EncoderRNN|LSTM|GRU

作者：Aero-Maxwell | 来源：互联网 | 2023-08-21 09:38

篇首语：本文由编程笔记#小编为大家整理，主要介绍了PyTorch：Encoder-RNN|LSTM|GRU相关的知识，希望对你有一定的参考价值。-柚子皮-

篇首语：本文由编程笔记#小编为大家整理，主要介绍了PyTorch：Encoder-RNN|LSTM|GRU相关的知识，希望对你有一定的参考价值。

-柚子皮-

RNN

参数

Parameters
input_size – The number of expected features in the input x

hidden_size – The number of features in the hidden state h

num_layers – Number of recurrent layers. E.g., setting num_layers&＃61;2 would mean stacking two RNNs together to form a stacked RNN, with the second RNN taking in outputs of the first RNN and computing the final results. Default: 1 堆叠层数

nonlinearity – The non-linearity to use. Can be either &＃39;tanh&＃39; or &＃39;relu&＃39;. Default: &＃39;tanh&＃39;

bias – If False, then the layer does not use bias weights b_ih and b_hh. Default: True

batch_first – If True, then the input and output tensors are provided as (batch, seq, feature). Default: False

dropout – If non-zero, introduces a Dropout layer on the outputs of each RNN layer except the last layer, with dropout probability equal to dropout. Default: 0

bidirectional – If True, becomes a bidirectional RNN. Default: False 是否使用双向rnn。

Note: RNN这里的序列长度&＃xff0c;是动态的&＃xff0c;不写在参数里的&＃xff0c;具体会由输入的input参数而定。

Inputs: input, h_0
input维度 input of shape (seq_len, batch, input_size): tensor containing the features of the input sequence. The input can also be a packed variable length sequence. See torch.nn.utils.rnn.pack_padded_sequence() or torch.nn.utils.rnn.pack_sequence() for details.

[https://blog.csdn.net/zwqjoy/article/details/86490098]

h0维度 h_0 of shape (num_layers * num_directions, batch, hidden_size): tensor containing the initial hidden state for each element in the batch. Defaults to zero if not provided. If the RNN is bidirectional, num_directions should be 2, else it should be 1.h0是提供给每层RNN的初始输入&＃xff0c;所有num_layers要和RNN的num_layers对得上。

Outputs: output, h_n
output of shape (seq_len, batch, num_directions * hidden_size): tensor containing the output features (h_t) from the last layer of the RNN, for each t. If a torch.nn.utils.rnn.PackedSequence has been given as the input, the output will also be a packed sequence.For the unpacked case, the directions can be separated using output.view(seq_len, batch, num_directions, hidden_size), with forward and backward being direction 0 and 1 respectively. Similarly, the directions can be separated in the packed case.RNN的上侧输出。

h_n of shape (num_layers * num_directions, batch, hidden_size): tensor containing the hidden state for t &＃61; seq_len.Like output, the layers can be separated using h_n.view(num_layers, num_directions, batch, hidden_size).RNN的右侧输出&＃xff0c;如果是双向的话&＃xff0c;就还有一个左侧输出。

具体参数和返回结果参考[https://pytorch.org/docs/stable/generated/torch.nn.RNN.html#torch.nn.RNN]

示例

rnn&＃61;nn.RNN(10,20,2) #(each_input_size, hidden_state, num_layers)
input&＃61;torch.randn(5,3,10) # (seq_len, batch, input_size)
h0&＃61;torch.randn(2,3,20) #(num_layers * num_directions, batch, hidden_size)
output,hn&＃61;rnn(input,h0)
print(output.size(),hn.size())

LSTM

具体参数和返回结果参考[https://pytorch.org/docs/stable/generated/torch.nn.LSTM.html#torch.nn.LSTM]

示例

rnn&＃61;nn.LSTM(10,20,2) #(each_input_size, hidden_state, num_layers)
input&＃61;torch.randn(5,3,10) # (seq_len, batch, input_size)
h0&＃61;torch.randn(2,3,20) #(num_layers * num_directions, batch, hidden_size)
c0&＃61;torch.randn(2,3,20) #(num_layers * num_directions, batch, hidden_size)
output,(hn,cn)&＃61;rnn(input,(h0,c0)) #seq_len x batch x hidden*bi_directional
print(output.size(),hn.size(),cn.size())

GRU

gru &＃61; nn.GRU(embed_size, hidden_size, n_layers, dropout&＃61;dropout, bidirectional&＃61;True)

具体参数参考&＃xff1a;[https://pytorch.org/docs/stable/generated/torch.nn.GRU.html#gru]

示例

import torch
import torch.nn as nn
rnn &＃61; nn.GRU(2, 4, 2,bidirectional&＃61;True)
input &＃61; torch.randn(2, 2, 2)
h0 &＃61; torch.randn(4, 2, 4)
output, hn &＃61; rnn(input, h0)
print(output)
print(hn)
print(output.size(),hn.size())

from: -柚子皮-

ref:[LSTM和GRU原理及pytorch代码&＃xff0c;输入输出大小说明]

推荐阅读

char
js实现四则混合运算计算器

最近想用js做一个简单的计算器，不过网上的例子好像大部分都是直接从左到右挨个计算，就好像1+2*5，就会先计算1+2，再计算3*5，并没有实现运算符的优先级，这里找到了一种方法实现，来总结一下。不过这 ... [详细]

蜡笔小新 2024-09-30 12:30:30
hash
JavaHashMap原理解析

本文分析HashMap的实现原理。数据结构（散列表）HashMap是一个散列表（也叫哈希表），用来存储键值对( ... [详细]

蜡笔小新 2024-09-28 18:06:17
copy
再看ibatis Order By注入问题

接上文http:blog.itpub.net29254281viewspace-1318239领导让开发同学鼓捣一个可配置化的后台.又回到了原来的问题如果要灵活,很多参数要 ... [详细]

蜡笔小新 2024-09-27 19:50:41
jsp
22.Container With Most Water（能装最多水的容器）

thecontainercontainsthemos ... [详细]

蜡笔小新 2024-09-30 18:33:10
jsp
Android性能优化检测App卡顿

在移动APP性能评测-流畅度评测中，我们介绍了如何准确客观评价APP的流畅度，最终采用SM指标来评价应用的流畅度，在知道如何评价流畅度之后 ... [详细]

蜡笔小新 2024-09-30 15:39:41
jsp
rtems api用户指南_基本的Elixir Api指南

rtemsapi用户指南Elixir代表了相对较新的编程语言，面向更广泛的受众。它于2011年发布，此后一直在开发中。他的主要特征是取消功能范式 ... [详细]

蜡笔小新 2024-09-30 12:04:15
char
如何判断当前浏览器是不是微信浏览器

如何判断当前浏览器是不是微信浏览器主要代码块functionisWeiXin(){varuawindow.navigator.userAgent.toLowerCase();i ... [详细]

蜡笔小新 2024-09-30 10:41:15
hash
IDEA实用插件Lombok

LombokLombok是一个可以通过简单的注解形式来帮助我们简化消除一些必须有但显得很臃肿的Java代码的工具，通过使用对应的注解，可以在编译源码的时候生成对应的方法。通常，我们所定义的对象和b ... [详细]

蜡笔小新 2024-09-29 18:30:50
input
[解题报告] Where is the Marble?

题目大意题目原文：http:uva.onlinejudge.orgexternal10410474.pdf背景还是基本的排序问题，题目意思很简单就是首先 ... [详细]

蜡笔小新 2024-09-29 18:11:09
scala
Spark各组件功能简单理解（quick start）

各个组件confspark-env.sh配置spark的环境变量confspark-default.conf配置spark应用默认的配置项和spark-env.sh有重合之处，可在 ... [详细]

蜡笔小新 2024-09-29 10:05:20
scala
Android Studio 使用BottomNavigationView 实现底部 tabs (一)

一、在androidStudio中实现tabs比较简单，新建项目就可以选择tabs模板进行创建，默认实现tabs功能：直接运行项目就可以看到效果：可以说非常简单，但是我们在实际开发 ... [详细]

蜡笔小新 2024-09-28 19:42:55
scala
java 注入为空_@Autowired注入为null问题分析

问题说明最近看到Spring事务,在学习过程中遇到一个很苦恼问题搭建好Spring的启动环境后出现了一点小问题在启动时候却出现[java.lang.NullPointerExcep ... [详细]

蜡笔小新 2024-09-28 11:25:55
scala
一个对话框中的Android日期选择器 - Android Date Time picker in one dialog

IamusingmaterialDateTimepickerformyAndroidapp.ButIwanttocombinetheDateandTimepic ... [详细]

蜡笔小新 2024-09-28 10:23:29
sum
2019 年 Firebase 峰会上发布的新功能

作者FrancisMa,HeadofProductFirebase的使命是帮助移动开发者和Web开发者迈向成功，但考虑到Firebase每个月有超过200万个活跃的应 ... [详细]

蜡笔小新 2024-09-28 08:07:01
ip
IPVlan 详解

文章目录简介Ipvlan2同节点Ns互通Ns内与宿主机通信第三种方法Ns到节点外部结论Ipvlan31.同节点Ns互通Ns内与宿主机通信Ns内到外部网络总结源码分析ipvlan收包 ... [详细]

蜡笔小新 2024-09-27 20:23:02

Aero-Maxwell

这个家伙很懒，什么也没留下！

Tags | 热门标签

RankList | 热门文章