当前位置: 开发笔记 > 编程语言 > 正文

TensorFlow之SessionRunHook

作者：qaoxiuzcwhyx | 来源：互联网 | 2023-08-13 13:13

文章目录1.为什么要有Hook？2.Hook有什么用？3.TF内置了哪些Hook？4.TF怎么自定义Hook？5.怎么使用H

文章目录

- 1. 为什么要有 Hook&＃xff1f;
- 2. Hook 有什么用&＃xff1f;
- 3. TF 内置了哪些 Hook&＃xff1f;
- 4. TF 怎么自定义 Hook&＃xff1f;
- 5. 怎么使用 Hook&＃xff1f;
- - 5.1 怎么在 MonitoredTrainingSession 中使用 Hook
  - 5.2 怎么在 Estimator 中使用 Hook
  - 5.3 怎么在 slim 中使用 Hook
- 6. Hook 是怎么运作的&＃xff1f;
- 7. 内置 Hook 的研究
- 8. 参考文献

1. 为什么要有 Hook&＃xff1f;

SessionRunHook 用来扩展那些将 session 封装起来的高级 API 的 session.run 的行为。

2. Hook 有什么用&＃xff1f;

SessionRunHook 对于追踪训练过程、报告进度、实现提前停止等非常有用。

SessionRunHook 以观察者模式运行。SessionRunHook 的设计中有几个非常重要的时间点&＃xff1a;

session 使用前
session.run() 调用之前
session.run() 调用之后
session 关闭前

SessionRunHook 封装了一些可重用、可组合的计算&＃xff0c;并且可以顺便完成 session.run() 的调用。利用 Hook&＃xff0c;我们可以为 run() 调用添加任何的 ops或tensor/feeds&＃xff1b;并且在 run() 调用完成后获得请求的输出。Hook 可以利用 hook.begin() 方法向图中添加 ops&＃xff0c;但请注意&＃xff1a;在 begin() 方法被调用后&＃xff0c;计算图就 finalized 了。

3. TF 内置了哪些 Hook&＃xff1f;

TensorFlow 中已经内置了一些 Hook&＃xff1a;

StopAtStepHook&＃xff1a;根据 global_step 来停止训练。
CheckpointSaverHook&＃xff1a;保存 checkpoint。
LoggingTensorHook&＃xff1a;以日志的形式输出一个或多个 tensor 的值。
NanTensorHook&＃xff1a;如果给定的 Tensor 包含 Nan&＃xff0c;就停止训练。
SummarySaverHook&＃xff1a;保存 summaries 到一个 summary writer。

4. TF 怎么自定义 Hook&＃xff1f;

上节&＃xff0c;我们已经介绍了预制 Hook&＃xff0c;使用其可以实现一些常见功能。如果这些 Hook 不能满足你的需求&＃xff0c;那么自定义 Hook 是比较好的选择。

下面是自定义 Hook 的编写模板&＃xff1a;

class ExampleHook(tf.train.SessionRunHook):def begin(self):# You can add ops to the graph here.print(&＃39;Starting the session.&＃39;)self.your_tensor &＃61; ...def after_create_session(self, session, coord):# When this is called, the graph is finalized and# ops can no longer be added to the graph.print(&＃39;Session created.&＃39;)def before_run(self, run_context):print(&＃39;Before calling session.run().&＃39;)return SessionRunArgs(self.your_tensor)def after_run(self, run_context, run_values): # run_values 为 sess.run 的结果print(&＃39;Done running one step. The value of my tensor: %s&＃39;,run_values.results)if you-need-to-stop-loop:run_context.request_stop()def end(self, session):print(&＃39;Done with the session.&＃39;)

上面是官方给的解释&＃xff0c;下面是我设计的一个设置学习速率的Hook&＃xff1a;

class _LearningRateSetterHook(tf.train.SessionRunHook):"""Sets learning_rate based on global step."""def begin(self):self._global_step_tensor &＃61; tf.train.get_or_create_global_step()self._lrn_rate_tensor &＃61; tf.get_default_graph().get_tensor_by_name(&＃39;learning_rate:0&＃39;) # 注意&＃xff0c;这里根据name来索引tensor&＃xff0c;所以请在定义学习速率的时候&＃xff0c;为op添加名字self._lrn_rate &＃61; 0.1 # 第一阶段的学习速率def before_run(self, run_context):return tf.train.SessionRunArgs(self._global_step_tensor, # Asks for global step value.feed_dict&＃61;{self._lrn_rate_tensor: self._lrn_rate}) # Sets learning ratedef after_run(self, run_context, run_values):train_step &＃61; run_values.resultsif train_step < 10000:passelif train_step < 20000:self._lrn_rate &＃61; 0.01 # 第二阶段的学习速率elif train_step < 30000:self._lrn_rate &＃61; 0.001 # 第三阶段的学习速率else:self._lrn_rate &＃61; 0.0001 # 第四阶段的学习速率

5. 怎么使用 Hook&＃xff1f;

在那些将 session 封装起来的高阶 API 中&＃xff0c;我们可以使用 Hook 来扩展这些这些 API 的 session.run() 的行为。

首先&＃xff0c;我们梳理一下将 session 封装起来的高阶 API 有哪些&＃xff1f;这些 API 包括&＃xff0c;但不限于&＃xff1a;

tf.train.MonitoredTrainingSession&＃xff1a;
tf.estimator.Estimator&＃xff1a;
tf.contrib.slim&＃xff1a;

5.1 怎么在 `MonitoredTrainingSession` 中使用 Hook

with tf.train.MonitoredTrainingSession(hooks&＃61;your_hooks, ...) as mon_sess:while not mon_sess.should_stop():mon_sess.run(your_fetches)

5.2 怎么在 `Estimator` 中使用 Hook

在 tf.estimator.Estimator 的 train、evaluate、predict 方法中都可以使用 Hook。

下面是这些方法的 API&＃xff1a;

# 训练 # 这里的 est 是一个 Estimator 实例 est.train(input_fn, hooks&＃61;None, steps&＃61;None, max_steps&＃61;None, saving_listeners&＃61;None)

# 评估 est.evaluate(input_fn, steps&＃61;None, hooks&＃61;None, checkpoint_path&＃61;None, name&＃61;None)

# 预测 est.predict(input_fn, predict_keys&＃61;None, hooks&＃61;None, checkpoint_path&＃61;None, yield_single_examples&＃61;True)

5.3 怎么在 `slim` 中使用 Hook

Slim 是 TensorFlow 中一个非常优秀的高阶 API&＃xff0c;其可以极大地简化模型的构建、训练、评估。

未完待续。。。。

6. Hook 是怎么运作的&＃xff1f;

通过自定义 Hook 的过程&＃xff0c;我们了解到一个 Hook 包括 begin、after_create_session、before_run、after_run、end 五个方法。

下面的伪代码演示了 Hook 的运行过程&＃xff1a;

# 伪代码 call hooks.begin() sess &＃61; tf.Session() call hooks.after_create_session() while not stop is requested:call hooks.before_run()try:results &＃61; sess.run(merged_fetches, feed_dict&＃61;merged_feeds)except (errors.OutOfRangeError, StopIteration):breakcall hooks.after_run() call hooks.end() sess.close()

注意&＃xff1a;如果 sess.run() 引发 OutOfRangeError、StopIteration 或其它异常&＃xff0c;那么 hooks.after_run() 和 hooks.end() 将不会被执行。

7. 内置 Hook 的研究

预制的 Hook 比较多&＃xff0c;这里我们以 tf.train.StopAtStepHook 为例&＃xff0c;来看看内置 Hook 是怎么编写的。

# tf.train.StopAtStepHook 的定义 class StopAtStepHook(tf.train.SessionRunHook):"""Hook that requests stop at a specified step."""def __init__(self, num_steps&＃61;None, last_step&＃61;None):"""Initializes a &＃96;StopAtStepHook&＃96;.This hook requests stop after either a number of steps have beenexecuted or a last step has been reached. Only one of the two options can bespecified.if &＃96;num_steps&＃96; is specified, it indicates the number of steps to executeafter &＃96;begin()&＃96; is called. If instead &＃96;last_step&＃96; is specified, itindicates the last step we want to execute, as passed to the &＃96;after_run()&＃96;call.Args:num_steps: Number of steps to execute.last_step: Step after which to stop.Raises:ValueError: If one of the arguments is invalid."""if num_steps is None and last_step is None:raise ValueError("One of num_steps or last_step must be specified.")if num_steps is not None and last_step is not None:raise ValueError("Only one of num_steps or last_step can be specified.")self._num_steps &＃61; num_stepsself._last_step &＃61; last_stepdef begin(self):self._global_step_tensor &＃61; tf.train.get_or_create_global_step()if self._global_step_tensor is None:raise RuntimeError("Global step should be created to use StopAtStepHook.")def after_create_session(self, session, coord):if self._last_step is None:global_step &＃61; session.run(self._global_step_tensor)self._last_step &＃61; global_step &＃43; self._num_stepsdef before_run(self, run_context): # pylint: disable&＃61;unused-argumentreturn tf.train.SessionRunArgs(self._global_step_tensor)def after_run(self, run_context, run_values):global_step &＃61; run_values.results &＃43; 1if global_step >&＃61; self._last_step:# Check latest global step to ensure that the targeted last step is# reached. global_step read tensor is the value of global step# before running the operation. We&＃39;re not sure whether current session.run# incremented the global_step or not. Here we&＃39;re checking it.step &＃61; run_context.session.run(self._global_step_tensor)if step >&＃61; self._last_step:run_context.request_stop()

8. 参考文献

SessionRunHook 源码&＃xff1a;link
tf.train.SessionRunHook() 类详解&＃xff1a;link
Hook? tf.train.SessionRunHook()介绍【精】&＃xff1a;link

注意&＃xff1a;欢迎大家转载&＃xff0c;但需注明出处哦
$\quad$ $\quad$ $\;$ https://blog.csdn.net/u014061630/article/details/82998116

推荐阅读

go
Android性能优化检测App卡顿

在移动APP性能评测-流畅度评测中，我们介绍了如何准确客观评价APP的流畅度，最终采用SM指标来评价应用的流畅度，在知道如何评价流畅度之后 ... [详细]

蜡笔小新 2024-09-30 15:39:41
list
自定义RecyclerView添加EmptyView

你知道RecyclerView里没有Em ... [详细]

蜡笔小新 2024-09-30 15:31:47
web
rtems api用户指南_基本的Elixir Api指南

rtemsapi用户指南Elixir代表了相对较新的编程语言，面向更广泛的受众。它于2011年发布，此后一直在开发中。他的主要特征是取消功能范式 ... [详细]

蜡笔小新 2024-09-30 12:04:15
list
SENDMESSAGE函数巧应用

在这一期的SendMessage函数应用中，我将向大家介绍如何利用消息函数来扩展树型列表(TreeView)控件的功能相信对于树型列表控件大家十分的熟悉， ... [详细]

蜡笔小新 2024-09-29 19:45:57
io
赠送 HttpClient 和HttpURLConnection 的轻型网络框架 ---》按照自己的需求定制修改框架

转载注明出处：http:blog.csdn.netcodingandroidarticledetails41801309这个最后一节，我这里会总体的介绍一下这个项目的各个类的作用，以及使用 ... [详细]

蜡笔小新 2024-09-29 13:40:29
audio
配置OracleACFS集群文件系统

配置OracleACFS集群文件系统 2012-07-1010:18:39标签：asmacfs版权声明：原创作品，谢绝转载！否则将追究法律责任。 ... [详细]

蜡笔小新 2024-09-28 16:33:10
list
C#设计模式(8)——桥接模式（Bridge Pattern）

原文地址：http:www.cnblogs.comzhilipBridgePattern.html原文作者：Learninghard原文出处：博客园一、引言 ... [详细]

蜡笔小新 2024-09-28 15:11:33
io
java 注入为空_@Autowired注入为null问题分析

问题说明最近看到Spring事务,在学习过程中遇到一个很苦恼问题搭建好Spring的启动环境后出现了一点小问题在启动时候却出现[java.lang.NullPointerExcep ... [详细]

蜡笔小新 2024-09-28 11:25:55
list
一个对话框中的Android日期选择器 - Android Date Time picker in one dialog

IamusingmaterialDateTimepickerformyAndroidapp.ButIwanttocombinetheDateandTimepic ... [详细]

蜡笔小新 2024-09-28 10:23:29
web
百度_音频转文字

手机49kbps转换比特率256Kpbs{‘corpus_no’:‘7045177033217452815’,‘err_msg’:‘success.’,‘err_no’:0,‘re ... [详细]

蜡笔小新 2024-09-26 17:35:21
hash
聊聊nacos ServiceManager的removeInstance

序本文主要研究一下nacosServiceManager的removeInstanceServiceManagernacos-1.1.3namingsrcmainjavacomal ... [详细]

蜡笔小新 2024-09-26 13:58:00
datetime
Java SimpleDateFormat详细介绍

SimpleDateFormat类所在java包位置：java.text.SimpleDateFormat。继承结构如下：复制代码java.lang. ... [详细]

蜡笔小新 2024-09-26 12:11:09
range
python绘图设置正交坐标等距_Python:线性代数机器学习背后的优化原理 (五十五)...

线性代数：机器学习背后的优化原理线性代数作为数学的一个分支，广泛应用于科学和工程中，掌握好线性代数对于理解和从事机器学习算法相关工作是很有 ... [详细]

蜡笔小新 2024-09-26 10:09:12
io
ajax_servlet数据交互实例（一）

java代码packageaction;importjava.io.IOException;importjava.io.PrintWriter;importjavax.servlet.Serv ... [详细]

蜡笔小新 2024-09-25 17:16:02
window
Android游戏开发：游戏框架的搭建(4)

6.游戏框架　　所有的基础工作做完后，我们最后来探讨一下游戏框架本身。我们看下为了运行我们的游戏，还需要什么样的工作要做：游戏被分为不同的屏幕(screen)，每个屏幕执行着相同的任务：判断用户输入， ... [详细]

蜡笔小新 2024-09-25 16:10:28