tensorflow使用gpu

作者：王言 | 来源：互联网 | 2023-09-25 20:41

支持的设备在一套标准的系统上通常有多个计算设备.TensorFlow支持CPU和GPU这两种设备.我们用指定字符串strings来标识这些设备.比如:“cpu:0”:机器中的CP

支持的设备
在一套标准的系统上通常有多个计算设备. TensorFlow 支持 CPU 和 GPU 这两种设备. 我们用指定字符串 strings 来标识这些设备. 比如:

“/cpu:0”: 机器中的 CPU
“/gpu:0”: 机器中的 GPU, 如果你有一个的话.
“/gpu:1”: 机器中的第二个 GPU, 以此类推…
如果一个 TensorFlow 的 operation 中兼有 CPU 和 GPU 的实现, 当这个算子被指派设备时, GPU 有优先权. 比如matmul中 CPU 和 GPU kernel 函数都存在. 那么在 cpu:0 和 gpu:0 中, matmul operation 会被指派给 gpu:0 .

记录设备指派情况
为了获取你的 operations 和 Tensor 被指派到哪个设备上运行, 用 log_device_placement 新建一个 session, 并设置为 True.

新建一个 graph.
a &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[2, 3], name&＃61;’a’)
b &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[3, 2], name&＃61;’b’)
c &＃61; tf.matmul(a, b)

新建session with log_device_placement并设置为True.
sess &＃61; tf.Session(config&＃61;tf.ConfigProto(log_device_placement&＃61;True))

运行这个 op.
print sess.run(c)
你应该能看见以下输出:
Device mapping:
/job:localhost/replica:0/task:0/gpu:0 -> device: 0, name: Tesla K40c, pci bus
id: 0000:05:00.0
b: /job:localhost/replica:0/task:0/gpu:0
a: /job:localhost/replica:0/task:0/gpu:0
MatMul: /job:localhost/replica:0/task:0/gpu:0
[[ 22. 28.]
[ 49. 64.]]
手工指派设备
如果你不想使用系统来为 operation 指派设备, 而是手工指派设备, 你可以用 with tf.device 创建一个设备环境, 这个环境下的 operation 都统一运行在环境指定的设备上.

新建一个graph.
with tf.device(‘/cpu:0’):
a &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[2, 3], name&＃61;’a’)
b &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[3, 2], name&＃61;’b’)
c &＃61; tf.matmul(a, b)

新建session with log_device_placement并设置为True.
sess &＃61; tf.Session(config&＃61;tf.ConfigProto(log_device_placement&＃61;True))

运行这个op.
print sess.run(c)
你会发现现在 a 和 b 操作都被指派给了 cpu:0.
Device mapping:
/job:localhost/replica:0/task:0/gpu:0 -> device: 0, name: Tesla K40c, pci bus
id: 0000:05:00.0
b: /job:localhost/replica:0/task:0/cpu:0
a: /job:localhost/replica:0/task:0/cpu:0
MatMul: /job:localhost/replica:0/task:0/gpu:0
[[ 22. 28.]
[ 49. 64.]]
在多GPU系统里使用单一GPU
如果你的系统里有多个 GPU, 那么 ID 最小的 GPU 会默认使用. 如果你想用别的 GPU, 可以用下面的方法显式的声明你的偏好:

新建一个 graph.
with tf.device(‘/gpu:2’):
a &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[2, 3], name&＃61;’a’)
b &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[3, 2], name&＃61;’b’)
c &＃61; tf.matmul(a, b)

新建 session with log_device_placement 并设置为 True.
sess &＃61; tf.Session(config&＃61;tf.ConfigProto(log_device_placement&＃61;True))

运行这个 op.
print sess.run(c)
如果你指定的设备不存在, 你会收到 InvalidArgumentError 错误提示:
InvalidArgumentError: Invalid argument: Cannot assign a device to node ‘b’:
Could not satisfy explicit device specification ‘/gpu:2’
[Node: b &＃61; Const[dtype&＃61;DT_FLOAT, value&＃61;Tensor

新建一个 graph.
with tf.device(‘/gpu:2’):
a &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[2, 3], name&＃61;’a’)
b &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[3, 2], name&＃61;’b’)
c &＃61; tf.matmul(a, b)

新建 session with log_device_placement 并设置为 True.
sess &＃61; tf.Session(config&＃61;tf.ConfigProto(
allow_soft_placement&＃61;True, log_device_placement&＃61;True))

运行这个 op.
print sess.run(c)
使用多个 GPU
如果你想让 TensorFlow 在多个 GPU 上运行, 你可以建立 multi-tower 结构, 在这个结构里每个 tower 分别被指配给不同的 GPU 运行. 比如:

新建一个 graph.
c &＃61; []
for d in [‘/gpu:2’, ‘/gpu:3’]:
with tf.device(d):
a &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[2, 3])
b &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[3, 2])
c.append(tf.matmul(a, b))
with tf.device(‘/cpu:0’):
sum &＃61; tf.add_n(c)

新建session with log_device_placement并设置为True.
sess &＃61; tf.Session(config&＃61;tf.ConfigProto(log_device_placement&＃61;True))

运行这个op.
print sess.run(sum)
你会看到如下输出:
Device mapping:
/job:localhost/replica:0/task:0/gpu:0 -> device: 0, name: Tesla K20m, pci bus
id: 0000:02:00.0
/job:localhost/replica:0/task:0/gpu:1 -> device: 1, name: Tesla K20m, pci bus
id: 0000:03:00.0
/job:localhost/replica:0/task:0/gpu:2 -> device: 2, name: Tesla K20m, pci bus
id: 0000:83:00.0
/job:localhost/replica:0/task:0/gpu:3 -> device: 3, name: Tesla K20m, pci bus
id: 0000:84:00.0
Const_3: /job:localhost/replica:0/task:0/gpu:3
Const_2: /job:localhost/replica:0/task:0/gpu:3
MatMul_1: /job:localhost/replica:0/task:0/gpu:3
Const_1: /job:localhost/replica:0/task:0/gpu:2
Const: /job:localhost/replica:0/task:0/gpu:2
MatMul: /job:localhost/replica:0/task:0/gpu:2
AddN: /job:localhost/replica:0/task:0/cpu:0
[[ 44. 56.]
[ 98. 128.]]
cifar10 tutorial 这个例子很好的演示了怎样用GPU集群训练.

推荐阅读

search
编写有趣的VBScript恶作剧脚本

本文将介绍如何编写一些有趣的VBScript脚本，这些脚本可以在朋友之间进行无害的恶作剧。通过简单的代码示例，帮助您了解VBScript的基本语法和功能。 ... [详细]

蜡笔小新 2024-12-28 09:46:23
header
深入解析Spring Cloud Ribbon负载均衡机制

本文详细介绍了Spring Cloud中的Ribbon组件如何实现服务调用的负载均衡。通过分析其工作原理、源码结构及配置方式，帮助读者理解Ribbon在分布式系统中的重要作用。 ... [详细]

蜡笔小新 2024-12-27 16:01:25
process
深入解析 Spring Security 用户认证机制

本文将详细介绍 Spring Security 中用户登录认证的核心流程，重点分析 AbstractAuthenticationProcessingFilter 和 AuthenticationManager 的工作原理。通过理解这些组件的实现，读者可以更好地掌握 Spring Security 的认证机制。 ... [详细]

蜡笔小新 2024-12-25 16:00:21
bash
Dockerfile 编写与 Docker 网络配置详解

本文详细介绍了 Dockerfile 的编写方法及其在网络配置中的应用，涵盖基础指令、镜像构建与发布流程，并深入探讨了 Docker 的默认网络、容器互联及自定义网络的实现。 ... [详细]

蜡笔小新 2024-12-27 17:31:41
text
使用 SQLiteJDBC 和 HikariCP 实现 Java 程序连接 SQLite 数据库

本文介绍了如何通过 Maven 依赖引入 SQLiteJDBC 和 HikariCP 包，从而在 Java 应用中高效地连接和操作 SQLite 数据库。文章提供了详细的代码示例，并解释了每个步骤的实现细节。 ... [详细]

蜡笔小新 2024-12-26 17:34:42
eval
TensorFlow 2.0 实战：多层感知机（MLP）网络入门

本教程详细介绍了如何使用 TensorFlow 2.0 构建和训练多层感知机（MLP）网络，涵盖回归和分类任务。通过具体示例和代码实现，帮助初学者快速掌握 TensorFlow 的核心概念和操作。 ... [详细]

蜡笔小新 2024-12-22 19:56:15
install
CentOS 6.8 上安装 Oracle 10.2.0.1 的常见问题及解决方案

本文记录了在 CentOS 6.8 系统上安装 Oracle 10.2.0.1 数据库时遇到的问题及解决方法，包括依赖库缺失、操作系统版本不兼容、用户权限不足等问题。 ... [详细]

蜡笔小新 2024-12-20 17:19:23
ip
配置SecureCRT以显示Linux终端颜色

本文介绍如何配置SecureCRT以正确显示Linux终端的颜色，并解决中文显示问题。通过简单的步骤设置，可以显著提升使用体验。 ... [详细]

蜡笔小新 2024-12-19 18:30:14
php
深入浅出TensorFlow数据读写机制

本文详细介绍TensorFlow中的数据读写操作，包括TFRecord文件的创建与读取，以及数据集（dataset）的相关概念和使用方法。 ... [详细]

蜡笔小新 2024-12-19 16:23:17
header
Feign远程调用请求头丢失问题分析与解决方案

本文详细探讨了在微服务架构中，使用Feign进行远程调用时出现的请求头丢失问题，并提供了具体的解决方案。重点讨论了单线程和异步调用两种场景下的处理方法。 ... [详细]

蜡笔小新 2024-12-19 10:17:16
text
JSP核心知识点解析与实践

本文详细介绍了JSP（Java Server Pages）的九大内置对象及其功能，探讨了JSP与Servlet之间的关系及差异，并提供了实际编码示例。此外，还讨论了网页开发中常见的编码转换问题以及JSP的两种页面跳转方式。 ... [详细]

蜡笔小新 2024-12-18 23:42:09
php
MySQL锁机制详解

本文深入探讨了MySQL中的锁机制，包括表级锁、行级锁以及元数据锁，通过实例详细解释了各种锁的工作原理及其应用场景。同时，文章还介绍了如何通过锁来优化数据库性能，避免常见的并发问题。 ... [详细]

蜡笔小新 2024-12-18 14:24:14
spring
Spring 4.2.5 中 Bean 在 ContextRefreshedEvent 上未启用事务代理的问题

本文探讨了一个特定于 Spring 4.2.5 的问题，即在应用上下文刷新事件（ContextRefreshedEvent）触发时，带有 @Transactional 注解的 Bean 未能正确代理事务。该问题在 Spring 4.1.9 版本中正常运行，但在升级至 4.2.5 后出现异常。 ... [详细]

蜡笔小新 2024-12-18 09:47:38
spring
Transforming the Future of Virtual Worlds

Explore how Matterverse is redefining the metaverse experience, creating immersive and meaningful virtual environments that foster genuine connections and economic opportunities. ... [详细]

蜡笔小新 2024-12-28 09:44:49
php
使用Vultr云服务器和Namesilo域名搭建个人网站

本文详细介绍了如何通过Vultr云服务器和Namesilo域名搭建一个功能齐全的个人网站，包括购买、配置服务器以及绑定域名的具体步骤。文章还提供了详细的命令行操作指南，帮助读者顺利完成建站过程。 ... [详细]

蜡笔小新 2024-12-26 16:36:34

王言

这个家伙很懒，什么也没留下！

Tags | 热门标签

RankList | 热门文章

tensorflow使用gpu

新建一个 graph. a &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[2, 3], name&＃61;’a’) b &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[3, 2], name&＃61;’b’) c &＃61; tf.matmul(a, b)

新建session with log_device_placement并设置为True. sess &＃61; tf.Session(config&＃61;tf.ConfigProto(log_device_placement&＃61;True))

新建一个graph. with tf.device(‘/cpu:0’): a &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[2, 3], name&＃61;’a’) b &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[3, 2], name&＃61;’b’) c &＃61; tf.matmul(a, b)

新建session with log_device_placement并设置为True. sess &＃61; tf.Session(config&＃61;tf.ConfigProto(log_device_placement&＃61;True))

新建一个 graph. with tf.device(‘/gpu:2’): a &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[2, 3], name&＃61;’a’) b &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[3, 2], name&＃61;’b’) c &＃61; tf.matmul(a, b)

新建 session with log_device_placement 并设置为 True. sess &＃61; tf.Session(config&＃61;tf.ConfigProto(log_device_placement&＃61;True))

新建一个 graph. with tf.device(‘/gpu:2’): a &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[2, 3], name&＃61;’a’) b &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[3, 2], name&＃61;’b’) c &＃61; tf.matmul(a, b)

新建 session with log_device_placement 并设置为 True. sess &＃61; tf.Session(config&＃61;tf.ConfigProto( allow_soft_placement&＃61;True, log_device_placement&＃61;True))

运行这个 op. print sess.run(c) 使用多个 GPU 如果你想让 TensorFlow 在多个 GPU 上运行, 你可以建立 multi-tower 结构, 在这个结构 里每个 tower 分别被指配给不同的 GPU 运行. 比如:

新建session with log_device_placement并设置为True. sess &＃61; tf.Session(config&＃61;tf.ConfigProto(log_device_placement&＃61;True))

新建一个 graph.
a &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[2, 3], name&＃61;’a’)
b &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[3, 2], name&＃61;’b’)
c &＃61; tf.matmul(a, b)

新建session with log_device_placement并设置为True.
sess &＃61; tf.Session(config&＃61;tf.ConfigProto(log_device_placement&＃61;True))

新建一个graph.
with tf.device(‘/cpu:0’):
a &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[2, 3], name&＃61;’a’)
b &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[3, 2], name&＃61;’b’)
c &＃61; tf.matmul(a, b)

新建session with log_device_placement并设置为True.
sess &＃61; tf.Session(config&＃61;tf.ConfigProto(log_device_placement&＃61;True))

新建一个 graph.
with tf.device(‘/gpu:2’):
a &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[2, 3], name&＃61;’a’)
b &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[3, 2], name&＃61;’b’)
c &＃61; tf.matmul(a, b)

新建 session with log_device_placement 并设置为 True.
sess &＃61; tf.Session(config&＃61;tf.ConfigProto(log_device_placement&＃61;True))

新建一个 graph.
with tf.device(‘/gpu:2’):
a &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[2, 3], name&＃61;’a’)
b &＃61; tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape&＃61;[3, 2], name&＃61;’b’)
c &＃61; tf.matmul(a, b)

新建 session with log_device_placement 并设置为 True.
sess &＃61; tf.Session(config&＃61;tf.ConfigProto(
allow_soft_placement&＃61;True, log_device_placement&＃61;True))

运行这个 op.
print sess.run(c)
使用多个 GPU
如果你想让 TensorFlow 在多个 GPU 上运行, 你可以建立 multi-tower 结构, 在这个结构里每个 tower 分别被指配给不同的 GPU 运行. 比如:

新建session with log_device_placement并设置为True.
sess &＃61; tf.Session(config&＃61;tf.ConfigProto(log_device_placement&＃61;True))