当前位置: 开发笔记 > 编程语言 > 正文

python调用数据集mnist_pythonMNIST手写识别数据调用API的方法

作者：版中凌菱 | 来源：互联网 | 2023-10-12 19:20

MNIST数据集比较小，一般入门机器学习都会采用这个数据集来训练有4个有用的文件：train-images-idx3-ubyte:trainingset

MNIST数据集比较小&＃xff0c;一般入门机器学习都会采用这个数据集来训练

有4个有用的文件&＃xff1a;

train-images-idx3-ubyte: training set images

train-labels-idx1-ubyte: training set labels

t10k-images-idx3-ubyte: test set images

t10k-labels-idx1-ubyte: test set labels

The training set contains 60000 examples, and the test set 10000 examples. 数据集存储是用binary file存储的&＃xff0c;黑白图片。

下面给出load数据集的代码&＃xff1a;

import os

import struct

import numpy as np

import matplotlib.pyplot as plt

def load_mnist():

&＃39;&＃39;&＃39;

Load mnist data

http://yann.lecun.com/exdb/mnist/

60000 training examples

10000 test sets

Arguments:

kind: &＃39;train&＃39; or &＃39;test&＃39;, string charater input with a default value &＃39;train&＃39;

Return:

xxx_images: n*m array, n is the sample count, m is the feature number which is 28*28

xxx_labels: class labels for each image, (0-9)

&＃39;&＃39;&＃39;

root_path &＃61; &＃39;/home/cc/deep_learning/data_sets/mnist&＃39;

train_labels_path &＃61; os.path.join(root_path, &＃39;train-labels.idx1-ubyte&＃39;)

train_images_path &＃61; os.path.join(root_path, &＃39;train-images.idx3-ubyte&＃39;)

test_labels_path &＃61; os.path.join(root_path, &＃39;t10k-labels.idx1-ubyte&＃39;)

test_images_path &＃61; os.path.join(root_path, &＃39;t10k-images.idx3-ubyte&＃39;)

with open(train_labels_path, &＃39;rb&＃39;) as lpath:

# &＃39;>&＃39; denotes bigedian

# &＃39;I&＃39; denotes unsigned char

magic, n &＃61; struct.unpack(&＃39;>II&＃39;, lpath.read(8))

#loaded &＃61; np.fromfile(lpath, dtype &＃61; np.uint8)

train_labels &＃61; np.fromfile(lpath, dtype &＃61; np.uint8).astype(np.float)

with open(train_images_path, &＃39;rb&＃39;) as ipath:

magic, num, rows, cols &＃61; struct.unpack(&＃39;>IIII&＃39;, ipath.read(16))

loaded &＃61; np.fromfile(train_images_path, dtype &＃61; np.uint8)

# images start from the 16th bytes

train_images &＃61; loaded[16:].reshape(len(train_labels), 784).astype(np.float)

with open(test_labels_path, &＃39;rb&＃39;) as lpath:

# &＃39;>&＃39; denotes bigedian

# &＃39;I&＃39; denotes unsigned char

magic, n &＃61; struct.unpack(&＃39;>II&＃39;, lpath.read(8))

#loaded &＃61; np.fromfile(lpath, dtype &＃61; np.uint8)

test_labels &＃61; np.fromfile(lpath, dtype &＃61; np.uint8).astype(np.float)

with open(test_images_path, &＃39;rb&＃39;) as ipath:

magic, num, rows, cols &＃61; struct.unpack(&＃39;>IIII&＃39;, ipath.read(16))

loaded &＃61; np.fromfile(test_images_path, dtype &＃61; np.uint8)

# images start from the 16th bytes

test_images &＃61; loaded[16:].reshape(len(test_labels), 784)

return train_images, train_labels, test_images, test_labels

再看看图片集是什么样的&＃xff1a;

def test_mnist_data():

&＃39;&＃39;&＃39;

Just to check the data

Argument:

none

Return:

none

&＃39;&＃39;&＃39;

train_images, train_labels, test_images, test_labels &＃61; load_mnist()

fig, ax &＃61; plt.subplots(nrows &＃61; 2, ncols &＃61; 5, sharex &＃61; True, sharey &＃61; True)

ax &＃61;ax.flatten()

for i in range(10):

img &＃61; train_images[i][:].reshape(28, 28)

ax[i].imshow(img, cmap &＃61; &＃39;Greys&＃39;, interpolation &＃61; &＃39;nearest&＃39;)

print(&＃39;corresponding labels &＃61; %d&＃39; %train_labels[i])

if __name__ &＃61;&＃61; &＃39;__main__&＃39;:

test_mnist_data()

跑出的结果如下&＃xff1a;

以上就是本文的全部内容&＃xff0c;希望对大家的学习有所帮助&＃xff0c;也希望大家多多支持脚本之家。

推荐阅读

default
深入理解org.neo4j.helpers.collection.Iterators.single()方法及其应用

本文详细介绍了Java中org.neo4j.helpers.collection.Iterators.single()方法的功能、使用场景及代码示例，帮助开发者更好地理解和应用该方法。 ... [详细]

蜡笔小新 2024-12-28 10:51:55
js
技术分享：从动态网站提取站点密钥的解决方案

本文探讨了如何从动态网站中提取站点密钥，特别是针对验证码（reCAPTCHA）的处理方法。通过结合Selenium和requests库，提供了详细的代码示例和优化建议。 ... [详细]

蜡笔小新 2024-12-28 04:11:47
default
深入解析ExpandableComposite.addExpansionListener()方法及其应用

本文详细介绍了Java中org.eclipse.ui.forms.widgets.ExpandableComposite类的addExpansionListener()方法，并提供了多个实际代码示例，帮助开发者更好地理解和使用该方法。这些示例来源于多个知名开源项目，具有很高的参考价值。 ... [详细]

蜡笔小新 2024-12-27 16:11:49
python
深入理解Python的os和sys模块

本文详细解析了Python中的os和sys模块，介绍了它们的功能、常用方法及其在实际编程中的应用。 ... [详细]

蜡笔小新 2024-12-26 22:04:19
string
毕业设计：基于机器学习与深度学习的垃圾邮件（短信）分类算法实现

本文详细介绍了如何使用机器学习和深度学习技术对垃圾邮件和短信进行分类。内容涵盖从数据集介绍、预处理、特征提取到模型训练与评估的完整流程，并提供了具体的代码示例和实验结果。 ... [详细]

蜡笔小新 2024-12-25 17:38:50
default
编写有趣的VBScript恶作剧脚本

本文将介绍如何编写一些有趣的VBScript脚本，这些脚本可以在朋友之间进行无害的恶作剧。通过简单的代码示例，帮助您了解VBScript的基本语法和功能。 ... [详细]

蜡笔小新 2024-12-28 09:46:23
default
Python配置文件读写指南

本文详细介绍如何使用Python进行配置文件的读写操作，涵盖常见的配置文件格式（如INI、JSON、TOML和YAML），并提供具体的代码示例。 ... [详细]

蜡笔小新 2024-12-28 08:39:55
default
使用 Azure Service Principal 和 Microsoft Graph API 获取 AAD 用户列表

本文介绍了一段通用代码示例，该代码不仅能够操作 Azure Active Directory (AAD)，还可以通过 Azure Service Principal 的授权访问和管理 Azure 订阅资源。Azure 的架构可以分为两个层级：AAD 和 Subscription。 ... [详细]

蜡笔小新 2024-12-27 16:07:12
jsp
Python学习笔记：使用pydoc工具查询文档

本文介绍了在Windows环境下使用pydoc工具的方法，并详细解释了如何通过命令行和浏览器查看Python内置函数的文档。此外，还提供了关于raw_input和open函数的具体用法和功能说明。 ... [详细]

蜡笔小新 2024-12-26 17:05:56
js
中央电视台电影频道节目预告及优化分析

本文详细介绍了中央电视台电影频道的节目预告，并通过专业工具分析了其加载方式，确保用户能够获取最准确的电视节目信息。 ... [详细]

蜡笔小新 2024-12-25 21:01:14
default
Handling Null Object Encoding in OAuth 1.0a API Implementation

Explore a common issue encountered when implementing an OAuth 1.0a API, specifically the inability to encode null objects and how to resolve it. ... [详细]

蜡笔小新 2024-12-28 08:54:34
string
ServiceStack与Swagger的无缝集成指南

本文详细介绍了如何在ServiceStack项目中集成Swagger，以实现API文档的自动生成和在线测试。通过本指南，您将了解从配置到部署的完整流程，并掌握如何优化API接口的开发和维护。 ... [详细]

蜡笔小新 2024-12-26 19:52:39
python
寻找满足特定条件的整数N的最大和(a+b)

本文探讨了如何在给定整数N的情况下，找到两个不同的整数a和b，使得它们的和最大，并且满足特定的数学条件。 ... [详细]

蜡笔小新 2024-12-26 19:26:18
yaml
使用Python在SAE上开发新浪微博应用的初步探索

最近重新审视了新浪云平台（SAE）提供的服务，发现其已支持Python开发。本文将详细介绍如何利用Django框架构建一个简单的新浪微博应用，并分享开发过程中的关键步骤。 ... [详细]

蜡笔小新 2024-12-26 13:36:52
default
基因组浏览器中的Wig格式解析

本文详细介绍了Wiggle（Wig）格式及其在基因组浏览器中的应用，涵盖variableStep和fixedStep两种主要格式的特点、适用场景及具体使用方法。同时，还提供了关于数据值和自定义参数的补充信息。 ... [详细]

蜡笔小新 2024-12-26 11:21:09

版中凌菱

这个家伙很懒，什么也没留下！

Tags | 热门标签

RankList | 热门文章