【深度学习】吴恩达网易公开课练习(class1week4)

作者：不分手得恋爱假的_457 | 来源：互联网 | 2023-10-10 16:14

概要class1week3的任务是实现单隐层的神经网络代码，而本次任务是实现有L层的多层深度全连接神经网络。关键点跟class3的基本相同，算清各个参数的维度即可。关键变量：m:训练样本

概要

class1 week3的任务是实现单隐层的神经网络代码，而本次任务是实现有L层的多层深度全连接神经网络。关键点跟class3的基本相同，算清各个参数的维度即可。

关键变量：

m: 训练样本数量
n[l]：第l层的节点数量，输入认为是第0层
方括号上标[l]: 第l层
圆括号上标(i): 第i个样本

$$X = \left[\begin{matrix}\vdots & \vdots & \vdots & \vdots \\x^{(1)} & x^{(2)} & \vdots & x^{(m)} \\\vdots & \vdots & \vdots & \vdots \\\end{matrix}\right]_{(n[0], m)}$$

$$W^{[l]} = \left[\begin{matrix}\cdots & w^{[l] T}_1 & \cdots \\\cdots & w^{[l] T}_2 & \cdots \\\cdots & \cdots & \cdots \\\cdots & w^{[l] T}_{n[l]} & \cdots \\\end{matrix}\right]_{(n[l], n[l-1])}$$

$$b^{[l]} = \left[\begin{matrix}b^{[l]}_1 \\b^{[l]}_2 \\\vdots \\b^{[l]}_{n[l]} \\\end{matrix}\right]_{(n[l], 1)}$$

$$A^{[l]}=\left[\begin{matrix}\vdots & \vdots & \vdots & \vdots \\a^{[l](1)} & a^{[l](2)} & \vdots & a^{[l](m)} \\\vdots & \vdots & \vdots & \vdots \\\end{matrix}\right]_{(n[l], m)}$$

$$Z^{[l]}=\left[\begin{matrix}\vdots & \vdots & \vdots & \vdots \\z^{[l](1)} & z^{[l](2)} & \vdots & z^{[l](m)} \\\vdots & \vdots & \vdots & \vdots \\\end{matrix}\right]_{(n[l], m)}$$

深度神经网络关键公式：

前向传播

$$Z^{[l]}=W^{[l]}A^{[l-1]}+b^{[l]}$$$$A^{[l]}=g^{[l]}(Z^{[l]})$$

当l $g^{[l]}$=relu函数
当l = L时，$g^{[L]}$=sigmoid函数
即，输出层激活函数用sigmoid，其他层激活函数用relu函数。

反向传播

$$ dZ^{[l]} = dA^{[l]} * g'(Z^{[l]})$$$$ dW^{[l]} = \frac{\partial \mathcal{L} }{\partial W^{[l]}} = \frac{1}{m} dZ^{[l]} A^{[l-1] T}$$$$ db^{[l]} = \frac{\partial \mathcal{L} }{\partial b^{[l]}} = \frac{1}{m} \sum_{i = 1}^{m} dZ^{[l](i)}$$$$ dA^{[l-1]} = \frac{\partial \mathcal{L} }{\partial A^{[l-1]}} = W^{[l] T} dZ^{[l]}$$

初始化dAL:

dAL = - (np.divide(Y, AL) - np.divide(1 - Y, 1 - AL))

cost计算

$$-\frac{1}{m} \sum\limits_{i = 1}^{m} (y^{(i)}\log\left(a^{[L] (i)}\right) + (1-y^{(i)})\log\left(1- a^{[L](i)}\right))$$

深度全连接神经网代码:

关键函数：

# 初始化参数，每一层的权重初始化为随机
# 输入layer_dims是每一层的节点数
# 输出parameters是字典，可以通过parameters['W' + str(l)]，parameters['b' + str(l)]获取每一层的初始参数
parameters = initialize_parameters_deep(layer_dims)   


# 线性前向传播函数
# 根据Z = W*A_prev + b计算当前层的Z， linear_cache=(A_prev, W, b)
Z, linear_cache = linear_forward(A_prev, W, b)


# 线性激活前向传播函数
# 根据A = g(Z) = g(W*A_prev + b)计算前向传播函数, 其中linear_activation_cache=(linear_cache, activation_cache)=((A_prev, W, b), (Z))
A, linear_activation_cache = linear_activation_forward(A_prev, W, b, activation = "sigmoid")


# L层完整的前向传输过程，输出的AL是最终输出，caches是每一层的缓存
AL, caches = L_model_forward(X, parameters)


# 线性反向传播函数
# 通过上述反向传播函数，通过dZ推导出dA_prev, dW, db，其中利用了缓存结果
dA_prev, dW, db = linear_backward(dZ, linear_cache)


# 线性激活函数反向传播
# 通过前面的linear_backward和激活函数导数计算dA_prev, dW, db
dA_prev, dW, db = linear_activation_backward(dA, linear_activation_cache, activation = "sigmoid")


# L层反向传播
# grads是每一层的导数，grads["dA" + str(l)]， grads["dW" + str(l)]， grads["db" + str(l)]格式
grads = L_model_backward(AL, Y, caches)


# 根据学习速率跟新参数
parameters = update_parameters(parameters, grads, 0.1) 


# 整体模型函数，通过迭代次数循环调用上述前向传播和反向传播函数实现
parameters = L_layer_model(train_x, train_y, layers_dims, learning_rate = 0.0075, num_iterations = 2500, print_cost = True)

完整代码：

import numpy as np
import matplotlib.pyplot as plt
import h5py


def sigmoid(Z):
    """
 Implements the sigmoid activation in numpy
 
 Arguments:
 Z -- numpy array of any shape
 
 Returns:
 A -- output of sigmoid(z), same shape as Z
 cache -- returns Z as well, useful during backpropagation
 """
    
    A = 1/(1+np.exp(-Z))
    cache = Z
    
    return A, cache

def relu(Z):
    """
 Implement the RELU function.

 Arguments:
 Z -- Output of the linear layer, of any shape

 Returns:
 A -- Post-activation parameter, of the same shape as Z
 cache -- a python dictionary containing "A" ; stored for computing the backward pass efficiently
 """
    
    A = np.maximum(0,Z)
    
    assert(A.shape == Z.shape)
    
    cache = Z 
    return A, cache


def relu_backward(dA, cache):
    """
 Implement the backward propagation for a single RELU unit.

 Arguments:
 dA -- post-activation gradient, of any shape
 cache -- 'Z' where we store for computing backward propagation efficiently

 Returns:
 dZ -- Gradient of the cost with respect to Z
 """
    
    Z = cache
    dZ = np.array(dA, copy=True) # just converting dz to a correct object.
    
    # When z <= 0, you should set dz to 0 as well. 
    dZ[Z <= 0] = 0
    
    assert (dZ.shape == Z.shape)
    
    return dZ

def sigmoid_backward(dA, cache):
    """
 Implement the backward propagation for a single SIGMOID unit.

 Arguments:
 dA -- post-activation gradient, of any shape
 cache -- 'Z' where we store for computing backward propagation efficiently

 Returns:
 dZ -- Gradient of the cost with respect to Z
 """
    
    Z = cache
    
    s = 1/(1+np.exp(-Z))
    dZ = dA * s * (1-s)
    
    assert (dZ.shape == Z.shape)
    
    return dZ


def load_data():
    train_dataset = h5py.File('datasets/train_catvnoncat.h5', "r")
    train_set_x_orig = np.array(train_dataset["train_set_x"][:]) # your train set features
    train_set_y_orig = np.array(train_dataset["train_set_y"][:]) # your train set labels

    test_dataset = h5py.File('datasets/test_catvnoncat.h5', "r")
    test_set_x_orig = np.array(test_dataset["test_set_x"][:]) # your test set features
    test_set_y_orig = np.array(test_dataset["test_set_y"][:]) # your test set labels

    classes = np.array(test_dataset["list_classes"][:]) # the list of classes
    
    train_set_y_orig = train_set_y_orig.reshape((1, train_set_y_orig.shape[0]))
    test_set_y_orig = test_set_y_orig.reshape((1, test_set_y_orig.shape[0]))
    
    return train_set_x_orig, train_set_y_orig, test_set_x_orig, test_set_y_orig, classes


def initialize_parameters_deep(layer_dims):
    """
 Arguments:
 layer_dims -- python array (list) containing the dimensions of each layer in our network
 
 Returns:
 parameters -- python dictionary containing your parameters "W1", "b1", ..., "WL", "bL":
 Wl -- weight matrix of shape (layer_dims[l], layer_dims[l-1])
 bl -- bias vector of shape (layer_dims[l], 1)
 """
    
    np.random.seed(1)
    parameters = {}
    L = len(layer_dims)            # number of layers in the network

    for l in range(1, L):
        parameters['W' + str(l)] = np.random.randn(layer_dims[l], layer_dims[l-1]) / np.sqrt(layer_dims[l-1]) #*0.01
        parameters['b' + str(l)] = np.zeros((layer_dims[l], 1))
        
        assert(parameters['W' + str(l)].shape == (layer_dims[l], layer_dims[l-1]))
        assert(parameters['b' + str(l)].shape == (layer_dims[l], 1))

        
    return parameters


def linear_forward(A, W, b):
    """
 Implement the linear part of a layer's forward propagation.

 Arguments:
 A -- activations from previous layer (or input data): (size of previous layer, number of examples)
 W -- weights matrix: numpy array of shape (size of current layer, size of previous layer)
 b -- bias vector, numpy array of shape (size of the current layer, 1)

 Returns:
 Z -- the input of the activation function, also called pre-activation parameter 
 cache -- a python dictionary containing "A", "W" and "b" ; stored for computing the backward pass efficiently
 """
    
    Z = W.dot(A) + b
    
    assert(Z.shape == (W.shape[0], A.shape[1]))
    cache = (A, W, b)
    
    return Z, cache

def linear_activation_forward(A_prev, W, b, activation):
    """
 Implement the forward propagation for the LINEAR->ACTIVATION layer

 Arguments:
 A_prev -- activations from previous layer (or input data): (size of previous layer, number of examples)
 W -- weights matrix: numpy array of shape (size of current layer, size of previous layer)
 b -- bias vector, numpy array of shape (size of the current layer, 1)
 activation -- the activation to be used in this layer, stored as a text string: "sigmoid" or "relu"

 Returns:
 A -- the output of the activation function, also called the post-activation value 
 cache -- a python dictionary containing "linear_cache" and "activation_cache";
 stored for computing the backward pass efficiently
 """
    
    if activation == "sigmoid":
        # Inputs: "A_prev, W, b". Outputs: "A, activation_cache".
        Z, linear_cache = linear_forward(A_prev, W, b)
        A, activation_cache = sigmoid(Z)
    
    elif activation == "relu":
        # Inputs: "A_prev, W, b". Outputs: "A, activation_cache".
        Z, linear_cache = linear_forward(A_prev, W, b)
        A, activation_cache = relu(Z)
    
    assert (A.shape == (W.shape[0], A_prev.shape[1]))
    cache = (linear_cache, activation_cache)

    return A, cache

def L_model_forward(X, parameters):
    """
 Implement forward propagation for the [LINEAR->RELU]*(L-1)->LINEAR->SIGMOID computation
 
 Arguments:
 X -- data, numpy array of shape (input size, number of examples)
 parameters -- output of initialize_parameters_deep()
 
 Returns:
 AL -- last post-activation value
 caches -- list of caches containing:
 every cache of linear_relu_forward() (there are L-1 of them, indexed from 0 to L-2)
 the cache of linear_sigmoid_forward() (there is one, indexed L-1)
 """

    caches = []
    A = X
    L = len(parameters) // 2                  # number of layers in the neural network
    
    # Implement [LINEAR -> RELU]*(L-1). Add "cache" to the "caches" list.
    for l in range(1, L):
        A_prev = A 
        A, cache = linear_activation_forward(A_prev, parameters['W' + str(l)], parameters['b' + str(l)], activation = "relu")
        caches.append(cache)
    
    # Implement LINEAR -> SIGMOID. Add "cache" to the "caches" list.
    AL, cache = linear_activation_forward(A, parameters['W' + str(L)], parameters['b' + str(L)], activation = "sigmoid")
    caches.append(cache)
    
    assert(AL.shape == (1,X.shape[1]))
            
    return AL, caches

def compute_cost(AL, Y):
    """
 Implement the cost function defined by equation (7).

 Arguments:
 AL -- probability vector corresponding to your label predictions, shape (1, number of examples)
 Y -- true "label" vector (for example: containing 0 if non-cat, 1 if cat), shape (1, number of examples)

 Returns:
 cost -- cross-entropy cost
 """
    
    m = Y.shape[1]

    # Compute loss from aL and y.
    cost = (1./m) * (-np.dot(Y,np.log(AL).T) - np.dot(1-Y, np.log(1-AL).T))
    
    cost = np.squeeze(cost)      # To make sure your cost's shape is what we expect (e.g. this turns [[17]] into 17).
    assert(cost.shape == ())
    
    return cost

def linear_backward(dZ, cache):
    """
 Implement the linear portion of backward propagation for a single layer (layer l)

 Arguments:
 dZ -- Gradient of the cost with respect to the linear output (of current layer l)
 cache -- tuple of values (A_prev, W, b) coming from the forward propagation in the current layer

 Returns:
 dA_prev -- Gradient of the cost with respect to the activation (of the previous layer l-1), same shape as A_prev
 dW -- Gradient of the cost with respect to W (current layer l), same shape as W
 db -- Gradient of the cost with respect to b (current layer l), same shape as b
 """
    A_prev, W, b = cache
    m = A_prev.shape[1]

    dW = 1./m * np.dot(dZ,A_prev.T)
    db = 1./m * np.sum(dZ, axis = 1, keepdims = True)
    dA_prev = np.dot(W.T,dZ)
    
    assert (dA_prev.shape == A_prev.shape)
    assert (dW.shape == W.shape)
    assert (db.shape == b.shape)
    
    return dA_prev, dW, db

def linear_activation_backward(dA, cache, activation):
    """
 Implement the backward propagation for the LINEAR->ACTIVATION layer.
 
 Arguments:
 dA -- post-activation gradient for current layer l 
 cache -- tuple of values (linear_cache, activation_cache) we store for computing backward propagation efficiently
 activation -- the activation to be used in this layer, stored as a text string: "sigmoid" or "relu"
 
 Returns:
 dA_prev -- Gradient of the cost with respect to the activation (of the previous layer l-1), same shape as A_prev
 dW -- Gradient of the cost with respect to W (current layer l), same shape as W
 db -- Gradient of the cost with respect to b (current layer l), same shape as b
 """
    linear_cache, activation_cache = cache
    
    if activation == "relu":
        dZ = relu_backward(dA, activation_cache)
        dA_prev, dW, db = linear_backward(dZ, linear_cache)
        
    elif activation == "sigmoid":
        dZ = sigmoid_backward(dA, activation_cache)
        dA_prev, dW, db = linear_backward(dZ, linear_cache)
    
    return dA_prev, dW, db

def L_model_backward(AL, Y, caches):
    """
 Implement the backward propagation for the [LINEAR->RELU] * (L-1) -> LINEAR -> SIGMOID group
 
 Arguments:
 AL -- probability vector, output of the forward propagation (L_model_forward())
 Y -- true "label" vector (containing 0 if non-cat, 1 if cat)
 caches -- list of caches containing:
 every cache of linear_activation_forward() with "relu" (there are (L-1) or them, indexes from 0 to L-2)
 the cache of linear_activation_forward() with "sigmoid" (there is one, index L-1)
 
 Returns:
 grads -- A dictionary with the gradients
 grads["dA" + str(l)] = ... 
 grads["dW" + str(l)] = ...
 grads["db" + str(l)] = ... 
 """
    grads = {}
    L = len(caches) # the number of layers
    m = AL.shape[1]
    Y = Y.reshape(AL.shape) # after this line, Y is the same shape as AL
    
    # Initializing the backpropagation
    dAL = - (np.divide(Y, AL) - np.divide(1 - Y, 1 - AL))
    
    # Lth layer (SIGMOID -> LINEAR) gradients. Inputs: "AL, Y, caches". Outputs: "grads["dAL"], grads["dWL"], grads["dbL"]
    current_cache = caches[L-1]
    grads["dA" + str(L)], grads["dW" + str(L)], grads["db" + str(L)] = linear_activation_backward(dAL, current_cache, activation = "sigmoid")
    
    for l in reversed(range(L-1)):
        # lth layer: (RELU -> LINEAR) gradients.
        current_cache = caches[l]
        dA_prev_temp, dW_temp, db_temp = linear_activation_backward(grads["dA" + str(l + 2)], current_cache, activation = "relu")
        grads["dA" + str(l + 1)] = dA_prev_temp
        grads["dW" + str(l + 1)] = dW_temp
        grads["db" + str(l + 1)] = db_temp

    return grads

def update_parameters(parameters, grads, learning_rate):
    """
 Update parameters using gradient descent
 
 Arguments:
 parameters -- python dictionary containing your parameters 
 grads -- python dictionary containing your gradients, output of L_model_backward
 
 Returns:
 parameters -- python dictionary containing your updated parameters 
 parameters["W" + str(l)] = ... 
 parameters["b" + str(l)] = ...
 """
    
    L = len(parameters) // 2 # number of layers in the neural network

    # Update rule for each parameter. Use a for loop.
    for l in range(L):
        parameters["W" + str(l+1)] = parameters["W" + str(l+1)] - learning_rate * grads["dW" + str(l+1)]
        parameters["b" + str(l+1)] = parameters["b" + str(l+1)] - learning_rate * grads["db" + str(l+1)]
        
    return parameters


# GRADED FUNCTION: L_layer_model

def L_layer_model(X, Y, layers_dims, learning_rate = 0.0075, num_iterations = 3000, print_cost=False):#lr was 0.009
    """
 Implements a L-layer neural network: [LINEAR->RELU]*(L-1)->LINEAR->SIGMOID.
 
 Arguments:
 X -- data, numpy array of shape (number of examples, num_px * num_px * 3)
 Y -- true "label" vector (containing 0 if cat, 1 if non-cat), of shape (1, number of examples)
 layers_dims -- list containing the input size and each layer size, of length (number of layers + 1).
 learning_rate -- learning rate of the gradient descent update rule
 num_iterations -- number of iterations of the optimization loop
 print_cost -- if True, it prints the cost every 100 steps
 
 Returns:
 parameters -- parameters learnt by the model. They can then be used to predict.
 """

    np.random.seed(1)
    costs = []                         # keep track of cost
    
    # Parameters initialization.
    ### START CODE HERE ###
    parameters = initialize_parameters_deep(layers_dims)
    ### END CODE HERE ###
    
    # Loop (gradient descent)
    for i in range(0, num_iterations):

        # Forward propagation: [LINEAR -> RELU]*(L-1) -> LINEAR -> SIGMOID.
        ### START CODE HERE ### (≈ 1 line of code)
        AL, caches = L_model_forward(X, parameters)
        ### END CODE HERE ###
        
        # Compute cost.
        ### START CODE HERE ### (≈ 1 line of code)
        cost = compute_cost(AL, Y)
        ### END CODE HERE ###
    
        # Backward propagation.
        ### START CODE HERE ### (≈ 1 line of code)
        grads = L_model_backward(AL, Y, caches)
        ### END CODE HERE ###
 
        # Update parameters.
        ### START CODE HERE ### (≈ 1 line of code)
        parameters = update_parameters(parameters, grads, learning_rate)
        ### END CODE HERE ###
                
        # Print the cost every 100 training example
        if print_cost and i % 100 == 0:
            print ("Cost after iteration %i: %f" %(i, cost))
        if print_cost and i % 100 == 0:
            costs.append(cost)
            
    # plot the cost
    plt.plot(np.squeeze(costs))
    plt.ylabel('cost')
    plt.xlabel('iterations (per tens)')
    plt.title("Learning rate =" + str(learning_rate))
    plt.show()
    
    return parameters


def predict(X, y, parameters):
    """
 This function is used to predict the results of a L-layer neural network.
 
 Arguments:
 X -- data set of examples you would like to label
 parameters -- parameters of the trained model
 
 Returns:
 p -- predictions for the given dataset X
 """
    
    m = X.shape[1]
    n = len(parameters) // 2 # number of layers in the neural network
    p = np.zeros((1,m))
    
    # Forward propagation
    probas, caches = L_model_forward(X, parameters)

    
    # convert probas to 0/1 predictions
    for i in range(0, probas.shape[1]):
        if probas[0,i] > 0.5:
            p[0,i] = 1
        else:
            p[0,i] = 0
    
    #print results
    #print ("predictions: " + str(p))
    #print ("true labels: " + str(y))
    print("Accuracy: "  + str(np.sum((p == y)/m)))
        
    return p


train_x_orig, train_y, test_x_orig, test_y, classes = load_data()
# Reshape the training and test examples 
train_x_flatten = train_x_orig.reshape(train_x_orig.shape[0], -1).T   # The "-1" makes reshape flatten the remaining dimensions
test_x_flatten = test_x_orig.reshape(test_x_orig.shape[0], -1).T
# Standardize data to have feature values between 0 and 1.
train_x = train_x_flatten/255.
test_x = test_x_flatten/255.
layers_dims = [12288, 20, 7, 5, 1]

parameters = L_layer_model(train_x, train_y, layers_dims, learning_rate = 0.0075, num_iterations = 2500, print_cost = True)
predictions_train = predict(train_x, train_y, parameters)
pred_test = predict(test_x, test_y, parameters)

推荐阅读

io
图像标签与以图搜图技术的应用与实践

本文探讨了图像标签的多种分类场景及其在以图搜图技术中的应用，涵盖了从基础理论到实际项目实施的全面解析。 ... [详细]

蜡笔小新 2024-12-07 14:28:06
io
吴恩达推出TensorFlow实践课程，Python基础即可入门，四个月掌握核心技能

量子位报道，deeplearning.ai最新发布了TensorFlow实践课程，适合希望使用TensorFlow开发AI应用的学习者。该课程涵盖机器学习模型构建、图像识别、自然语言处理及时间序列预测等多个方面。 ... [详细]

蜡笔小新 2024-12-08 17:26:10
io
京东AI创新之路：周伯文解析京东AI战略的独特之处

2018年4月15日，京东在北京举办了人工智能创新峰会，会上首次公开了京东AI的整体布局和发展方向。此次峰会不仅展示了京东在AI领域的最新成果，还标志着京东AI团队的首次集体亮相。本文将深入探讨京东AI的发展策略及其与BAT等公司的不同之处。 ... [详细]

蜡笔小新 2024-12-06 22:57:11
spring
深入解析SpringMVC核心组件：DispatcherServlet的工作原理

本文详细探讨了SpringMVC的核心组件——DispatcherServlet的运作机制，旨在帮助有一定Java和Spring基础的开发人员理解HTTP请求是如何被映射到Controller并执行的。文章将解答以下问题：1. HTTP请求如何映射到Controller；2. Controller是如何被执行的。 ... [详细]

蜡笔小新 2024-12-21 18:50:52
post
Python + Pytest 接口自动化测试中 Token 关联登录的实现方法

本文将深入探讨 Python 和 Pytest 在接口自动化测试中如何实现 Token 关联登录，内容详尽、逻辑清晰，旨在帮助读者掌握这一关键技能。 ... [详细]

蜡笔小新 2024-12-21 14:48:49
spring
Python 工具推荐 | PyHubWeekly 第二十一期：提升命令行体验的五大工具

本期 PyHubWeekly 为大家精选了 GitHub 上五个优秀的 Python 工具，涵盖金融数据可视化、终端美化、国际化支持、图像增强和远程 Shell 环境配置。欢迎关注并参与项目。 ... [详细]

蜡笔小新 2024-12-21 14:45:11
io
2017年人工智能领域的十大里程碑事件回顾

随着2018年的临近，我们一同回顾过去一年中人工智能领域的重要进展。这一年，无论是政策层面的支持，还是技术上的突破，都显示了人工智能发展的迅猛势头。以下是精选的2017年人工智能领域最具影响力的事件。 ... [详细]

蜡笔小新 2024-12-16 17:59:16
io
Python并行处理：提升数据处理速度的方法与实践

本文探讨了如何利用Python进行数据处理的并行化，通过介绍Numba、多进程处理以及Pandas DataFrame上的并行操作等技术，旨在帮助开发者有效提高数据处理效率。 ... [详细]

蜡笔小新 2024-12-14 11:30:03
io
Python库在GIS与三维可视化中的应用

Python库极大地扩展了GIS的能力，使其能够执行复杂的数据科学任务。本文探讨了几个关键的Python库，这些库不仅增强了GIS的核心功能，还推动了地理信息系统向更高层次的应用发展。 ... [详细]

蜡笔小新 2024-12-13 17:24:24
io
黑客如何利用AI在暗网重建你的数字身份

随着技术的发展，黑客开始利用AI技术在暗网中创建用户的‘数字孪生’，这一现象引起了安全专家的高度关注。 ... [详细]

蜡笔小新 2024-12-12 17:45:26
io
强人工智能时代，区块链的角色与前景

随着强人工智能的崛起，区块链技术在新的技术生态中扮演着怎样的角色？本文探讨了区块链与强人工智能之间的互补关系及其在未来技术发展中的重要性。 ... [详细]

蜡笔小新 2024-12-07 14:46:21
scala
大数据时代的机器学习：人工特征工程与线性模型的局限

本文探讨了在大数据背景下，人工特征工程与线性模型的应用及其局限性。随着数据量的激增和技术的进步，传统的特征工程方法面临挑战，文章提出了未来发展的可能方向。 ... [详细]

蜡笔小新 2024-12-07 11:58:58
io
探究同一请求在不同机器上返回不同HTTP状态码200和429的原因

本文探讨了为何相同的HTTP请求在两台不同操作系统（Windows与Ubuntu）的机器上会分别返回200 OK和429 Too Many Requests的状态码。我们将分析代码、环境差异及可能的影响因素。 ... [详细]

蜡笔小新 2024-12-21 19:35:11
io
浪潮AI服务器NF5488A5在MLPerf基准测试中刷新多项纪录

近日，国际权威AI基准测试平台MLPerf发布了最新的推理测试结果，浪潮AI服务器NF5488A5在此次测试中创造了18项性能纪录，显著提升了数据中心AI推理性能。 ... [详细]

蜡笔小新 2024-12-12 13:57:17
io
图像分类算法的优化策略与实践

本文探讨了《Bag of Tricks for Image Classification with Convolutional Neural Networks》论文中的多项技术，旨在通过具体实例和实验验证，提高卷积神经网络在图像分类任务中的性能。文章详细介绍了从模型训练加速、网络结构调整到训练参数优化等多个方面的改进方法。 ... [详细]

蜡笔小新 2024-12-09 16:01:40

不分手得恋爱假的_457

这个家伙很懒，什么也没留下！

Tags | 热门标签

RankList | 热门文章