当前位置: 开发笔记 > 编程语言 > 正文

PatternRecognitioncourse笔记SemisupervisedLearning

作者：血狼2732_150 | 来源：互联网 | 2023-08-23 17:05

仅供个人笔记使用Apatternrecognitionproblemgoaltherearelarge“labeled”dataonlinee.g.tweetsusinghash

仅供个人笔记使用

A pattern recognition problem

goal

there are large “labeled” data online e.g. tweets using hash #
can we use these unlabel data to improve our classifier

labeled data
unlabeled data

在这里插入图片描述
-some applications

image classification (easy to obtain images e.g, from flicker)
protein function prediction
document classification
part of speech tagging

-semi-supervised classification

similar but with continuous out come measure
using some labels to improve a clustering solution
measure how well the unlabeled data could help to improve

Content

self-learning

One of the earliest studies on SSL (Hartley & Rao 1968):
• Maximum likelihood trying all possible labelings (!)
(the problem of treating unlabeled data is dealing with explosive parameter)

More feasible suggestion (McLachlan 1975):
• Start with supervised solution
• Label unlabeled objects using this classifier
• Retrain classifier treating labels as true labels
在这里插入图片描述

Also known as self-training, self-labeling or pseudo-labeling

self-learning $≈\approx$ EXPECTATION MAXIMIZATION

Linear Discriminant Analysis (LDA)
$p(X,y;θ)&＃61;∏i&＃61;1L[π0N(xi,μ0,Σ)]1−yi[π1N(xi,μ1,Σ)]y1p(X,y;\theta)&＃61;\prod_{i&＃61;1}^L[\pi_0N(x_i,\mu_0,\Sigma)]^{1-y_i}[\pi_1N(x_i,\mu_1,\Sigma)]^{y_1}$
share the covariance $Σ\Sigma$
$N(xi,μ0,Σ)N(x_i,\mu_0,\Sigma)$ gaussians for each class
LDA &＃43; unlabeled data

$p(X,y,Xu,h;θ)&＃61;∏i&＃61;1L[π0N(xi,μ0,Σ)]1−yi[π1N(xi,μ1,Σ)]y1×∏i&＃61;1u[π0N(xi,μ0,Σ)]1−hi[π1N(xi,μ1,Σ)]h1p(X,y,X_u,h;\theta)&＃61;\prod_{i&＃61;1}^L[\pi_0N(x_i,\mu_0,\Sigma)]^{1-y_i}[\pi_1N(x_i,\mu_1,\Sigma)]^{y_1}\\ \times\prod_{i&＃61;1}^u[\pi_0N(x_i,\mu_0,\Sigma)]^{1-h_i}[\pi_1N(x_i,\mu_1,\Sigma)]^{h_1}$
But we do not know h… Integrate it out!
$X_u; \theta) &＃61; \int_hp(X, y, X_u, h; \theta)dh$
LDA &＃43;
unlabeled data
$∏i&＃61;1L[π0N(xi,μ0,Σ)]1−yi[π1N(xi,μ1,Σ)]y1×∏i&＃61;1u∑c&＃61;01πcN(xi,μc,Σ)\prod_{i&＃61;1}^L[\pi_0N(x_i,\mu_0,\Sigma)]^{1-y_i}[\pi_1N(x_i,\mu_1,\Sigma)]^{y_1}\\ \times\prod_{i&＃61;1}^u\sum^1_{c&＃61;0}\pi_cN(x_i,\mu_c,\Sigma)$
Like LDA &＃43; a gaussian mixture with the same parameters

EM algorithm

• Log sum makes optimization difficult
• Change goal: find a local maximum of this function
在这里插入图片描述
EM algorithm: finding a lower bound

what we want is construct a lower bound and touch exactyl the objective function ,and get the best lower bound which you can get

Jensen’s inequality

If $f (x)$ concave then $\geq E[f(X)]$

在这里插入图片描述

Does unlabeled data help?

在这里插入图片描述

$θx→X\theta_x \rightarrow X$
$\rightarrow Y$
$θY∣X→Y\theta_{Y|X} \rightarrow Y$

Self-learning and EM conclusions

• For generative models:
• Integrate out the missing variables
• Difficult optimization problem can often be “solved” efficiently using
expectation maximization
• Only guaranteed to improve performance asymptotically, if the model is
correct
• Self-learning is a closely related technique that is applicable to any classifier
• Related: co-training (multi-view learning)
• Use labels predicted by other view(s) as newly labeled objects

Low-density assumption

Low-density assumption conclusion
• “Natural” extension for the SVM
• Local minima may be a problem
• Lots of work on optimization
• My experience: quite sensitive to parameter settings
• Other low-density approaches:
• Entropy Regularization (Bengio & Grandvalet 2005)

manifold assumption

在这里插入图片描述

manifold regularization
-consistency regularication

$∥f(x;w)−g(x′;wt)∥2\Vert f(x;w)-g(x&＃x27;;w^t) \Vert^2$

Semi-Supervised Conclusion

• Unlabeled data is often available
• Semi-supervised learning attempts to use it to improve classifier
• Often worthwhile, but it does not come for free
• Modeling time
• Computational cost
• Remember: an unlabeled object is less valuable than a labeled one
• Labeling a few more objects can be more effective
• Remember the goal: transductive or inductive?

推荐阅读

select
Java 15 发布，带来多项重要更新！

2020年9月15日，Oracle正式发布了最新的JDK 15版本。本次更新带来了许多新特性，包括隐藏类、EdDSA签名算法、模式匹配、记录类、封闭类和文本块等。 ... [详细]

蜡笔小新 2024-11-14 12:11:09
include
普通树(每个节点可以有任意数量的子节点)级序遍历

普通树(每个节点可以有任意数量的子节点)级序遍历 ... [详细]

蜡笔小新 2024-11-14 18:53:26
include
Leetcode学习成长记：天池leetcode基础训练营Task01数组

前言这是本人第一次参加由Datawhale举办的组队学习活动，这个活动每月一次，之前也一直关注，但未亲身参与过，这次看到活动 ... [详细]

蜡笔小新 2024-11-14 18:01:31
select
Java 中 com.apollographql.apollo.api.internal.Optional.orNull() 方法详解与示例

本文详细介绍了 com.apollographql.apollo.api.internal.Optional 类中的 orNull() 方法，并提供了多个实际代码示例，帮助开发者更好地理解和使用该方法。 ... [详细]

蜡笔小新 2024-11-14 15:03:23
shell
MySQL初级篇——字符串、日期时间、流程控制函数的相关应用

文章目录：1.字符串函数2.日期时间函数2.1获取日期时间2.2日期与时间戳的转换2.3获取年月日、时分秒、星期数、天数等函数2.4时间和秒钟的转换2. ... [详细]

蜡笔小新 2024-11-14 10:57:02
format
Python基础：使用NLTK和Python构建机器学习应用

本文节选自《NLTK基础教程——用NLTK和Python库构建机器学习应用》一书的第1章第1.2节，作者Nitin Hardeniya。本文将带领读者快速了解Python的基础知识，为后续的机器学习应用打下坚实的基础。 ... [详细]

蜡笔小新 2024-11-13 21:23:34
window
AngularJS 在 IE6 和 IE7 中实现历史记录支持

我在使用 AngularJS 的路由功能开发单页应用 (SPA)，但需要支持 IE7（包括 IE8 的 IE7 兼容模式）。我希望浏览器的历史记录功能能够正常工作，即使需要使用 jQuery 插件。 ... [详细]

蜡笔小新 2024-11-13 20:42:56
window
VB.net 进程通信中FindWindow、FindWindowEX、SendMessage函数的理解

目录一、代码背景二、主要工具三、函数解析1、FindWindow：2、FindWindowEx：3、SendMessage： ... [详细]

蜡笔小新 2024-11-13 14:28:28
c语言
WinMain 函数详解及示例

本文详细介绍了 WinMain 函数的参数及其用途，并提供了一个具体的示例代码来解析 WinMain 函数的实现。 ... [详细]

蜡笔小新 2024-11-13 12:49:31
request
在范围[0..n-1]中产生m个不同的随机数 - Generating m distinct random numbers in the range [0..n-1]

Ihavetwomethodsofgeneratingmdistinctrandomnumbersintherange[0..n-1]我有两种方法在范围[0.n-1]中生 ... [详细]

蜡笔小新 2024-11-13 09:49:14
include
poj 3352 Road Construction

poj 3352 Road Construction ... [详细]

蜡笔小新 2024-11-12 11:24:39
include
Java类加载机制详解：第二阶段深入解析

类加载机制是Java虚拟机运行时的重要组成部分。本文深入解析了类加载过程的第二阶段，详细阐述了从类被加载到虚拟机内存开始，直至其从内存中卸载的整个生命周期。这一过程中，类经历了加载（Loading）、验证（Verification）等多个关键步骤。通过具体的实例和代码示例，本文探讨了每个阶段的具体操作和潜在问题，帮助读者全面理解类加载机制的内部运作。 ... [详细]

蜡笔小新 2024-11-11 11:42:38
function
Python中调整数据分辨率的方法

本文介绍了如何在Python中使用插值方法将不同分辨率的数据统一到相同的分辨率。 ... [详细]

蜡笔小新 2024-11-14 15:10:26
window
需要知道一个“本地到屏幕坐标”的函数 - Need to know a “Local to screen Coordinate” function

Iwouldliketohaveatooltopdisplayedonatextboxunderacertainsituation.我希望在特定情况下在文本框中显示工具栏 ... [详细]

蜡笔小新 2024-11-14 13:03:06
format
深入解析Properties属性类及其应用

属性类 `Properties` 是 `Hashtable` 类的子类，用于存储键值对形式的数据。该类在 Java 中广泛应用于配置文件的读取与写入，支持字符串类型的键和值。通过 `Properties` 类，开发者可以方便地进行配置信息的管理，确保应用程序的灵活性和可维护性。此外，`Properties` 类还提供了加载和保存属性文件的方法，使其在实际开发中具有较高的实用价值。 ... [详细]

蜡笔小新 2024-11-11 13:55:43