转自王晋东的博客
迁移学习(transfer learning)通俗来讲,就是运用已有的知识来学习新的知识,核心是找到已有知识和新知识之间的相似性,用成语来说就是举一反三。由于直接对目标域从头开始学习成本太高,我们故而转向运用已有的相关知识来辅助尽快地学习新知识。比如,已经会下中国象棋,就可以类比着来学习国际象棋;已经会编写Java程序,就可以类比着来学习C#;已经学会英语,就可以类比着来学习法语;等等。世间万事万物皆有共性,如何合理地找寻它们之间的相似性,进而利用这个桥梁来帮助学习新知识,是迁移学习的核心问题。
图1不同位置、不同传感器的迁移标定。已知一个房间中A点的WiFi信号与相应的人体行为,如何标定另一个房间中C点的蓝牙信号?
具体地,在迁移学习中,我们已有的知识叫做源域(source domain),要学习的新知识叫目标域(target domain)。迁移学习研究如何把源域的知识迁移到目标域上。特别地,在机器学习领域中,迁移学习研究如何将已有模型应用到新的不同的、但是有一定关联的领域中。传统机器学习在应对数据的分布、维度,以及模型的输出变化等任务时,模型不够灵活、结果不够好,而迁移学习放松了这些假设。在数据分布、特征维度以及模型输出变化条件下,有机地利用源域中的知识来对目标域更好地建模。另外,在有标定数据缺乏的情况下,迁移学习可以很好地利用相关领域有标定的数据完成数据的标定。
图2 迁移学习与传统机器学习的不同。(a)传统机器学习对不同的学习任务建立不同的模型,(b)迁移学习利用源域中的数据将知识迁移到目标域,完成模型建立。插图来自:Sinno Jialin Pan and Qiang Yang, A survey on transfer learning. IEEE TKDE 2010.
迁移学习按照学习方式可以分为基于样本的迁移,基于特征的迁移,基于模型的迁移,以及基于关系的迁移。基于样本的迁移通过对源域中有标定样本的加权利用完成知识迁移;基于特征的迁移通过将源域和目标域映射到相同的空间(或者将其中之一映射到另一个的空间中)并最小化源域和目标域的距离来完成知识迁移;基于模型的迁移将源域和目标域的模型与样本结合起来调整模型的参数;基于关系的迁移则通过在源域中学习概念之间的关系,然后将其类比到目标域中,完成知识的迁移。
理论上,任何领域之间都可以做迁移学习。但是,如果源域和目标域之间相似度不够,迁移结果并不会理想,出现所谓的负迁移情况。比如,一个人会骑自行车,就可以类比学电动车;但是如果类比着学开汽车,那就有点天方夜谭了。如何找到相似度尽可能高的源域和目标域,是整个迁移过程最重要的前提。
迁移学习方面,代表人物有香港科技大学的Qiang Yang教授,南洋理工大学的Sinno Jialin Pan,以及第四范式的CEO戴文渊等。代表文献是Sinno Jialin Pan和Qiang Yang的A survey on transfer learning。
作者网站:http://jd92.wang.
[参考资料]
[1] Pan S J, Yang Q. A survey on transfer learning[J]. IEEE Transactions on knowledge and data engineering, 2010, 22(10): 1345-1359.
[2] Introduction to Transfer Learning: http://jd92.wang/assets/files/l03_transferlearning.pdf。
[3] Qiang Yang: http://www.cse.ust.hk/~qyang/.
[4] Sinno Jialin Pan: http://www.ntu.edu.sg/home/sinnopan/.
[5] Wenyuan Dai: https://scholar.google.com/citations?user=AGR9pP0AAAAJ&hl=zh-CN.
迁移学习的应用
20191222 NIPS-19 workshop Sim-to-Real Domain Adaptation For High Energy Physics
20191222 arXiv Transfer learning in hybrid classical-quantum neural networks
20191214 arXiv Unsupervised Transfer Learning via BERT Neuron Selection
20191214 arXiv Transfer Learning-Based Outdoor Position Recovery with Telco Data
20191214 NIPS-19 workshop Cross-Language Aphasia Detection using Optimal Transport Domain Adaptation
20191201 arXiv A Transfer Learning Method for Goal Recognition Exploiting Cross-Domain Spatial Features
20191201 AAAI-20 Zero-Resource Cross-Lingual Named Entity Recognition
20191125 arXiv Attention Privileged Reinforcement Learning For Domain Transfer
20191124 Cantonese Automatic Speech Recognition Using Transfer Learning from Mandarin
20191115 arXiv Instance-based Transfer Learning for Multilingual Deep Retrieval
20191115 arXiv Unsupervised Pre-training for Natural Language Generation: A Literature Review
20191115 AAAI-20 Unsupervised Domain Adaptation on Reading Comprehension
20191113 arXiv Open-Ended Visual Question Answering by Multi-Modal Domain Adaptation
20191113 arXiv NegBERT: A Transfer Learning Approach for Negation Detection and Scope Resolution
20191113 AAAI-20 TANDA: Transfer and Adapt Pre-Trained Transformer Models for Answer Sentence Selection
20191111 NIPS-19 workshop Transfer Learning in 4D for Breast Cancer Diagnosis using Dynamic Contrast-Enhanced Magnetic Resonance Imaging
20191111 BigData-19 Deep Transfer Learning for Thermal Dynamics Modeling in Smart Buildings
20191111 arXiv Unsupervised Domain Adaptation of Contextual Embeddings for Low-Resource Duplicate Question Detection
20191111 arXiv Towards Domain Adaptation from Limited Data for Question Answering Using Deep Neural Networks
20191111 arXiv Teacher-Student Training for Robust Tacotron-based TTS
20191111 arXiv SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization
20191111 arXiv Change your singer: a transfer learning generative adversarial framework for song to song conversion
20191111 arXiv Transfer Learning in Spatial-Temporal Forecasting of the Solar Magnetic Field
20191111 arXiv Deep geometric knowledge distillation with graphs
20191101 arXiv Transferable End-to-End Aspect-based Sentiment Analysis with Selective Adversarial Learning
20191101 Transfer Learning from Transformers to Fake News Challenge Stance Detection (FNC-1) Task
20191029 arXiv NER Models Using Pre-training and Transfer Learning for Healthcare
20191029 WACV-20 Progressive Domain Adaptation for Object Detection
20191029 WSDM-20 Meta-Learning with Dynamic-Memory-Based Prototypical Network for Few-Shot Event Detection
20191017 arXiv Evolution of transfer learning in natural language processing
20191017 arXiv Unsupervised Domain Adaptation Meets Offline Recommender Learning
20191017 Transfer Learning for Algorithm Recommendation
20191015 arXiv Emotion Recognition in Conversations with Transfer Learning from Generative Conversation Modeling
20191015 WSDM-20 DDTCDR: Deep Dual Transfer Cross Domain Recommendation
20191011 NeurIPS-19 Unified Language Model Pre-training for Natural Language Understanding and Generation
20191011 ICIP-19 Cross-modal knowledge distillation for action recognition
20191011 NeurIPS-19 workshop Language Transfer for Early Warning of Epidemics from Social Media
20191008 arXiv, ICCV-19 demo Cross-Domain Complementary Learning with Synthetic Data for Multi-Person Part Segmentation
20191008 arXiv Transfer Brain MRI Tumor Segmentation Models Across Modalities with Adversarial Networks
20191008 ICONIP-19 Semi-Supervised Domain Adaptation with Representation Learning for Semantic Segmentation across Time
20191008 arXiv Noise as Domain Shift: Denoising Medical Images by Unpaired Image Translation
20190926 arXiv Restyling Data: Application to Unsupervised Domain Adaptation
20190916 ISWC-19 Cross-dataset deep transfer learning for activity recognition
20190912 MICCAI workshop Multi-Domain Adaptation in Brain MRI through Paired Consistency and Adversarial Learning
20190909 IJCAI-FML-19 FedHealth: A Federated Transfer Learning Framework for Wearable Healthcare
20190829 EMNLP-19 Investigating Meta-Learning Algorithms for Low-Resource Natural Language Understanding Tasks
20190829 EMNLP-19 Unsupervised Domain Adaptation for Neural Machine Translation with Domain-Aware Feature Embeddings
20190828 MICCAI-19 workshop Cross-modality Knowledge Transfer for Prostate Segmentation from CT Scans
20180828 ICCV-19 workshop Unsupervised Deep Feature Transfer for Low Resolution Image Classification
20190828 arXiv VAE-based Domain Adaptation for Speaker Verification
20190821 arXiv Shallow Domain Adaptive Embeddings for Sentiment Analysis
20190813 IJAIT Transferring knowledge from monitored to unmonitored areas for forecasting parking spaces
20190809 ICCASP-19 Cross-lingual Text-independent Speaker Verification using Unsupervised Adversarial Discriminative Domain Adaptation
20190809 IJCAI-19 Progressive Transfer Learning for Person Re-identification
20190809 NeurIPS-18 MacNet: Transferring Knowledge from Machine Comprehension to Sequence-to-Sequence Models
20190802 arXiv Towards More Accurate Automatic Sleep Staging via Deep Transfer Learning
20190729 MICCAI-19 Annotation-Free Cardiac Vessel Segmentation via Knowledge Transfer from Retinal Images
20190703 arXiv Disentangled Makeup Transfer with Generative Adversarial Network
20190703 arXiv Applying Transfer Learning To Deep Learned Models For EEG Analysis
20190626 arXiv A Novel Deep Transfer Learning Method for Detection of Myocardial Infarction
20190517 PHM-19 Domain Adaptive Transfer Learning for Fault Diagnosis
20190515 ACL-19 Effective Cross-lingual Transfer of Neural Machine Translation Models without Shared Vocabularies
20190509 arXiv Unsupervised Domain Adaptation using Generative Adversarial Networks for Semantic Segmentation of Aerial Images
20190508 arXiv Text2Node: a Cross-Domain System for Mapping Arbitrary Phrases to a Taxonomy
20190508 arXiv On Transfer Learning For Chatter Detection in Turning Using Wavelet Packet Transform and Empirical Mode Decomposition
20190416 arXiv Deep Transfer Learning for Single-Channel Automatic Sleep Staging with Channel Mismatch
20190415 PAKDD-19 Targeted Knowledge Transfer for Learning Traffic Signal Plans
20190415 PAKDD-19 Knowledge Graph Rule Mining via Transfer Learning
20190415 PAKDD-19 Adaptively Transfer Category-Classifier for Handwritten Chinese Character Recognition
20190415 PAKDD-19 Multi-task Learning for Target-Dependent Sentiment Classification
20190415 PAKDD-19 Spatial-Temporal Multi-Task Learning for Within-Field Cotton Yield Prediction
20190415 PAKDD-19 Passenger Demand Forecasting with Multi-Task Convolutional Recurrent Neural Networks
20190409 arXiv Unsupervised Domain Adaptation for Multispectral Pedestrian Detection
20190408 arXiv Unsupervised Domain Adaptation of Contextualized Embeddings: A Case Study in Early Modern English
20190408 USENIX-19 Transfer Learning for Performance Modeling of Deep Neural Network Systems
20190403 arXiv Transfer Learning for Clinical Time Series Analysis using Deep Neural Networks
20190403 arXiv Med3D: Transfer Learning for 3D Medical Image Analysis
20190401 arXiv Cross-Subject Transfer Learning in Human Activity Recognition Systems using Generative Adversarial Networks
20190305 arXiv Unsupervised Domain Adaptation Learning Algorithm for RGB-D Staircase Recognition
20190221 arXiv Transfusion: Understanding Transfer Learning with Applications to Medical Imaging
20190123 arXiv Transfer Learning and Meta Classification Based Deep Churn Prediction System for Telecom Industry
20190123 arXiv Cold-start Playlist Recommendation with Multitask Learning
20190123 arXiv Adapting Convolutional Neural Networks for Geographical Domain Shift
20190117 NeurIPS-18 workshop Transfer Learning for Prosthetics Using Imitation Learning
20190115 IJAERS Weightless Neural Network with Transfer Learning to Detect Distress in Asphalt
20190115 arXiv Disease Knowledge Transfer across Neurodegenerative Diseases
20190111 ICMLA-18 Supervised Transfer Learning for Product Information Question Answering
20190102 arXiv High Quality Monocular Depth Estimation via Transfer Learning
20181230 arXiv The CORAL+ Algorithm for Unsupervised Domain Adaptation of PLDA
20181230 arXiv Domain-Aware Generalized Zero-Shot Learning
20181225 arXiv A Multi-task Neural Approach for Emotion Attribution, Classification and Summarization
20181225 arXiv A General Approach to Domain Adaptation with Applications in Astronomy
20181225 arXiv An Integrated Transfer Learning and Multitask Learning Approach for Pharmacokinetic Parameter Prediction
20181221 arXiv Deep Transfer Learning for Static Malware Classification
20181221 arXiv PnP-AdaNet: Plug-and-Play Adversarial Domain Adaptation Network with a Benchmark at Cross-modality Cardiac Segmentation
20181220 arXiv Domain Adaptation for Reinforcement Learning on the Atari
20181220 arXiv Deep UL2DL: Channel Knowledge Transfer from Uplink to Downlink
20181219 NER-19 Transfer Learning in Brain-Computer Interfaces with Adversarial Variational Autoencoders
20181219 ICCPS-19 Simulation to scaled city: zero-shot policy transfer for traffic control via autonomous vehicles
20181218 arXiv Transfer learning to model inertial confinement fusion experiments
20181214 arXiv Bridging the Generalization Gap: Training Robust Models on Confounded Biological Data
20181214 BioCAS-19 ECG Arrhythmia Classification Using Transfer Learning from 2-Dimensional Deep CNN Features
20181214 LAK-19 Transfer Learning using Representation Learning in Massive Online Open Courses
20181214 DVPBA-19 Considering Race a Problem of Transfer Learning
20181213 arXiv Multichannel Semantic Segmentation with Unsupervised Domain Adaptation
20181212 arXiv 3D Scene Parsing via Class-Wise Adaptation
20181212 arXiv Secure Federated Transfer Learning
20181206 NeurIPS-18 workshop Towards Continuous Domain adaptation for Healthcare
20181206 NeurIPS-18 workshop A Hybrid Instance-based Transfer Learning Method
20181205 arXiv Learning from a tiny dataset of manual annotations: a teacher/student approach for surgical phase recognition
20181204 arXiv From Known to the Unknown: Transferring Knowledge to Answer Questions about Novel Visual and Semantic Concepts
20181128 arXiv Cross-domain Deep Feature Combination for Bird Species Classification with Audio-visual Data
20181128 NeurIPS-18 workshop Multi-Task Generative Adversarial Network for Handling Imbalanced Clinical Data
20181128 WACV-19 CNN based dense underwater 3D scene reconstruction by transfer learning using bubble database
20181127 NeurIPS-18 workshop Predicting Diabetes Disease Evolution Using Financial Records and Recurrent Neural Networks
20181123 NIPS-18 workshop Population-aware Hierarchical Bayesian Domain Adaptation
20181121 arXiv Transferrable End-to-End Learning for Protein Interface Prediction
20181121 NSFREU-18 Transfer Learning with Deep CNNs for Gender Recognition and Age Estimation
20181121 arXiv Distribution Discrepancy Maximization for Image Privacy Preserving
20181120 arXiv Spatial-temporal Multi-Task Learning for Within-field Cotton Yield Prediction
20181117 arXiv Unsupervised domain adaptation for medical imaging segmentation with self-ensembling
20181117 arXiv Performance Estimation of Synthesis Flows cross Technologies using LSTMs and Transfer Learning
20181117 AAAI-19 GaitSet: Regarding Gait as a Set for Cross-View Gait Recognition
20181115 AAAI-19 Unsupervised Transfer Learning for Spoken Language Understanding in Intelligent Agents
20181114 arXiv A Framework of Transfer Learning in Object Detection for Embedded Systems
20181107 ICONIP-18 Transductive Learning with String Kernels for Cross-Domain Text Classification
20181012 arXiv Bird Species Classification using Transfer Learning with Multistage Training
20181012 arXiv Survival prediction using ensemble tumor segmentation and transfer learning
20181012 ICMLA-18 Virtual Battery Parameter Identification using Transfer Learning based Stacked Autoencoder
20180912 PervasiveHealth-18 Transfer Learning and Data Fusion Approach to Recognize Activities of Daily Life
20180912 ICIP-18 Adversarial Domain Adaptation with a Domain Similarity Discriminator for Semantic Segmentation of Urban Areas
20180912 arXiv Tensor Alignment Based Domain Adaptation for Hyperspectral Image Classification
20180909 arXiv Driving Experience Transfer Method for End-to-End Control of Self-Driving Cars
20180909 arXiv Deep Learning for Domain Adaption: Engagement Recognition
20180904 EMBC-18 Multi-Cell Multi-Task Convolutional Neural Networks for Diabetic Retinopathy Grading Kang
20180904 ICPR-18 Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks
20180826 ISPRS journal Deep multi-task learning for a geographically-regularized semantic segmentation of aerial images
20180823 ICPR-18 Multi-task multiple kernel machines for personalized pain recognition from functional near-infrared spectroscopy brain signals
20180821 arXiv Unsupervised adversarial domain adaptation for acoustic scene classification
20180819 arXiv Transfer Learning and Organic Computing for Autonomous Vehicles
20180819 arXiv Transfer Learning for Brain-Computer Interfaces: An Euclidean Space Data Alignment Approach
20180801 arXiv Multimodal Deep Domain Adaptation
20180801 arXiv Rank and Rate: Multi-task Learning for Recommender Systems
20180801 MICCAI-18 Leveraging Unlabeled Whole-Slide-Images for Mitosis Detection
20180801 ECCV-18 DOCK: Detecting Objects by transferring Common-sense Knowledge
20180801 ECCV-18 A Zero-Shot Framework for Sketch-based Image Retrieval
20180731 ICANN-18 Metric Embedding Autoencoders for Unsupervised Cross-Dataset Transfer Learning
20180705 arXiv 将迁移学习应用于自动驾驶中的不同天气适配:Modular Vehicle Control for Transferring Semantic Information to Unseen Weather Conditions using GANs
20180627 arXiv 用迁移学习进行感染预测:Domain Adaptation for Infection Prediction from Symptoms Based on Data from Different Study Designs and Contexts
20180627 arXiv 生成模型用于姿态迁移:Generative Models for Pose Transfer
20180622 arXiv 跨领域的人脸识别用于银行认证系统:Cross-Domain Deep Face Matching for Real Banking Security Systems
20180621 arXiv 迁移学习用于角膜组织的分类:Transfer Learning with Human Corneal Tissues: An Analysis of Optimal Cut-Off Layer
20180621 arXiv 迁移学习用于强化学习中的图像翻译:Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation
20180615 Interspeech-18 很全面地探索了很多类方法在语音识别上的应用:A Study of Enhancement, Augmentation, and Autoencoder Methods for Domain Adaptation in Distant Speech Recognition
20180615 Interspeech-18 对话中的语音识别:Unsupervised Adaptation with Interpretable Disentangled Representations for Distant Conversational Speech Recognition
20180614 arXiv 跨数据集的person reid:Cross-dataset Person Re-Identification Using Similarity Preserved Generative Adversarial Networks
20180614 arXiv 将迁移学习应用于多个speaker的文字到语音:Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
20180613 SIGIR-18 多任务学习用于推荐系统:Explainable Recommendation via Multi-Task Learning in Opinionated Text Data
20180613 CVPR-18 跨数据集的VQA:Cross-Dataset Adaptation for Visual Question Answering
20180612 ICASSP-18 迁移学习用于资源少的情感分类:Semi-supervised and Transfer learning approaches for low resource sentiment classification
20180612 KDD-18 多任务学习用于ICU病人数据挖掘:Learning Tasks for Multitask Learning: Heterogenous Patient Populations in the ICU
20180610 CEIG-17 将迁移学习用于插图分类:Transfer Learning for Illustration Classification
20180610 BioNLP-18 将迁移学习用于病人实体分类:Embedding Transfer for Low-Resource Medical Named Entity Recognition: A Case Study on Patient Mobility
20180610 MICCAI-18 将迁移学习用于前列腺图分类:Adversarial Domain Adaptation for Classification of Prostate Histopathology Whole-Slide Images
20180610 arXiv 迁移学习用于Coffee crop分类:A Comparative Study on Unsupervised Domain Adaptation Approaches for Coffee Crop Mapping
20180605 arXiv 迁移学习应用于胸X光片分割:Semantic-Aware Generative Adversarial Nets for Unsupervised Domain Adaptation in Chest X-ray Segmentation
20180604 arXiv 用CNN迁移学习进行硬化症检测:One-shot domain adaptation in multiple sclerosis lesion segmentation using convolutional neural networks
20180530 MNRAS 用迁移学习检测银河星系兼并:Using transfer learning to detect galaxy mergers
20180529 arXiv 迁移学习用于表情识别:Meta Transfer Learning for Facial Emotion Recognition
20180524 KDD-18 用迁移学习方法进行人们的ID迁移:Learning and Transferring IDs Representation in E-commerce
20180519 arXiv 用迁移学习进行物体检测,200帧/秒:Object detection at 200 Frames Per Second
20180519 arXiv 用迁移学习进行肢体语言识别:Optimization of Transfer Learning for Sign Language Recognition Targeting Mobile Platform
20180516 ACL-18 将对抗迁移学习用于危机状态下的舆情分析:Domain Adaptation with Adversarial Training and Graph Embeddings
20180504 arXiv 用迁移学习进行心脏病检测分类:ECG Heartbeat Classification: A Deep Transferable Representation
20180427 CVPR-18(workshop) 将深度迁移学习用于Person-reidentification: Adaptation and Re-Identification Network: An Unsupervised Deep Transfer Learning Approach to Person Re-Identification
20180426 arXiv 迁移学习用于医学名字实体检测;Label-aware Double Transfer Learning for Cross-Specialty Medical Named Entity Recognition
20180425 arXiv 将bagging和dropping结合起来进行迁移的一个深度网络:A New Channel Boosted Convolution Neural Network using Transfer Learning
20180425 arXiv 迁移学习应用于自然语言任务:Dropping Networks for Transfer Learning
20180421 arXiv 采用联合分布适配的深度迁移网络用于工业生产中的错误诊断:Deep Transfer Network with Joint Distribution Adaptation: A New Intelligent Fault Diagnosis Framework for Industry Application
20180419 arXiv 跨领域的推荐系统:CoNet: Collaborative Cross Networks for Cross-Domain Recommendation
20180413 arXiv 跨模态检索:Cross-Modal Retrieval with Implicit Concept Association
20180410 arXiv 用迁移学习进行犯罪现场的图像匹配:Cross-Domain Image Matching with Deep Feature Maps
20180408 ASRU-18 用迁移学习中的domain separation network进行speech recognition:Unsupervised Adaptation with Domain Separation Networks for Robust Speech Recognition
20180408 arXiv 小数据集上的迁移学习手写体识别:Boosting Handwriting Text Recognition in Small Databases with Transfer Learning
20180404 arXiv 用迁移学习进行物体检测:Transferring Common-Sense Knowledge for Object Detection
20180402 arXiv 将迁移学习用于癌症检测:Improve the performance of transfer learning without fine-tuning using dissimilarity-based multi-view learning for breast cancer histology images
迁移学习用于行为识别 Transfer learning for activity recognition
https://arxiv.org/pdf/1411.1792.pdf
https://www.nature.com/articles/nature21056.epdf?referrer_access_token=_snzJ5POVSgpHutcNN4lEtRgN0jAjWel9jnR3ZoTv0NXpMHRAJy8Qn10ys2O4tuP9jVts1q2g1KBbk3Pd3AelZ36FalmvJLxw1ypYW0UxU7iShiMp86DmQ5Sh3wOBhXDm9idRXzicpVoBBhnUsXHzVUdYCPiVV0Slqf-Q25Ntb1SX_HAv3aFVSRgPbogozIHYQE3zSkyIghcAppAjrIkw1HtSwMvZ1PXrt6fVYXt-dvwXKEtdCN8qEHg0vbfl4_m&tracking_referrer=edition.cnn.com