热门标签 | HotTags
当前位置:  开发笔记 > 编程语言 > 正文

文字检测与识别资源

综述[2015-PAMI-Overview]TextDetectionandRecognitioninImagery:ASurvey[paper][2014-Front.Comput.
综述

[2015-PAMI-Overview]Text Detection and Recognition in Imagery: A Survey[paper]

 

[2014-Front.Comput.Sci-Overview]Scene Text Detection and Recognition: Recent Advances and Future Trends[paper]

 

自然场景文字检测

[2017-arXiv]R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection[paper]


[2017-CVPR]EAST: An Efficient and Accurate Scene Text Detector [paper]


[2017-arXiv]Cascaded Segmentation-Detection Networks for Word-Level Text Spotting[paper]


[2017-arXiv]Deep Direct Regression for Multi-Oriented Scene Text Detection[paper]

 

[2017-CVPR]Detecting oriented text in natural images by linking segments [paper]


[2017-CVPR]Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detection[paper]


[2017-arXiv]Arbitrary-Oriented Scene Text Detection via Rotation Proposals [paper]


[2017-AAAI]TextBoxes: A Fast Text Detector with a Single Deep Neural Network[paper][code]



[2016-arXiv]Accurate Text Localization in Natural Image with Cascaded Convolutional TextNetwork [paper]



 

[2016-arXiv]DeepText : A Unified Framework for Text Proposal Generation and Text Detectionin Natural Images [paper] [data]


 

[2016-arXiv]TextProposals: a Text-specific Selective Search Algorithm for Word Spotting in the Wild [paper] [code]


 

[2016-arXiv] SceneText Detection via Holistic, Multi-Channel Prediction [paper]


 

[2016-CVPR] CannyText Detector: Fast and Robust Scene Text Localization Algorithm [paper]


 

[2016-CVPR]Synthetic Data for Text Localisation in Natural Images [paper] [data][code]


 

[2016-ECCV]Detecting Text in Natural Image with Connectionist Text Proposal Network[paper][demo][code]


[2016-TIP]Text-Attentional Convolutional Neural Networks for Scene Text Detection [paper]



 

[2016-IJDAR]TextCatcher: a method to detect curved and challenging text in natural scenes[paper]


 

[2016-CVPR]Multi-oriented text detection with fully convolutional networks [paper]


 

[2015-TPRMI]Real-time Lexicon-free Scene Text Localization and Recognition[paper]


 

[2015-CVPR]Symmetry-Based Text Line Detection in Natural Scenes[paper][code]


 

[2015-ICCV]FASText: Efficient unconstrained scene text detector[paper][code]

 


 

[2015-D.PhilThesis] Deep Learning for Text Spotting [paper]

 

[2015 ICDAR]Object Proposals for Text Extraction in the Wild [paper] [code]


 

[2014-ECCV] Deep Features for Text Spotting [paper] [code] [model] [GitXiv]


 

[2014-TPAMI] Word Spotting and Recognition with Embedded Attributes [paper] [homepage] [code]


 

[2014-TPRMI]Robust Text Detection in Natural Scene Images[paper]


 

[2014-ECCV] Robust Scene Text Detection with Convolution Neural Network Induced MSER Trees [paper]


 

[2013-ICCV] Photo OCR: Reading Text in Uncontrolled Conditions[paper]


[2012-CVPR]Real-time scene text localization and recognition[paper][code]


 

[2010-CVPR]Detecting Text in Natural Scenes with Stroke Width Transform [paper] [code]


 

自然场景文字识别

[2017-AAAI-网络图片]Detection and Recognition of Text Embedded in Online Images via Neural Context Models[paper][project]


[2017-arvix 文档识别] Full-Page TextRecognition : Learning Where to Start and When to Stop[paper]


[2016-AAAI]Reading Scene Text in Deep Convolutional Sequences [paper]


 

[2016-IJCV]Reading Text in the Wild with Convolutional Neural Networks [paper] [demo] [homepage]


 

[2016-CVPR]Recursive Recurrent Nets with Attention Modeling for OCR in the Wild [paper]


 

[2016-CVPR] Robust Scene Text Recognition with Automatic Rectification [paper]


 

[2016-NIPs] Generative Shape Models: Joint Text Recognition and Segmentation with Very Little Training Data[paper]



[2015-CoRR] AnEnd-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition [paper] [code]


 

[2015-ICDAR]Automatic Script Identification in the Wild[paper]


 


 

[2015-ICLR] Deep structured output learning for unconstrained text recognition [paper]


 

[2014-NIPS]Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition [paperhomepage] [model]


 


 

[2014-TIP] A Unified Framework for Multi-Oriented Text Detection and Recognition [paper]


 

[2012-ICPR]End-to-End Text Recognition with Convolutional Neural Networks [paper] [code] [SVHN Dataset]


 


 

数据集

 

COCO-Text (ComputerVision Group, Cornell) 2016

63,686images, 173,589 text instances, 3 fine-grained text attributes.

Task:text location and recognition

COCO-Text API

Synthetic Data for Text Localisation in Natural Image (VGG)2016

         800k thousand images

         8 million synthetic word instances

         download

Synthetic Word Dataset (Oxford, VGG) 2014

9million images covering 90k English words

Task:text recognition, segmentation

download

IIIT 5K-Words 2012

5000images from Scene Texts and born-digital (2k training and 3k testing images)

Eachimage is a cropped word image of scene text with case-insensitive labels

Task:text recognition

download

StanfordSynth(Stanford, AI Group) 2012

Smallsingle-character images of 62 characters (0-9, a-z, A-Z)

Task:text recognition

download

MSRA Text Detection 500 Database(MSRA-TD500) 2012

500 natural images(resolutions of the images vary from 1296x864 to 1920x1280)

Chinese,English or mixture of both

Task:text detection

Street View Text (SVT) 2010

350 high resolution images (average size 1260 × 860) (100 images for training and 250 images for testing)

Onlyword level bounding boxes are provided with case-insensitive labels

Task:text location

KAIST Scene_Text Database 2010

3000images of indoor and outdoor scenes containing text

Korean,English (Number), and Mixed (Korean + English + Number)

Task:text location, segmentation and recognition

Chars74k 2009

Over74K images from natural images, as well as a set of synthetically generatedcharacters

Smallsingle-character images of 62 characters (0-9, a-z, A-Z)

Task:text recognition

ICDARBenchmark Datasets

Dataset

Discription

Competition Paper

ICDAR 2015

1000 training images and 500 testing images

paper 

link

ICDAR 2013

229 training images and 233 testing images

paper 

link

ICDAR 2011

229 training images and 255 testing images

paper 

link

ICDAR 2005

1001 training images and 489 testing images

paper 

link

ICDAR 2003

181 training images and 251 testing images(word level and character level)

paper 

link

 

开源库

 

Tesseract: c++ based tools for documents analysis and OCR,support 60+ languages [code]

 

Ocropy: Python-based tools for document analysis and OCR [code]

 

CLSTM : A small C++ implementation of LSTM networks,focused on OCR [code]

 

Convolutional Recurrent Neural Network,Torch7 based [code]

 

Attention-OCR: Visual Attention based OCR [code]

 

Umaru: An OCR-system based on torch using the technique of LSTM/GRU-RNN, CTC and referred to the works of rnnlib and clstm [code]

 

 

其他

 

DeepFont:Identify Your Font from An Image[paper]

 

Writer-independent Feature Learning for Offline Signature Verification using Deep Convolutional Neural Networks[paper]

 

End-to-End Interpretation of the French Street Name Signs Dataset [paper] [code]

 

Extracting text from an image using Ocropus [blog]

 

手写字识别

[2016-arXiv]Drawingand Recognizing Chinese Characters with Recurrent Neural Network [paper]

 

Learning Spatial-Semantic Context with Fully Convolutional Recurrent Network for Online Handwritten Chinese Text Recognition [paper]

 

Stroke Sequence-Dependent Deep Convolutional Neural Network for Online Handwritten Chinese Character Recognition [paper]

 

High Performance Offline Handwritten Chinese Character Recognition Using GoogLeNet and Directional Feature Maps [paper] [github]

 

DeepHCCR:Offline Handwritten Chinese Character Recognition based on GoogLeNet and AlexNet (With CaffeModel) [code]

 

如何用卷积神经网络CNN识别手写数字集?[blog][blog1][blog2] [blog4] [blog5] [code6]

 

Scan,Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTMAttention [paper]

 

MLPaint:the Real-Time Handwritten Digit Recognizer [blog][code][demo]

 

caffe-ocr: OCR with caffe deep learning framework [code] (单字分类器)

 

牌照等识别


ReadingCar License Plates Using Deep Convolutional Neural Networks and LSTMs  [paper]

 

Numberplate recognition with Tensorflow [blog] [code]

 

end-to-end-for-plate-recognition[code]

 

ApplyingOCR Technology for Receipt Recognition[blog][mirror]

 

破解验证码


[2017-Arvix]Using Synthetic Data to Train NeuralNetworks is Model-Based Reasoning[paper]


Using deep learning to break a Captcha system [blog] [code]

 

Breakingreddit captcha with 96% accuracy [blog] [code]

 

I'mnot a human: Breaking the Google reCAPTCHA [paper]

 

NeuralNet CAPTCHA Cracker [slides] [code] [demo]

 

Recurrentneural networks for decoding CAPTCHAS [blog] [code] [demo]

 

Readingirctc captchas with 95% accuracy using deep learning [code]

 

端到端的OCR:基于CNN的实现 [blog]

 

IAm Robot: (Deep) Learning to Break Semantic Image CAPTCHAs [paper]

 

参考

[1]http://handong1587.github.io/deep_learning/2015/10/09/ocr.html

[2]https://github.com/chongyangtao/Awesome-Scene-Text-Recognition



推荐阅读
  • 本文介绍了在处理不规则数据时如何使用Python自动提取文本中的时间日期,包括使用dateutil.parser模块统一日期字符串格式和使用datefinder模块提取日期。同时,还介绍了一段使用正则表达式的代码,可以支持中文日期和一些特殊的时间识别,例如'2012年12月12日'、'3小时前'、'在2012/12/13哈哈'等。 ... [详细]
  • 在Android开发中,使用Picasso库可以实现对网络图片的等比例缩放。本文介绍了使用Picasso库进行图片缩放的方法,并提供了具体的代码实现。通过获取图片的宽高,计算目标宽度和高度,并创建新图实现等比例缩放。 ... [详细]
  • 本文详细介绍了使用C#实现Word模版打印的方案。包括添加COM引用、新建Word操作类、开启Word进程、加载模版文件等步骤。通过该方案可以实现C#对Word文档的打印功能。 ... [详细]
  • 带添加按钮的GridView,item的删除事件
    先上图片效果;gridView无数据时显示添加按钮,有数据时,第一格显示添加按钮,后面显示数据:布局文件:addr_manage.xml<?xmlve ... [详细]
  • 如何自行分析定位SAP BSP错误
    The“BSPtag”Imentionedintheblogtitlemeansforexamplethetagchtmlb:configCelleratorbelowwhichi ... [详细]
  • 阿里Treebased Deep Match(TDM) 学习笔记及技术发展回顾
    本文介绍了阿里Treebased Deep Match(TDM)的学习笔记,同时回顾了工业界技术发展的几代演进。从基于统计的启发式规则方法到基于内积模型的向量检索方法,再到引入复杂深度学习模型的下一代匹配技术。文章详细解释了基于统计的启发式规则方法和基于内积模型的向量检索方法的原理和应用,并介绍了TDM的背景和优势。最后,文章提到了向量距离和基于向量聚类的索引结构对于加速匹配效率的作用。本文对于理解TDM的学习过程和了解匹配技术的发展具有重要意义。 ... [详细]
  • 本文讨论了如何在不使用SearchBar display controller的情况下,单独使用SearchBar并捕获其textChange事件。作者介绍了实际状况,即左侧SliderMenu中的SearchBar需要在主页TableView中显示搜索结果。然后,作者提供了解决方案和步骤,帮助读者实现这一功能。 ... [详细]
  • Excel数据处理中的七个查询匹配函数详解
    本文介绍了Excel数据处理中的七个查询匹配函数,以vlookup函数为例进行了详细讲解。通过示例和语法解释,说明了vlookup函数的用法和参数的含义,帮助读者更好地理解和运用查询匹配函数进行数据处理。 ... [详细]
  • ASP.NET2.0数据教程之十四:使用FormView的模板
    本文介绍了在ASP.NET 2.0中使用FormView控件来实现自定义的显示外观,与GridView和DetailsView不同,FormView使用模板来呈现,可以实现不规则的外观呈现。同时还介绍了TemplateField的用法和FormView与DetailsView的区别。 ... [详细]
  • Android开发实现的计时器功能示例
    本文分享了Android开发实现的计时器功能示例,包括效果图、布局和按钮的使用。通过使用Chronometer控件,可以实现计时器功能。该示例适用于Android平台,供开发者参考。 ... [详细]
  • 深度学习中的Vision Transformer (ViT)详解
    本文详细介绍了深度学习中的Vision Transformer (ViT)方法。首先介绍了相关工作和ViT的基本原理,包括图像块嵌入、可学习的嵌入、位置嵌入和Transformer编码器等。接着讨论了ViT的张量维度变化、归纳偏置与混合架构、微调及更高分辨率等方面。最后给出了实验结果和相关代码的链接。本文的研究表明,对于CV任务,直接应用纯Transformer架构于图像块序列是可行的,无需依赖于卷积网络。 ... [详细]
  • 突破MIUI14限制,自定义胶囊图标、大图标样式,支持任意APP
    本文介绍了如何突破MIUI14的限制,实现自定义胶囊图标和大图标样式,并支持任意APP。需要一定的动手能力和主题设计师账号权限或者会主题pojie。详细步骤包括应用包名获取、素材制作和封包获取等。 ... [详细]
  • Python爬虫中使用正则表达式的方法和注意事项
    本文介绍了在Python爬虫中使用正则表达式的方法和注意事项。首先解释了爬虫的四个主要步骤,并强调了正则表达式在数据处理中的重要性。然后详细介绍了正则表达式的概念和用法,包括检索、替换和过滤文本的功能。同时提到了re模块是Python内置的用于处理正则表达式的模块,并给出了使用正则表达式时需要注意的特殊字符转义和原始字符串的用法。通过本文的学习,读者可以掌握在Python爬虫中使用正则表达式的技巧和方法。 ... [详细]
  • EzPP 0.2发布,新增YAML布局渲染功能
    EzPP发布了0.2.1版本,新增了YAML布局渲染功能,可以将YAML文件渲染为图片,并且可以复用YAML作为模版,通过传递不同参数生成不同的图片。这个功能可以用于绘制Logo、封面或其他图片,让用户不需要安装或卸载Photoshop。文章还提供了一个入门例子,介绍了使用ezpp的基本渲染方法,以及如何使用canvas、text类元素、自定义字体等。 ... [详细]
  • node.jsurlsearchparamsAPI哎哎哎 ... [详细]
author-avatar
love_xiao奇
这个家伙很懒,什么也没留下!
PHP1.CN | 中国最专业的PHP中文社区 | DevBox开发工具箱 | json解析格式化 |PHP资讯 | PHP教程 | 数据库技术 | 服务器技术 | 前端开发技术 | PHP框架 | 开发工具 | 在线工具
Copyright © 1998 - 2020 PHP1.CN. All Rights Reserved | 京公网安备 11010802041100号 | 京ICP备19059560号-4 | PHP1.CN 第一PHP社区 版权所有