本文介绍了除了对图像进行降采样的二进制网格外,我还可以将哪些功能用于手写OCR?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

您好,我一直在研究有关哪些功能对我的手写OCR分类神经网络有好处的研究论文.我是一个初学者,所以我只是拍摄了手写字符的图像,在其周围制作了一个边框,然后将其调整为15x20的二进制图像.所以这意味着我有300个要素的输入层.从我在google上发现的论文(其中大多数都是相当古老的)来看,方法确实有所不同.仅用图像的二进制网格,我的精度就不错,但是我想知道是否有人可以使用其他功能来提高精度.甚至只是指向正确的方向.我真的很感激!

Hi I have been searching though research papers on what features would be good for me to use in my handwritten OCR classifying neural network. I am a beginner so I have been just taking the image of the handwritten character, made a bounding box around it, and then resize it into a 15x20 binary image. So this means i have an input layer of 300 features. From the papers i have found on google (most of which are quite old) the methods really vary. My accuracy is not bad with just a binary grid of the image, but I was wondering if anyone had other features I could use to boost my accuracy. Or even just pointing me in the right direction. I would really appreciate it!

谢谢,扎克

推荐答案

我尚未阅读有关此主题的任何实际论文,但我的建议是提高创造力.使用您认为可能会帮助分类器识别数字的任何东西.

I haven't read any actual papers on this topic, but my advice would be to get creative. Use anything you could think of that might help the classifier identify numbers.

我的第一个想法是尝试通过修改后的滑动窗口"算法(滑动/旋转线?)来尝试识别图像中的线条",或者尝试尝试识别出最适合的线条"图像(以帮助分类器响应斜体或书写样式的变化).确实,但是,如果您使用的是神经网络,它应该在没有人工帮助的情况下进行此类操作(这就是全部内容!)

My first thought would be to try and identify "lines" in the image, maybe via a modified "sliding window" algorithm (sliding/rotating line?), or to try and identify a "line of best fit" to the image (to help the classifier respond to changes in italicism or writing style). Really though, if you're using a neural network, it should be picking up on these sorts of things without your manual help (that's the whole point of them!)

我将首先关注网络的结构和拓扑,以尝试提高性能,并且仅在无法以其他方式获得令人满意的性能时才担心其他功能.另外,您可以尝试改善现有功能,确保字符在图像中居中,或者尝试使用一种算法来倾斜斜体字符以使其垂直?

I would focus first on the structure and topology of your net to try and improve performance, and worry about additional features only if you cannot get satisfactory performance some other way. Also you could try improving the features you already have, make sure the character is centered in the image, maybe try an algorithm to skew italicised characters to make them vertical?

以我的经验,这些事情通常并没有帮助,但是您可能会很幸运并遇到可以改善您的网络的问题:)

In my experience these sorts of things don't often help, but you could get lucky and run into one that improves your net :)

这篇关于除了对图像进行降采样的二进制网格外,我还可以将哪些功能用于手写OCR?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!

09-15 02:50