本文介绍了Tesseract手写输入法,受字典训练的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我在文本文件中有一个单词词典,用换行符分隔.而且我想使用Tesseract识别笔迹,并在文本文件中输出最接近的匹配行.

I have a dictionary of words in a text file, separated by newlines. And I want to recognize the handwriting using Tesseract, and output the nearest matching line in the text file.

这是我第一次使用Tesseract,它已经在我的项目工作区中,我只需要训练数据.

This is the first time I'll be using Tesseract, and it's already in my project workspace, I just need the training data.

是否可以训练Tesseract来做到这一点?

Is it possible to train Tesseract to do this?

推荐答案

可以训练tesseract识别笔迹.以下是说明: https://tesseract-ocr.github.io/tessdoc/培训-Tesseract

It's possible to train tesseract to recognize handwriting. Here are the instructions: https://tesseract-ocr.github.io/tessdoc/Training-Tesseract

但是不要期望效果很好.学者们通常获得的准确性结果高达90%.这是单词数字.因此,如果您的用例可以处理至少1/10个错误,那么这可能对您有用.

But don't expect very good results. Academics have typically gotten accuracy results topping out about 90%. Here are a couple references for words and numbers. So if your use case can deal with at least 1/10 errors, this might work for you.

这篇关于Tesseract手写输入法,受字典训练的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!

09-11 14:41