What algorithm does OCR character recognition use?

The general OCR routine is like this

1. First detect and extract the text area.

2. Then we use radon hough transform to correct the text.

3. Segmentation of single-line text image with projection histogram.

Finally, a single line of OCR.

There are two main ideas for single-line OCR.

The first is the need to split characters.

There are many ways to divide characters. The most common method is to use the extreme point of the projection histogram as the candidate segmentation point, and use classifier+beam search to search the best segmentation point.

After finding the segmentation point, the traditional method is feature engineering+classifier for a single character. The general process is-> gray scale; Binary->; Corrected image->; Feature extraction (various methods such as pca lbp, etc. )->; Quantifiers (quantifiers are generally SVM· ANN Ann, etc.).

Now CNN (Convolutional Neural Network) can largely avoid feature engineering.

The second is that you don't need to split characters.

Another point is end-to-end identification, but only if you need a large number of labeled data sets. This method can output the character sequence continuously without dividing the image.

Multi-label classification can be used for short lengths. Such as license plate, verification code. Here, I tried the multi-label classification of license plates. End-to-end recognition of undivided characters in license plate recognition.

This is the way Google uses to identify street view house numbers.

Paper: How to be a qualified coal mine safety inspector?

How to distinguish the preface and abstract of a paper, and ask for a detailed explanation

Model essay on college physics papers

Wu Mengting's paper

Will the graduation certificate be cancelled if the paper fails to pass the sampling inspection?

Paper on dust removal device

Application of epoxy resin adhesive? Study on filler selection

Clinical malnutrition of inpatients

How many times do you need to have an ultrasound examination during pregnancy? Prenatal ultrasound examination strategy is coming.

What are the general contents of the first debate, the second debate, the third debate and the fourth debate respectively?