OCR text recognition software refers to the use of OCR? Optical Character Recognition (OCR) technology is a software that directly converts the text content in pictures and photos into editable text. Software can convert pictures into editable text. Support JPG, PNG, GIF, BMP, DOC and other image formats.
Extended data:
Due to the popularity and wide application of scanners, OCR software only needs to provide an interface with scanners and use scanner driver software. Therefore, OCR software mainly consists of the following parts.
1, image input, preprocessing:
Image input: For different image formats, there are different storage formats and different compression methods. Pretreatment: It mainly includes binarization, denoising and tilt correction.
2, binarization:
Most of the pictures taken by the camera are color images, which contain a huge amount of information. For the content of the picture, we can simply divide it into foreground and background. In order to make the computer recognize characters faster and better, we need to process the color image first, so that the image only has foreground information and background information. We can simply define that the foreground information is black and the background information is white, which is a binary image.
3. Noise removal:
For different documents, we can define noise differently, and denoising according to the characteristics of noise is called denoising.
4. Tilting is more active:
Because ordinary users are more casual when taking photos of documents, the photos taken will inevitably tilt, which requires text recognition software to correct.
5. The process of dividing a document picture into paragraphs and lines is called layout analysis. Because of the diversity and complexity of actual documents, there is no fixed optimal cutting model.
6, character cutting:
Due to the limitation of photographing conditions, characters often stick together and break pens, which greatly limits the performance of the recognition system, which requires the word cutting function of the character recognition software.
7, character recognition:
This research has been done for a long time, with template matching in the early stage and feature extraction in the later stage. Due to the displacement of characters, stroke thickness, broken strokes, adhesion, rotation and other factors, the difficulty of feature extraction is greatly affected.
8. Page recovery:
People hope that the recognized characters will still be arranged like the original document pictures, and the paragraphs, positions and orders will remain unchanged, and they will be output to word documents, pdf documents and so on. This process is called page recovery.
9. Post-processing and proofreading:
According to the relationship between specific language contexts, it is post-processing to correct the recognition results. The purpose of developing an OCR character recognition software system is very simple, that is, to transform the image and keep the graphics in the image. If there is a table, the data in the table and the characters in the image will become computer characters.
It can reduce the storage of image data, reuse and analyze the recognized characters, and of course save the manpower and time of keyboard input.
Baidu encyclopedia -OCR character recognition
Baidu Encyclopedia -ocr Text Recognition Software