2. If the word scanner can't scan it, it can only be input into the computer one word at a time. After all, paper systems can only be detected by cloud computing, and there is no manual detection system.
3. Recognition system: Character recognition generally includes character information collection, information analysis and processing, and information classification and discrimination.
4. Information acquisition converts the gray scale of the characters on the paper into electrical signals and inputs them into the computer. Information collection is realized by the paper feeding mechanism and photoelectric conversion device in the character recognition machine, including flying spot scanning, camera, photosensitive element and laser scanning.
5. Information analysis and processing The transformed electrical signal is normalized in size, deflection, shade and thickness to eliminate all kinds of noise and interference caused by printing quality and paper quality (uniformity, stains, etc.). ) or writing instruments.
6. Classification and identification of information Classify and identify the denoised and normalized text information to output identification results.
7. Character recognition methods: Character recognition methods are basically divided into three categories: statistics, logical judgment and syntax. Commonly used methods are template matching method and geometric feature extraction method.
(1), the template matching method matches the input character with the given standard character (template), calculates the similarity between the input character and the template, and takes the category with the largest similarity as the recognition result.
(2) Geometric feature extraction method extracts some geometric features of characters, such as endpoints, bifurcation points, concave-convex parts, horizontal, vertical, inclined line segments, closed rings, etc. According to the position and relationship of these features, the recognition result is obtained. Because of the use of structural information, this recognition method is also suitable for handwritten characters with large deformation.
Extended data:
1, paper inspection service:
(1), the paper detection service, also known as paper duplicate checking, is a computer software detection system for academic misconduct (including plagiarism, forgery, tampering, improper signature, multiple submissions, etc.). ) papers (including dissertations, academic papers, published papers, paper titles, scientific research achievements and student compositions).
Now that the graduation season is approaching, there is news from the university that students' graduation thesis should be plagiarized. Once convicted of plagiarism, students will not be able to graduate on time.
3. With the wide application of anti-plagiarism software, a tug-of-war has been launched between teachers and students in colleges and universities. Recently, a new industry has emerged. Taobao has a large number of sellers who provide "paper inspection service". They claim that they can provide "detection nodes with universities". Got the same result.
4. Most of the anti-plagiarism software used in colleges and universities is the "Academic Misconduct Detection System" developed by China Knowledge Network. Taobao online sellers claim to use HowNet system.
In fact, "anti-plagiarism software" is provided to users free of charge by China HowNet. Official website particularly emphasized that the system is only free for users of institutions such as universities, scientific research institutions and publishing units, not for individual users.
Baidu Encyclopedia-Paper Detection Service
Baidu Encyclopedia-Character Recognition