The rules and principles for duplicate checking of HowNet are as follows:
1, HowNet duplicate checking is continuous 13 characters will be judged as duplicates, and 13 characters are equivalent to 6-7 Chinese characters. Some students will ask why the opening of a paragraph is repeated in only three words. That's because it is composed of 13 characters at the end of the previous paragraph, and so is the red repetition at the end.
2. Hownet duplicate checking system has a threshold. There will be errors in detecting the same article at the same time. Errors will occur when the structure and content of the article change. However, the overall results generally have little error.
Extended data:
Rules and principles of duplicate checking of HowNet papers;
1. Because of the most advanced fuzzy algorithm, if the overall structure and outline are disrupted, it may lead to the inconsistency between the first and second red detection of articles in the same place, or the part that was not marked as red in the first detection will be marked as red in the second detection. Therefore, try to change the sentence pattern when modifying the repeated content, so as not to disturb the original overall outline and structure of the paper.
2. After the whole paper is uploaded, the system will automatically detect the chapter information of the paper according to the directory generated by the article, and then the system will detect the chapter of the paper, so that the copy ratio of each single chapter can be obtained. The directory is gray and does not participate in the text detection; Otherwise, it will be automatically segmented and detected according to 10000 or so characters. At the same time, the directory may be detected as text, and if it is duplicated, it will be marked as red.
3. China Knowledge Network has set a threshold for the sensitivity of this duplicate checking system, which is 5%. In terms of paragraphs, plagiarism or quotation below 5% cannot be detected, which is common in clauses or small concepts in large paragraphs. For example, if the detection paragraph 1 has 10000 words, a single document with less than 500 words will not be detected.
In fact, here also tells the students a modification method, that is, never choose an article to quote from paragraph plagiarism, try to choose as many documents as possible, and intercept a few words from one article, so that it will not be found out.
4. How to detect plagiarism in a paper? The condition of hownet paper detection is that 13 words with continuous similarity or plagiarism will be marked as red, but the prerequisite in 3 must be met: that is, the total number of words in a document you quoted or plagiarized must reach more than 5% in each detection paragraph before it can be detected as red.
5. Hownet detection system will automatically identify references, and references will not participate in text detection. In addition, if excluded, the references in the HowNet test report are displayed in gray font, indicating that they did not participate in the test. Of course, if the format of the reference is completely correct and standardized, this will be automatically excluded.
Otherwise, references will be detected as text, which will cause all references to be marked in red. Higher grades!
6. The overall upload of hownet papers, PDF or Word format may affect the test results. Because uploading a PDF for testing, PDF will have one more text conversion process than Word, which may disturb your original correct table of contents and reference format, and the table of contents and reference format will be disordered, resulting in system recognition errors and being marked as red.
Especially those papers with English catalogues and most English references, English accounts for a high number of words. If English is marked in red, the total score will be greatly increased.
7. Try to quote the whole paragraph. If you quote a sentence or two, HowNet system can't identify the specific sentence in which article you quoted. So try to quote long paragraphs. And the references must be exactly the same.
Baidu Encyclopedia-Paper Coincidence