Current location - Education and Training Encyclopedia - Graduation thesis - Why is the repetition rate of paperpass much higher than that of HowNet?
Why is the repetition rate of paperpass much higher than that of HowNet?
The network resource database of PaperPass is larger than HowNet, the network resources are very complex, and the algorithm of PaperPass is not as advanced as HowNet. The algorithm of PaperPass detection system is strict, but it does not mean that it is the most accurate.

The algorithm threshold of the duplicate checking system of HowNet is 3%, and the duplicate checking of papers is mainly based on the accuracy, so when choosing the duplicate checking system, you need to know what detection system is used for graduation schools and publishing magazines first. PaperPass red plagiarism is serious, yellow plagiarism is slight, only green security; The red plagiarism, yellow quotation and repetition rate of HowNet are the proportion of repeated words to the total number of words.

Double check rule

1, same duplicate check

The rules of paper duplicate checking will certainly include the same duplicate checking. If there are 13 words in a paper that are the same as those in other papers, they are repeated words according to the residue. So generally speaking, if you want to quote some sentences, but the content is long, you should replace the sentence patterns and synonyms yourself.

2. Fuzzy duplicate checking of paragraphs and sentences

At present, the rules of duplicate checking can be checked intelligently and fuzzily. Even if you add some related words to a sentence, you can check out which document the sentence comes from and have a certain degree of relevance. Therefore, we must pay special attention to the basic treatment of paper. Don't just change the order of paragraphs or sentences, it's basically meaningless.

Step 3 double-check the form

In fact, for papers with duplicate tables, the duplicate checking rule depends more on whether there are a lot of duplicates in the content and format of the table. Although many people may just copy the format of the table, the content is completely different from others and will be repeated. Why? Because your table format is too similar, if more than 80% of the contents are the same, it is likely to be counted as duplication, so you should use the table carefully.