Current location - Education and Training Encyclopedia - Graduation thesis - HowNet and paperpass have seen a big gap. What should I do?
HowNet and paperpass have seen a big gap. What should I do?
Let me ask you a question first. Is the HowNet you checked finalized or decomposed?

The final version of HowNet refers to:

Special edition: PMLC HowNet, including the joint comparison database of college students' papers.

Shuobo: HowNet VIP5.2 system, including the joint comparison database of academic papers.

If you measure the final version of the network, you can change it or not. There are two situations:

1, write more yourself.

2. You copied a lot, but HowNet didn't measure it.

In the first case, there is no need to change it. Give it directly to the school.

If it is the second case, it is recommended to change it again. Because we not only have to go through the school to check our weight! But also through the naked eye of the tutor and the blind evaluation group! Know the network unforeseen in time, and with the years of experience of tutors and blind evaluation teams, you can see the plagiarized content at a glance.

Then why is there such a big gap between PP and HowNet?

This is because: PP and HowNet have different databases and algorithms!

But it's hard to say who is tall and who is low ~

Look at an example: pp 19%, HowNet 86%.

The database and algorithm of PP and HowNet are different, so the repeated content and similarity of judgment are not necessarily the same.

Don't you believe it? Well, irrefutable evidence!

(The upper part is the detection result of HowNet, and the lower part is the detection result of paperpass.)

The sentence in the red box above.

"As the core of personal moral quality, social responsibility is highly unified with personal values, which requires individuals to organically combine self-development with social development, coordinate development, realize self-value in the process of serving and contributing to society, and pursue and realize happiness in life."

In hownet detection system, it is not repeated.

In paperpass, it was judged as "slightly similar".

For the purpose of rechecking, HowNet did not measure it, so in fact, this passage can be left unchanged. Well, if you wrote it yourself, well, forget it. But if it's really plagiarism, it's better to correct it honestly. The eyes of the tutor and the blind evaluation group are still very sharp.

Look at another example:

College students' sense of self-responsibility includes "the consciousness of self-survival and self-development, which specifically refers to cherishing their own lives, caring for their physical and mental health, enriching their spiritual life, and having clear goals and life pursuits; Study hard, improve your self-cultivation and actively pursue a valuable life; Be responsible for your words and deeds and fulfill your obligations to improve your life realm. " ? [ 1]

Please note that this is an offer. HowNet is also running normally, and it is judged as repeated reference. But in paperpass, there are three judgments in this complete quotation!

Non-repetition (green): college students' sense of self-responsibility includes; Care about your physical and mental health, enrich your spiritual life, and have clear goals and life pursuits; Study hard, improve your self-cultivation and actively pursue a valuable life.

Slightly similar (orange): responsible for your words and deeds and fulfilling your obligations, improving your life realm, etc.

Severe similarity (red): the sense of responsibility for self-survival and self-development, specifically, cherishing one's own life.

I really want to ask my brother Cheng who is watching paperpass. What is your test principle? So casual, so casual?

It's time to integrate theory with practice (has the stool moved? )

1, the databases are very different.

As can be seen from the above table, the database of HowNet is undoubtedly more complete and detailed.

The unique database of joint comparison of academic papers for master's and doctoral dissertations contains almost all the master's and doctoral dissertations tested before the latest update (which can also be understood as half a year ago);

The unique joint comparison database of college students' papers includes almost all the papers that participated in the test before (which can also be understood as 1 year ago).

Note: I took the test. Whether it is the final version submitted to the school or not, it will be included as long as it is tested.

Paperpass has only five databases.

Of course, we can't judge whose database data is bigger from the number of databases.

However, from the practical experience, HowNet is more academic, so it has become a cooperative duplicate checking system for more than 90% universities in China, and PP itself is a commercial product. PP can't get high-quality and large amount of academic content and data, so it can only enrich its own database from Internet resources.

Hownet is different from PP database, even though the algorithm is consistent. The test results are also different.

And their algorithms are different.

2. Different algorithms

Relatively speaking, the principle of PP detection is more strict (empirically speaking).

In paperpass, if the similarity is more than 40%, it is judged to be duplicate, which is what needs to be modified.

What is the concept of 40% similarity? Look at this example:

This sentence is slightly similar in judgment. However, the same sentence is divided into two parts, and the similarity between the two parts is different: (this is really strange)

The similarity of the first half sentence is 45%, and that of the second half sentence is 53%. And the sources of similarity are different.

The detection principle of PP is really a mystery ... but obviously there are "core keywords" that are similar. However, as mentioned above, can the word "social responsibility" be replaced by other words? It seems not!

Let's take a look at the serious similarity in PP:

How did 7 1% come from? Are there too many keywords piled up here? No solution. Welcome students with answers to comment and leave a message ~

The algorithm of HowNet is much clearer. It's either plagiarism or repeated quotation.

How is plagiarism? 13 consecutive identical or similar words will be marked in red, which is plagiarism.

The same:

Similarity: (This shows that deleting plagiarized content or simply adding other words in the middle of plagiarized content seems to separate this 13 continuous similar word, which is invalid! )

However, the citation of HowNet, em… can only say that most of the situations are normal, that is, those with footnotes or citations are repeated citations. Marked as green in HowNet report.

Then let's get back to the point. Can paperpass be used as a reference to check weight and lose weight?

If you write a lot yourself, be prepared that the PP test result is higher than HowNet.

If you copy too much, be prepared that the PP test score is lower than HowNet.

Fortunately, PP has a humanized function: adding self-built libraries. When testing PP, upload all the papers you borrowed to the PP background as a self-built database and submit them for revision. More than 99% of the content of the self-built library will be repeated, and the rest will depend on the free play of PP.

Finally, according to the test results to modify, the basic problem is not big!

However, due to the abnormality of PP algorithm and unfriendly to keywords and technical terms, it is not recommended to pursue too low PP repetition rate, which will almost be within 20%.

At present, PP is 34%, so it is recommended to change another one!

~~~~~~? (^_-)? (^_-)? (^_-)~~~~~~

Over! I heard that all the praised friends have passed their last papers! Hihia ~ what? What ~

(Original publishing address: network link)