Too much plagiarism, once found to be more than 30%, will have serious consequences. Those who are light will postpone graduation, and those who are heavy will cancel their degrees. It's hard to study hard in college and get my degree reimbursed.
But the software, after all, is a manual setting mechanism, which embeds the detection algorithm. As long as the mechanism is clear and simple modifications are made, the test can be successfully passed.
This article is information collected on the Internet. I have sorted out the most important parts for your reference.
Paper plagiarism detection algorithm;
1. Paragraph and format of the file
Paper detection is basically the whole article upload. After uploading, the paper detection software first divides it, and the final manuscript format has a great influence on the plagiarism rate. The division of different paragraphs may cause small paragraphs of dozens of words not to be found. So we can reduce the plagiarism rate by dividing more small paragraphs.
2. Database database
Paper detection mostly aims at matching published graduation papers, periodical papers and conference papers, and some databases also contain some articles on the Internet. It is revealed here that many books are not in the testing database. Before, my friend extracted a lot of words from a research work and didn't find it out. It can be seen that this method is still effective.
3. Chapter conversion
Many students changed the order of chapters, or extracted different articles from different articles, which had little effect on the results of plagiarism detection. Therefore, plagiarism detection experts suggest that you should not think that copying a few or dozens of articles will pass.
4. Mark references
How to define quoting others' articles and copying others' articles in detection software? It's actually quite simple. In our paper, reference symbols are added, but in plagiarism detection software. All these are viewed in a unified way. The threshold of software is generally set to 1%. For example, an article is 5000 words, and 1% of the article is 50 words. If more than 50 words are plagiarism, even if references are added, it is also plagiarism.
5. Word number matching
Paper plagiarism detection system is strict, as long as more than 20 units of text match, it is considered plagiarism, but the premise is to meet the fourth point, the labeling of references.
Paper plagiarism modification method:
The first is the change of words. The professional vocabulary in the article can be kept, and synonyms can be changed as much as possible;
Secondly, change the description in the text, such as inverted sentences, passive sentences and active sentences; Disrupt the order of paragraphs, divide the paragraphs when copying the original, and reorganize them.
Through the above methods, the plagiarism rate can be effectively reduced.
Here are a few examples for your reference:
Example a:
In this paper, the construction of HFS is studied by using the genetic algorithm combining integer coding and real coding as the objective function. The chromosome coding method and the corresponding genetic operation method proposed in this paper can realize the global random optimization of the research object. The research on the standard examples of automobile series shows that this method has high calculation repeatability and efficiency.
Modify a:
In this paper, the construction of HFS problem is studied. Genetic algorithm is combined with integer and real number coding, and the objective function is to maximize the utilization rate of equipment. The chromosome coding method and the corresponding genetic algorithm operation in this paper can effectively improve the global search ability of the algorithm. Through the study of some benchmark examples, the effectiveness of the algorithm in this paper is verified, and it has high computational repeatability and high operation efficiency.
Example b:
Due to the strong regionality of real estate commodities, real estate development enterprises usually need to set up project companies when investing in different regions, and at this time they will be faced with the choice of establishing branches or subsidiaries. A subsidiary is an independent legal person, but a branch is not. They differ in tax benefits. The subsidiary is an independent legal person, regarded as a taxpayer in the established area, and usually bears the same comprehensive tax payment obligations as other companies in the area; Branches are not independent legal entities, and they are not regarded as taxpayers in the place where they are established, and only bear limited tax obligations. The profit and loss of the branch company should be calculated with the head office.
Modify b:
When real estate development enterprises invest in different regions, they need to set up project companies because of the strong regionality of such commodities. At this point, enterprises need to choose whether to establish branches or subsidiaries. The main difference is that the subsidiary has an independent legal person, but the branch does not. Secondly, in terms of tax incentives, because the branch is not an independent legal person, the area where the branch is established is not regarded as a taxpayer, but only bears the tax obligation. The head office needs to count the profits and losses of the branches together; A subsidiary is an independent legal person, which is regarded as a legal entity in its region and needs to bear the same comprehensive tax obligations as other companies in the region.
There are no more ways to correct plagiarism than these. It is suggested that students be familiar with the references you read, close them and write them in their own words, so that they will not be greatly influenced by the references.
Some students here have raised questions. The detection system used by the school is the academic misconduct detection system of HowNet, not the Wan Fang data detection that Taobao bought for a few dollars.
In fact, the algorithm of each detection system is not very different, but there are many databases. If you don't have too much, don't be afraid of any system. Since you copied it, you should revise the article while you get the test report.
After copying, change the phase similarity, so you can leave the middle, with different meanings and words.
First, the principle of duplicate checking.
1. hownet dissertations are tested as a whole article, and the format may have an impact on the test results. It is necessary to submit the final submission format for testing, so as to minimize the impact, and dozens of blocks may not be detected. Papers over 30,000 words can be ignored.
The comparison databases include: online publishing database of academic journals in China, full-text database of Chinese doctoral dissertations/full-text database of excellent master's dissertations in China, full-text database of important national conferences, full-text database of important newspapers and periodicals in China, full-text database of patents in China, personal comparison database and other comparison databases. Some books are not in the HowNet Library and cannot be detected.
2. After the paper is uploaded, the system will automatically detect the chapter information of the paper. If there is automatically generated directory information, the system will detect the paper in sections, otherwise it will automatically detect the paper in sections.
3. It is normal for some students to report that they explicitly quoted or copied paragraphs or sentences from other documents in their own paragraphs, and why they didn't detect them. China Knowledge Network has set a threshold for the sensitivity of this detection system, which is 5%. In terms of paragraphs, plagiarism or quotation below 5% cannot be detected, which is common in clauses or small concepts in large paragraphs. For example, if the detection paragraph 1 has 10000 words, a single document with less than 500 words will not be detected. In fact, here also tells the students a method of revision, that is, never choose an article to quote from paragraph plagiarism, try to choose as many documents as possible, and intercept a few words from one article, so that it will not be found out.
4. How to detect plagiarism in a paper? The condition of hownet paper detection is that 13 words with similar or plagiarized words will be marked in red, but the precondition in 3 must be met: that is, the sum of A documents you quoted or plagiarized should reach 5% in each detection paragraph.
Second, seven methods of quick duplicate checking
Method 1: Translation of foreign documents
Consult foreign literature in the research field, especially those in high-level journals, such as Science, Nature and Water Resources. And translate the theoretical explanation into Chinese and put it in your own paper.
Advantages: 1, everyone's language habits are different, and the translated Chinese is bound to be different. Therefore, even if the same paragraph is translated by different people, there will be no plagiarism. Reading foreign literature can improve your English and broaden your professional horizons.
Disadvantages: Students with poor English, especially those with poor professional English, are more difficult to implement.
Method 2: Change the wording method
Rewrite the words in other people's papers, or change the sentence structure, active voice and passive voice, or change keywords, or increase or decrease. Of course, if it is a classic sentence, it should be quoted in a classic way.
Advantages: 1 After the text is modified, according to HowNet program and algorithm, as long as there are no repeated 13 continuous words and keywords, it will not be marked red. I know every word and sentence of the paper like the palm of my hand, and I know it by heart, and I will be like a duck to water when I reply.
Disadvantages: word-for-word revision is time-consuming and laborious.
Method 3: Cut the head and tail and change the word order in the middle.
If you change the words in other people's papers from beginning to end, leave a paragraph in the middle and change the rest into passive sentences, then the sentence pattern and structure will change, and then you can successfully avoid duplication by correcting the language defects yourself.
Advantages: convenient and quick, and can be modified in large sections.
Disadvantages If the literature is not good, it will be very hard, and it will take half a day.
Method 4: Transform picture method.
Cut the words in other people's papers into pictures and put them in your own papers. At present, the duplicate checking system of HowNet can only check words, not pictures and tables, so duplicate checking can be avoided.
Advantages: It is more convenient and faster than changing sentence order.
Disadvantages: If it is convenient to use, it is easy to see that the whole page is full of pictures, which will affect the number of words in the whole paper.
Method 5: Insert document method
Insert some quoted words into the paper in the form of word documents.
Advantages: this method is even better than the fourth method, because it can be re-edited in the inserted document in the future, and the image conversion method is not convenient for further modification.
Disadvantages: not found yet.
Method 6: Insert space method
Insert spaces between all the words in the article, and then adjust the spacing between empty words to a minimum. Because the basis of duplicate checking is based on words, spaces truncate words and naturally skip the duplicate checking system.
Advantages: Based on the principle of duplicate checking system, it has high reliability.
Disadvantages: the workload is huge, and the course can be completed through macros, but you need to learn the compilation of macros.
Method 7: Self-original method
Write your own paper, or don't copy and paste the original text when writing; Please add the quotation correctly.
Advantages: Basically, you will never worry about not passing the duplicate check, even if the threshold of the duplicate check system is lowered.
Disadvantages: If there are advantages and disadvantages, it is that after writing the graduation thesis, more brain cells may die. Ha ha. . .
Detailed description of hownet system calculation standard:
1. After reading the introduction of this system, I have a question. This system is good for text copy recognition, but what about other contents, such as data and charts? Isn't it useless to detect it?
Among all kinds of academic misconduct, plagiarism is the most common and serious. At present, the detection system has reached a high level. The detection of plagiarism and tampering of charts, formulas and data is currently under development and has made great progress. You are welcome to continue to pay attention to the progress of this detection system and put forward more critical and constructive opinions and suggestions.
2. According to this system, less than 39% is displayed in yellow, so does it mean that it is within the tolerable limit? Recently, I read the news that the national social science fund project of a teacher in Shanghai University was cancelled because two papers he published were plagiarized, accounting for 25% and 30% respectively. Please specify what the warning line is.
Percentage only describes the proportion of overlapping words in the detected documents, and does not refer to the plagiarism of the documents. It can only be said that the greater the percentage, the more overlapping words, and the greater the possibility of plagiarism. Whether it is plagiarism or not and the severity of plagiarism need to be decided by experts after review.
3. How to prevent the academic misconduct detection system of dissertations from becoming a platform for personal revenge?
This is something we are seriously considering. At present, this detection system is only used by users at the institutional level. We have established a strict management process. At the same time, technically, we have also taken various measures to prevent malicious acts as much as possible, including a series of strict identity authentication and login.
4. The minimum detection unit is one sentence, so you can't detect one or two words in each sentence?
We also deal with sentences accordingly, and have an algorithm of sentence similarity. It is not the same sentence that is judged to be the same. Sentences have sentence-level similarity algorithms, and paragraphs have paragraph-level similarity algorithms. Calculating whether a document or paragraph is similar to other documents is based on this.
5. If the original word is taken from relevant books, but the word has been copied from relevant documents in the database, that is to say, the previous article also took the same word from relevant books, but the words marked in my paper are from relevant books, is this academic plagiarism?
The detection system can't draw a conclusion, whether it is plagiarism or not, and finally there is manual review. So if it is the situation you describe, experts will make corresponding judgments. Our system only provides all kinds of clues and basis, so that people can quickly grasp the information of test literature.
6. The authority of HowNet detection system?
The detection system of academic misconduct documents has not reached a conclusion, that is, the detection system does not characterize the detection documents, but only shows the similarities between the detection documents and other published documents and lists objective facts, and whether such detection documents belong to academic misconduct needs the final examination and confirmation of experts.
How to detect plagiarism in papers? The condition of hownet paper detection is that 13 words with similar or plagiarized words will be marked in red, but the precondition in 3 must be met: that is, the sum of A documents you quoted or plagiarized should reach 5% in each detection paragraph.
Paper duplicate checking and revision method;
1. If it is a reference, do not use a period after the reference label. If you write a paragraph, you will be copied (although I think it is a reference). Therefore, try to use semicolons before the end of the reference. Some people put quotation marks after the period, which is wrong and should be put before the period.
2. You can convert the text into a table and hide the border of the table.
3. If you read a lot of foreign languages, all of which are translated and quoted by foreign languages, personally, you don't need endnotes, so you can keep them for yourself, because the duplicate database is only the matching of characters and can't match Chinese and English.
4. Duplicate checking is a matching process, based on sentences. If a sentence is repetitive, it is easy to judge that it is repetitive, so:
It is indeed a classic sentence, so it is indicated by superscript endnotes in the references, or the quoted content is framed by the original author's name and quotation marks. Anything in quotation marks will be regarded as a reference.
If it is a general quotation, add the omitted subject, predicate, etc. in a verbose way. In the original sentence. Anyway, even if one more word is victory, you can use the horizontal knife method to remove some sentence elements and replace them with some pronouns. Or foreign devil law, if the foreign name in the original text is Chinese, just use English directly, if the English name is Chinese, just use Chinese name directly. If the name is in Chinese, find it and change it into a Chinese name.
Deliberately add (notes) to the English side of some abbreviations. In short, every sentence can be changed, even if one word is added or one word is subtracted, it is a victory.
Pay special attention to punctuation, change it, turn English complex sentences into two or more simple sentences, and so on, and master it flexibly.
Because it's almost impossible to actually write a paper, but citing a lot of other people's things shows that you have strong comprehensive ability and read a lot of materials. This is a process, a process of learning and summing up.
Everything, don't let the tutor blame you on the page, this is the most uneconomical. The tutor hates irregular typesetting, because he is only responsible for the content, but he can't bear to let his apprentice be kicked out because of typesetting problems.
5. The following one, which I tried by a silly girl, must be correct. B: Select other people's words and some of your own words, copy them (in blocks and rectangles), create an empty file on the desktop, copy the contents into the file, save it and close it. Select the icon of this file, copy and paste it directly in the position of your text, and it will become a picture, which cannot be edited. This operation is actually to insert the content file as an object, so it is a picture. This operation actually inserts the content file as an object. So it's a picture.
To sum up the above things:
Duplicate checking is a matching process, based on sentences. If a sentence is repetitive, it is easy to judge the repetition, so:
1) If it is indeed a classic sentence, it will be indicated by superscript endnotes in the references.
2) If it is a general quotation, add all the omitted subjects and predicates in a lengthy way. In the original sentence. Anyway, one more word is victory.
3) You can also use the horizontal knife method to remove some sentence elements and replace them with some pronouns.
4) or foreign devil law, if the foreign names in the original text are in Chinese, use English directly, and if the English names are in Chinese, use Chinese names directly. If the names are in Chinese, find them all and change them into Chinese names.
5) deliberately add (comments) to the English side of some abbreviations. In short, every sentence can be changed, even if one word is added or one word is subtracted, it is a victory.
6) If it is a quotation mark, don't use a period after the quotation mark. If you write a full stop, you will be plagiarized (although I think it is a quotation), so try to use a semicolon before the quotation ends. Some people put quotation marks after the period, which is wrong and should be put before the period.
7) Text can be converted into tables, and tables are basically impossible to copy. When words become graphics, tables become graphics, which are clear at a glance and will never be aware of repeated plagiarism.
Check the papers again and revise the school requirements: 1. Thesis title: Requirements are accurate, concise, eye-catching and novel.
2. Table of Contents: A table of contents is a brief list of main paragraphs in a paper. (Essays don't need to be listed in the table of contents)
3. Abstract: It is an excerpt from the main content of the article, which requires short and pithy content. The number of words can be as few as dozens, and it is advisable not to exceed 300 words.
4. Keywords or subject words: keywords are selected from the title, abstract and text of the paper, and are words with substantive significance to express the central content of the paper. Keywords are words used by computer systems to index the content characteristics of papers, which are convenient for information systems to collect and provide readers with retrieval. Generally, 3-8 words are selected as keywords for each paper, and a new line is set at the bottom left of the "abstract".
Subject words are standard words. When determining the subject words, the paper should have a theme, and according to the indexing and collocation rules, it should be converted into standardized words in the subject glossary.
5. Text of the document:
(1) Introduction: Introduction, also known as preface, preface and introduction, is used at the beginning of the paper. The introduction should generally write the author's intention, explain the purpose and significance of the topic, and point out the scope of the paper writing. The introduction should be short and concise, and stick to the theme.
(2) Text of the paper: The text is the main body of the paper and should include arguments, arguments, argumentation process and conclusions. The main part includes the following contents:
A. Propose-demonstrate;
B. analyzing the problem-arguments and arguments;
C. solving problems-demonstrations and steps;
D. conclusion.
6. The references of the paper refer to the main documents listed at the end of the paper that can be referenced or quoted in writing. References should be marked on a new page according to GB77 14-87 "Description Rules of References at the End of Documents".
English: Title-Author-Publication Information (edition, publisher, publication date): Author-Title-The requirements for the references listed in the publication information are:
The references listed in (1) should be official publications for readers' textual research.
(2) The listed references shall be marked with serial numbers, titles of works or articles, authors and publication information.
I sorted it out below, welcome to read!
1. Investigation Report on Household Energy Conservation
Garlic can also cure diseases.
[14] Magic ... *