1.Daya duplicate checking method:
Daya is a detection system based on similarity analysis. Similar literature mainly includes periodicals, papers and other types. By comparing the inspection document with the document, we can get the inspection HTML report and PDF report. Daya detection system can automatically exclude references, quotations and other contents. Daya supports single test and batch test, and will be used in cooperation with universities.
Daya focuses on books and journal articles. Books and periodical papers are copied too much, and the duplicate checking rate is naturally high, because he belongs to the superstar group. If you copy a lot of master's and doctoral dissertations, the natural duplicate checking rate is very low. Wang Zhi covers millions of master's and doctoral dissertations at home and abroad, which are plagiarized naturally and have a high duplicate checking rate.
Elegance is to detect the similarity with books, and HowNet is to detect the similarity of journal papers. However, only Daya can detect books in the detection system.
2. The standard of 2.Daya's repeated inspection is:
Text content analysis: using thesaurus matching technology and improved hash algorithm, the document content is analyzed to determine the similarity between documents;
2. Analyze the text structure: analyze the sentence structure of documents by lexical analysis technology, and exclude documents with similar articles but different structures;
3. Text style analysis: using natural language processing technology to analyze the grammatical features of documents and exclude documents with similar content and structure but different styles;
4. Text application analysis: using AI technology, according to the usage scenarios of documents, determine the repetition rate of documents, and exclude documents with similar content, structure and style but different uses.