Phần mềm phát hiện sao chép luận văn Đại học Cửu Long
Abstract
This paper proposes a plagiarism detection software, which based on the information retrieval techniques. The thesis plagiarism is commonly verified by two basic resources: local and online databases. The copy will be checked at sentence level because this is the most common copied form. To detect the plagiarism online, the system will initially run a query on the searching engine Google to find the documents where plagiarism may derive from; and then the Jaccard measure is used to compute the similarity between two sentences. In terms of local resources, the MongoDB database is used to store the inverse index and the Cosine measure is simultaneously used to compute the similarity between two senfences. The study results showed that the proposed software is effectively applied in reality.