Back close

Plagiarism detection in text documents using sentence bounded stop word n-grams

Publication Type : Journal Article

Publisher : Journal of Engineering Science and Technology, Taylor's University.

Source : Journal of Engineering Science and Technology, Taylor's University, Volume 11, Number 10, p.1403-1420 (2016)

Url : https://www.scopus.com/inward/record.uri?eid=2-s2.0-84992015478&partnerID=40&md5=b52344b110a2455bc9f3bd9dae09a12b

Campus : Coimbatore

School : School of Business

Year : 2016

Abstract : With the evolution of technologies like internet search engines and improved text editors, plagiarism has become a critical issue. Many works are already available in verbatim plagiarism detection which is a type of simple copy and paste plagiarism but when it comes to intelligent plagiarism the scenario becomes more complex. Intelligent plagiarism includes plagiarism through idea adoption, translation and text manipulations which is more challenging to deal with. The paper makes an attempt to detect intelligent plagiarism using the structural information within the document. This is done by the extraction of stop words, in contrast to the other methods that usually rely upon content words. The proposed method enhances this existing idea by including the rough sentence boundaries along with stop word profiles. Further this method is extended using the part of speech tags and finally the system is evaluated using sample documents from PAN- 2010 data set. The results are compared with the baseline approach and performance is evaluated based on standard PAN measures. © School of Engineering, Taylor’s University.

Cite this Research Publication : D. Gupta, Vani, K., and Leema, L. M., “Plagiarism detection in text documents using sentence bounded stop word n-grams”, Journal of Engineering Science and Technology, vol. 11, pp. 1403-1420, 2016.

Admissions Apply Now