Back close

A plagiarized source retrieval system developed using efficient download filtering and POS tagged query formulation with effective paragraph based chunking

Publication Type : Journal Article

Publisher : International Journal of Artificial Intelligence, CESER Publications

Source : International Journal of Artificial Intelligence, CESER Publications, Volume 14, Number 1, p.145-160 (2016)

Url : http://www.scopus.com/inward/record.url?eid=2-s2.0-84956916944&partnerID=40&md5=537c96a7cc8aa3015cf9069a9b4b1e16

Campus : Bengaluru

School : Department of Computer Science and Engineering, School of Engineering

Center : Center for Industrial Research and Innovation (ACIRI), E-Learning

Department : Computer Science, Mathematics

Year : 2016

Abstract : Source Retrieval is an important task of External Plagiarism Detection system which involves in identifying a set of candidate source documents for a given suspicious document. Not to lose any actual source document while reducing the size of the candidate source document set is crucial. This paper describes the approach of Source Retrieval task of External Plagiarism Detection System. The approach includes chunking of documents based on paragraphs along with Part-of- Speech tagging and an efficient download filtering method. The proposed system is evaluated against PAN 2011-12, PAN 2012-13 PAN 2014-15 Test Data Set and results are analysed and compared using standard PAN measures: Recall, Precision, F Measure, average number of queries and downloads. The proposed approach exhibited improved efficiency in PAN 2015 conducted by PAN CLEF Evaluation lab1, by acquiring highest values for F Measure and Precision along with least Downloads. The results are further improved by incorporating efficient query and download filtering mechanisms over the proposed system. The effect of the enhanced proposed system is also discussed and analysed in this paper. © 2016 CESER PUBLICATIONS.

Cite this Research Publication : NaRiya Ravi and Dr. Deepa Gupta, “A plagiarized source retrieval system developed using efficient download filtering and POS tagged query formulation with effective paragraph based chunking”, International Journal of Artificial Intelligence, vol. 14, pp. 145-160, 2016.

Admissions Apply Now