Back close

Investigating the impact of combined similarity metrics and POS tagging in extrinsic text plagiarism detection system

Publication Type : Conference Paper

Publisher : 2015 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2015

Source : 2015 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2015, Institute of Electrical and Electronics Engineers Inc., p.1578-1584 (2015)

Url : http://www.scopus.com/inward/record.url?eid=2-s2.0-84946223892&partnerID=40&md5=25722efefe2dfed7af9bdbb1773fa235

ISBN : 9781479987917

Keywords : Combined metrics, Computational linguistics, Extrinsic Plagiarism, Information science, Intellectual property, PoS tagging, Semantics, Single Metrics, Statistical tests, Syntactics, Vector space models, Vector spaces

Campus : Bengaluru

School : Department of Computer Science and Engineering, School of Engineering

Department : Computer Science, Mathematics

Year : 2015

Abstract : Plagiarism is an illicit act which has become a prime concern mainly in educational and research domains. This deceitful act is usually referred as an intellectual theft which has swiftly increased with the rapid technological developments and information accessibility. Thus the need for a system/ mechanism for efficient plagiarism detection is at its urgency. In this paper, an investigation of different combined similarity metrics for extrinsic plagiarism detection is done and it focuses on unfolding the importance of combined similarity metrics over the commonly used single metric usage in plagiarism detection task. Further the impact of utilizing part of speech tagging (POS) in the plagiarism detection model is analyzed. Different combinations of the four single metrics, Cosine similarity, Dice coefficient, Match coefficient and Fuzzy-Semantic measure is used with and without POS tag information. These systems are evaluated using PAN1 -2014 training and test data set and results are analyzed and compared using standard PAN measures, viz, recall, precision, granularity and plagdet-score. © 2015 IEEE.

Cite this Research Publication : Ka Vani, Dr. Deepa Gupta, Krishnaswamy D, Thampi S.M, Callegari C, Alcaraz Calero J.M, Takagi H, Mauri J.L, Meghanathan, N., Rodrigues, J., Bojkovic Z.S, ,, Wozniak, M., Sahni, S., Vinod M., Prasad, N. R., Que X., and Au E., “Investigating the impact of combined similarity metrics and POS tagging in extrinsic text plagiarism detection system”, in 2015 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2015, 2015, pp. 1578-1584.

Admissions Apply Now