Detecting phishing E-mail using machine learning techniques CEN-SecureNLP

Publisher : CEUR Workshop Proceedings

Campus : Coimbatore

School : School of Engineering

Center : Computational Engineering and Networking

Year : 2018

Abstract :

The number of unsolicited aka phishing emails are increasing tremendously day by day. This suggests the need to design a reliable framework to filter out phishing emails. In the proposed work, we develop a supervised classifier for distinguishing phishing email from legitimate ones. The term frequency-inverse document frequency (tf-idf) matrix and Doc2Vec are formed for legitimate and phishing emails. This is passed to various traditional machine learning classifiers for classification. The machine learning classifiers with Doc2Vec representation have performed well in comparison to the tf-idf representation. Thus we conclude Doc2Vec representation is more appropriate for detecting and classifying phishing and legitimate emails. Copyright © by the paper's authors.

