Back close

F-test feature selection in Stacking ensemble model for breast cancer prediction

Publication Type : Conference Proceedings

Publisher : Elsevier

Source : Procedia Computer Science

Url : https://www.sciencedirect.com/science/article/pii/S1877050920311467

Campus : Amritapuri

School : School of Computing

Year : 2020

Abstract : Cancer data sets contains many details of patient information, out of which only a few attributes contribute in predicting the accurate stage of cancer. Certain attributes of the entire data set play a major role in deciding the type of cancer i.e. whether benign or malignant hence feature selection techniques are useful in such scenarios for retaining the relevant feature set. Moreover, in order to achieve our goal of predicting the accurate stage of cancer, we need an appropriate model which generally results in higher accuracy and ensemble model proves to be the best model for such scenarios. In this study, we are using the existing ensemble techniques along with a combination of supervised machine learning algorithms to develop a new model for breast cancer prediction. We are also using feature selection techniques to enhance the performance of the ensemble model. For this purpose, machine learning algorithms like Support Vector Machines, Naive Bayes, K-Nearest Neighbors, Logistics Regression and feature selection techniques like Variance threshold and f-test have been taken into consideration. To achieve higher accuracy for the ensemble model, bagging, boosting and stacking techniques are used.

Cite this Research Publication : Dhanya R, Paul Irene Rose, Akula Sai Sindhu, Sivakumar Madhumathi, Jyothisha J. Nair, F-test feature selection in Stacking ensemble model for breast cancer prediction, Procedia Computer Science, 2020.

Admissions Apply Now