Publication Type : Conference Proceedings
Publisher : Elsevier
Source : Procedia Computer Science
Url : https://www.sciencedirect.com/science/article/pii/S1877050920311467
Campus : Amritapuri
School : School of Computing
Year : 2020
Abstract : Cancer data sets contains many details of patient information, out of which only a few attributes contribute in predicting the accurate stage of cancer. Certain attributes of the entire data set play a major role in deciding the type of cancer i.e. whether benign or malignant hence feature selection techniques are useful in such scenarios for retaining the relevant feature set. Moreover, in order to achieve our goal of predicting the accurate stage of cancer, we need an appropriate model which generally results in higher accuracy and ensemble model proves to be the best model for such scenarios. In this study, we are using the existing ensemble techniques along with a combination of supervised machine learning algorithms to develop a new model for breast cancer prediction. We are also using feature selection techniques to enhance the performance of the ensemble model. For this purpose, machine learning algorithms like Support Vector Machines, Naive Bayes, K-Nearest Neighbors, Logistics Regression and feature selection techniques like Variance threshold and f-test have been taken into consideration. To achieve higher accuracy for the ensemble model, bagging, boosting and stacking techniques are used.
Cite this Research Publication : Dhanya R, Paul Irene Rose, Akula Sai Sindhu, Sivakumar Madhumathi, Jyothisha J. Nair, F-test feature selection in Stacking ensemble model for breast cancer prediction, Procedia Computer Science, 2020.