Classification Boosting in Imbalanced Data

Authors

DOI:

https://doi.org/10.22452/mjs.sp2019no2.4

Keywords:

Boosting, G-mean, Imbalanced classification, SMOTE

Abstract

Most existing classification approaches assumed underlying training data set to be evenly distributed. However, in the imbalanced classification, the training data set of one majority class could far surpass those of the minority class. This became a problem because classification tends to predict data rewrite by comparing the two classes. This leads to the underestimation of the minority class and influences the performance evaluation criteria. One popular method recently used to rectify this is the SMOTE- Boosting which combines algorithms at data level. Therefore, this paper presents a review of this method by focusing on a two-class problem. Based on the performance criteria of G-mean, the method showed a better performance by taking advantage of the algorithms boost. However, while this affects the performance classification of the base classifier by focusing on all data classes, the SMOTE algorithm alters only for minority classes.

Author Biographies

  • Sinta Septi Pangastuti, Sepuluh Nopember Institute of Technology

    Statistics Department, Institut Teknologi Sepuluh Nopember Jl. Arif Rahman Hakim, Surabaya 60111 Indonesia and Statistics Department, Faculty of Mathematics and Natural Sciences, Padjadjaran University Jl. Raya Bandung-Sumedang Km. 21, Jatinangor 45363, Indonesia

  • Nur Iriawan, Sepuluh Nopember Institute of Technology

    Statistics Department, Institut Teknologi Sepuluh Nopember Jl. Arif Rahman Hakim, Surabaya 60111 Indonesia

  • Wahyuni Suryaningtyas, Sepuluh Nopember Institute of Technology

    Statistics Department, Institut Teknologi Sepuluh Nopember Jl. Arif Rahman Hakim, Surabaya 60111 Indonesia and Mathematics Education Program Study, Faculty of Teacher Training and Education, Muhammadiyah University of Surabaya Jl. Sutorejo No. 59, Surabaya 60113, Indonesia

Downloads

Published

30-09-2019

How to Cite

Classification Boosting in Imbalanced Data. (2019). Malaysian Journal of Science (MJS), 38(Sp2), 36-45. https://doi.org/10.22452/mjs.sp2019no2.4

Similar Articles

You may also start an advanced similarity search for this article.

Most read articles by the same author(s)