A proposed method for handling an imbalance data in classification of blood type based on Myers-Briggs type indicator

Open Access

Abstract

Blood type still leads to an assumption about its relation to some personality aspects. This study observes preprocessing methods for improving the classification accuracy of MBTI data to determine blood type. The training and testing data use 250 data from the MBTI questionnaire answers given by 250 respondents. The classification uses the k-Nearest Neighbor (k-NN) algorithm. Without preprocessing, k-NN results in about 32 % accuracy, so it needs some preprocessing to handle data imbalance before the classification. The proposed preprocessing consists of two-stage, the first stage is the unsupervised resample, and the second is the supervised resample. For the validation, it uses ten cross-validations. The result of k-Nearest Neighbor classification after using these proposed preprocessing stages has finally increased the accuracy, F-score, and recall significantly.

Keywords

Funding Information

Universitas Pembangunan Nasional Veteran Yogyakarta

This publication has 16 references indexed in Scilit:

Biased support vector machine and weighted-smote in handling class imbalance problem
International Journal of Advances in Intelligent Informatics, 2018
An Improved kNN Based on Class Contribution and Feature Weighting
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2018
Resampling Imbalanced Class and the Effectiveness of Feature Selection Methods for Heart Failure Dataset
International Robotics & Automation Journal, 2018
Efficient kNN Classification With Different Numbers of Nearest Neighbors
IEEE Transactions on Neural Networks and Learning Systems, 2017
Predicting student personality based on a data-driven model from student behavior on LMS and social networks
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2015
An Improved KNN Algorithm Based on Kernel Methods and Attribute Reduction
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2015
ABO Blood Type and Personality Traits in Healthy Japanese Subjects
PLOS ONE, 2015
Simulation of pair programming using multi-agent and MBTI personality model
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2015
Comparative Study of Recommendation Algorithms and Systems using WEKA
International Journal of Computer Applications, 2015
Efficient resampling methods for training support vector machines with imbalanced datasets
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2010

Cited by 2 articles