A proposed method for handling an imbalance data in classification of blood type based on Myers-Briggs type indicator
Open Access
- 11 September 2020
- journal article
- Published by Institute of Research and Community Services Diponegoro University (LPPM UNDIP) in Jurnal Teknologi dan Sistem Komputer
- Vol. 8 (4), 276-283
- https://doi.org/10.14710/jtsiskom.2020.13625
Abstract
Blood type still leads to an assumption about its relation to some personality aspects. This study observes preprocessing methods for improving the classification accuracy of MBTI data to determine blood type. The training and testing data use 250 data from the MBTI questionnaire answers given by 250 respondents. The classification uses the k-Nearest Neighbor (k-NN) algorithm. Without preprocessing, k-NN results in about 32 % accuracy, so it needs some preprocessing to handle data imbalance before the classification. The proposed preprocessing consists of two-stage, the first stage is the unsupervised resample, and the second is the supervised resample. For the validation, it uses ten cross-validations. The result of k-Nearest Neighbor classification after using these proposed preprocessing stages has finally increased the accuracy, F-score, and recall significantly.Keywords
Funding Information
- Universitas Pembangunan Nasional Veteran Yogyakarta
This publication has 16 references indexed in Scilit:
- Biased support vector machine and weighted-smote in handling class imbalance problemInternational Journal of Advances in Intelligent Informatics, 2018
- An Improved kNN Based on Class Contribution and Feature WeightingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2018
- Resampling Imbalanced Class and the Effectiveness of Feature Selection Methods for Heart Failure DatasetInternational Robotics & Automation Journal, 2018
- Efficient kNN Classification With Different Numbers of Nearest NeighborsIEEE Transactions on Neural Networks and Learning Systems, 2017
- Predicting student personality based on a data-driven model from student behavior on LMS and social networksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2015
- An Improved KNN Algorithm Based on Kernel Methods and Attribute ReductionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2015
- ABO Blood Type and Personality Traits in Healthy Japanese SubjectsPLOS ONE, 2015
- Simulation of pair programming using multi-agent and MBTI personality modelPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2015
- Comparative Study of Recommendation Algorithms and Systems using WEKAInternational Journal of Computer Applications, 2015
- Efficient resampling methods for training support vector machines with imbalanced datasetsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2010