Development and Investigation of Cost-Sensitive Pruned Decision Tree Model for Improved Schizophrenia Diagnosis

  • Ephraim Nwoye Professor, Department of Biomedical Engineering, University of Lagos, Lagos, Nigeria
  • Wai Lok Woo Department of Computer and Information Sciences, Northumbria University, England, United Kingdom
  • Obinna Fidelis Department of Biomedical Technology, Federal University of Technology, Akure, Ondo State, Nigeria
  • Charles Umeh Department of Psychiatry, Lagos University Teaching Hospital, Idi-Araba, Lagos State, Nigeria
  • Bin Gao School of Automation Engineering, University of Electronic Science and Technology of China, Chengdu, China
Keywords: Schizophrenia, Decision Tree, Machine Learning, Naive-Bayes model

Abstract

Schizophrenia is often characterized by delusions, hallucinations, and other cognitive difficulties, affects approximately seventy million adults globally. This study presents a cost-sensitive pruned Decision Tree J48 model for fast and accurate diagnosis of Schizophrenia. The model implements supervised learning procedures with 10-fold cross-validation resampling method and utilizes unstructured filter to replace missing values in the data with the modal values of corresponding features. Features are selected using Pearson’s correlation on hot-coded data to detect redundancy in data. Cost matrix is designed to minimize the tendencies of the J48 algorithm to predict false negative outcomes. This consequently reduces the error of the model in diagnosing a Schizophrenia candidate as free from the disease. The model is found to significantly diagnose Schizophrenia with 78% accuracy, 89.7% sensitivity, 57.4% specificity and Area under the Receiver Operator Characteristic (ROC) curve of 0.895. The ROC curve is also seen to distinguish Schizophrenia from other conditions with similar symptoms. These results show the potential of machine-learning models for quick, effective diagnosis of schizophrenia.

Published
2020-10-30