A Modified Decision Tree and its Application to Assess Variable Importance

Francis Fuller Bbosa, Ronald Wesonga, Peter Nabende, Josephine Nabukenya

نتاج البحث: Conference contribution

ملخص

This paper presents an approach to further improve the data reduction abilities of the traditional C4.5 algorithm by integrating the information gain ratio and forward stepwise regression algorithms. Motivated by the fact that the traditional C4.5 algorithm utilizes a full set of antecedent attributes without taking into consideration irrelevant attributes which is a precursor to spurious predictive model estimates. This study aims to overcome this drawback by developing and evaluating the performance of an importance-based attribute selection algorithm called the C4.5-Forward Stepwise (C4.5-FS) for improving the data reduction abilities of the traditional C4.5 classifiers. Five datasets with dimensionality ranging from 6 to 10,000 attributes were employed to evaluate the model performance the goodness of fit for the modified and traditional C4.5 classifier was done using k-fold cross-validation based on a confusion matrix. Experimental results revealed that the C4.5-FS algorithm trained on fewer antecedent attributes improved the data reduction capabilities of the traditional C4.5 algorithm trained on a full set of antecedent attributes by achieving higher accuracy.

اللغة الأصليةEnglish
عنوان منشور المضيف2021 4th International Conference on Data Science and Information Technology, DSIT 2021
ناشرAssociation for Computing Machinery
الصفحات468-475
عدد الصفحات8
رقم المعيار الدولي للكتب (الإلكتروني)9781450390248
المعرِّفات الرقمية للأشياء
حالة النشرPublished - يوليو 23 2021
الحدث4th International Conference on Data Science and Information Technology, DSIT 2021 - Shanghai, China
المدة: يوليو ٢٣ ٢٠٢١يوليو ٢٥ ٢٠٢١

سلسلة المنشورات

الاسمACM International Conference Proceeding Series

Conference

Conference4th International Conference on Data Science and Information Technology, DSIT 2021
الدولة/الإقليمChina
المدينةShanghai
المدة٧/٢٣/٢١٧/٢٥/٢١

ASJC Scopus subject areas

  • ???subjectarea.asjc.1700.1712???
  • ???subjectarea.asjc.1700.1709???
  • ???subjectarea.asjc.1700.1707???
  • ???subjectarea.asjc.1700.1705???

بصمة

أدرس بدقة موضوعات البحث “A Modified Decision Tree and its Application to Assess Variable Importance'. فهما يشكلان معًا بصمة فريدة.

قم بذكر هذا