Constraint Enforcement on Decision Trees: a Survey

Geraldin Nanfack; Paul Temple; Benoît Frénay

doi:10.1145/3506734

Constraint Enforcement on Decision Trees: a Survey

Geraldin Nanfack, Paul Temple, Benoît Frénay

Research output: Contribution to journal › Article › peer-review

Abstract

Decision trees have the particularity of being machine learning models that are visually easy to interpret and understand. Therefore, they are primarily suited for sensitive domains like medical diagnosis, where decisions need to be explainable. However, if used on complex problems, then decision trees can become large, making them hard to grasp. In addition to this aspect, when learning decision trees, it may be necessary to consider a broader class of constraints, such as the fact that two variables should not be used in a single branch of the tree. This motivates the need to enforce constraints in learning algorithms of decision trees. We propose a survey of works that attempted to solve the problem of learning decision trees under constraints. Our contributions are fourfold. First, to the best of our knowledge, this is the first survey that deals with constraints on decision trees. Second, we define a flexible taxonomy of constraints applied to decision trees and methods for their treatment in the literature. Third, we benchmark state-of-The art depth-constrained decision tree learners with respect to predictive accuracy and computational time. Fourth, we discuss potential future research directions that would be of interest for researchers who wish to conduct research in this field.

Original language	English
Article number	201
Number of pages	34
Journal	ACM Computing Surveys
Volume	54
Issue number	10
DOIs	https://doi.org/10.1145/3506734
Publication status	Published - 14 Sept 2022

Keywords

Decision trees
constraints
domain knowledge
explainability
fairness
interpretability
privacy

Access to Document

10.1145/3506734

https://dl.acm.org/doi/pdf/10.1145/3506734

Cite this

@article{6ac71ada287e425fa260ba8d4e80edc5,

title = "Constraint Enforcement on Decision Trees: a Survey",

abstract = "Decision trees have the particularity of being machine learning models that are visually easy to interpret and understand. Therefore, they are primarily suited for sensitive domains like medical diagnosis, where decisions need to be explainable. However, if used on complex problems, then decision trees can become large, making them hard to grasp. In addition to this aspect, when learning decision trees, it may be necessary to consider a broader class of constraints, such as the fact that two variables should not be used in a single branch of the tree. This motivates the need to enforce constraints in learning algorithms of decision trees. We propose a survey of works that attempted to solve the problem of learning decision trees under constraints. Our contributions are fourfold. First, to the best of our knowledge, this is the first survey that deals with constraints on decision trees. Second, we define a flexible taxonomy of constraints applied to decision trees and methods for their treatment in the literature. Third, we benchmark state-of-The art depth-constrained decision tree learners with respect to predictive accuracy and computational time. Fourth, we discuss potential future research directions that would be of interest for researchers who wish to conduct research in this field.",

keywords = "Decision trees, constraints, domain knowledge, explainability, fairness, interpretability, privacy",

author = "Geraldin Nanfack and Paul Temple and Beno{\^i}t Fr{\'e}nay",

note = "Funding Information: This work has been funded by the EOS-VeriLearn, project number 30992574 of the Fonds de la Recherche Scientifique (F.R.S-FNRS) in Belgium Publisher Copyright: {\textcopyright} 2022 Association for Computing Machinery.",

year = "2022",

month = sep,

day = "14",

doi = "10.1145/3506734",

language = "English",

volume = "54",

journal = "ACM Computing Surveys",

issn = "0360-0300",

publisher = "ACM Press",

number = "10",

}

TY - JOUR

T1 - Constraint Enforcement on Decision Trees

T2 - a Survey

AU - Nanfack, Geraldin

AU - Temple, Paul

AU - Frénay, Benoît

N1 - Funding Information: This work has been funded by the EOS-VeriLearn, project number 30992574 of the Fonds de la Recherche Scientifique (F.R.S-FNRS) in Belgium Publisher Copyright: © 2022 Association for Computing Machinery.

PY - 2022/9/14

Y1 - 2022/9/14

N2 - Decision trees have the particularity of being machine learning models that are visually easy to interpret and understand. Therefore, they are primarily suited for sensitive domains like medical diagnosis, where decisions need to be explainable. However, if used on complex problems, then decision trees can become large, making them hard to grasp. In addition to this aspect, when learning decision trees, it may be necessary to consider a broader class of constraints, such as the fact that two variables should not be used in a single branch of the tree. This motivates the need to enforce constraints in learning algorithms of decision trees. We propose a survey of works that attempted to solve the problem of learning decision trees under constraints. Our contributions are fourfold. First, to the best of our knowledge, this is the first survey that deals with constraints on decision trees. Second, we define a flexible taxonomy of constraints applied to decision trees and methods for their treatment in the literature. Third, we benchmark state-of-The art depth-constrained decision tree learners with respect to predictive accuracy and computational time. Fourth, we discuss potential future research directions that would be of interest for researchers who wish to conduct research in this field.

AB - Decision trees have the particularity of being machine learning models that are visually easy to interpret and understand. Therefore, they are primarily suited for sensitive domains like medical diagnosis, where decisions need to be explainable. However, if used on complex problems, then decision trees can become large, making them hard to grasp. In addition to this aspect, when learning decision trees, it may be necessary to consider a broader class of constraints, such as the fact that two variables should not be used in a single branch of the tree. This motivates the need to enforce constraints in learning algorithms of decision trees. We propose a survey of works that attempted to solve the problem of learning decision trees under constraints. Our contributions are fourfold. First, to the best of our knowledge, this is the first survey that deals with constraints on decision trees. Second, we define a flexible taxonomy of constraints applied to decision trees and methods for their treatment in the literature. Third, we benchmark state-of-The art depth-constrained decision tree learners with respect to predictive accuracy and computational time. Fourth, we discuss potential future research directions that would be of interest for researchers who wish to conduct research in this field.

KW - Decision trees

KW - constraints

KW - domain knowledge

KW - explainability

KW - fairness

KW - interpretability

KW - privacy

UR - http://www.scopus.com/inward/record.url?scp=85142478085&partnerID=8YFLogxK

U2 - 10.1145/3506734

DO - 10.1145/3506734

M3 - Article

SN - 0360-0300

VL - 54

JO - ACM Computing Surveys

JF - ACM Computing Surveys

IS - 10

M1 - 201

ER -

Constraint Enforcement on Decision Trees: a Survey

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this