Robustness of artificial neural network and discrete choice modelling in presence of unobserved variables

Morgane Dumont; Johan Barthelemy; Timoteo Carletti

Robustness of artificial neural network and discrete choice modelling in presence of unobserved variables

Morgane Dumont, Johan Barthelemy, Timoteo Carletti

Research output: Contribution in Book/Catalog/Report/Conference proceeding › Conference contribution

34 Downloads (Pure)

Abstract

Models are used to gain a better understanding of complex systems such as the evolution of a population, the transportation demand, the brain behaviour, elections outcome, the propagation of a disease,... System models should be precise and parsimonious. However, the total variation of the system cannot be precisely captured by the observed variables as there can be unobserved ones influencing the system output. The unexplained variation caused by unobserved variables is, therefore, considered as a noise in the model. Different models handle that noise in a different way. For instance, a linear regression assumes that the noise follows a normal distribution and explicitly incorporates it into the model formulation. On the other hand, other models, such as a deterministic neural network, do not explicitly incorporate that noise. Several models can then be applied and the selection of the best one can be a challenging question. This research aims to highlight the importance of the unobserved variables on the results of two types of simple yet widely used models: feedforward neural networks (FFNN) and logit discrete choice models (LDCM). The first application consists in modelling the divorces in an agent-based microsimulation, the agents being the individuals of a given population. For each couple in the model, the divorce is predicted based on the characteristics of the couple (ex: length of the marriage, age of the individuals). In this application, it is shown that the LDCM outperforms the neural network due to the presence of - possibly many - unobserved variables. The second example is a model defined to predict the level of interaction between groundwater and quarry extensions. In this application, the value of every relevant variable is assumed to be known, i.e. the noise from unobserved variables is minimum. In this case, it is shown that both approaches perform well, but FFNN perform slightly better than LDCM. We then investigate how the model performance evolves when the noise increases by removing variables from the models specification. Finally, those two applications will allow us to conclude on the robustness of the discrete choice models and artificial neural network in presence of unobserved variables.

Original language	English
Title of host publication	Proceedings - 22nd International Congress on Modelling and Simulation, MODSIM 2017
Subtitle of host publication	Modelling and Simulation Society of Australia and New Zealand
Editors	Geoff Syme, Darla Hatton MacDonald, Beth Fulton, Julia Piantadosi
Pages	480-486
Number of pages	7
ISBN (Electronic)	978-0-9872143-7-9
Publication status	Published - 1 Jan 2017

Publication series

Name	Proceedings - 22nd International Congress on Modelling and Simulation, MODSIM 2017

Keywords

Discrete choice modelling
Neural network
unobserved variables
Unobserved variables

Access to Document

2017_DumontBarthelemyCarletti_article

https://www.mssanz.org.au/modsim2017/C6/dumont.pdf

Cite this

Dumont, M., Barthelemy, J., & Carletti, T. (2017). Robustness of artificial neural network and discrete choice modelling in presence of unobserved variables. In G. Syme, D. H. MacDonald, B. Fulton, & J. Piantadosi (Eds.), Proceedings - 22nd International Congress on Modelling and Simulation, MODSIM 2017: Modelling and Simulation Society of Australia and New Zealand (pp. 480-486). (Proceedings - 22nd International Congress on Modelling and Simulation, MODSIM 2017). https://www.mssanz.org.au/modsim2017/C6/dumont.pdf

Dumont, Morgane ; Barthelemy, Johan ; Carletti, Timoteo. / Robustness of artificial neural network and discrete choice modelling in presence of unobserved variables. Proceedings - 22nd International Congress on Modelling and Simulation, MODSIM 2017: Modelling and Simulation Society of Australia and New Zealand. editor / Geoff Syme ; Darla Hatton MacDonald ; Beth Fulton ; Julia Piantadosi. 2017. pp. 480-486 (Proceedings - 22nd International Congress on Modelling and Simulation, MODSIM 2017).

@inproceedings{3aaf0fed16e549b09c9c23b056b3ac7e,

title = "Robustness of artificial neural network and discrete choice modelling in presence of unobserved variables",

abstract = "Models are used to gain a better understanding of complex systems such as the evolution of a population, the transportation demand, the brain behaviour, elections outcome, the propagation of a disease,... System models should be precise and parsimonious. However, the total variation of the system cannot be precisely captured by the observed variables as there can be unobserved ones influencing the system output. The unexplained variation caused by unobserved variables is, therefore, considered as a noise in the model. Different models handle that noise in a different way. For instance, a linear regression assumes that the noise follows a normal distribution and explicitly incorporates it into the model formulation. On the other hand, other models, such as a deterministic neural network, do not explicitly incorporate that noise. Several models can then be applied and the selection of the best one can be a challenging question. This research aims to highlight the importance of the unobserved variables on the results of two types of simple yet widely used models: feedforward neural networks (FFNN) and logit discrete choice models (LDCM). The first application consists in modelling the divorces in an agent-based microsimulation, the agents being the individuals of a given population. For each couple in the model, the divorce is predicted based on the characteristics of the couple (ex: length of the marriage, age of the individuals). In this application, it is shown that the LDCM outperforms the neural network due to the presence of - possibly many - unobserved variables. The second example is a model defined to predict the level of interaction between groundwater and quarry extensions. In this application, the value of every relevant variable is assumed to be known, i.e. the noise from unobserved variables is minimum. In this case, it is shown that both approaches perform well, but FFNN perform slightly better than LDCM. We then investigate how the model performance evolves when the noise increases by removing variables from the models specification. Finally, those two applications will allow us to conclude on the robustness of the discrete choice models and artificial neural network in presence of unobserved variables. ",

keywords = "Discrete choice modelling, Neural network, unobserved variables, Unobserved variables",

author = "Morgane Dumont and Johan Barthelemy and Timoteo Carletti",

year = "2017",

month = jan,

day = "1",

language = "English",

series = "Proceedings - 22nd International Congress on Modelling and Simulation, MODSIM 2017",

pages = "480--486",

editor = "Geoff Syme and MacDonald, {Darla Hatton} and Beth Fulton and Julia Piantadosi",

booktitle = "Proceedings - 22nd International Congress on Modelling and Simulation, MODSIM 2017",

}

Dumont, M, Barthelemy, J & Carletti, T 2017, Robustness of artificial neural network and discrete choice modelling in presence of unobserved variables. in G Syme, DH MacDonald, B Fulton & J Piantadosi (eds), Proceedings - 22nd International Congress on Modelling and Simulation, MODSIM 2017: Modelling and Simulation Society of Australia and New Zealand. Proceedings - 22nd International Congress on Modelling and Simulation, MODSIM 2017, pp. 480-486. <https://www.mssanz.org.au/modsim2017/C6/dumont.pdf>

Robustness of artificial neural network and discrete choice modelling in presence of unobserved variables. / Dumont, Morgane; Barthelemy, Johan; Carletti, Timoteo.
Proceedings - 22nd International Congress on Modelling and Simulation, MODSIM 2017: Modelling and Simulation Society of Australia and New Zealand. ed. / Geoff Syme; Darla Hatton MacDonald; Beth Fulton; Julia Piantadosi. 2017. p. 480-486 (Proceedings - 22nd International Congress on Modelling and Simulation, MODSIM 2017).

Research output: Contribution in Book/Catalog/Report/Conference proceeding › Conference contribution

TY - GEN

T1 - Robustness of artificial neural network and discrete choice modelling in presence of unobserved variables

AU - Dumont, Morgane

AU - Barthelemy, Johan

AU - Carletti, Timoteo

PY - 2017/1/1

Y1 - 2017/1/1

N2 - Models are used to gain a better understanding of complex systems such as the evolution of a population, the transportation demand, the brain behaviour, elections outcome, the propagation of a disease,... System models should be precise and parsimonious. However, the total variation of the system cannot be precisely captured by the observed variables as there can be unobserved ones influencing the system output. The unexplained variation caused by unobserved variables is, therefore, considered as a noise in the model. Different models handle that noise in a different way. For instance, a linear regression assumes that the noise follows a normal distribution and explicitly incorporates it into the model formulation. On the other hand, other models, such as a deterministic neural network, do not explicitly incorporate that noise. Several models can then be applied and the selection of the best one can be a challenging question. This research aims to highlight the importance of the unobserved variables on the results of two types of simple yet widely used models: feedforward neural networks (FFNN) and logit discrete choice models (LDCM). The first application consists in modelling the divorces in an agent-based microsimulation, the agents being the individuals of a given population. For each couple in the model, the divorce is predicted based on the characteristics of the couple (ex: length of the marriage, age of the individuals). In this application, it is shown that the LDCM outperforms the neural network due to the presence of - possibly many - unobserved variables. The second example is a model defined to predict the level of interaction between groundwater and quarry extensions. In this application, the value of every relevant variable is assumed to be known, i.e. the noise from unobserved variables is minimum. In this case, it is shown that both approaches perform well, but FFNN perform slightly better than LDCM. We then investigate how the model performance evolves when the noise increases by removing variables from the models specification. Finally, those two applications will allow us to conclude on the robustness of the discrete choice models and artificial neural network in presence of unobserved variables.

AB - Models are used to gain a better understanding of complex systems such as the evolution of a population, the transportation demand, the brain behaviour, elections outcome, the propagation of a disease,... System models should be precise and parsimonious. However, the total variation of the system cannot be precisely captured by the observed variables as there can be unobserved ones influencing the system output. The unexplained variation caused by unobserved variables is, therefore, considered as a noise in the model. Different models handle that noise in a different way. For instance, a linear regression assumes that the noise follows a normal distribution and explicitly incorporates it into the model formulation. On the other hand, other models, such as a deterministic neural network, do not explicitly incorporate that noise. Several models can then be applied and the selection of the best one can be a challenging question. This research aims to highlight the importance of the unobserved variables on the results of two types of simple yet widely used models: feedforward neural networks (FFNN) and logit discrete choice models (LDCM). The first application consists in modelling the divorces in an agent-based microsimulation, the agents being the individuals of a given population. For each couple in the model, the divorce is predicted based on the characteristics of the couple (ex: length of the marriage, age of the individuals). In this application, it is shown that the LDCM outperforms the neural network due to the presence of - possibly many - unobserved variables. The second example is a model defined to predict the level of interaction between groundwater and quarry extensions. In this application, the value of every relevant variable is assumed to be known, i.e. the noise from unobserved variables is minimum. In this case, it is shown that both approaches perform well, but FFNN perform slightly better than LDCM. We then investigate how the model performance evolves when the noise increases by removing variables from the models specification. Finally, those two applications will allow us to conclude on the robustness of the discrete choice models and artificial neural network in presence of unobserved variables.

KW - Discrete choice modelling

KW - Neural network

KW - unobserved variables

KW - Unobserved variables

UR - http://www.scopus.com/inward/record.url?scp=85080859598&partnerID=8YFLogxK

M3 - Conference contribution

T3 - Proceedings - 22nd International Congress on Modelling and Simulation, MODSIM 2017

SP - 480

EP - 486

BT - Proceedings - 22nd International Congress on Modelling and Simulation, MODSIM 2017

A2 - Syme, Geoff

A2 - MacDonald, Darla Hatton

A2 - Fulton, Beth

A2 - Piantadosi, Julia

ER -

Dumont M, Barthelemy J, Carletti T. Robustness of artificial neural network and discrete choice modelling in presence of unobserved variables. In Syme G, MacDonald DH, Fulton B, Piantadosi J, editors, Proceedings - 22nd International Congress on Modelling and Simulation, MODSIM 2017: Modelling and Simulation Society of Australia and New Zealand. 2017. p. 480-486. (Proceedings - 22nd International Congress on Modelling and Simulation, MODSIM 2017).

Robustness of artificial neural network and discrete choice modelling in presence of unobserved variables

Abstract

Publication series

Keywords

Access to Document

Other files and links

Fingerprint

The 22nd International Congress on Modelling and Simulation (MODSIM2017)

SMART Infrastructure Facility

Cite this

Robustness of artificial neural network and discrete choice modelling in presence of unobserved variables

Abstract

Publication series

Keywords

Access to Document

Other files and links

Fingerprint

Activities

The 22nd International Congress on Modelling and Simulation (MODSIM2017)

SMART Infrastructure Facility

Cite this