Synthetic population generation without a sample

Johan Barthelemy, Ph Toint

Résultats de recherche: Contribution à un journal/une revueArticleRevue par des pairs

179 Téléchargements (Pure)


The advent of microsimulation in the transportation sector has created the need for extensive disaggregated data concerning the population whose behavior is modeled. Because of the cost of collecting this data and the existing privacy regulations, this need is often met by the creation of a synthetic population on the basis of aggregate data. Although several techniques for generating such a population are known, they suffer from a number of limitations. The first is the need for a sample of the population for which fully disaggregated data must be collected, although such samples may not exist or may not be financially feasible. The second limiting assumption is that the aggregate data used must be consistent, a situation that is most unusual because these data often come from different sources and are collected, possibly at different moments, using different protocols. The paper presents a new synthetic population generator in the class of the Synthetic Reconstruction methods, whose objective is to obviate these limitations. It proceeds in three main successive steps: generation of individuals, generation of household type's joint distributions, and generation of households by gathering individuals. The main idea in these generation steps is to use data at the most disaggregated level possible to define joint distributions, from which individuals and households are randomly drawn. The method also makes explicit use of both continuous and discrete optimization and uses the χ2 metric to estimate distances between estimated and generated distributions. The new generator is applied for constructing a synthetic population of approximately 10,000,000 individuals and 4,350,000 households localized in the 589 municipalities of Belgium. The statistical quality of the generated population is discussed using criteria extracted from the literature, and it is shown that the new population generator produces excellent results. © 2013 INFORMS.

langue originaleAnglais
Pages (de - à)266-279
Nombre de pages14
journalTransportation Science
Numéro de publication2
Les DOIs
Etat de la publicationPublié - 1 mai 2013

Empreinte digitale Examiner les sujets de recherche de « Synthetic population generation without a sample ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation