Skip to main content

ORIGINAL RESEARCH article

Front. Appl. Math. Stat.
Sec. Statistics and Probability
Volume 9 - 2023 | doi: 10.3389/fams.2023.1150735

An implementation of Hurdle models for spatial count data. Study case: Civil war as a risk factor for development of childhood Leukemia in Colombia

  • 1Facultad de Medicina, Universidad Nacional de Colombia, Colombia
  • 2Departamento de estadística, Universidad Nacional de Colombia, Colombia
  • 3Instituto Técnico Profesional · Universidad Nacional Abierta y a Distancia - UNAD Colombia, Colombia

The final, formatted version of the article will be published soon.

Receive an email when it is updated
You just subscribed to receive the final version of the article

We propose a novel, efficient and powerful methodology to deal with overdispersion, excess zeros, heterogeneity and spatial correlation. It is based on the combination of Hurdle models and Spatial filtering Moran eigenvectors. Hurdle models are the best option to manage the presence of overdispersion and excess of zeros, separating the model into two parts: the first one models the probability of the zero value, and the second one models the probability of the non-zero values. Finally, gathering the spatial information in new covariates through spatial filtering Moran vector method, involves spatial correlation and spatial heterogeneity to improve the model fitting and explain spatial effects of variables that were not possible to measure. Thus, our proposal adapts usual regression models for count data so that it is possible to deal with phenomena where the usual theoretical assumptions, such as constant variance, independence and unique distribution are not fulfilled. In addition, this research shows how a prolonged armed-conflict can impact the health of children. The data includes children exposed to armed-conflict in Colombia, a country enduring a non-international armed-conflict lasting over 60 years. The findings indicate that children exposed to high levels of violence, as measured by the armed-conflict index, demonstrate a significant association with the incidence and mortality rate of LAP in children. This fact is illustrated here using one of the most catastrophic conditions in childhood, as is Pediatric Acute Leukemia (LAP). The association between armed conflict and LAP has its conceptual basis in the epidemiology literature, given that, the incidence and mortality rates of neoplastic diseases increase with exposure to toxic and chronic stress during gestation and childhood. Our methodology provides a valuable framework for complex data analysis and contributes to understanding the health implications in conflict-affected regions.

Keywords: spatial correlation, Moran eigenvector spatial filtering, excess zeros, Chronic stress exposure, Hurdle models on count data

Received: 24 Jan 2023; Accepted: 19 Sep 2023.

Copyright: © 2023 Montilla, Bohorquez and Renteria. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Prof. Martha P. Bohorquez, Universidad Nacional de Colombia, Departamento de estadística, Bogotá, Colombia