Predicting politicians' misconduct: Evidence from Colombia Academic Article

Doi

Overview
Research
Identity
View All

journal

Data and Policy

abstract

Corruption has pervasive effects on economic development and the well-being of the population. Despite being crucial and necessary, fighting corruption is not an easy task because it is a difficult phenomenon to measure and detect. However, recent advances in the field of artificial intelligence may help in this quest. In this article, we propose the use of machine-learning models to predict municipality-level corruption in a developing country. Using data from disciplinary prosecutions conducted by an anti-corruption agency in Colombia, we trained four canonical models (Random Forests, Gradient Boosting Machine, Lasso, and Neural Networks), and ensemble their predictions, to predict whether or not a mayor will commit acts of corruption. Our models achieve acceptable levels of performance, based on metrics such as the precision and the area under the receiver-operating characteristic curve, demonstrating that these tools are useful in predicting where misbehavior is most likely to occur. Moreover, our feature-importance analysis shows us which groups of variables are most important in predicting corruption.

authors

publication date

2022-11-14

edition

4

keywords

Anti-Corruption
Artificial intelligence
Colombia
Developing countries
Economics
Machine learning
Neural networks
Random forests
artificial intelligence
corruption
developing country
economics
evidence
learning
mayor
misbehavior
municipality
neural network
performance
politician
prosecution
recipient
well-being

Digital Object Identifier (DOI)

https://doi.org/10.1017/dap.2022.35