Predicting politicians' misconduct: Evidence from Colombia Academic Article

journal

  • Data and Policy

abstract

  • Corruption has pervasive effects on economic development and the well-being of the population. Despite being crucial and necessary, fighting corruption is not an easy task because it is a difficult phenomenon to measure and detect. However, recent advances in the field of artificial intelligence may help in this quest. In this article, we propose the use of machine-learning models to predict municipality-level corruption in a developing country. Using data from disciplinary prosecutions conducted by an anti-corruption agency in Colombia, we trained four canonical models (Random Forests, Gradient Boosting Machine, Lasso, and Neural Networks), and ensemble their predictions, to predict whether or not a mayor will commit acts of corruption. Our models achieve acceptable levels of performance, based on metrics such as the precision and the area under the receiver-operating characteristic curve, demonstrating that these tools are useful in predicting where misbehavior is most likely to occur. Moreover, our feature-importance analysis shows us which groups of variables are most important in predicting corruption.

publication date

  • 2022-11-14

edition

  • 4

keywords

  • Anti-Corruption
  • Artificial intelligence
  • Colombia
  • Developing countries
  • Economics
  • Machine learning
  • Neural networks
  • Random forests
  • artificial intelligence
  • corruption
  • developing country
  • economics
  • evidence
  • learning
  • mayor
  • misbehavior
  • municipality
  • neural network
  • performance
  • politician
  • prosecution
  • recipient
  • well-being