The hard problem of prediction for conflict prevention

Mueller, Hannes; Rauh, Christopher

Show metadata

Permalink

https://hdl.handle.net/1866/21631

Article [Version of Record]

Cahier_2019-2.pdf (1.725Mb)

Is part of

Cahier de recherche ; no. 2019-02.

Publisher(s)

Université de Montréal. Département de sciences économiques.

2019-04

Author(s)

Mueller, Hannes

Rauh, Christopher

Affiliation

Université de Montréal. Faculté des arts et des sciences. Département de sciences économiques

Abstract(s)

There is a rising interest in conflict prevention and this interest provides a strong motivation for better conflict forecasting. A key problem of conflict forecasting for prevention is that predicting the start of conflict in previously peaceful countries is extremely hard. To make progress in this hard problem this project exploits both supervised and unsupervised machine learning. Specifically, the latent Dirichlet allocation (LDA) model is used for feature extraction from 3.8 million newspaper articles and these features are then used in a random forest model to predict conflict. We find that several features are negatively associated with the outbreak of conflict and these gain importance when predicting hard onsets. This is because the decision tree uses the text features in lower nodes where they are evaluated conditionally on conflict history, which allows the random forest to adapt to the hard problem and provides useful forecasts for prevention.

Collections

Faculté des arts et des sciences – Département de sciences économiques - Travaux et publications [564]

This document disseminated on Papyrus is the exclusive property of the copyright holders and is protected by the Copyright Act (R.S.C. 1985, c. C-42). It may be used for fair dealing and non-commercial purposes, for private study or research, criticism and review as provided by law. For any other use, written authorization from the copyright holders is required.