CAST methodology
The detailed methodology for ACLED's Conflict Alert System (CAST) includes information on access, how the model was constructed, and how to understand the results.
The ACLED Conflict Alert System (CAST) is a global prediction tool that forecasts the number of political violence events that will occur in each upcoming rolling-four-week period, for the next six periods, in every country around the world. The tool predicts the ACLED Battles, Explosions/remote violence, and Violence against civilians event types at the first administrative division level (i.e., provinces), country and global levels. New predictions are published each week for the coming six periods, alongside accuracy metrics for previous forecasts.
Input and output units
The temporal unit of analysis is a four-week rolling period ending on Fridays, meaning the model is trained on these historical four-week periods to produce the forecast. The final period in the data includes the four consecutive weeks leading up to and including the most recent Friday in the ACLED data. The earlier periods extend back in time to January 2018 relative to this most recent period.
The data has a hierarchical spatial structure. The base spatial unit is the first administrative unit level ("ADMIN1," e.g., provinces). These ADMIN1 units are nested within country units, which are in turn nested within the global level. The model produces predictions at each level, which are then reconciled to ensure consistency across all three levels.
The structure of the target variables — or the types of events being forecasted — is also hierarchical, including three base event types as well as their total. These include the “Battles,” “Explosions/remote violence,” and “Violence against civilians” event types from the ACLED dataset. Their total is directly predicted and referred to as organized violence. The outcome is therefore a count of events per four-week period at a given spatial unit of analysis.
Input features
ACLED CAST relies on a variety of indicators to generate its predictions. Many of the indicators are derived from the ACLED dataset, including trends in recent violence, activity of key actor types, spillover violence across borders, and strategic developments like peace agreements, among others. In addition to the indicators derived from the ACLED dataset, CAST uses a variety of other inputs. For example, population estimates from WorldPop, as well as infant mortality rates from the Center for International Earth Science Information Network, act as proxies for relatively static subnational development, while indicators from V-Dem capture variation in political institutions across countries. A time trend variable is also included to capture unobserved time-varying predictors and is essential for capturing seasonality and, for instance, climate effects that vary throughout the year.
Model
The machine learning algorithm underpinning CAST is a light gradient boosted machine (LGBM) with a Tweedie objective function. As a tree-based algorithm, the LGBM flexibly accounts for the non-linear and interactive relationships among our features. Additionally, LGBM provides a strong balance between flexibility and regularization, enabling it to capture complex non-linear and interaction effects while controlling for overfitting. Compared to other tree-based alternatives, the LGBM is extremely efficient to train, making weekly tuning and updates feasible with minimal overhead. Finally, the Tweedie objective addresses the zero-inflated nature of the target event counts.
Hierarchical reconciliation
For each forecasted period, we generate predictions for every spatial unit and outcome in the data, referred to as “base forecasts.” These base forecasts are generated for each ADMIN1-outcome-period combination, each country-outcome-period combination, and globally for each outcome-period (e.g., 8 November to 5 December 2025 for Brazil-Battles). When it comes to aggregating these forecasts, however, there is no guarantee that the lower-level forecasts will correctly sum to the higher-level forecasts. For example, the ADMIN1 forecasts within a given country might not sum to the direct country-level forecast for that country.
Forecasts that do not correctly sum throughout the hierarchy are not coherent. We therefore apply a hierarchical reconciliation algorithm to enforce the aggregation logic inherent in our data. This ensures that the lower-level spatial units coherently sum to higher levels and the individual event types within each spatial unit coherently sum to the total organized violence forecast. More specifically, we apply MinTraceSparse reconciliation. This method has two key elements: 1) It ensures that aggregation constraints are respected (e.g., lower-level spatial units sum to higher levels; individual event types sum to organized violence); and 2) the adjustments made to the base forecast are minimized according to a weighted loss function.
Uncertainty
In addition to the reconciled forecasts, we also produce estimates of high and low scenarios based on the historical uncertainty for each unit. The reconciled forecast may be thought of as our expected estimate of future events, while the high (low) forecasts represent the estimate of the likely upper (lower) bound of likely events given the uncertain nature of conflict dynamics. Our approach draws on conformal inference but makes a few adjustments to better suit our needs.
As is standard in conformal prediction, we first define the expected coverage level, referred to as alpha. If alpha is 0.05, we expect 95% of observed event counts to fall within a forecast interval. We then execute a rolling time-series cross-validation loop to simulate how a tuned model would have performed historically for each unit in the hierarchy. For each iteration, we generate base forecasts and then apply the hierarchical reconciliation algorithm. We store the residuals (observed minus reconciled predicted values) for each step, which results in a distribution of historical forecast errors for each unit-forecast horizon combination. We then sort the residuals and identify upper and lower thresholds from this distribution that ensure the interval range contains 1-alpha historical errors. These values are then added to the point forecasts to generate upper and lower bounds.
We take a few steps to further calibrate the intervals. In some rare situations, the intervals derived from the distribution of residuals might not contain the point forecast. In this case, we identify all negative (positive) values available and select the median negative (positive) value rather than the quantile implied from alpha. If a given case has no negative (positive) values in its distribution of residuals, we make the uncertainty intervals symmetric. As a final means of calibration, we also smooth the intervals across the forecast horizon to reduce noise in the interval widths over time.
Change from the designated moving average
In the ACLED CAST dashboard, all forecasts are presented alongside the moving average for that country or administrative division. The time horizon of the moving average is customizable by the user. This comparison point is provided in order to contextualize the forecasts within the larger scope of a country or administrative division’s conflict environment. For example, if the 12-month moving average is selected, the dashboard will show the selected month’s forecast alongside the 12-month moving average for that country or administrative division. This shows, on average, how many Battles, Explosions/remote violence, or Violence against civilians events occurred in that area in the last 12 months. The time range for the moving average is customizable to allow for dynamic conflict environments; if, for example, a country has a recent spike in conflict events, the user may wish to set the moving average to just the last one to three months.
Summary
ACLED CAST is a hierarchical forecasting model that leverages machine learning to predict organized violence events at multiple spatial levels. The model incorporates a variety of features derived from ACLED data and other sources and employs hierarchical reconciliation to ensure consistency across predictions at different spatial units. It also quantifies high and low forecast scenarios on historical uncertainty for each unit.
All use of ACLED CAST must abide by the ACLED Terms and Conditions. If you wish to reproduce or republish a visual, graphic, or map from ACLED CAST (rather than creating an original image using raw data) for non-commercial purposes, please cite ACLED CAST using the following format:
ACLED, “ACLED Conflict Alert System,” accessed on DD Month Year. https://acleddata.com/conflict-alert-system/
Appendix A: Glossary
| NAME | DESCRIPTION |
|---|---|
| Battles | Violent interactions between two politically organized armed groups at a particular time and location. Battles can occur between armed and organized state, non-state, and external groups, and in any combination therein. There is no fatality minimum necessary for inclusion. Included as Battles (t-1) when used as a predictor. |
| Violence Against Civilians | Violent events where an organized armed group deliberately inflicts violence upon unarmed non-combatants (civilians). Included as Violence against civilians (t-1) when used as a predictor. |
| Explosions/ Remote Violence | One-sided violent events in which the tool for engaging in conflict creates asymmetry by taking away the ability of the target to respond. Included as Explosions/remote violence (t-1) when used as a predictor. |
| (t-1) | Represents the value of the accompanying predictor from the previous month. Example: Battles (t-1) represents the number of Battles events from the previous month. |
| Protests | A public demonstration in which the participants do not en- gage in violence, though violence may be used against them. Events include individuals and groups who peacefully demon- strate against a political entity, government institution, policy, group, tradition, businesses or other private institutions. In- cluded as Protests (t-1) when used as a predictor. |
| Riots | Violent events where demonstrators or mobs engage in disruptive acts, including but not limited to rock throwing, property destruction, etc. They may target other individuals, property, businesses, other rioting groups or armed actors. Included as Riots (t-1) when used as a predictor. |
| Excessive Force Against Protesters | Events where individuals are engaged in a peaceful protest and are targeted with violence by an actor leading to (or if it could lead to) serious/lethal injuries. Included as Excessive force against protesters (t-1) when used as a predictor. |
| Organized violence | Sum total of all organized violence event times in a given month (total of battles, violence against civilians, and explo- sions/remote violence). Included as the 6 month moving av- erage and standard deviation when used to generate predic- tors. |
| Fatalities | The number of reported fatalities which occurred during an event. Included as Fatalities (t-1) when used as a predictor. |
| Actor Concentration | A Herfindahl-Hirschman Index tracking the number of active conflict actors and for how many violent events each is responsible. Included as Actor concentration (t-1) when used as a predictor. |
| Actor Interactions | Interactions among the actors in an event, grouped by the types of actors engaged in the event. Group types include State Forces, Rebel Groups, Political Militias, Identity Militias, Rioters, Protesters, Civilians, and External/Other Forces. An event involving a state military and an armed rebel group, for example, would be a State Forces-Rebel interaction. Events with only one actor are coded as sole actions (e.g., State Forces sole action). Included as the Interaction-type (t-1) when used as predictors, where the variable represents the sum total of events within that actor type in the month prior. For example, State forces-rebel interactions (t-1) is the count of state military and rebel group interactions in an Admin1 the month before. |
| Violence in Neighbors | Battles, Explosions/Remote violence, and Violence against civilians events in neighboring Admin1 locations. Included as Violence in neighbors (t-1) when used as a predictor. |
| Strategic Developments | Contextually important information regarding the activities of violent groups that is not itself recorded as political violence, yet may trigger future events or contribute to political dynamics within and across states. Included as Strategic developments (t-1) when used as a predictor. |
| Agreements | Agreements between different actors (such as governments and rebel groups) within the previous six months. Examples include peace agreements/talks, ceasefires, evacuation deals, prisoner exchanges, negotiated territorial transfers, prisoner releases, surrenders, repatriations, etc. Included as Agreements (t-1) when used as a predictor. |
| Time Trend | A set of variables capturing many potential temporal trends, including those specific to years, months, and quarters, as well as a linear monthly trend since 2018. |
| Infant Mortality | Estimated Admin1-level infant mortality rates, derived from the Center for International Earth Science Information Network’s (CIESIN) Global Subnational Infant Mortality Rates rasters. |
| Population | Estimated Admin1-level population, derived from WorldPop population rasters |
| Relative Time Comparison | Time period in the past to compare forecasted violence with. |
| Predicted Change Category | Level of predicted change from selected time period. |
| Forecast Date | Month within the next six months to view forecast. |
| State Forces | Collective actors that are recognised to perform government functions, including military and police, over a given territory. Referred to as “Military” in interactions. |
| Rebel Groups | Political organizations whose goal is to counter an established national governing regime by violent acts. |
| Political Militias | Armed, organized groups with political goals that use violence to advance those goals. Unlike rebel groups, political militias generally do not actively seek to topple or replace the national government using violence, though some are organized in opposition to government authority. |
| Identity Militias | Armed and violent groups organized around a collective, common feature including community, ethnicity, region, re- ligion, or, in exceptional cases, livelihood. Therefore, identity militias captured in the ACLED dataset include those reported as tribal, clan, communal, ethnic, local, community, religious, and livelihood militias. |
| Rioters | Individuals who engage in violence during demonstrations and mob violence events. Violence can be directed against people, property, or both. |
| Protesters | Peaceful, unarmed demonstrators. Although protesters are nonviolent, they may be the targets of violence by other groups (e.g., security institutions, private security firms, or other armed actors). |
| Civilians | Civilians, in whatever number or association, are victims of violent acts within ACLED as they are, by definition, unarmed and, hence, vulnerable. Some normally armed actors may be coded as civilians if they are targeted with violence in situations where they are caught unarmed. |
| External/Other Forces | International organizations, state forces active outside of their main country of operation, private security firms and their armed employees, and hired mercenaries acting independently. |