Global data-driven prediction of fire activity

Di Giuseppe, Francesca; McNorton, Joe; Lombardi, Anna; Wetterhall, Fredrik

doi:10.1038/s41467-025-58097-7

Download PDF

Article
Open access
Published: 01 April 2025

Global data-driven prediction of fire activity

Nature Communications volume 16, Article number: 2918 (2025) Cite this article

14k Accesses
4 Citations
82 Altmetric
Metrics details

Subjects

Abstract

Recent advancements in machine learning (ML) have expanded the potential use across scientific applications, including weather and hazard forecasting. The ability of these methods to extract information from diverse and novel data types enables the transition from forecasting fire weather, to predicting actual fire activity. In this study we demonstrate that this shift is feasible also within an operational context. Traditional methods of fire forecasts tend to over predict high fire danger, particularly in fuel limited biomes, often resulting in false alarms. By using data on fuel characteristics, ignitions and observed fire activity, data-driven predictions reduce the false-alarm rate of high-danger forecasts, enhancing their accuracy. This is made possible by high quality global datasets of fuel evolution and fire detection. We find that the quality of input data is more important when improving forecasts than the complexity of the ML architecture. While the focus on ML advancements is often justified, our findings highlight the importance of investing in high-quality data and, where necessary create it through physical models. Neglecting this aspect would undermine the potential gains from ML-based approaches, emphasizing that data quality is essential to achieve meaningful progress in fire activity forecasting.

Evaluation of machine learning and deep learning algorithms for fire prediction in Southeast Asia

Article Open access 29 May 2025

Exploration of geo-spatial data and machine learning algorithms for robust wildfire occurrence prediction

Article Open access 28 March 2025

Wildfire spreading prediction using multimodal data and deep neural network approach

Article Open access 31 January 2024

Introduction

Advancements in machine learning (ML) have opened new possibilities in weather prediction in recent years^1,2,3,4,5. The use of ML technologies in place of physically based methods to predict weather has improved the accessibility and precision of forecasts⁶ and blurred the lines between physical and human-generated information. Models that adapt in real time to changing patterns of both physical variables⁷ and human behavior^8,9 are now capable of exploiting the information content locked in social interactions^10,11 and has the potential to improve the prediction of human-influenced natural hazards¹². This is especially important because humans have and still are modifying the environment on a large scale¹³.

The prediction of landscape fires can benefit from this data-driven revolution. Fire is a complex process influenced by various interconnected factors, including fuel composition, weather, and ignition¹⁴. Fires are predominantly human-induced in large parts of the world, making them inherently stochastic and challenging to predict. Even natural ignitions, such as those caused by lightning, present significant forecasting difficulties^15,16. As a result, most forecast models are specific to local regions and cannot be easily applied globally¹⁷. Predictive models have until now relied on the concept of danger ratings. Danger ratings are empirical metrics used in current global early warning systems to mark anomalous fire weather and potential fire behavior, provided that an ignition occurs. These early warning systems use observed weather data or forecasts from numerical weather prediction models^18,19 but do not take into account variations in fuel status and abundance. Fuel information is only available on a very local scale. By not including two crucial components, fuel and ignitions, current fire forecasts only inform on anomalous weather conditions rather than provide a reliable prediction of fire occurrence. Including fuel and ignition could improve the reliability of fire risk prediction.

Excluding fuel in fire forecasts can lead to an underestimation of fire severity, for example the Alexandroupolis fire in Greece, and the extended burning in western Amazonia in 2023²⁰. The unprecedented and uncontrollable urban fire that raged across Los Angeles in 2025 was in part driven by fire prone conditions, but also by an unusual accumulation of fuel from the preceding wet springs. Typically, global fire forecasts do not take into account fuel because of insufficient information on fuel availability and status. Furthermore, even with good observations of fuel availability and status it would be complex to establish a physical relationship between fuel variables and fire purely through a process-based analysis.

Significant advances have been made in the ability to observe wildfires from space with the increased availability and capability of remote sensing^21,22,23,24. Despite the challenges in detecting small fires²⁵ and decontaminating data from spurious signals²⁶, satellite data now provide a valuable global status of fire activity at increasingly detailed resolution. Satellite data have transformed the study of fire occurrence, patterns, trends, and controls, which would not have been possible using incomplete national or regional inventories^24,27,28,29. In addition, global fire activity observations provide the opportunity to employ data-driven modeling to proactively address the information gap in fire ignitions.

Machine learning is already widely applied in multiple areas of fire science, including fire management practices³⁰, early³¹ and long-term detection³², and crucially to aid firefighting operations on a very local scale^33,34,35. Global proof-of-concept AI predictive systems have mostly focused on reproducing burned areas. However, understanding the feasibility of a global, kilometer-scale fire activity prediction system is still in its early stages. An analysis of the skill of such a system, its limitations, and the key factors driving its performance is still lacking. In recent years, the European Centre for Medium-Range Weather Forecasts (ECMWF) established a system to predict the probability of wildfires occurring anywhere on the planet at least a week in advance³⁶. This paper reflects on ECMWF’s operational data-driven fire predictions, which have been running since 2023, and analyzes in details the extensive fire events that burned large parts of Canada in 2023 and the devastating 2025 fire in Los Angeles, California. We evaluate how skillful this completely data-centric system has been, and determine where this predictability mainly comes from-whether it is the training data or the data-driven approach itself. The key question is: do current forecasts lack skill because of the quality of input data, or because traditional physical models lack the complexity to capture wildfire processes? Understanding this fundamental issue is critical not only to confirm trust in ML methods but also to understand the importance of good quality training data.

Results

Data as a source of fire prediction

The occurrence and severity of landscape fires are typically explained by the fire triangle, which considers three key factors: ignition source (either man-made or natural, such as lightning), fuel (abundance, condition, and continuity), and weather (wind, temperature, and moisture conditions)³⁷.

To evaluate the importance of data compared to the complexity of machine learning frameworks, three models with increasing complexity (random forest, XGBoost and neural networks)³⁶ were used in a set of ablation experiments, progressively incorporating additional data sources during model training (see Table 1). Designing a clear set of experiments to pinpoint the importance of each data source is challenging, particularly given the acknowledged complexity of interactions between fuel, weather, and ignition. For example, vegetation abundance and moisture levels are influenced by weather conditions, while local vegetation can also modulate weather patterns. Consequently, the contributions of fuel, weather and ignition to fire activity are not entirely independent of each other. To account for the possible combinations of these controls, we trained several data-driven models using various combinations of factors from the fire triangle (Fig. 1). This included pairing weather with fuel, fuel with ignition, and ignition with weather. Additionally, models were trained using each control in isolation and, finally, incorporating all three controls together.

Table 1 Datasets used for training: Nineteen predictors are grouped into three categories: weather, fuel, and ignition

Full size table

**Fig. 1: Skills of the data-driven fire prediction.**

Validating the prediction skill on intrinsically stochastic processes is challenging, with no suitable single metric. Therefore, we used a selection of skill scores to assess prediction skill when using different controls and how they contribute to the global prediction of fire activities (see details in the “Methods” subsection “Validation metrics”). The consistently low probability predictions (<5%) results in a strong penalization in probabilistic scores like Brier and Logloss even when fire events occur, as these metrics are designed to penalize the distance from a deterministic-like prediction of 100% probability in case of a hit. To provide a generalized assessment, we averaged across all skill scores, even recognizing that they are different in nature and not uncorrelated. The best-performing ML architecture is the intermediate-complexity model, XGBoost, which significantly outperforms the simpler random forest while offering performance equivalent to the more complex neural network. This indicates that a deep layered infrastructure does not provide additional accuracy for classification problems such as fire ignitions. Combining all data provides the best prediction of fire activity for all infrastructures, both globally and regionally. From an ideal starting point where all sources of information are included, a 30% degradation of skill is expected when only weather or ignition is considered, and 15% when only fuel is considered. This is not surprising as fuel includes weather-driven variables like fuel moisture, making it the most relevant factor. Any combination of two controls increases the quality of the prediction, resulting in a degradation of only 3–7% of the maximum achievable skill. The improvement achieved by incorporating additional data into the training process outweighs the gains obtained from transitioning from a medium-complexity to a high-complexity architecture.

There are regional differences, especially where fire activity is inhibited (Sahara) or mostly driven by lightning ignitions (eastern Australia)¹⁵. In the latter ignitions play a relevant role, and even our simple representation based on lightning forecasts and static maps of population and road density provides the best possible input for a fire forecasting system. Regions for which traditional fire weather indices were designed, such as northwestern North America, still select weather as the most relevant control to explain fire activity, confirming the weather-limited regime of forested areas. However, the fuel data derived through the physical model of McNorton and Di Giuseppe³⁸ is the most relevant control in isolation, not only in fuel-limited ecosystem such as the Mediterranean region and western Africa, but also in tropical regions in South America and globally. Fuel is here represented as a combination of fuel abundance and fuel moisture content, and changes are mainly driven by weather condition and human influence on land use and land cover. Fire activity in regions controlled by fuel account for the bulk of fire activity globally and are the largest source of carbon emissions (e.g. ref. ³⁹).

Predicting fire activity rather then fire danger

One of the most significant limitations of traditional fire weather indices is their tendency to consistently predict high fire danger in regions with low fuel availability⁴⁰. For example, FWI often indicates extreme fire danger in desert areas where extreme temperatures and low moisture persist for most of the year, despite the absence of fires due to insufficient vegetation. Such areas are typically masked out in applications using fixed land-cover datasets. In fact, when comparing the climatology of fire danger for 2023 with the actual fire activity recorded for that year, the FWI highlights extreme values in vast desert regions such as the Sahel, the Tibetan plateau, and the Gobi desert, where no fire activity was recorded (Fig. 2). A data-driven approach trained directly on observed fire activity forecasts fire activity rather than fire danger, which to some extent addresses this issue. The model avoids the false predictions characteristic of traditional indices by learning the locations of barren areas from the climatological absence of fire activity (Fig. 2). Importantly, this learning process is significantly improved by expanding the training dataset. When weather variables are the sole input, the inhibitory factors related to the lack of recorded fire activity stem only from the target dataset. However, incorporating additional information on fuel availability and ignition sources during training enables the forecasting system to better reproduce the global climatology of fire activity. This underscores the substantial predictive improvements that can be achieved by integrating relevant data during the training phase.

**Fig. 2: Predicted yearly climatology of data-driven and fire weather indices.**

Predicting extremes

Another interesting aspect is the prediction of extremes and model confidence when making high probability predictions (Fig. 3). As fires are sporadic events subject to favorable conditions but ignited through stochastic triggers, they are systematically over-predicted in deterministic process-based forecasting systems (also visible in Fig. 2). The inherently probabilistic nature of the data-driven approach is therefore advantageous. In this logical framework, accuracy means that the predicted probability should match the frequency of the events, a metric typically referred to as the reliability of the forecast. The frequency of observing a single fire in any given year within a 9 km grid box is expected to be low (1 time in 365 days would lead to an average ~0.3% probability for each day), so we also expect a low probability to be the correct forecast. At low probabilities, all data-driven models perform equally well globally with a prediction probability that matches the observed frequency (Fig. 3). However, the weather-only data-driven model tends to over-predict the forecasts for high probabilities. Including information on fuel availability and sources of ignition not only substantially improves the predictions in many regions, but also increases the model confidence in predicting higher probabilities. This becomes a key aspect when these systems are considered for operational early warning systems. For example, a false alarm could result in investing resources when not needed and a forecast miss could result in more severe consequences, such as not issuing warnings and a lack of preparation for wildfire events.

**Fig. 3: Reliability of the data-driven fire prediction.**

Real-time application for a data-driven fire prediction system

The use of data-driven prediction for fire activity is showcased in two events: one in California in 2025 and another in Canada in 2023.

Starting on January 7, 2025, a series of catastrophic wildfires devastated the Los Angeles metropolitan area and surrounding regions. These fires were fueled by extremely low humidity, dry conditions, and hurricane-force Santa Ana winds, which reached speeds of 160 km/h in some areas. The wildfires claimed several lives, destroyed or damaged thousands of residential and commercial structures, and forced nearly 200,000 people to evacuate. The most severe impacts were caused by the two largest fires: the Palisades fire and Eaton fire, which remained uncontrolled for days. Traditional fire weather indices had indicated that persistent anomalous and dry conditions for the winter season and the incoming katabatic winds would create ideal anomalous conditions and warnings had been issued over the regions. However, FWI indicated widespread danger conditions (Fig. 4). The data-driven model including all the variables shows better localization of areas where fires ignited (Fig. 4). This better skill stems from the inclusion of information on fuel abundance and status as well as proximity to human activities. In fact, the severity of the fires was linked to a phenomenon known as “hydroclimate whiplash"⁴¹ which is believed to be exacerbated by climate change. This phenomenon occurs when anomalous wet conditions are followed by very dry conditions. The wet spell promotes vegetation growth and a rapid increase in fuel availability. During the subsequent dry period, moisture is quickly depleted, leaving the fuel highly flammable and often resulting in severe burning. California experienced two very wet spring seasons preceding the 2025 fires, which contributed to an enhanced availability of fuel in the region. These anomalies created a distinct pattern of increased flammability at the wildland–urban interface. This pattern was effectively captured by the ML model, which accounted for the anomalous fuel conditions but remained undetected by traditional FWI metrics.

**Fig. 4: California data-driven fire prediction for 2025.**

The 2023 fire season in Canada began earlier than usual, mirroring a recurring trend of spring fires in the northern prairie provinces attributed to early spring drying and the emergence of overwintering smoldering fires from warmer winters^20,42,43. Notably, British Columbia witnessed its first wildfire evacuation of the season in mid-April, while Nova Scotia’s capital Halifax experienced the largest wildfire on record, prompting the evacuation of over 16,000 individuals in late May⁴⁴ (Fig. 5 upper panels). In June, Quebec encountered two lightning outbreaks, igniting hundreds of new fires, and resulting in a record-high burned area of 4300 km² ²⁰ (Fig. 5 lower panels).

**Fig. 5: Canada data-driven fire prediction for 2023.**

Given the Canadian wildfire season in 2023 was significantly more widespread than any seen in our training data, we hypothesize if a data-driven approach can perform under conditions it was not trained for, and if it can be useful for real-time applications? Our analysis shows that both versions of the forecast model, a standard 9 km version presented here and an experimental 1 km version, deliver accurate information of extreme fire activity for the Canadian fires when trained using all available drivers, offering valuable insights up to 10 days in advance (Fig. 5).

Both the spatial distribution and the total intensity of fire activity is well forecasted 10 days ahead when we compare to recorded MODIS active fires. The model correctly predicts fires even when MODIS is unable to detect them because of cloud or smoke. MODIS data used in our analyses are known to be conservative because of limitations in detecting small fires based on surface spectral changes at 500 m resolution²⁴. We showcase a typical event on 30 May 2024 where MODIS detection is underestimated, and measurements from other sensors (VIIRS) depict more intense fire activity (Fig. 5 lower panel). Since our model is trained globally, and the missing observations from MODIS are not systematic, the data-driven approach can correct for these satellite omission deficiencies.

This capability is relevant not only for early warning systems but also for informing on possible missing observations. Emission forecasting systems relying on observations of fire radiative power are known to underestimate the contribution of fire emission to air quality forecasts⁴⁵. Despite most of the Quebec fires were in remote regions, the smoke they generated blanketed several major cities in eastern North America, including New York, the latter experiencing its worst air quality in half a century. The observed daily mean PM2.5 concentration rose to 148.3 μm⁻³ on 7 June 2023, over four times the recommended daily limit⁴⁶. In total, over 50 million people were exposed to high levels of PM2.5 for several days⁴⁷. The full extent of the fires over this period were missing from MODIS observations resulting in observation-based emission estimates (e.g. The Copernicus Atmosphere Modeling System) severely underestimating the total amount of PM2.5.

Discussion

Since the 1970s, landscape fire predictions have relied on empirical models of landscape flammability tailored to specific ecosystems^48,49,50. They have become pivotal tools for fire management agencies in preemptively identifying critical areas for suppression^51,52. The ease of implementation and the availability of weather data have contributed significantly to their widespread use. Despite their utility, studies have highlighted the limited effectiveness of the fire weather index and similar metrics in fuel-limited ecosystems where fires are driven by the short-term superficial drying of intermittently available biomass⁴⁰. The availability of remote observations for fuel, either independently⁵³ or supported by modeling frameworks^38,54,55, has permitted the development of new indices that partially incorporate fuel considerations into their formulation^40,56. However, it is the emergence of data-driven technology that holds the promise of significantly enhancing our predictive capabilities³⁶ as it allows us to exploit information from diverse sources in a computationally efficient way. Using a set of ML infrastructures we have shown that a data-driven approach of fire prediction can:

learn conditions where fire is inhibited, even when only utilizing the same atmospheric variables as the FWI.
Rapidly integrate information on fuel without the complexity of formulating a process-based framework connecting fuel to fire activity.
Consider ignitions that may lack a direct physical basis, e.g, when induced by humans.
Adapt and refine its predictions over time in a simple way, allowing it to learn about changes in flammability due to climate change⁵⁷ or human practices^58,59. Updates in process-based derivation would require more targeted data acquisitions (e.g., through prescribed burnings), which are more difficult to obtain and site specific.

A data-driven model can forecast the probability of fire activity itself, unlike process-based methods. This enhances the usability of data-driven outputs as they directly relate to observable variables of fire activity. The quality of the forecast can be verified in a probabilistic framework against a measured quantity. This opens new avenues in our capability to forecast fire activity, as it becomes feasible to generate an initial estimate of active fire observations, such as those from MODIS. This could allow for the correction of missing observations, which would have a substantial impact if applied to fire emission estimations which are highly affected by availability of satellite observations^60,61,62.

A data-driven approach is likely to outperform process-based methods in most circumstances, but the effectiveness of data-driven predictions relies on the quality and relevance of the input data. Data-driven models can still encounter challenges, such as over-predicting fire activity in sparsely vegetated areas. This is especially true when only using the same input variables as traditional fire weather indices. Weather input alone does not provide all the information needed to constrain the problem. It is essential to consider all factors that contribute to fire activity (the three apexes of the fire triangle) during model training. Only when we incorporate weather conditions, fuel characteristics, and elements related to ignitions, we substantially improve the accuracy and reliability of fire activity forecasts. By fine-tuning the model with comprehensive input data it becomes better equipped to generate more precise predictions across various landscapes and conditions, resulting in up to 30% improvement.

Fuel status is the single most important predictor. Thus, the lack of direct real-time global observations of fuel has been the biggest limiting factor in developing a global prediction system for fire activity. The datasets used here rely on a physical understanding of fuel dynamics to derive a consistent picture of fuel characteristics over time. The use of a physical model for fuel could be avoided if sufficient information was available directly from the observations⁶³. For most applications direct observations are scarce, if available at all. For now, the creation of these fuel datasets is based on process-based models that are used to inform the final ML model³⁸. A physical-derived dataset might remain a prerequisite for successful machine learning applications for many applications where the observing system is limited. Our findings indicate that the acquisition of high-quality global data is paramount to successfully train a ML-based fire activity model.

Methods

ML infrastructures

We use the data driven approach developed in the ECMWF model called the Probability of Fire (PoF)³⁶. This model has been running in ECMWF since 2023 and produces daily forecasts available to ECMWF users. PoF predicts active fire (AF) observations from the MODIS MCD14ML active fire product (collection 6.1; 1 km resolution²⁶). Fires flagged as low confidence in the AF product were not used. The PoF system uses gradient-boosted decision trees from the XGBoost library on detected active fires⁶⁴. The training iteratively adds models to correct errors made by previous iterations, resulting in a computationally efficient optimization. The system training uses a classifier approach which defines a positive hit as an active fire detection within the grid cell on a given day.

To compare multiple data-driven approaches we also trained random forest and neural network models. The random forest model implemented using⁶⁵ consists of 50 trees with a maximum depth of 5, and each tree is trained on 80% of the samples using 50% of the features for each split. This configuration ensures a balance between model complexity and computational efficiency, while leveraging ensemble learning for robust performance.

The neural network model consisted of two hidden layers with 32 and 16 nodes, respectively, with sequential information flow from the input to the output layer using ReLU activation functions. The features were standardized. The model was trained using the Adam optimizer⁶⁶ with a learning rate of 0.001, categorical cross-entropy loss, and a batch size of 64 for up to 15 epochs. To prevent overfitting and optimize training efficiency early stopping was applied.

Several alternative configurations of each model was explored, including random forest models with 100 and 200 estimators and an neural network model with a four-layer architecture. These modifications yielded only marginal improvements, with the RF normalized average score increasing from 0.77 to 0.78 for both 100 and 200 estimators, and the NN from 0.83 to 0.84. Given the negligible performance gains relative to the increased computational cost, we opted to retain the simpler configurations for efficiency.

While tree-based methods like XGBoost and random forests lack temporal memory, this limitation is mitigated by including input features such as live fuel moisture content and fuel load which inherently include memory of previous conditions. For example, the deep soil moisture with a memory of several months. In contrast, neural networks have the potential to capture complex nonlinear interactions in the data but are computationally expensive, require extensive hyperparameter tuning and are less interpretable for probabilistic classification tasks. Performance-wise they are often shown to be outperform by XGBoost for tasks such as classification^67,68.

Training dataset

We included 19 predictors of active fires that are grouped into three controls: weather, fuel and ignition (Table 1).

Weather variables are from ERA5-Land (9 km resolution⁶⁹) Fuel variables are from the fuel characteristic model of McNorton and Di Giuseppe³⁸ also available at 9 km resolution. This model uses ESA-CCI above ground biomass estimates⁷⁰ and Copernicus Atmosphere Monitoring Service net ecosystem exchange estimates⁷¹ to infer fuel abundance. Abundance is then split between live leaf and wood load, and dead foliage and wood load based on the leaf area index (LAI) and vegetation type. Fuel moisture content is split between live fuel, dead foliage and dead wood. Live fuel moisture is a function of LAI, soil moisture and vegetation type whilst dead fuel is based on an extension of the Nelson model⁷².

Ignition drivers include variables known to indirectly control human capability to ignite a fire such as population density, proximity to urban areas and road density. Lightning density is also included as a source of natural ignitions from ECMWF analysis^15,45,73.

Grouping the set of 19 drivers between the three controls which forms the three sides of the fire triangle—weather, fuel and ignitions—is not straightforward, as fuel moisture and weather variables are strongly correlated and fuel load is also related to weather conditions. Hence, some variables can be associated to more than one control. The clustering stems from the historical reason of preserving the same weather drivers as the classical fire danger indices. It also takes into consideration the different models that have been used to generate the datasets.

Target dataset

We describe fire activity in terms of active fires. While a similar approach could be employed for estimating burned areas, the fire weather index (FWI) was specifically derived to characterize fire intensity, making it closer in nature to active fires, hence our choice. The longest coherent global observations come from the MODIS AQUA and TERRA satellites, launched in 1999 and 2002, respectively. The active fire product MCD14 v6.1 is the latest version of the dataset^21,74. Although the observation time-series spans over two decades, they are still short compared to the decadal to centurial fire return intervals in many ecosystems. As consistent long-term records of fire extent and properties are fundamental for the presented analysis, the training would benefit for an extension of the record and also an increase in resolution. While fire observations from other moderate-resolution datasets (such as the visible infrared imaging radiometer suite (VIIRS)) and high-resolution datasets (e.g. Landsat and Sentinel sensors) are increasingly available they do not provide the long time record of MODIS. It is important to note that the accuracy of our predictions could be significantly enhanced with access to a more robust dataset from multiple sensors. Therefore, while the results provided here are very encouraging it is important that we take into account the potential for utilizing a broader dataset to improve the accuracy of fire activity predictions for an operational use.

Validation metric

We use a set of metrics to assess the quality of the predictions. In the following we define what these are and why they were chosen.

Correlation

Correlation measures the linear relationship between the predicted and observed fire activity. Given we provide a forecast in terms of probability of occurrence and not the binary occurrence which is observed, the interpretation of the correlation is to be intended as the concomitant occurrence of high probabilities when fire events are observed. Given the sample is highly bias in favor of non-fire predictions, the correlation gives more weight to positive detection and predictions. A point to notice is that while correlation provides an overall estimations of how well high probability correspond to observed fires, in this specific application is a metric which has limitation as fires are events that occur at relatively low probabilities (1–2%) so values between prediction and fire activity tends to be very low. Most of the information here comes therefore from comparing across experiments rather than from looking at the absolute value.

Brier score

The Brier score is a measure of the accuracy of probabilistic predictions. It is calculated as the mean squared difference between the predicted probability and the actual outcome (0 or 1). The Brier score is therefore a true probabilistic skill score as it evaluates the quality of the probability estimates. It considers both the calibration and the refinement of the predictions, making it a good fit for probabilistic forecasts where the probabilities are generally low.

Logarithmic loss (Logloss)

Logloss measures the performance of a classification model by penalizing false predictions. It is calculated as the negative log of the likelihood of the true labels given the predicted probabilities. Logloss is sensitive to the confidence of the predictions. It strongly penalizes predictions that are confident and wrong, which is useful for ensuring that the model not only predicts the correct class but also assigns appropriate probabilities. In scenarios, such as fire prediction, where the observed outcomes are mostly zeros, Logloss remains informative because it considers the probability distribution of predictions. It is less biased by the imbalance, unlike some other metrics that might favor the majority class.

Receiving operator curve (ROC) and area under the curve (AUC)

The ROC curve is a plot of the true positive rate (recall) against the false positive rate at various probability thresholds. The AUC shown in Fig. 2 represents the model’s ability to distinguish between classes. The ROC–AUC is useful for evaluating unbalanced datasets, such as the datasets used here, as it considers all possible classification thresholds, providing a single value to summarize the model’s performance across all thresholds. A perfect model would have an ROC–AUC of 1, while a model with any misclassifications will have an ROC–AUC below 1.

Expected calibration error

Expected calibration error (ECE) quantifies the reliability of predictions, specifically, how well the probabilities align with observed frequencies of fire occurrence. We assess the reliability by comparing predicted probabilities with observed fires over a range of probability bins, with a lower ECE indicating better calibration. For our dataset, where fire events are uncommon, the ECE is useful as it provides a measure of skill, even in the case of low-event probabilities. By optimizing for low ECE, we ensure that the model not only identifies potential fire occurrences but does so in a manner that reflects true likelihoods, without under- or over- predicting.

Reliability

The panels in Fig. 3 are based on a reliability skill score, which we have identified as the most relevant metric in our analysis. Reliability in fire forecasting refers to the accuracy with which a predictive model’s forecasted probabilities match the actual observed frequency of fire events. Thus, it measures how well the predicted likelihood of fires corresponds to their real-world occurrence.

A reliable fire forecast model ensures that the probabilities it assigns to fire events are calibrated correctly. For example, if a model predicts a 0.3% probability of fire on a given day, then, on average, fire should occur on 0.3% of such days over a long period. We use annual estimates, as resource planning is typically performed on a yearly basis. Hence, a consistent 0.3% probability prediction is successful if it corresponds to 1 observed fire per year.

This concept is crucial because it determines the model’s trustworthiness in practical applications, such as resource allocation for fire prevention and response. Moreover, it can be used to assess the increased probability of fire occurrence due to external factors like human practices or climate change.

Reliable fire forecasting is also important for minimizing false alarms, which can lead to unnecessary resource expenditure, and for avoiding missed warnings, which can result in unpreparedness and greater harm from unexpected fires.

A potential challenge in computing a reliability score aggregated over a large spatiotemporal ___domain is the issue of compensating biases. For instance, consistent over-predictions in one region may be offset by consistent under-predictions in another, leading to an artificially high reliability score. A similar effect can occur when considering temporal biases. Therefore, analyzing regional reliability scores is crucial for accurately assessing model skill. Our analysis indicates that, for the models used in this study, compensating biases do not typically arise from seasonal variations in bias.

Fire weather index

The Canadian fire weather index, derives from the four main inputs the vegetation moisture state representative of different depths of the forest floor⁷⁵. It further expands on this by using wind and long-term precipitation deficit to derive a potential rate of spread index and a build-up indicator which then combined define a generic index of fire danger known as the fire weather index (FWI)⁴⁸. A higher FWI indicates fire weather conditions more conducive to wildfires once ignited. The index is derived assuming a specific forest type, “Pinus Banksiana," and thus an environment with a sufficient fuel load. The FWI is especially useful for predicting the likelihood and severity of extreme events in ecosystems where weather is the primary limitation to fire (i.e., those mainly limited by moisture or temperature) because of its original design for use in forest ecosystems⁷⁶. In areas with limited fuel⁴⁰ and where burning is heavily controlled by human practices²⁸, the correlation with actual fire activity is limited. FWI is extensively used in operational global information platforms such as the European Forest Fire Information System (EFFIS), the Global Wildfire Information System (GWIS), and the Canadian Wildland Fire Information System (CWFIS)^51,52,77,78.

To produce daily FWI data outlined in this study for 2023 at 9 km resolution, we used the global ECMWF Fire Forecast (GEFF) model 4.1 forced by ERA5-land data⁶⁹. The FWI dataset is generated at 9 km resolution using the same forcing as the fuel model. FWI assesses potential fire danger by integrating key meteorological factors, specifically temperature, humidity, wind speed, and precipitation, providing a quantitative measure of landscape flammability. Most global^18,51 and regional⁵² early warning systems employ this metric as a generic measure of fire danger.

We acknowledge that the FWI is not the only index for fire danger, and sub-indices of this system or other fire danger systems developed for other regions^50,79,80,81 can be used to improve the information provided to forestry agencies in biomes different from the boreal forest of Canada, for which the FWI was derived. The global performance of fire weather indices could, in absolute terms, be better than what is shown here. However, the scope of this study is to assess the enhanced ability in directly forecasting fires when a data-driven approach is applied to match observed fire activity, and how much of this ability stems from datasets that correctly describe the whole fire process.

Data availability

The input meteorological data for training the data-driven model presented here is taken from the ERA5-Land dataset openly available through the Copernicus Climate Data store (https://doi.org/10.24381/cds.e2161bac). The fuel characteristic dataset is available through weblinks at https://doi.org/10.24381/378d1497. The MODIS active fire product was downloaded from the University of Maryland SFTP (formerly FTP) server. Connect using the following information: Server: fuoco.geog.umd.edu Login name: fire Password: burnt. Spurious signals were removed using a Copernicus Atmosphere Monitoring Service mask which can be requested to the corresponding authors. The data to create Figs. 1–4 are available as pickle files on zenodo https://zenodo.org/records/11653699. The full set of data-driven fire activity forecast for 2023 in the multiple configurations processed for this study are available through the corresponding authors. Daily operational data-driven model output for the best training configuration (images and WMS layers) is available to registered users through the ECMWF Web platform, ECCharts (https://eccharts.ecmwf.int).

Code availability

The main scripts for data processing, model training, and analysis are archived in a publicly accessible repository https://doi.org/10.24433/CO.8570224.v1, with documentation to facilitate replication of the results.

References

Bi, K. et al. Pangu-weather: a 3d high-resolution model for fast and accurate global weather forecast. arXiv preprint arXiv:2211.02556 (2022).
Lam, R. et al. Learning skillful medium-range global weather forecasting. Science 382, 1416–1421 (2023).
Pathak, J. et al. FourCastNet: a global data-driven high-resolution weather model using adaptive Fourier neural operators. Preprint at arXiv https://arxiv.org/abs/2202.11214 (2022).
Lang, S. et al. AIFS—ECMWF’s data-driven forecasting system. Preprint at https://arxiv.org/abs/2406.01465 (2024).
Price, I. et al. Probabilistic weather forecasting with machine learning. Nature 637, 84–90 (2025).
Article CAS PubMed MATH Google Scholar
Ben Bouallègue, Z. et al. The rise of data-driven weather forecasting: a first statistical assessment of machine learning–based weather forecasts in an operational-like context. Bull. Am. Meteorol. Soc. 105, E864–E883 (2024).
Sathishkumar, V. E., Cho, J., Subramanian, M. & Naren, O. S. Forest fire and smoke detection using deep learning-based learning without forgetting. Fire Ecol. 19, 1–17 (2023).
Article Google Scholar
Kanervisto, A. et al. World and human action models towards gameplay ideation. Nature 638, 656–663 (2025).
Lake, B. M. & Baroni, M. Human-like systematic generalization through a meta-learning neural network. Nature 623, 115–121 (2023).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Wang, Z., Ye, X. & Tsou, M.-H. Spatial, temporal, and content analysis of twitter for wildfire hazards. Nat. Hazards 83, 523–540 (2016).
Article MATH Google Scholar
Behl, S., Rao, A., Aggarwal, S., Chadha, S. & Pannu, H. Twitter for disaster relief through sentiment analysis for covid-19 and natural hazard crises. Int. J. Disaster Risk Reduct. 55, 102101 (2021).
Article Google Scholar
Lu, Y. Artificial intelligence: a survey on evolution, models, applications and future trends. J. Manag. Anal. 6, 1–29 (2019).
MATH Google Scholar
Goudie, A. S. Human Impact on the Natural Environment: Past, Present and Future (Wiley, 2018).
Bowman, D. M. et al. Human exposure and sensitivity to globally extreme wildfire events. Nat. Ecol. Evol. 1, 0058 (2017).
Article MATH Google Scholar
Coughlan, R. et al. Using machine learning to predict fire-ignition occurrences from lightning forecasts. Meteorol. Appl. 28, 1973 (2021).
Article ADS MATH Google Scholar
Di Giuseppe, F. The value of probabilistic prediction for lightning ignited fires. Geophys. Res. Lett. 49, 2022–099669 (2022).
Article MATH Google Scholar
Anderson, K. A model to predict lightning-caused fire occurrences. Int. J. Wildland Fire 11, 163–172 (2002).
Article MATH Google Scholar
Di Giuseppe, F. et al. The potential predictability of fire danger provided by numerical weather prediction. J. Appl. Meteorol. Climatol. 55, 2469–2491 (2016).
Article ADS MATH Google Scholar
Di Giuseppe, F. et al. Fire weather index: the skill provided by the european centre for medium-range weather forecasts ensemble prediction system. Nat. Hazards Earth Syst. Sci. 20, 2365–2378 (2020).
Article ADS MATH Google Scholar
Jones, M. W. et al. State of wildfires 2023–2024. Earth Syst. Sci. Data 16, 3601–3685 (2024).
Giglio, L., Randerson, J. T. & Werf, G. R. Analysis of daily, monthly, and annual burned area using the fourth-generation global fire emissions database (gfed4). J. Geophys. Res.: Biogeosci. 118, 317–328 (2013).
Article Google Scholar
Wooster, M. J. et al. Satellite remote sensing of active fires: history and current status, applications and future requirements. Remote Sens. Environ. 267, 112694 (2021).
Article MATH Google Scholar
Giglio, L. et al. 3. Mapping and Characterizing Fire, pp. 37–51. American Geophysical Union (AGU), Geophysical Monograph 280. Hoboken, NJ: John Wiley and Sons https://doi.org/10.1002/9781119757030.ch3 (2023).
Chen, Y. et al. Multi-decadal trends and variability in burned area from the 5th version of the global fire emissions database (gfed5). Earth Syst. Sci. Data Discuss. 2023, 1–52 (2023).
CAS Google Scholar
Hantson, S., Andela, N., Goulden, M. L. & Randerson, J. T. Human-ignited fires result in more extreme fire behavior and ecosystem impacts. Nat. Commun. 13, 2717 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Giglio, L., Csiszar, I., Justice, C.O.: Global distribution and seasonality of active fires as observed with the terra and aqua moderate resolution imaging spectroradiometer (modis) sensors. Journal of Geophysical Research: Biogeosciences (2005–2012) 111(G2) (2006).
Andela, N. et al. The global fire atlas of individual fire size, duration, speed and direction. Earth Syst. Sci. Data 11, 529–552 (2019).
Article ADS MATH Google Scholar
Abatzoglou, J. T., Williams, A. P., Boschetti, L., Zubkova, M. & Kolden, C. A. Global patterns of interannual climate–fire relationships. Glob. change Biol. 24, 5164–5175 (2018).
Article ADS Google Scholar
Kelly, L. T. et al. Fire and biodiversity in the anthropocene. Science 370, 0355 (2020).
Article MATH Google Scholar
Li, Y., Zhang, T., Ding, Y., Wadhwani, R. & Huang, X. Review and perspectives of digital twin systems for wildland fire management. J. For. Res. 36, 1–24 (2025).
MATH Google Scholar
Ji, Y., Wang, D., Li, Q., Liu, T. & Bai, Y. Global wildfire danger predictions based on deep learning taking into account static and dynamic variables. Forests 15, 216 (2024).
Article MATH Google Scholar
Kondylatos, S. et al. Wildfire danger prediction and understanding with deep learning. Geophys. Res. Lett. 49, e2022GL099368 (2022).
Jain, P. et al. A review of machine learning applications in wildfire science and management. Environ. Rev. 28, 478–505 (2020).
Article MATH Google Scholar
Alkhatib, R., Sahwan, W., Alkhatieb, A. & Schütt, B. A brief review of machine learning algorithms in forest fires science. Appl. Sci. 13, 8275 (2023).
Article CAS MATH Google Scholar
Avazov, K. et al. Forest fire detection and notification method based on ai and iot approaches. Futur. Internet 15, 61 (2023).
Article Google Scholar
McNorton, J. R., Di Giuseppe, F., Pinnington, E., Chantry, M. & Barnard, C. A global probability-of-fire (pof) forecast. Geophys. Res. Lett. 51, 2023–107929 (2024).
Article Google Scholar
Moritz, M. A., Morais, M. E., Summerell, L. A., Carlson, J. & Doyle, J. Wildfires, complexity, and highly optimized tolerance. Proc. Natl Acad. Sci. USA 102, 17912–17917 (2005).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
McNorton, J. R. & Di Giuseppe, F. A global fuel characteristic model and dataset for wildfire prediction. Biogeosciences 21, 279–300 (2024).
Article ADS MATH Google Scholar
Friedlingstein, P. et al. Global carbon budget 2023. Earth Syst. Sci. Data 15, 5301–5369 (2023).
Article MATH Google Scholar
Di Giuseppe, F. Accounting for fuel in fire danger forecasts: the fire occurrence probability index (fopi). Environ. Res. Lett. 18, 064029 (2023).
Article ADS Google Scholar
Swain, D. L. et al. Hydroclimate volatility on a warming earth. Nat. Rev. Earth Environ. 6, 35–50 (2025).
Article MATH Google Scholar
Scholten, R. C., Jandt, R., Miller, E. A., Rogers, B. M. & Veraverbeke, S. Overwintering fires in boreal forests. Nature 593, 399–404 (2021).
Article ADS CAS PubMed Google Scholar
Luo, K., Wang, X., Jong, M. & Flannigan, M. Drought triggers and sustains overnight fires in north america. Nature 627, 321–327 (2024).
Article ADS CAS PubMed Google Scholar
Kolden, C. A., Abatzoglou, J. T., Jones, M. W. & Jain, P. Wildfires in 2023. Nat. Rev. Earth Environ. 5, 238–240 (2024).
Di Giuseppe, F., Benedetti, A., Coughlan, R., Vitolo, C. & Vuckovic, M. A global bottom-up approach to estimate fuel consumed by fires using above ground biomass observations. Geophys. Res. Lett. 48, 2021–095452 (2021).
Article Google Scholar
Wang, Z. et al. Severe global environmental issues caused by Canada’s record-breaking wildfires in 2023. Adv. Atmos. Sci. 41, 565–571 (2023).
Article MATH Google Scholar
Yu, M., Zhang, S., Ning, H., Li, Z. & Zhang, K. Assessing the 2023 Canadian wildfire smoke impact in Northeastern US: air quality, exposure and environmental justice. Sci. Total Environ. 926, 171853 (2024).
Article CAS PubMed MATH Google Scholar
Van Wagner, C. et al. Development and Structure of the Canadian Forest Fire Weather Index System Vol. 35 (Canadian Forestry Service, Headquarters, Ottawa, Canada, 1987).
Noble, I., Gill, A. & Bary, G. Mcarthur’s fire-danger meters expressed as equations. Aust. J. Ecol. 5, 201–203 (1980).
Article Google Scholar
Deeming, J. E., Burgan, R. E. & Cohen, J. D. The National Fire-danger Rating System—1978. USDA Forest Service General Technical Report INTUS (USA). no. 39 (USDA Forest Service, 1977).
San-Miguel-Ayanz, J. et al. Comprehensive monitoring of wildfires in europe: the european forest fire information system (effis). In: Approaches to Managing disaster-Assessing Hazards, Emergencies and Disaster Impacts (IntechOpen, Canada, 2012).
Worsnop, R. P. et al. Probabilistic fire danger forecasting: a framework for week-two forecasts using statistical postprocessing techniques and the global ECMWF fire forecast system (geff). Weather Forecast. 36, 2113–2125 (2021).
ADS MATH Google Scholar
Chaparro, D. et al. Vegetation moisture estimation in the western united states using radiometer-radar-lidar synergy. Remote Sens. Environ. 303, 113993 (2024).
Article MATH Google Scholar
Anderson, H. E. Aids to Determining Fuel Models for Estimating Fire Behavior. In The Bark Beetles, Fuels, and Fire Bibliography, Vol. 143. General Technical Report INT-122 (United States Department of Agriculture Forest Service Intermountain Forest and Range Experiment Station, 1982).
Yebra, M. et al. A global review of remote sensing of live fuel moisture content for fire danger assessment: moving towards operational products. Remote Sens. Environ. 136, 455–468 (2013).
Article ADS MATH Google Scholar
Preisler, H. K., Burgan, R. E., Eidenshink, J. C., Klaver, J. M. & Klaver, R. W. Forecasting distributions of large federal-lands fires utilizing satellite and gridded weather information. Int. J. Wildland Fire 18, 508–516 (2009).
Article MATH Google Scholar
Jones, M. W. et al. Global and regional trends and drivers of fire under climate change. Rev. Geophys. 60, 2020–000726 (2022).
Article MATH Google Scholar
Coop, J. D. et al. Wildfire-driven forest conversion in western North American landscapes. BioScience 70, 659–673 (2020).
Article PubMed PubMed Central MATH Google Scholar
Nolan, R. H. et al. Limits to post-fire vegetation recovery under climate change. Plant, Cell Environ. 44, 3471–3489 (2021).
Article CAS PubMed MATH Google Scholar
Kaiser, J. et al. Biomass burning emissions estimated with a global fire assimilation system based on observed fire radiative power. Biogeosciences 9, 527–554 (2012).
Article ADS CAS MATH Google Scholar
Di Giuseppe, F., Rémy, S., Pappenberger, F. & Wetterhall, F. Improving forecasts of biomass burning emissions with the fire weather index. J. Appl. Meteorol. Climatol. 56, 2789–2799 (2017).
Article ADS Google Scholar
Di Giuseppe, F., Rémy, S., Pappenberger, F. & Wetterhall, F. Using the fire weather index (FWI) to improve the estimation of fire emissions from fire radiative power (FRP) observations. Atmos. Chem. Phys. 18, 5359–5370 (2018).
Article ADS Google Scholar
McNally, A. et al. Data driven weather forecasts trained and initialised directly from observations. arXiv preprint arXiv:2407.15586 (2024).
Chen, T. & Guestrin, C. XGBoost: a scalable tree boosting system. In Proc. 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '16) 785–794 (Association for Computing Machinery, 2016).
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article MATH Google Scholar
Zhang, Z. Improved Adam optimizer for deep neural networks. In 2018 IEEE/ACM 26th International Symposium on Quality of Service (IWQoS) 1–2 (IEEE, 2018).
Caruana, R. & Niculescu-Mizil, A. An empirical comparison of supervised learning algorithms. In Proc. 23rd international conference on Machine learning (ICML '06) 161–168 (Association for Computing Machinery, 2006).
Grinsztajn, L., Oyallon, E. & Varoquaux, G. Why do tree-based models still outperform deep learning on typical tabular data? Adv. Neural Inf. Process. Syst. 35, 507–520 (2022).
Google Scholar
Muñoz-Sabater, J. et al. Era5-land: a state-of-the-art global reanalysis dataset for land applications. Earth Syst. Sci. Data 13, 4349–4383 (2021).
Article ADS MATH Google Scholar
Cartus, O., Santoro, M., Wegmüller, U., Labrière, N. & Chave, J. Sentinel-1 coherence for mapping above-ground biomass in semiarid forest areas. IEEE Geosci. Remote Sens. Lett. 19, 1–5 (2021).
Article Google Scholar
Agustí-Panareda, A. et al. A biogenic CO₂ flux adjustment scheme for the mitigation of large-scale biases in global atmospheric CO₂ analyses and forecasts. Atmos. Chem. Phys. 16, 10399–10418 (2016).
Article ADS MATH Google Scholar
Nelson Jr, R. M. Predicting behavior of the 10-hour timelag fuel moisture model. Int. J. Wildland Fire 10, 215–222 (2000).
MATH Google Scholar
Lopez, P. A lightning parameterization for the ecmwf integrated forecasting system. Mon. Weather Rev. 144, 3057–3075 (2016).
Article ADS MATH Google Scholar
Giglio, L., Schroeder, W. & Justice, C. O. The collection 6 modis active fire detection algorithm and fire products. Remote Sens. Environ. 178, 31–41 (2016).
Article ADS PubMed PubMed Central MATH Google Scholar
Van Wagner, C. Method of computing fine fuel moisture content throughout the diurnal cycle. Inf. Rep. PS-X-69 (1977).
Groot, W. J. et al. A comparison of canadian and russian boreal forest fire regimes. For. Ecol. Manag. 294, 23–34 (2013).
Article MATH Google Scholar
Groot, W. J. et al. Calibrating the fine fuel moisture code for grass ignition potential in sumatra, indonesia. Int. J. Wildland Fire 14, 161–168 (2005).
Article MATH Google Scholar
Field, R. D. et al. A drought-based predictor of recent haze events in western indonesia. Atmos. Environ. 38, 1869–1878 (2004).
Article ADS CAS MATH Google Scholar
McArthur, A. Forest Fire Danger Meter Mk5 (Forest Research Institute, Forestry and Timber Bureau, Canberra, 1973).
keetch, J. J. & Byram, G. M. F. A Drought Index for Forest Fire Control. USDA Forest Service Research Paper SE-38 33 (Southeastern Forest Experiment Station, Asheville, NC, USA, 1968).
de Groot, W. J., Field, R. D., Brady, M. A., Roswintiarti, O. & Mohamad, M. Development of the Indonesian and Malaysian fire danger rating systems. Mitig. Adapt. Strateg. Glob. Change 12, 165–180 (2007).
Article Google Scholar
Wigneron, J.-P. et al. SMOS-IC data record of soil moisture and L-VOD: historical development, applications and perspectives. Remote Sens. Environ. 254, 112238 (2021).
Article MATH Google Scholar
Boussetta, S. et al. Ecland: the ECMWF land surface modelling system. Atmosphere 12, 723 (2021).
Article ADS CAS MATH Google Scholar
McNorton, J. et al. An urban scheme for the ecmwf integrated forecasting system: global forecasts and residential CO₂ emissions. J. Adv. Model. Earth Syst. 15, 2022–003286 (2023).
Article MATH Google Scholar
Hersbach, H. Decomposition of the continuous ranked probability score for ensemble prediction systems. Weather Forecast. 15, 559–570 (2000).
Article ADS MATH Google Scholar
Meijer, J. R., Huijbregts, M. A., Schotten, K. C. & Schipper, A. M. Global patterns of current and future road infrastructure. Environ. Res. Lett. 13, 064006 (2018).
Article ADS MATH Google Scholar
Van Wagner, C. et al. Equations and FORTRAN Program for the Canadian Forest Fire Weather Index System Vol. 33 (Canadian Forestry Service, Headquarters, Ottawa, Canada, 1985).

Download references

Acknowledgements

F.D.G. and J.M. are funded by the Copernicus Emergency Management Service contract no. 942604 between the Joint Research Centre and ECMWF. The discussion section has benefited from several informal conversations about the role of process-based and data-driven forecasts with colleagues at both ECMWF and ESA over the past years. We also acknowledge the discussion on the role of physical base fire science and the technological advancement of ML at the the GOFC-GOLD Fire Implementation meeting hosted in Canada in 2023. Credits: Graphical elements of Fig. 1 were designed by Freepik (https://www.freepik.com).

Author information

These authors contributed equally: Francesca Di Giuseppe, Joe McNorton.

Authors and Affiliations

ECMWF, European Centre for Medium-range Weather Forecast, Shinfield park, Reading, RG29AX, UK
Francesca Di Giuseppe, Joe McNorton, Anna Lombardi & Fredrik Wetterhall

Authors

Francesca Di Giuseppe
View author publications
Search author on:PubMed Google Scholar
Joe McNorton
View author publications
Search author on:PubMed Google Scholar
Anna Lombardi
View author publications
Search author on:PubMed Google Scholar
Fredrik Wetterhall
View author publications
Search author on:PubMed Google Scholar

Contributions

F.D.G. and F.W. conceived the idea. J.M. implemented the experiments. F.D.G. and J.M. performed the data analysis. A.L. curated the graphical display and generated the infographic displayed in Fig. 1. F.D.G. wrote the paper. All authors contributed to the interpretation of the results and revised the manuscript.

Corresponding authors

Correspondence to Francesca Di Giuseppe or Joe McNorton.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Jiafu Mao and the other, anonymous, reviewer for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Transparent Peer Review file

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Di Giuseppe, F., McNorton, J., Lombardi, A. et al. Global data-driven prediction of fire activity. Nat Commun 16, 2918 (2025). https://doi.org/10.1038/s41467-025-58097-7

Download citation

Received: 21 October 2024
Accepted: 11 March 2025
Published: 01 April 2025
DOI: https://doi.org/10.1038/s41467-025-58097-7

This article is cited by

Being proactive about anthropogenic environmental changes: augmenting students’ decision making with artificial intelligence (AI) technology
- Xi Xiang
- Michael E. Meadows
Educational technology research and development (2025)