Global Crop-Specific Fertilization Dataset from 1961–2019

Coello, Fernando; Decorte, Thomas; Janssens, Iris; Mortier, Steven; Sardans, Jordi; Peñuelas, Josep; Verdonck, Tim

doi:10.1038/s41597-024-04215-x

Download PDF

Data Descriptor
Open access
Published: 09 January 2025

Global Crop-Specific Fertilization Dataset from 1961–2019

Scientific Data volume 12, Article number: 40 (2025) Cite this article

6794 Accesses
1 Citations
5 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 17 February 2025

This article has been updated

Abstract

As global fertilizer application rates increase, high-quality datasets are paramount for comprehensive analyses to support informed decision-making and policy formulation in crucial areas such as food security or climate change. This study aims to fill existing data gaps by employing two machine learning models, eXtreme Gradient Boosting and HistGradientBoosting algorithms to produce precise country-level predictions of nitrogen (N), phosphorus pentoxide (P₂O₅), and potassium oxide (K₂O) application rates. Subsequently, we created a comprehensive dataset of 5-arcmin resolution maps depicting the application rates of each fertilizer for 13 major crop groups from 1961 to 2019. The predictions were validated by both comparing with existing databases and by assessing the drivers of fertilizer application rates using the model’s SHapley Additive exPlanations. This extensive dataset is poised to be a valuable resource for assessing fertilization trends, identifying the socioeconomic, agricultural, and environmental drivers of fertilizer application rates, and serving as an input for various applications, including environmental modeling, causal analysis, fertilizer price predictions, and forecasting.

Global crop-specific nitrogen fertilization dataset in 1961–2020

Article Open access 11 September 2023

NPKGRIDS: a global georeferenced dataset of N, P₂O₅, and K₂O fertilizer application rates for 173 crops

Article Open access 30 October 2024

Spatiotemporal co-optimization of agricultural management practices towards climate-smart crop production

Article 02 January 2024

Background & Summary

Inorganic fertilizers are essential for replenishing the nutrients that are removed from soils during crop harvesting. The three main nutrients provided by fertilizers, nitrogen (N), phosphorus (P) and potassium (K), play a key role in plant functions. While N and P, which are basic components of nucleotides, proteins and membrane lipids, are essential in energy metabolism^1,2, K is essential for the transportation of water, metabolites, and nutrients across plant tissues, for defense against oxidative stresses, and for the maintenance of osmotic homeostasis^3,4. Although the first commercial inorganic fertilizers were developed in 1843, they were not the main anthropogenic inputs in the N, P, and K biochemical cycles until the second half of the 20th century⁵. Today, inorganic fertilizers dominate as the primary nutrient input in croplands, surpassing the second human input, manure, by over double⁵, and also serve as one of the main N input for grasslands⁶. This substantial surge during the 20th century not only facilitated the rapid growth in human population, but also had ecological and socioeconomic ramifications, such as water eutrophication, soil degradation, climate change, and mineral resource depletion^7,8. In the remainder of this study, the term ‘fertilizer’ will refer to inorganic fertilizers, and all data and results regarding P and K will be presented in their oxidative forms (P₂O₅ and K₂O, respectively), in accordance with common references in international standards and regulations.

Given their food security, socioeconomic and environmental implications, considerable research has been conducted to discern the temporal and regional trends in the use of N, P₂O₅, and K₂O^6,9,10,11,12. Nevertheless, limited availability of temporal global spatial information regarding their application across various crops have restricted these analyses to a few global and regional studies that primarily focused on N^9,13,14. These studies initially estimated consumption at the country- and state-level using simple equations, based on a few crop-specific fertilization features and changes in crop surface area^9,13,14, or using Bayesian Markov Chain Monte Carlo modeling¹⁵. A global, crop-specific fertilization dataset is crucial for understanding crop nutrient management practices worldwide, identifying past trends and current gaps in fertilization, guiding agricultural policies to improve crop yields while minimizing environmental impacts, and providing input data for modeling⁹. Therefore, we aim to address this knowledge gap by providing insights into the application rates of P₂O₅ and K₂O while also seeking to improve estimates for N.

In order to accomplish this objective, we began by updating the panel datasets on cropland fertilization; enhancing the most comprehensive database developed by Ludemann et al.¹⁶ by incorporating global datasets covering data from the 1970s and 1980s^17,18, country-specific data for European countries from 2001 to 2014^19,20,21,22. This compilation process led to a 35% expansion of the Ludemann et al.¹⁶ database. Second, the dataset was expanded with data of various potential socioeconomic, environmental, and agricultural drivers of cropland fertilization. Third, two ML regression models - XGB²³ and HGB²⁴, both capable of handling the prevalent missing values within the dataset²⁴- were applied to predict N, P₂O₅, and K₂O fertilizer application rates for the different crop classes over 60 years. Since these models are considered black-box models, feature importance was incorporated using SHAP²⁵ values to identify the global socioeconomic, agricultural, and environmental drivers of cropland fertilization and to validate the ML models. Fourth, the predictions were validated on national databases. However, since the ML models were trained on global data, which show a discrepancy with the national data, the model predictions were first adjusted to match the total annual country-level N, P₂O₅, and K₂O use in agricultural land, similar to previous studies^{9,10,11,12,26}. Crucial in this adjustment was the fraction of total country-level fertilizer use allocated to grasslands and fodder crops, as an important portion of total fertilizer use in some countries is devoted to these areas, and little previous estimates existed^5,26,27,28, especially for K₂O⁵. Therefore, these fractions were estimated by reviewing scientific and technical information from 75 countries. The adjusted predictions were then validated using national databases of fertilizer application rates at the crop-level. Finally, the results were spatially allocated using crop maps of the year 2000, developed by Monfreda et al.²⁹; the annual harvested area of each crop class in each country; and the spatial changes in cropland surface based on the Hyde v3.3. project³⁰.

Methods

The following section outlines the comprehensive methodology that was adopted in this study. The methodology encompasses various stages, including the collection and aggregation of different datasets and the compilation into a unified dataset, as well as all preprocessing steps that were carried out. Additionally, we introduce the ML models used in this study, as well as the respective training and evaluation procedures. Furthermore, we discuss the measures that were undertaken to explain the predictions made by the ML models. Following this, we describe how we used the predictions to create detailed maps of global fertilizer application rates. Finally, we explain how we assessed the validity and plausibility of the dataset derived from our study.

Data collection and preprocessing

Data collection

Fertilizer application rate by crops. To compile a consistent and detailed dataset of fertilizer application rates for different crops, countries, and years, 14 global datasets^{16,17,18,19,20,21,22,31,32,33,34,35,36,37} were used. We discarded national databases, such as the USA³⁸ and India^{39,40,41,42,43,44,45}, to construct a homogeneous database. This approach avoids multiple year-nutrient-crop-country entries from both global and national databases, and allows us to retain external databases for validating the ML model predictions. To standardize all these datasets and minimize data loss, we classified all crop types into 13 crop groups (wheat, maize, rice, other cereals, soybean, palm fruit, other oilseeds, vegetables, fruits, roots and tubers, sugar crops, fiber crops and other crops) (Table 1), in alignment with the ICC Version 1.1⁴⁶.

Table 1 Crop Classification with FAOSTAT Item Codes.

Full size table

During the 80 s, the IFDC published two reports^17,18 regarding crop-specific data of FUBC (hereinafter referred to as FUBC-IFDC). After the crop grouping, these publications included data for 459 country-crop-years combinations (kg ha⁻¹ of N, P₂O₅, and K₂O) from 83 countries for 1973–1988. During the 90 s, the FAO, in collaboration with the fertilizer industry (IFDC and IFA), published five crop-specific datasets of fertilizer application rate (hereinafter referred to as FUBC-FAO). After grouping the data, these publications included data for 1693 fertilizer application rate specific to years and crops (kg ha⁻¹ of N, P₂O₅, and K₂O) from 108 countries for 1984–2002, although most of the data (98%) covered 1988–2002. The data were collected using questionnaires from governmental agencies, members of industry companies, agronomists, and economic experts. In both datasets (FUBC-IFDC and FUBC-FAO), the use of fertilizer for each combination of nutrient, crop, country, and year was provided two ways: (a) as the average application rate of a fertilizer over total cropland area, and (b) as the percentage of fertilized cropland area and the application rate in that area. We transformed all data to the average application rate by multiplying the percentage of fertilized area by the application rate in that area. The data were either from a year (e.g., 1996) or a season (e.g., 1996/97). For seasonal data, we considered the starting year of the season as the year of the data in the analyses. Fore data for nutrient, crop, country, and year that were in more than one report, the data was selected from the most recent report. Data for crop, country, and year that were divided into crop varieties or management practices (e.g., irrigated or rain-fed rice, or soft or durum wheat) were aggregated and weighted by the area of the crop class included in the report. Data for sweet maize, or corn, were excluded, assuming that it referred to Zea mays var. saccharata and the data for silage maize, because FAOSTAT reports only the harvested area for maize grain. Values for the crop groups were derived from individual crops when either more than 90% of the harvested area (based on FAOSTAT data⁴⁷) was dedicated to the production of a single crop, or when a combination of crops was available in the data, their weighted average was assigned to the entire group.

Since the last FAO publication, IFA has released five reports detailing the total amount of N, P₂O₅, and K₂O used for various crop classes, providing yearly or seasonal data spanning from 2006 to 2018^16,36,37,48 (hereinafter referred to as FUBC-IFA). Initially covering 11 crop types, these reports expanded to 14 types in the fourth report. They encompassed information for the European Union (EU) together as well as 27 other countries. In 2022, Ludemann published a more comprehensive dataset covering data for 66 countries, featuring EU data at the country scale, and information for 20 crop classes¹⁶. This report also included the FUBC-FAO data for the 1990s and prior data from IFA. However, small discrepancies between the FUBC-FAO original data and the one compiled by Ludemann et al.¹⁶ prompted us to retain the original FUBC-FAO information. To estimate the average application rate for each combination of crop, country, and year, we divided the total used amount of each fertilizer by the harvested area provided by FAOSTAT⁴⁷. As previous research we assumed the harvested area as a proxy for the crop’s annual surface on each country^9,10. It is worth noting that the average application rate for maize was slightly overestimated because FUBC-IFA included the amount discharged to silage maize. According to Maiz’Europ’⁴⁹, the current area of forage maize crops is 17.3 million ha (approximately 1% of the total area of maize crops in 2020) with the European Union as the most important producer of silage maize, with 6 million ha. We utilized the available raw data from Ludemann et al.¹⁶, adopting FAO-IFDC datasets methods for grouping, and omitted certain countries where values were estimated based on the previous report and changes in crop surface. For the EU countries, Norway and the UK, four unpublished datasets from FE spanning 2001–2015 (referred to as FUBC-EFMA)^19,20,21,22 were used. These datasets offered similar information to the FUBC-FAO publications for the EU countries, the UK and Norway and allowed us to exclude the fertilizer application to silage maize, which is important in the EU⁴⁹. However, FUBC-EFMA datasets lacked individual crop classes for rice and soybeans, resulting in missing data at the country-level for these crops since 2000 in EU countries.

The resulting dataset included data for the average fertilizer application for 3712 combinations of 13 crop classes, 114 countries, and years from 1973 to 2018. For most of the combinations of countries and crops, data were available for only a few years (on average, a country-crop combination had data for 4.1 ± 2.9 years, and 64% of the combinations had five or fewer years with available data).

In order to later validate our estimations, we compiled a series of national databases. National data was quite limited, as only a few countries conduct surveys to study fertilizer management across different crops. The two countries with most available data were the USA³⁸, and the UK⁵⁰, which collected long time series on cropland fertilization for the three primary nutrients. The USA dataset³⁸ contains fertilization information for four crops -cotton, maize, soybean, and wheat- dating back to 1964. To compare with our predictions, we converted all data to average kg ha⁻¹. Additionally, based on the same surface threshold used for global datasets, we assumed that the application rate for cotton was equivalent to that of all fiber crop classes. The UK dataset⁵⁰ provides data for four crop classes -roots and tubers, other oilseeds, sugar crops, and wheat- starting from 1998 for the three nutrients across all Great Britain. We also compiled existing information from several Asian countries, including India, the Philippines, and Pakistan^{39,40,41,42,43,44,45,51,52}. The datasets from India^{39,40,41,42,43,44,45} and Pakistan⁵¹ did not require additional preprocessing, as they provided the data in average kg ha⁻¹. However, the dataset from Pakistan presented the information for all three nutrients combined⁵¹. For the dataset from the Philippines, which covers rice and maize, we converted the raw data on the regional number of 50 kg bags per hectare of different fertilizers to N and P₂O₅ using the country-specific fertilizer nutrient information⁵³. Finally, we also compiled existing data from Sweden^54,55,56,57 and New Zealand⁵⁸. The data for P₂O₅ and K₂O in the Sweden dataset, initially present in their pure nutrient form, were transformed to their oxidized forms by multiplying by the molecular weights of these elements.

Fertilizer use in other agricultural lands. An important step in the methods involves adjusting ML model predictions to national-level fertilizer use. We used the FAOSTAT database regarding fertilizer annual use at the country level for making this adjustment⁵⁹. This database includes data on all fertilizer use for agricultural lands, covering both croplands and grasslands⁵⁹. However, the crops included in the ML models, as well as in the FAOSTAT harvested area data⁴⁷ do not cover grasslands -whether permanent or temporary- nor fodder crops such as silage maize or fodder beet. Therefore, the primary goal of this section is to estimate the fraction of total fertilizer used for these types of agricultural lands.

Data regarding fertilizer application rate for grasslands and fodder crops is even more scarce than fertilization for other croplands. Additionally, FAOSTAT lacks information about the surface of the majority of the fodder crops⁴⁷. Therefore, the methods used for estimation may not be as accurate as those used for other agricultural lands. Here, we reviewed technical information, such as the FUBC compiled reports^{16,17,18,19,20,21,22,31,32,33,34,35,37}, and scientific information from countries where the fertilization of grasslands was considered to be higher than 1% of the total fertilizer consumption in previous research^5,6,26,27,28. Previous research typically focused only on permanent grassland fertilization, as their goal was to distinguish agricultural fertilizer usage between arable -croplands and temporary grasslands- and non-arable land -permanent grasslands-^5,27,28. However, we included in the estimation the proportion of fertilizer used for temporary grasslands and fodder crops for two main reasons: 1) our main goal was to distinguish agricultural fertilizer usage between all croplands included in the thirteen crop classes defined in the previous section and the rest of the agricultural land, 2) the majority of data available in the compiled global reports give information about all grasslands and fodder crops together^{16,17,18,19,20,21,22,31,32,33,34,35,36,37}. The information estimated was the annual country proportion of N, P₂O₅, K₂O fertilizers used for agriculture for grasslands and fodder crops. Depending on the available information, we have assessed at the country- or regional-level. In total, we reviewed scientific and technical reports for 75 countries. As in previous research^5,26,27,28, the methods used for estimating the share of N, P₂O₅, and K₂O usage for grasslands and fodder crops varied between countries and regions depending on the available information. Therefore, for every country, we argued the decisions taken based on the available data for providing at least as transparent as possible the estimations made. Moreover, we included a summary table (Table 2) with the sources used for estimating the range of values used for each country.

Table 2 Fraction of N, P₂O₅, and K₂O allocated for grasslands and fodder crops.

Full size table

Argentina: In the 1960s, fertilizer application rate in Argentina was primarily directed towards sugar cane and citrus⁶⁰, with minimal application to grasslands, nearly zero in 1964⁶⁰. Throughout the 1970s and 1980s, the fertilizer application rate remained low, although there was a notable increase in P₂O₅ application to grasslands, reaching 28% country consumption in 1979¹⁷. The substantial expansion in N and P₂O₅ fertilizer occurred during the 90 s, leading to a slight rise in the share of N used for grasslands, and to a significant decrease in P₂O₅ share for grasslands^{31,32,33,34,35}. To fill data gaps, we adopted a methodology similar to Lassaletta et al.²⁷, utilizing linear interpolation of national^{61,62,63,64,65,66,67,68,69} and global datasets for the years lacking data, with grasslands’ fertilizer share assumed as 0 in 1965⁶⁰. Despite potential limitations, setting the share to 0, as done in FAO nutrient budgets⁵, may underestimate fertilizer application rate, particularly for P₂O₅. K₂O fertilizer application rate in Argentina remains minimal due to soil composition, with all reports except one considering it as 0 in the use for grasslands and fodder crops^{17,18,31,32,33,34,35,61,62,63,64,65,66,67,68,69}.

Brazil: According to several sources, the use of fertilizer in Brazil’s grasslands has been very low^70,71. The most recent values reported by IFA in 2014 and 2018 indicate that less than 1% of the fertilizer used in Brazil is used in grasslands^16,37. However, Lassaleta et al.²⁷ and FAO⁵ considered higher percentages for N and K₂O based on regional averages²⁷ or previous research⁵. For P₂O₅ and K₂O, only FAO includes an estimation, considering 0 for P₂O₅, while they estimate the K₂O consumption by calculating the average between N and P₂O₅ consumption⁵. We have decided to consider 0 as the share used for grasslands and fodder crops due to the latest reported values and considering that no information is reported in previous reports^{16,17,18,31,32,33,34,35,37}.

Canada: Most of the compiled reports do not provide information about the use of fertilizers for fodder crops and grasslands^{17,18,31,32,33,34,35}. The latest report, with 2018 data, indicated that 0.5% of N, 0.9% of P₂O₅, and 0.6% of K₂O fertilizers were allocated to permanent grasslands, which increased to 12%, 14.5%, and 25% respectively when considering tame hay and silage maize as well16. Regarding N, FAO⁵ and the 2014 estimation by Lassaleta et al.²⁷ are consistent with the 2018 estimation for all forages. However, the values for P₂O₅ and K₂O for all forages in the latest report differ significantly from those used by FAO⁵ (0% for P₂O₅ and 5% for K₂O). This discrepancy in P₂O₅ may be due to FAO’s reliance on Heffer et al.³⁷ which does not consider nongrass perennial crops 0%⁷², and the discrepancy for K₂O because FAO considered the average value between N and P₂O₅⁵. We decided to utilize the percentage for all forages included in the last report¹⁶ for the entire period. We maintained the same values throughout the period due to insufficient data to estimate any trends. Additionally, in 1974, Beaton and Berger noted that a significant share of fertilizer used in Canada was for forages, estimating 45% of total use in 1970 was for hay and grazing grasslands⁷³. They suggested that their estimation might be overestimated; however, it is unlikely that the fraction of fertilizers used for forages was 0 between 1960 and 1990.

Chile: Based on the estimations of the FAO and IFA reports, Lassaleta et al.²⁷ and FAO⁵ considered a significant share of fertilizer used for grasslands. For N Lassaleta et al.²⁷ suggested an increasing percentage from 0% in 1960 to 20% in 2005, while FAO maintained a constant percentage of 20%. For P₂O₅ and K₂O, the values used by FAO were also high, at 30% and 25% respectively. However, for Chile, using a constant value for the period overestimated the early years as the share used for grasslands for N and P₂O₅ was only 1% at the beginning of the 1960s⁷⁴. We therefore decided to make a reconstruction similar to the one demonstrated by Lassaleta et al.²⁷, by considering 1% as the starting share for each nutrient, and incorporating the reported values for all grasslands^{16,17,18,31,32,33,34,35,37}.

Dominican Republic: The values reported in global studies from the 1990 s indicate that during this decade, the percentage of fertilizer application rate on grasslands and fodder crops ranged between 2% and 4%^31,33,34. Considering these findings, Lassaleta et al.²⁷ allocated values ranging from 0% to 2% for N. We have chosen to utilize the average values from the three reports^31,33,34 for the period 1990–2020. This decision was influenced by the lack of available data since 1997, and by the emergence of fertilizer application rate for pasture as a new and increasing practice during the 90s⁷⁵.

Mexico: The use of fertilizer for grasslands and fodder crops appears to be nearly zero, as indicated by previous research^5,27 and reported values^16,31,35,37. The only relevant fertilizer used for grasslands and fodder crops in Mexico appears to be related with P₂O₅ related with alfalfa production^31,35,76. Due to limited available information, and the longstanding presence of alfalfa production in Mexico since the Spanish colonization, we opted to consider the average percentage (2.5%) used in the two reports with data for alfalfa^31,35.

United States of America: According to global and national estimates from previous research, the share of N used for grasslands during the period ranged from 0% to 20% of the total^5,13,27. For P₂O₅ and K₂O, the most recent estimation from FAO indicated a constant share of 0% for phosphorus and 10% for potassium⁵. To estimate the total fertilizer use for permanent and non-permanent grasslands from 1959 to 2014, we used all the available data^{31,33,37,73,77}. In many sources, the information for grasslands is combined, encompassing both permanent and non-permanent grasslands. We used linear interpolation to estimate the share used for all grasslands together, replicating the method from the most recent estimation¹³. However, we included data from three additional years (1974, 1992, 1996)^31,33,73, and also extended the estimation to cover P₂O₅, and K₂O.

Uruguay: Grassland fertilization was actively promoted by the Uruguayan government during the 60 s⁷⁸. As early as 1963, one-third of the fertilizer used in the country was applied to pastures, with a focus on P₂O₅ due to the low P content of the Uruguayan soils⁷⁸. These trends are reflected in the first IFDC report, which allocated 45% of the P₂O₅ used in the country for grasslands and fodder in the year 1986¹⁸. However, this share decreased to 22% by 2018. In contrast, the percentage of N used for grasslands has shown an increasing trend, from almost 0% in 1986¹⁸ to 12% in recent years^16,35. K₂O is not used for these agricultural lands in the country^16,18,31,35. Given the significant variation in percentages between decades and nutrients, we performed linear interpolation considering 33% for P₂O₅ in 1960, and 0% for N as starting points, and all the values included in the reports^16,18,31,35.

Venezuela: Information regarding grassland and fodder crop fertilization in Venezuela is limited. Due to the scarcity of data and discrepancies between reported values^33,35, FAO has considered a fertilization rate of 0% for grasslands during the specified period. Conversely, Lassaleta et al.²⁷ proposed different rates between 0% and 9% from 1960 to 2009 for N. Given the challenge of determining the most appropriate criteria, we opted to adhere to the FAO considerations. This decision is influenced by low government optimal use recommendations for grasslands compared to croplands⁷⁹, along with scientific evidence suggesting minimal fertilization for warm-climate grasslands^79,80.

Australia: According to Lassaletta et al.²⁷, the share of N used for grasslands never exceeded 8.5%²⁷, which is similar to the 10% used by FAO in their nutrient budgets assessments⁵. Despite an intensification in the use of N in Australian grasslands over the past three decades⁸¹, it is noted that these grasslands were already being fertilized in the late 1950s, primarily with K₂O⁸². For instance, in 1956, 15% of the K₂O used in South Australia was directed towards pastures, a figure that rose to 42% by 1966⁸². Therefore, we have opted to consider a constant share of 6.4% for N use since 1960 derived from the mean value of the reports^{16,18,31,32,33,34,35,37}. Regarding P₂O₅ and K₂O fertilizer, it appears that the FAO estimations⁵ may have underestimated their use, particularly for K₂O. Thus, we decided to use the average value of all reports, because even with fluctuations, the variation in the reported values since 1985 is not too high, resulting in figures of 38.4 ± 4.1% for P₂O₅ and 41.6 ± 6.9% for K₂O^{16,18,31,32,33,34,35,37}.

New Zealand: Previous global research presented contradictory estimates of fertilizer application rate for grasslands in New Zealand^5,27, with figures ranging widely from 0% to 90%. However, both global and national reports consistently support the notion that the majority of the fertilizer application rate in the country is directed towards grasslands and fodder crops^{16,33,35,37,58}. Therefore, we have adopted a constant percentage throughout the entire period as grasslands have been the primary type of agricultural land developed in the country since the British colonization, their fertilization has been relevant since the early 20th century⁸³, and the fraction used for grassland has remained constant at least in the last 30 years^16,33,35,37. The percentages selected were derived from the average of global reports^16,33,35,37: 91.1 ± 1.4% for N, 93.0 ± 3.3% for P₂O₅, and 88.8 ± 4.4% for K₂O.

Europe: Between 1980 and 2000, Europe accounted for at least half of the N fertilizer used for grasslands and fodder crops, while consuming less than one-third of the total global fertilizer consumption⁶. Consequently, the available information was broader, and the methods applied could be more comprehensive. Einarsson et al.²⁸ provided the most comprehensive estimation for N in most European countries²⁸. They compiled and estimated the surfaces of croplands, including fodder crops, as well as temporary and permanent grasslands for the EU countries spanning from 1960 to 2019. Using their compiled data and the fertilizer application rate information from our study, we employed a similar methodology to estimate the fraction of N, P₂O₅, and K₂O used in these areas.

However, we extended the analysis to include fodder crops and all types of grasslands together, while also estimating P₂O₅, and K₂O. First, we used Eq. (1) to estimate the ratio (R_f–a) between the fertilization intensity of grasslands and fodder combined, and the fertilization intensity of all agricultural land for the years with available data:

$$\frac{{Q}_{f}}{{Q}_{a}}=\frac{{R}_{f}\times {A}_{f}}{{R}_{a}\times {A}_{a}}\to \frac{{Q}_{f}\times {A}_{a}}{{Q}_{a}\times {A}_{f}}=\frac{{R}_{f}}{{R}_{a}}={R}_{f-a}$$

(1)

where Q_f is the amount of fertilizer (N, P₂O₅, or K₂O) used for grasslands and fodder crops, Q_a denotes all the fertilizer of the same nutrient used in the country, A_f represent the grassland and fodder surface, and A_a represents the total agricultural land, and R_f–a the ratio of fertilizer application rate per area between fodder and grasslands (R_f), and all agricultural land (R_a). Therefore, R_f–a represents the fertilizer application relationship between fodder and grassland in comparison to all agricultural lands.

After estimating the annual R_f–a, we used two different procedures and equations depending on the years for which R_f–a data was available. If scientific literature and the observed variation in R_f–a indicated significant differences across the years, we performed a linear interpolation of the available values and then applied Eq. (2). Otherwise, we applied (3). To assess the variation in R_f–a we estimated the MAE of the results derived from Eq. (3) compared with all the reported values. When the variation of R_f–a occurred only in some decades within the period, we combined Eqs. (2) and (3). Detailed explanations were provided for each country individually. For non-EU countries, we applied similar procedures as those used for the other continents. In Eqs. (2) and (3) presented below, $\overline{{R}_{f-a}}$ is the average R_f–a of all reports with data, and i is the year.

$$\frac{{Q}_{{f}_{i}}}{{Q}_{{a}_{i}}}={R}_{f-{a}_{i}}\times \frac{{A}_{{f}_{i}}}{{A}_{{a}_{i}}}$$

(2)

$$\frac{{Q}_{{f}_{i}}}{{Q}_{{a}_{i}}}=\overline{{R}_{f-a}}\times \frac{{A}_{{f}_{i}}}{{A}_{{a}_{i}}}$$

(3)

Austria: The methodology used by Einarsson et al.²⁸ results in an almost constant percentage of N used for permanent grasslands of ≈10% for the 1960–2019 period. This result led FAO to consider that 10% of fertilizer used in agricultural lands was used for permanent grasslands⁵. The intensification of grassland management began in the 1970s and 1980s⁸⁴, and the share used for grasslands was higher in the late 1970s than in the 1990s^17,31. For P₂O₅ and K₂O, FAO considered a constant 10% allocation for permanent grasslands⁵, based on previous estimations for P₂O₅²⁶ and the average value between the fraction used for N for K₂O. While historical data suggest fluctuations in the percentage of fertilizers used for grasslands and fodder crops over time^{17,18,31,32,33,34,35}, the application of Eq. (3) using constant $\overline{{R}_{f-a}}$ values of 0.33 for N, 0.46 for P₂O₅, and 0.32 for K₂O, and surfaces changes²⁸, provided an MAE of 2.33 ± 3.09%, 3.87 ± 3.47%, 3.31 ± 2.29% respectively. Only two errors higher than 10% occurred, both underestimations, namely −11.8% for N in 1977¹⁷, and −10.2% for P₂O₅ for 1990³¹, suggesting higher R_f–a during the 1970–1990 period. Based on these results, we decided to utilize the mentioned $\overline{{R}_{f-a}}$ values for the period from 1991 to 2020 as well as for the period from 1961 to 1969. For the years from 1970 to 1990, we calculated the average R_f–a from 1977 and 1990 reports^17,31 to minimize the errors during the period.

Belgium and Luxembourg: Belgium and Luxembourg often share statistics as a single entity in historical statistics. Consequently, we adopted the same estimation for both countries. According to Einarsson et al.²⁸, the percentage of fertilizer application rate for permanent grasslands ranged from 53% in the 1980s to 40% in the last years. They deem the N fertilization of permanent grasslands significant throughout the period based on the little available information they found²⁸. The same literature confirmed the use P₂O₅, K₂O for grasslands as early as 1955, although with slightly lower applications²⁸ as in the actual reports. The use of constant R_f–a values of 1.03 for N, 0.91 for P₂O₅, and 0.81 for K₂O based on the technical reports values^{16,17,19,20,21,22,31,32,33,34,35,36} resulted in MAE values of 2.18 ± 1.82% for N, 5.46 ± 4.04% for P₂O₅, 3.62 ± 2.51% for K₂O. Only two instances of overestimations exceeding 10% were observed for P₂O₅ in the two last reports^16,22. This may be linked with the enforcement of limits on P₂O₅ application in the Flanders region since 2011⁸⁵. Therefore, for P₂O₅ we decided to use the average R_f–a for the 1960–2010 period, and use a linear interpolation of the R_f–a values since 2011.

Czech Republic, Slovakia, and Czechoslovakia: Information regarding grasslands and fodder crops before the disintegration of the Czechoslovak Republic is very limited²⁸. Following assumptions made by Einarsson et al.²⁸, we extended the average $\overline{{R}_{f-a}}$ reported for the Czech Republic and Slovakia since 1993^16,21,22,35 through the period 1960–1992, considering surfaces changes, and the agricultural land of each country²⁸. Potential overestimations could occur for the early years, as the fertilization of these areas compared to other croplands might have been lower than in the 1990s, like in neighboring countries such as Hungary or Germany^86,87. After 1993, there are four years with available data for both countries^16,21,22,35. The R_f–a values for all years are similar for each nutrient in each country, so we used Eq. (3) to estimate the 1993–2020 period. This approach resulted in low deviations from the reported values for the Czech Republic (MAE = 2.08 ± 1.58% for N, 2.57 ± 1.30% for P₂O₅, 1.69 ± 1.47% for K₂O) and Slovakia (MAE = 1.49 ± 1.47% for N, 2.02 ± 2.87% for P₂O₅, 1.79 ± 2.13% for K₂O).

Denmark: Danish grasslands and fodder crop fertilization have a long history with N, with average rates of 45 and 17 kg ha⁻¹ for temporary and permanent grasslands respectively in the early 1960s⁸⁸. The usage of Eq. (3) for the whole period for the three nutrients resulted in large deviations (MAE = 8.89 ± 4.40% for N, 5.36 ± 3.71% for P₂O₅, 8.42 ± 5.67% for K₂O). Therefore, as the amount of available data was large in the compiled technical reports we used Eq. (2), and linear interpolation of all R_f–a values for the period 1980–2020^{16,17,19,20,21,22,31,32,33,34,35}. For the 1960–1980 period, we utilized N data from 9 years within that timeframe⁸⁸. Additionally, we considered the 1980–2020 relationship between N $\overline{{R}_{f-a}}$ and P₂O₅ or K₂O $\overline{{R}_{f-a}}$, and the available N data for estimating the 1960–1980 timeframe regarding the P₂O₅ or K₂O values. We regard this assumption as the only available information for the period spanning 1960–1980 for P₂O₅ and K₂O⁷³ suggests a similar relationship in the application rates for all forages between N and the other nutrients, at least in the reported values since 1980^{16,17,19,20,21,22,31,32,33,34,35}.

Finland: Einarsson et al.²⁸ did not consider significant fertilization on permanent grasslands in Finland, as they mainly use arable land for forage production⁸⁹. However, fodder crops and temporary grasslands are key parts of the agricultural production in the country⁸⁹, and they are commonly fertilized^{16,17,19,20,21,22,31,32,33,34,35}. Using Eq. (3) for the entire period across the three nutrients resulted in minimal deviations for N and P₂O₅ (MAE = 1.57 ± 2.99%, 2.10 ± 3.34% respectively), but substantial deviations for K₂O (7.51 ± 7.38%). Given the substantial deviation for K₂O, and the large bias for R_f–a in 1979¹⁷ for N and P₂O₅, the first year with available data, we opted to use Eq. (2), and the linear interpolation of the R_f–a. However, potential deviations may arise for the 1960s, as fertilizers were predominantly utilized for high-value crops during the early part of the decade⁹⁰, yet no data are available for that period.

France: Data regarding grasslands and fodder crop fertilization is less limited than in the majority of EU countries, although large differences exist between the available data. Two recent publications estimated the share of N and P₂O₅ used for permanent grasslands since 1960^28,91 based on country surveys at the region-level^92,93,94,95. However, the results obtained by them differ from the FUBC-FAO and FUBC-FE reports^{16,19,20,21,22,31,32,33,34,35}. For example, for 2006, Le Nöe et al.⁹¹ report a share of P₂O₅ used for permanent grasslands of 27% whereas the FE reports a value for all grasslands of 20%. Considering other years with comparable data, such as 1990 or 2017, Einarsson et al.²⁸ estimate a share of 16% and 7% respectively for N used for permanent grasslands, while FAO only reports 6% for 1990, and the national survey reports 4.7% for 2017⁹⁵. Therefore, as it is difficult to discern the more accurate value between the two estimations, we opted to use the average between the R_f–a linear interpolated data from the global datasets^{16,19,20,21,22,31,32,33,34,35}, and from the national surveys^92,93,94,95, considering for both as 0 the share in 1955⁹¹ and the single estimate for the 70 s⁷³.

Germany: The availability of data since the German reunification is substantial in global reports^{19,20,21,22,31,32,33,34,35}. These reports suggest a decline since 1990 in fertilizer use for all forages compared to the rest of croplands, with the drop being particularly notable for N and P₂O₅. As a result, we decided to use Eq. (2), and interpolate the R_f–a values, instead of $\overline{{R}_{f-a}}$ for the 1990–2020 period. For the 1960–1989 period, data on grassland and fodder fertilization is scarce and primarily pertains to West Germany⁷³. Most of the data available for the period are relative to N, except the 1982 IFDC-FUBC report. For the 1960–1989 period, We decided to use the linear interpolation assuming, similar to the case of France, zero fertilization of grasslands and fodder crops in 1955, as fertilization of these areas in Western Germany, where most of this agricultural land is located, was minimal before 1960⁸⁷, using the only report with available data for the three nutrients¹⁷. We extrapolate the data from Western Germany for the entire country due to data availability^17,28,73, the prevalence of these agricultural areas in Germany²⁸, and because grassland fertilization in East Germany was similar to that in West Germany, at least in the late 1970 s⁹⁶. Using these approaches, we deviate by approximately 3.9% from the N estimated data for the year 1974⁷³. Additionally, we deviated by about 10% from the N value for permanent grasslands reported by Einarsson et al.²⁸ for 1966 (based on real data)²⁸. This deviation is reasonable, considering that the average difference between Q_f/Q_a only using information for permanent grasslands or all forages for N is 7.9%^{19,20,21,22,31,32,33,34,35}.

Greece: Fertilization has not been considered for permanent grasslands in either previous research^5,27,28 or technical reports^{16,19,20,21,22,31,32,33,34,35}. However, since we are also considering fertilization for fodder crops, the technical reports have allocated fertilization for them, especially for alfalfa and sillage maize^{16,19,20,21,22,31,32,33,34,35}, which constitute the two main actual fodder crops in the country²⁸. Therefore, we used Eq. (2) and the linear interpolation of R_f–a because the values of the 1990s are lower than the actual ones, and we have assumed a zero level of fodder fertilization in 1960, as it was only experimental in the country⁹⁷.

Hungary: Einarsson et al.²⁸ did not consider fertilization for permanent grasslands due to the scarcity of the data and because grassland fertilization is not a common practice nowadays²⁸. Reported values suggest that a significant fraction, approximately 5% of the fertilizer used since 1990 in the country was allocated to grasslands and fodder crops^{16,20,21,22,31}, with an even higher proportion during the 1980s¹⁷. Scientific information confirms that the change in the political regime in 1989 was a key driver of fertilization practices in the country, reducing the fertilizer use by five-fold in the country, and limiting fertilization of these areas to managed grasslands⁹⁸. Furthermore, fertilization in the country commenced in the 1960 s and remained stagnant during the 1980 s⁸⁶. Therefore, for the period 1960–1989, we applied Eq. (2), and the linear interpolation of R_f–a from a 0 value in 1960, to the 1980 reported value¹⁷. For the 1990–2020 period, we used Eq. (3), and the average $\overline{{R}_{f-a}}$, as there is no deviation larger than 10% from the reported values using this method.

Ireland: Ireland is likely one of the countries that use a larger proportion of fertilizers for grasslands and fodder crops^5,27,28, and also has more available information. Since 1972, six national surveys have been conducted, providing data for 22 years^{99,100,101,102,103,104}. Moreover, the global datasets also include information from ten different years since 1987^{16,19,20,21,22,31,32,33,34,35}. For the 1986–2020 period, we used the average of the linear interpolation of the R_f–a values based on national surveys^{99,100,101,102,103,104}, and surfaces data^{105,106,107,108,109}, along with the R_f–a values based on the global datasets^{16,19,20,21,22,31,32,33,34,35} and the Einarsson et al.²⁸ surface compilation²⁸. We excluded R_f–a values based on the global datasets^{16,19,20,21,22,31,32,33,34,35} and the Einarsson et al.²⁸ surface compilation²⁸ for the 2006–2010 period due to a change in the criteria for temporary grassland surface, which resulted in overestimations (Q_f/Q_a > 1). For the 1960–1985 period, we only considered the linear interpolation of the available data, all from the national surveys R_f–a^99,100,101, and surfaces^105,106. In cases where there was no available surface data¹⁰⁵ in the national databases, like 1972, we used the closest year with available data (e.g., 1970). For 2008, which has two available national surveys^103,104, we took the average of both. We considered the share of fertilizer used for grasslands and fodder crops as zero in 1955 because almost all fertilizer was used for tillage crops in that year¹¹⁰, with grassland fertilization increasing during the 1960s¹¹¹.

Italy: Einarsson et al.²⁸ used a constant $\overline{{R}_{f-a}}$ for permanent grasslands for all years, as similar values are given in various reports and scientific information²⁸. When considering grasslands and fodder crops together, the R_f–a were also consistent for each nutrient over all years^{16,19,20,21,22,32,33,34,35,73}, even including the 1974 data⁷³. The MAE using Eq. (3) for the entire period across the three nutrients resulted in minimal deviations comparing with the reported values^{16,19,20,21,22,32,33,34,35,73} (MAE = 2.24 ± 1.55% for N, 2.00 ± 1.37% for P₂O₅, and 3.21 ± 1.21% for K₂O). Therefore, we used the the $\overline{{R}_{f-a}}$ for the three nutrients. However, there could be potential overestimations for the 1960s decade because nearby countries like France or Germany did not use fertilizers for these agricultural lands before 1955⁹¹.

The Netherlands: Information regarding grassland fertilization in the country is abundant^28,112. However, before the development of global datasets, information regarding P₂O₅ and K₂O is very limited. For the period 1979–2019, we used Eq. (2) considering the linear interpolation of the eleven R_f–a data derived from the global datasets^{16,17,19,20,21,22,33,34,35} and the agricultural surfaces changes²⁸. We used the global datasets instead of the national data available because they provide information regarding the three nutrients. For the years 1960 to 1979, we used the available compilation of N application rates¹¹², and the total N fertilizer consumption⁵⁹ to estimate the Q_f/Q_a values for N. For P₂O₅ and K₂O, we used the ratio between the Q_f/Q_a used for N and these two nutrients for the most recent year with available data, 1979¹⁷, to extrapolate the results for the 1960–1979 period.

Poland: The available data in reports from the period 1988-2018^{16,19,20,21,22,31,34,35} did not show a constant R_f–a for any nutrient N, P₂O₅ and K₂O. Data on fertilization before 1989, during the communist government, is sparse^28,31. However, similar to other Eastern European countries like Hungary, it appears that fertilizer intensification in the country started during the 1960s¹¹³, with a significant drop following the regime change⁵⁹. As a result, we adopted the same criteria used for other Eastern European countries, setting the 1960 value to zero, and applying two distinct linear interpolations of R_f–a: one for the 1960–1989 period, and another for the 1990–2020 period. For the 1990–2020 period, there are seven years with available data, whereas for the 1960–1989 only 1989 has data. Despite this limited data for the earlier period, survey estimates¹¹³ combined with FAOSTAT totals⁵⁹ suggest that the combined share of the three nutrients was between 14% and 15% in the late 1960s, which aligns with the individual nutrient shares calculated by the linear interpolation which are between 10% and 13%.

Portugal: Einarsson et al.²⁸ did not consider fertilization of permanent grasslands, citing the relatively low surface area in the country²⁸. However, recent technical reports suggest that Q_f/Q_a exceeds 20% for the three major nutrients^{16,19,20,21,22,33,34,35}. We chose to apply Eq. (2) and to interpolate the 1977–2020 data^{16,17,19,20,21,22,31,32,34,35} because using Eq. (2) led to discrepancies greater than 10% in some years. For the years before 1977, we retained the R_f–a 1977 values¹⁷ (which resulted in Q_f/Q_a < 2%) as there is no information for the earlier period.

Romania: As with other Eastern European countries, there is no available information regarding grassland and fodder crop fertilization before the political regime change in 1989. However, between 1990 and 2020, data from five years suggest that about 5% of fertilizer is used for grasslands and fodder crops^{20,21,22,31,33}. For Romania, we applied Eq. (3), using the average $\overline{{R}_{f-a}}$ value and the grassland and cropland surface data²⁸. Potential overestimations occurred during the first decades, although the estimated Q_f/Q_a are less than 5% for the first decades.

Spain: Previous research has not considered the fertilization of permanent grassland because this practice in the country is very uncommon^5,28. However, when considering temporary grasslands and fodder crops, this assumption changes, as forage crops occupy about 8% of the arable land in the country and consume nearly the same percentage of fertilizers¹¹⁴. To estimate the share of fertilizer use in these areas, we created a linear interpolation of the R_f–a data from the ten years with available data, ranging from 1979 to 2014, and applied Eq. (2). Using Eq. (3) resulted in estimations that were twice the reported values for the earlier years. Given the fraction used for these areas in 1979 was minimal (Q_f/Q_a < of 2%), potential overestimations for the first years are also likely minimal.

Sweden: In the country, fertilization of forage production areas is closely linked to the transition from natural permanent grassland to temporary grassland production on arable land that occurred during the first part of the 20th century, especially during the 1940s and 1950s¹¹⁵. Moreover, based on the available data, fertilizer intensification of these areas compared to other croplands R_f–a was lower during the 1970s than at the end of the century^34,35,73. Therefore, we applied Eq. (2) and performed the linear interpolation of the R_f–a of each nutrient of the 11 years with available data since 1974^{16,18,19,20,21,22,31,32,34,35,73}. A slight overestimation might occur for the earlier years, as the intensification of these areas was increasing before the first year with available data¹¹⁵, but no data for the period was found.

United Kingdom and Northern Ireland (UK): The UK has the world’s longest and most complete dataset on the fertilization of grasslands and croplands⁵⁰. Annual time series data on fertilizer use for permanent and temporary grasslands are available for England and Wales since 1969 and for Great Britain since 1982⁵⁰. Northern Ireland is not included in these surveys. Additionally, there are surveys for the years 1957, 1962, and 1966 for England and Wales¹¹⁶. Two problems arise for the estimation of Q_f/Q_a from this data. The first one is that the surveys only include fertilization on permanent and temporary grassland, excluding rough grazing. The second challenge is that there is no information for Northern Ireland - which accounts for about 6% of the country’s fertilizer consumption⁵⁰-, and from 1960 to 1982, there is also no data for Scotland, who are responsible for about 14% of the country’s fertilizer consumption⁵⁰. For the period 1982–2019, we used the annual fertilizer application rates for Great Britain’s tillage crops⁵⁰ and the corresponding cropland surface area¹¹⁷ (excluding temporary grasslands) to estimate the total fertilizer use for croplands. We considered grassland fertilization to be the complement of the value obtained, assuming the same application rates for Northern Ireland. To include these estimations in the fraction used for fodder crops, we add the average share used for them, which is less than the 3% for all nutrients^{16,19,20,21,22,31,33}. For the period 1960–1981, we applied the same methodology but using the application rates^50,116 and surfaces¹¹⁸ from England and Wales, adjusted by -2.5% for N, +2.8% for P₂O₅, and +0.9% for K₂O. These adjustments are based on the observed differences between the application rates in Great Britain and those in England and Wales during the 1980s decade. Moreover, for the 1960s decade for which there are no data available for all years, we applied the linear interpolation of the years with data. We used the national databases instead of the global datasets because they provide annual information covering almost the entire period for the three nutrients, and the values between them were quite similar.

Iceland: Iceland’s agriculture sector is primarily focused on livestock production, with about 90% of its agricultural land being permanent grasslands¹¹⁹. Additionally, most of the arable land is used for forage crops¹¹⁹. While grassland fertilization is a common practice in Iceland¹²⁰, there is limited information on application rates for different types of agricultural land, and no specific estimates on the proportion of fertilizer used for forage crops in the country. When we applied Eq. (3) using the average R_f–a from other Nordic countries—Denmark, Sweden, and Finland, it resulted in a Q_f/Q_a ratio greater than 100%. To address this, we allocated a mid-value between 100% and the proportion of agricultural land occupied by grasslands and fodder crops, ensuring it does not exceed 100%.

Switzerland: Data on fodder crop and grassland fertilization in the country from the period 1979–1999 suggest that between 30 and 50% of the fertilizer used in the country is applied to these lands^{17,31,32,33,34,35}. However, whereas the data of the first two years indicate that almost 50% of N is used for grasslands and fodder crops^17,31, only about 30% was used in 1999³⁵. Since 2000, the areas of artificial grasslands and silage maize (the two main forages that receive fertilizers³¹) have remained almost constant¹²¹. As there is no information available regarding grassland fertilization before 1979 or after 2000, we used the 1979 data for the period 1960–1979 and the 2000 data for the period 2000–2020. For the period from 1979 to 2000, we applied linear interpolation to the six years with available data^{17,31,32,33,34,35}.

Norway: Fodder crops and grasslands (both temporary and permanent) play a key role in the agricultural sector of the country^122,123. Technical reports and scientific studies data indicate a nearly constant share of Q_f/Q_a for N, P₂O₅, and K₂O^{16,19,20,21,22,32,33,34,35,73}. Therefore, we used the average of all the available Q_f/Q_a data^{16,19,20,21,22,32,33,34,35,73}, covering the period 1974–2018 for N, and from the period 1990–2018 for P₂O₅ and K₂O. The resulting values, with a share of 64.02% ± 1.76% for N, 50.02% ± 2.25% for P₂O₅, and 65.59% ± 6.07% for K₂O, were comparable to those estimated for other Scandinavian countries.

Yugoslav Socialist Federal Republic (Yugoslav SFR), and actual former countries: Fodder crops and grasslands played a significant role in the agricultural production of the Yugoslav SFR¹²⁴. Pastures and meadows occupied 33% of the country’s land, while fodder crops took up 20% of the arable land¹²⁴. However, to the best of our knowledge, no information is available regarding fertilization for different agricultural lands before the dissolution of the country. After the dissolution, information became available in global reports for Croatia and Slovenia, but not in the other countries^{16,19,20,21,22,32,35}. To estimate the Q_f/Q_a values for Yugoslav SFR during the period 1961–1991, we used the weighted average by agricultural land surface²⁸ of the earliest R_f–a values from Croatia and Slovenia^19,28,32, given that their R_f–a values have changed significantly in recent years^{16,19,20,21,22,32,35}. We also considered the cropland, grasslands, and fodder crop surfaces of Yugoslavia SFR from the 1990 s¹²⁴ to estimate the Q_f/Q_a used for the 1961–1991 period. For the period 1990–2019, for actual EU former countries, we performed the linear interpolation of the R_f–a values^{16,19,20,21,22,32,35} to estimate Q_f/Q_a considering the annual surfaces values²⁸. In Serbia, the largest country, forage production is a crucial component of its agricultural sector, with about two-fifths of the agricultural land dedicated to this purpose¹²⁵. However, as no specific information on fertilization rates has been found. We considered the average weighted R_f–a ratio of Croatia and Slovenia along with the 2004–2008 surfaces of agricultural lands, grasslands, and fodder crops¹²⁵. For smaller countries like Montenegro of North Macedonia, we assumed the average annual Q_f/Q_a values of Serbia and Croatia.

Union of Soviet Socialist Republics (USSR) and Former USSR Countries: Quantitative and qualitative information about fertilization of grassland and fodder crops before the collapse of the USSR is quite scarce^31,126,127. Some publications suggest that the use of fertilizers in these areas was minimal before 1975^126,127. However, data from 1990–1991, just before the collapse, from certain republics (Russia, Latvia, Estonia, or Belarus) indicate that a significant share of fertilizers was used for fodder crops and grasslands³¹ (e.g., 40% for N in the Russian Federation³¹). For the period 1960–1991, we estimated the R_f–a for the entire USSR in 1990, weighing the value of each republic R_f–a^31,128 in 1990-1991 by the total fertilizer use of each republic¹²⁸. The four republics with available data for this year (Russian Federation, Belarus, Latvia, and Estonia) account for 40% of the agricultural land of the country and 62% of its fertilizer consumption¹²⁸. After estimating R_f–a for each nutrient in 1990, we used linear interpolation to estimate the annual R_f–a values, considering the value in 1975 as zero^126,127. Finally, similar to the EU countries, we considered the annual cropland, grassland, and fodder crop surfaces¹²⁸, along with the calculated R_f–a, to estimate the annual Q_f/Q_a. For the period from 1992 to 2020, we considered individual country information where some data was available. However, for the following actual countries, there is no information in the global reports^{16,19,20,21,22,31,32,33,34,35,36}: Armenia, Georgia, Kazakhstan, Kyrgyzstan, Tajikistan, and Turkmenistan. For all these countries, we considered a constant Q_f/Q_a ratio during the 1992–2020 period due to the limited information. For Armenia and Georgia, we assumed the Q_f/Q_a value in 1998 for Azerbaijan, the other Caucasian country³¹. For the Central Asian countries, we used the ratio for grasslands derived from Uzbekistan’s 2014 data³⁷, which is significantly lower than the USSR’s share in 1990. This reduction seems reasonable given the significant decrease in fertilizer use, temporary grasslands, and fodder crop surfaces in the region since the USSR collapse¹²⁹.

Estonia, Latvia, Lithuania: The Baltic countries are the three former USSR countries with the most available data in global datasets^{16,19,20,21,22,35}. Fertilizer intensification in these areas has changed significantly over the last three decades due to the abandonment of intensively managed areas²⁸. This trend is reflected in the changing R_f–a values. Therefore, we used Eq. (2) and the linear interpolation with the six years with available data R_f–a from the 1991–2018 period^{16,19,20,21,22,35} to estimate the Q_f/Q_a values since the collapse of the USSR.

Belarus, Moldova, and Ukraine: For these three countries, limited data is available regarding fodder crops and grasslands, but some information can be found in global reports^16,33,37. Thus, for each country, we used the average of the Q_f/Q_a values from the 1992–2020 period. In the case of Belarus, where two sets of data were available for grasslands and one for fodder crops^16,37, we took the average for grasslands from both reports and the ratio that accounts for the share of grasslands and the share including fodder crops.

Russian Federation: There are three years with available data between 1992 and 2020^32,33,37. In the first two years, the data showed that an average of approximately 25% of the country’s fertilizer was used on grasslands and fodder crops^32,33. However, in the latest report from 2014, only about 4% was attributed to these areas (excluding fodder crops not used for hay or silage)³⁷. Therefore, we decide to use the linear interpolation of the Q_f/Q_a values for the years with available data. For the late years, we likely underestimated the value because some fertilizer is used for fodder crops, like fodder beet, that are not intended for silage or hay. However, these fodder crops only accounted for about the 8% of the total fertilizer used for fodder crops and grasslands in 1990³¹.

China: Fertilization of China grasslands remains low at present³⁷. Among the compiled reports, only the latest one considers a proportion of the total fertilizer application rate in China, allocating 2% for N, 4% for P₂O₅, and 3% for K₂O. Other information on grassland fertilization in China is scarce, with the few authors that provided some information describing it as sparse¹³⁰. FAO⁵, considers this proportion as 0% for all three nutrients throughout the entire period, which differs from Lassaleta et al.²⁷, who, based on regional averages, estimated a percentage ranging between 0 and 4.7% from 1960 to 2014. However, any global report or national more detailed information considers any fertilization. We have decided to adopt the same criteria as FAO⁵, albeit potentially underestimating values for the last decades.

Iran: Fertilization of Iran’s grasslands and fodder crops appears to be minimal, with few reports providing data, and only since 1990, indicating values between 2% and 6% for all three nutrients^31,32,37. Other information is scarce and focused on experimental trials rather than broader country-wide applications. Considering that the first fertilization trials were developed during the 70s, and the first report with data is for 1990³¹, which reported 2% of N and 6% for P₂O₅, we considered as 0% the share for the period 1960–1990, and the average of the reports for the period 1990–2020.

Japan: Since the first report with data, in 1979, almost all reports have underscored the importance of grassland and fodder fertilization in Japan. FAO attributed a constant share of 20% for N, 0% for P₂O₅, and 10% for K₂O for the 1960–2020 period⁵. Conversely, Lassaleta et al.²⁷ suggested a growing percentage of 20% for N, starting from 0% in 1960, and increasing to 20% in 2009. Although data before 1979 is unavailable, the reported data for N use in 1979 was 15.7%, higher than the 5.2% estimated by Lassaleta et al.²⁷. Additionally, due to the lack of data, it is challenging to determine the inception of grassland fertilization in Japan, though it appears to coincide with the transition from semi-natural grasslands to more intensive pasture during the 60s¹³¹. Therefore, we opted to adhere to FAO’s criteria, maintaining the same percentage throughout the period, despite the potential overestimated values for the initial years. We considered the average of all available reports with data^{16,17,31,32,33,34,35,37}, because FAO criteria appears to underestimate the P₂O₅, and K₂O used for grasslands, resulting in percentages of 17.3% for N, 16.9% for P₂O₅, 15.6% for K₂O.

Korea Republic: Grassland fertilization appears to be a common practice in the country nowadays¹³². However, there is no available data on the fertilization of these areas in global reports^17,31,33,35, nor scientific publications. We used the same assumption as Lassaleta et al.²⁷, which is to consider the same proportion as in Japan, the geographically and socioeconomically closest country²⁷. This assumption also aligns with the observation that the sum of this percentage, and the fertilizer used for the main crops^17,31,33,35 is less than the total for the country⁵⁹.

Turkey: Information about fertilization of grasslands and fodder crops in Turkey is scarce, suggesting that it is not a common practice. Lassaleta et al.²⁷ considered percentages as high as 4.8% for N in 2009, whereas FAO considered 0% for all nutrients. All the available data since 1990 except for 2014 considered some amount of fertilizer used for grasslands, and forages^{16,17,31,32,33,34,35,37}. Therefore, we used the average percentage of all reports for the period 1990–2020^{16,17,31,32,33,34,35,37}.

Other Asian Countries: Cambodia, Indonesia, Malaysia, The Philippines, Thailand, Vietnam, India, and Pakistan: In Asian Southeast countries, only Lassaleta et al.²⁷ considered that some fertilizer is used on grasslands, based on regional averages used for grasslands and other crops (including fruits, tea, vegetables, and forage and grasslands)²⁷. However, no global report^{16,17,18,31,32,33,34,35,37} or country-level sources¹³³ mentioned fertilizer application to grasslands as significant in these countries. Therefore, we have chosen to align with FAO’s criteria, which assumes no fertilizer application rate for grasslands in this region⁵. We applied the same criteria for India and Pakistan, despite previous research considering a certain percentage used for grasslands^6,27. The data reports^{16,17,18,31,32,33,34,35,37}, the scientific literature^134,135, and FAO⁵ support the idea of non-fertilization of grassland in these two countries.

Egypt: Data regarding grassland and fodder crop fertilization in Egypt are scarce^18,34. As is common for many African countries, there is no fertilization of grasslands¹³⁶. However, the few available data about the fertilization of Egyptian clover^18,34, the main fodder crop in the country⁷⁶, suggests that a significant portion of N and P₂O₅ is utilized for fodder production, aligning with country recommendations¹³⁷. Previous research, focused solely on grasslands, has either considered 0% allocation for the three nutrients⁵ or a range between 0% and 4% for N²⁷. Here, we opted to consider the average of the two reports (1986, 1997) with data^18,34 for the entire period as Egyptian clover production has been significant since the beginning of the period¹³⁸, and the available data is not sufficient to discern any trend.

Morocco: Previous research has indicated various fractions of N fertilizer used for grasslands in the country, ranging from 0% to 11%^5,27,136. With the available information, it is impossible to discern if any application for permanent grasslands occurred in the country, but not for forages such as alfalfa, Egyptian clover, or vetch^139,140. Additionally, due to the scarce available data in the reports, discerning any trend is challenging^31,34,140, although the presence of improved pastures, usually linked to fertilizer application rate, doubled during the 80s decade¹³⁹. Here, we have opted to use the same percentage, the average of all reports, to estimate the percentage of N, P₂O₅, and K₂O, despite the potential overestimations in the first decades.

South Africa: Fertilization of grasslands and fodder crops such as alfalfa appeared to be significant throughout the study period in South Africa. Both previous scientific research^5,27 and various technical reports^{16,31,32,33,34,35,37} indicated percentages ranging 0% and 22.3% for N. For all three nutrients, the share used for grasslands and fodder crops during the 90s was higher than in the last decades^{16,31,32,33,34,35,37}. This percentage appears to be higher due to larger fertilizer application rates to croplands compared to grasslands and fodder crops^16,31, and not due to the relationship between cropland and grassland surface¹⁴¹. While information regarding grassland fertilization prior to 1990 is limited, several factors support the hypothesis of early fertilizer application rate for grassland and fodder production. These include the fraction used for grasslands and fodder in 1990³¹, substantial research conducted on improved grasslands since 1920s¹⁴², and the early introduction of alfalfa in 1858⁷⁶ which is a significant consumer of P₂O₅ and K₂O in the country. Given the challenge of identifying any discernible trend and the likelihood of significant consumption at the beginning of the period, we have chosen to adopt the same percentage for the entire duration, aligning with FAO assumptions⁵, despite potential slight over- and underestimations throughout the period. The average of all reports^{16,31,32,33,34,35,37}, resulted in percentages of 12.4% for N, 13.3% for P₂O₅, and 9.2% for K₂O.

Potential drivers. To develop our ML models, we compiled a series of datasets that contain information on features that were identified in previous research as drivers or correlates of cropland fertilization. In this section and the next two, we clarify our rationale for the variable selection, the data sources and the methods used for estimating some of these variables. The list of all considered features can be found in Table 3 and further details about their estimation are provided below.

Table 3 Environmental, agrological and socioeconomic features used in the prediction of the fertilizer application rates, accompanied by their description, unit and data source.

Full size table

Environmental data. Environmental variables related to climate and soil characteristics have been identified as factors that influence fertilization management in farm-level studies¹⁴³ and regional panel data^144,145. Therefore, we selected several potential factors, some of which have previously been shown to correlate with fertilization, such as MAP¹⁴⁴, or SOC¹⁴⁵, as well as newer potential factors such as the aridity index. Data for these factors were sourced from two main databases: the CRU v.4. databases¹⁴⁶, for climatic factors, and the SoilGrids v.2. database¹⁴⁷, for soil characteristics. Obtaining values at the country-level while considering variations in climatic and soil conditions within a country can be imprecise. However, our fundamental unit of analysis is the country-level, as the FUBC values are measured on this scale. To mitigate this limitation, we used spatial information for climatic and soil characteristics along with information about the ___location of crops²⁹. All environmental variables were estimated using Eq. (4), but preprocessing differed across variables.

$$En{v}_{jic}=\frac{\sum _{g\in G}(En{v}_{ig}\times HArea\_M{2000}_{gcj})}{HArea\_M{2000}_{cj}}$$

(4)

Here, Env_jic represents the mean value of the environmental variable for country j, in year i, where crop c is located in the country; Env_ig is the value of the environmental variable in year i, for grid cell g; $HArea\_M2000gc$ denotes the area of grid cell g for crop c in country j; $HArea\_M{2000}_{cj}$ is the total surface of crop c in country j based on Monfreda et al.²⁹ crop maps; and G denotes the set of cells where the crop is located based on Monfreda et al.²⁹ crop maps. For the MAP, the Env_ig values of Eq. (4) are calculated by summing the precipitation from all months in the CRU v.4. dataset¹⁴⁶ for each grid cell g, and year i. For the MAT, the Env_ig values are calculated as the average of the monthly temperatures from the CRU v.4. dataset¹⁴⁶, weighed by the number of days of each month. The PET values are derived by multiplying the daily month average from CRU v.4¹⁴⁶. by the number of days in each month and summing the results. For the aridity index, we used the United Nations (UN) definition¹⁴⁸ of the ratio between MAP and total PET for each grid cell. As soil variables do not have temporal resolution, we simplified Eq. (4) by removing the temporal factor. Additionally, for some soil variables like the soil CEC, we aggregated the information by calculating the average for the first three depth layers from SoilGrid v.2. (0–5, 1–15 and 15–30 cm)¹⁴⁷.

Agrological data. We selected agrological features that were previously identified as factors potentially related to or driving fertilizer intensification, such as holding size¹⁴⁹, crop area¹⁴⁵, or irrigation implementation¹⁴⁴, as well as features that should be connected to crop fertilization at the country-level, such as country fertilizer use per cropland area⁵⁹. Most of the agrological variables used are taken directly from the sources indicated in Table 3. However, some required preprocessing. For holding size, we applied the methodology used by Zou et al.²⁶, which involves standardizing the information based on the average holding size according to the total agricultural area. We used holding size data from the FAOSTAT agricultural censuses¹⁵⁰ and previous research¹⁵¹. To estimate the annual nutrient removal for each crop class based on annual production, we used the recent compilation by FAO⁵ on nutrient removal in kilograms per tonne of crop produced, along with the annual country production data from FAOSTAT⁴⁷. Additionally, we used this compilation alone as a proxy for fertilizer recommendations, since these recommendations are generally based on the nutrient requirements of each crop¹⁵².

Socioeconomic data. Economic factors, particularly those related to the profitability of fertilizer use, have been widely studied to understand fertilizer adoption at the farming-level^153,154. Both input prices (fertilizers) and output prices (crops) determine profitability and can be key factors influencing fertilization decisions. However, assessing inputs at the country-level is challenging, primarily due to a lack of standardized data. The only available dataset, FAOSTAT¹⁵⁵, does not cover all periods and lacks standardization. To address this, we used two variables as proxies of fertilizer prices: a) global real prices for Urea, phosphate rock, and muriate of potash, as compiled by the World Bank¹⁵⁶; and b) the distance from the production sites or mines, following the methodology proposed by McArthur et al.¹⁵⁷. This methodology uses gravity models of trade, based on the premise that fertilizers are produced in a few specific countries¹⁵⁷. The underlying hypothesis is that countries closer to fertilizer plants or mines are more sensitive to price variations because transport costs are a significant factor for farmers¹⁵⁷. We applied a similar approach, estimating the minimum cost-adjusted distance by using the costDist function from terra package¹⁵⁸, global friction maps¹⁵⁹, the locations and operational years of potash¹⁶⁰ and phosphate mines, the locations of ammonia plants^157,161 and the centroid of the cropland area on the country based on the Monfreda et al.²⁹ crop maps²⁹. Assessing the output prices for crops faces a similar problem: there is no standardized dataset with national-level data for the entire period. To resolve this, we used two proxies for crop prices: a) global real prices for specific commodities like wheat, maize, rice, palm oil, soybeans, sugar, and cotton, compiled by the World Bank¹⁵⁶, and b) standardized data from two FAOSTAT datasets^162,163 that provide prices paid to producers at the country-level. The first dataset¹⁶² contains information from 1990 onwards in USD, and LCU, while the second dataset¹⁶³ covers 1966 to 1990, only in LCU. To standardize both datasets, we converted the older dataset into USD using annual currency exchange rates¹⁶⁴. We then removed outliers independently for each crop by considering only values within 1.5 times the interquartile range. Before applying this method to the 1966–1990 dataset, we tested it on the LCU data for maize, wheat, and rice from the 1990 onwards dataset. We compared the original USD values with those obtained after converting the LCU data using exchange rates. The outlier detection method retained more than 99% of equivalent values (defined by a ratio between the original and calculated USD values of 0.99 to 1.01), while removing over 90% of non-equivalent values. Finally, the data was converted to real prices by applying the Consumer Price Index¹⁶⁵.

Other socioeconomic factors, that are not directly related to the profitability of fertilizer use, have also been linked to country-level fertilizer use. These factors include the income level, reflected in the GDP per capita¹⁶⁶; the population pressure, defined as the country’s population divided by its agricultural land area¹⁶⁷; and the farmers’ knowledge about fertilizer use, as well as general education levels¹⁵³, which we measured by the percentage of total GDP spent on education. We used the sources listed in Table 3 to obtain data for these variables.

Data preprocessing

Several preprocessing steps were performed to prepare the raw data for the ML models. First, drawing from both expert ___domain knowledge and exploratory data analysis, the features relevant to N, P₂O₅ and K₂O fertilizer application rate were selected (Table 3). Since not every feature was relevant for each of the three targets, we narrowed down the dataset to data points where the average fertilizer application rate is known for all three fertilizers. This restriction ensured that the dataset comprised only labeled data points, which is crucial for supervised ML techniques. Subsequently, anomalies in the data where the fertilizer application rate was unrealistically large, i.e., greater than 5000 kg ha⁻¹, were removed. Finally, categorical features were OHE.

Machine learning

Previous studies within this ___domain typically propose linear equations to estimate the fertilizer application rate, and only consider a limited set of agricultural factors^9,10. However, it is well-established that natural phenomena frequently exhibit nonlinear relationships¹⁶⁸, rendering them unsuitable for modeling with linear methodologies. Similar studies have also employed Bayesian methods¹⁵, with certain modeling assumptions that are not present in our study. ML has the potential to overcome these limitations. The field of ML has seen major increases in research¹⁶⁹ and industry¹⁷⁰, and, more specifically, ML has shown promising results in the field of ecology^171,172, including agricultural research^173,174, fertilizer consumption^175,176 and fertilizer management¹⁷⁷. For this reason, ML was used in this study to estimate the annual fertilizer application rate at the crop- and country-level. The benefit of using ML is threefold. First, ML allows us to include a larger range of variables, for example also including socioeconomic factors. Second, nonlinear ML techniques enable us to model nonlinear relationships between the variables in our dataset. Lastly, the model output can provide insights into the drivers associated with crop fertilization on a global scale, through the use of SHAP values²⁵ outlining the feature importance. The employed ML methods to estimate fertilizer application rate for crops differ from previous research, which typically relied solely on changes in crop area, overall fertilizer consumption, and limited data regarding fertilizer application rate at the individual crop-level^9,10. An advantage of our method is that it enables us to estimate values for countries where specific data is lacking by relying on other related variables. For example, the projected data for the USSR aligns closely with national totals, even in the absence of country-specific data and without adjustments based on total consumption, as conventionally done^9,10.

Models

In this study, two ML models based on gradient boosted regression trees were selected to predict the average annual fertilizer application at the crop- and country-level. In gradient boosting¹⁷⁸, an ensemble of weak learners (in our case regression trees) is trained sequentially. First, a weak learner is fitted to the original data. In the next iteration, another weak learner is fitted to the residuals, i.e., the differences between the ground truth target values and the current predictions made by the ensemble. When fitting a new weak learner to the residuals, gradient boosting adjusts its parameters in the negative gradient direction, aiming to reduce the residual error of the ensemble. This sequential learning process enables gradient boosting models to create a strong learner by combining multiple weak learners. The specific gradient boosting models employed in this study are XGB²³ and HGB^24,179. XGB has been shown to be a powerful tool for predictive modeling in a wide range of applications in both industry and research, including agricultural research¹⁷⁴ and fertilizer research¹⁷⁵. It offers an optimized and scalable implementation of gradient boosting, and includes regularization techniques to prevent overfitting²³. The HGB model is primarily based on LightGBM¹⁷⁹, which addresses one of the major bottlenecks in gradient boosting model training, namely the requirement to sort all samples at each node²⁴. Indeed, in a traditional gradient boosting model, samples must be sorted at each node to determine the best split. This sorting process can become computationally expensive, especially when dealing with large datasets or deep trees. In HGB, the samples are first collected into a histogram, which removes the need for sorting as samples in a histogram are implicitly ordered. This optimization results in a model that is much faster to train than traditional gradient boosting models, while still achieving similar or better performance²⁴. The choice for these two methods over other conventional ML approaches, such as neural networks, was primarily driven by the fact that both methods natively handle missing values. This constitutes a significant advantage, given that global fertilizer application rate data, along with the socioeconomic and agricultural variables used to predict the annual fertilizer application, are often incomplete. This also demonstrates another advantage of applying ML to this problem over the conventional approach using linear equations. Indeed, the absence of just one variable in the equation renders it impossible to compute.

Model training and evaluation

The selection of the optimal set of model hyperparameters is usually done using CV, after which the CV error is reported as the performance of a model¹⁸⁰. However, based on Stone (1974)¹⁸¹, model assessment and model performance require different CV approaches. For this reason, we used nested CV, as it allowed us to find the optimal set of hyperparameters for a model and provide an unbiased estimate of the model’s performance¹⁸². In nested CV, two levels of CV loops are used: an outer loop and an inner loop. In the outer loop, the dataset is split into training and testing sets, typically using k-fold CV. Each fold of the outer loop trains the model on the training set and evaluates the model on the testing set. Within each fold of the outer loop, the training data is provided to an inner CV loop, in which the training data is further split into training and validation sets, also typically using k-fold CV. The inner loop is responsible for selecting the set of hyperparameters that performs best on the validation set. Finally, the performance of the selected set of hyperparameters is evaluated on the corresponding test set in the outer loop. In our study, we used a 2 × 5 nested CV, i.e., we had two outer loops and five inner loops. We employed a grid search that iteratively went over all possible combinations of hyperparameters, based on the explored hyperparameters as shown in Table 4 for both the HGB and XGB models. The performance of the models was evaluated by averaging the performances of the two models in the outer CV loop. The considered performance metrics included the determination coefficient (R²), MAE, MSE, and RMSE, all computed between the predicted and reported data points.

Table 4 Overview of the explored hyperparameters for the Histogram-based Gradient Boosting (HGB) and eXtreme Gradient Boosting (XGB) regression models.

Full size table

Model interpretability through SHAP value analysis

Unfortunately, gradient boosting methods are so-called black-box models, i.e., it is not immediately clear how certain predictions are made. However, assessing the impact of the features on the predicted fertilizer application rate in the learned models could provide us with valuable insights into the drivers of fertilizer application rate. Therefore, we resorted to xAI methods to understand the predictions made by our models. More specifically, we used SHAP values²⁵ as they are model-agnostic, can account for interactions between features and have an intuitive interpretation. Indeed, summing the SHAP values for all features in one sample results in the prediction of the model. Additionally, like XGB and HGB, SHAP values are robust with respect to missing data by design²⁵. Special attention was given to categorical values, as retrieving one SHAP value for a categorical feature that is divided into OHE features is non-trivial. However, as the SHAP values are calculated using the preprocessed input data (i.e., containing the OHE categorical features), the SHAP values for one categorical variable were obtained by adding together all SHAP values for its respective OHE features.

Adjustment to country totals

Previous research has always started with the same premise of allocating total fertilizer consumption at the country-level for estimating crop-level use^9,10. However, here we adopt a different strategy, initiating the estimation of the fertilizer consumption at the crop-level directly. Despite this shift in strategy, we still consider country-level data to be more reliable than datasets compiled from various FUBC sources. To reconcile our approach with the more dependable country-level data, we adjusted the ML predictions to align with FAOSTAT’s total fertilizer consumption at the country-level⁵⁹. As shown in Eq. (5), we distributed the differences between the predicted total fertilizer consumption and the FAOSTAT totals equally among crops, after excluding the fraction used for grasslands and fodder crops from FAOSTAT totals.

$$Fert{\rm{\_}}Pre{d}_{icj}=FertML{\rm{\_}}Pre{d}_{icj}\times \frac{FAOSTAT{\rm{\_}}FERTn{g}_{ij}}{\sum _{d\in C}(FertML{\rm{\_}}Pre{d}_{idj}\times HArea{\rm{\_}}FA{O}_{idj})}$$

(5)

Where, $Fert\_Pre{d}_{icj}$ represents the fertilizer application rate predictions after the adjustment for year i, crop c, and country j. $FertML\_Pre{d}_{icj}$ denotes the ML model predictions, C is the set of all crops classes included in the models, $HArea\_FA{O}_{idj}$ the FAOSTAT harvested area⁴⁷ of each crop class d, and $FAOSTAT\_FERTn{g}_{ij}$ is the total FAOSTAT fertilizer consumption for the country, after removing the fraction used for grasslands and fodder crops.

Validation

To validate the model predictions, containing information about the average use per hectare for different fertilizers and crops. This validation is quantified using the MAE and MAPE as well as with comparative plots if enough data was obtainable from the various national databases. The MAE gives an idea about the actual deviation, whilst the MAPE makes the comparison between prediction errors easier. The evaluated national databases include data obtained from for the USA³⁸, UK⁵⁰, India^{39,40,41,42,43,44,45}, Sweden^54,55,56,57, Philippines⁵², and New Zealand⁵⁸. For Pakistan⁵¹, only data for the sum of fertilizer application rate is available, hence the sum of N, P₂O₅, and K₂O was used, expressed as NPK. This approach is restricted by available data in national databases for average fertilizer application rate across various crops and nutrients.

Gridded crop-specific application rate per fertilizer

Following the generated comprehensive dataset of global fertilizer application rate, we constructed detailed 5-arcmin resolution gridded maps for each fertilizer (N, P₂O₅, and K₂O), crop class and year from 1961 to 2019. The final gridded map dataset was compiled in a three-step process, as highlighted in Fig. 1. First, data of the gridded harvested area spanning from 1961 to 2019 for the 13 distinct crop classes (see Table 1) were acquired by combining data from the EARTHSTAT project of the year 2000 ($HArea\_M2000$)²⁹, supplemented with historical arable land and permanent crop areas per year ($CArea\_Hyde$) from the History Database of the Global Environment (HYDE version 3.3)³⁰. The EARTHSTAT maps were created by combining national-, state-, and country-level census statistics with an up-to-date global dataset of croplands, organized on a 5-arcminute by 5-arcminute latitude-longitude grid. These datasets, reflecting land use around the year 2000, detail both the area harvested and the yield of 175 diverse crops worldwide²⁹. Innovative maps outlining major crop groups were generated by consolidating these individual crop maps. The HYDE 3.3 project provides long time series estimates and maps for land use, including the cropland areas, based on an allocation algorithm with time-dependent weighting³⁰. The elaborate information from the crop specific EARTHSTAT maps for the year 2000, in combination with the yearly changes in gridded cropland from HYDE 3.3, allowed us to make detailed gridded 5-arcmin resolution crop specific harvested areas for each of the evaluated years and crops using Eqs. (6) to (9):

For $CArea\_Hyd{e}_{gi}\, > \,0$ and crop is rice:

$$HArea\_{M}_{gic}=CArea\_Hyde\_{R}_{gi}$$

(6)

For $CArea\_Hyd{e}_{gi}\, > \,0$ and crop is not rice:

$$HArea\_{M}_{gic}=CArea\_Hyde\_N{R}_{gi}\times \frac{HArea\_M{2000}_{gc}}{CArea\_Hyde\_N{R}_{g2000}}$$

(7)

For $CArea\_Hyd{e}_{gi}\,\mathrm{ > }\,0\,\cup \,{\sum }_{c\in C}HArea\_M{2000}_{c}\,=\,0$ and crop is rice:

$$HArea\_{M}_{gic}=CArea\_Hyde\_{R}_{gi}$$

(8)

For $CArea\_Hyd{e}_{gi}\,\mathrm{ > }\,0\,\cup \,{\sum }_{c\in C}HArea\_M{2000}_{c}\,=\,0$ and crop is not rice:

$$HArea\_{M}_{gic}=CArea\_Hyde\_N{R}_{gi}\times \frac{\sum _{k\in K}HArea\_M{2000}_{gc}/K}{CArea\_Hyde\_N{R}_{g2000}}$$

(9)

Here, the indices denote the grid cell (g), the year (i), the crop (c). The harvested area ($HArea\_{M}_{gic}$) was generated through a series of conditional operations. These conditions stipulate that if the value of the HYDE3.3 cropland area map ($CArea\_Hyd{e}_{gi}$) for that year i and grid cell g is larger than 0, and the crop is not rice, then the value of that grid cell for that specific crop and year is given by the HYDE3.3 cropland area ($CArea\_Hyde\_N{R}_{gi}$) for that grid cell/year combination. The value of the grid cell is then further adjusted by the ratio of the HYDE3.3 map of the year 2000 to the EARTHSTAT map of the year 2000 for the corresponding grid cell and crop ($\frac{HArea\_M{2000}_{gc}}{CArea\_Hyde\_N{R}_{g2000}}$). In the case of rice, the specific HYDE3.3 map for cropland area of rice was selected and not altered as this is readily available. Additionally, in instances where $CArea\_Hyd{e}_{gi}$ was larger than 0 and the sum of all crops across the EARTHSTAT maps of the year 2000 is equal to 0 (e.g., when new lands are cultivated), a progressively expanding area K was evaluated to find an appropriate ratio based on the average of the k neighboring cells. The evaluated values for k were 5, 10, 25, 50, 100, 150, 200 and 250, up until a value different from zero for the ratio is found. If no value different from zero was found, the ratio value was set equal to 1. This last step made the assumption that the crop distribution in neighboring cells adequately represents the distribution in the newly cultivated area, allowing for the calculation of adjusted harvested areas. Furthermore, as the $HArea\_M{2000}_{gc}$ is consistently used, we assumed that the changes in crop distribution over time remain constant. To ensure the accuracy of the generated maps, we capped the harvested area at the maximum feasible value in each cell.

To ensure consistency with FAOSTAT data used in the model predictions, the gridded harvested area ($HArea\_M\mathrm{1961\_2019}$) was aligned with the country-specific harvested area reported by FAOSTAT ($HArea\_FAO2000$). Additionally, due to this alignment, some cells may initially have harvested area values that exceed the maximum possible for that cell. To correct this, we cap the harvested area at the maximum feasible value per cell and then redistribute any excess proportionally across other cells with harvested area values, ensuring overall consistency with FAOSTAT data. These adjustments, applied through Eq. (10), provided a corrected gridded harvested area for the 13 crop classes over the 60-year period ($HArea\mathrm{\_1961\_2019}$):

$$HAre{a}_{gic}=HArea\_{M}_{gic}\times \frac{\sum _{j\in J}HArea\_FA{O}_{icj}}{\sum _{j\in J}HArea\_{M}_{icj}}$$

(10)

In this equation, $HArea\_FA{O}_{icj}$ represents the harvested area for year i, crop c, and country j as reported by FAOSTAT47,summed over all countries (J) in grid cell g (to accommodate grid cells with multiple countries). Similarly, $HArea\_{M}_{icj}$ represents the estimated harvested area for the same combinations, also summed over all countries in grid cell g. The ratio of these sums adjusts the model gridded harvested area ($HArea\_{M}_{gic}$) to match FAOSTAT data, ensuring the resulting gridded harvested area on a country level is consistent with official statistics across the 60-year period.

Finally, the gridded harvested area ($HAre{a}_{\mathrm{1961\_2019}}$) was augmented with the average application rate of each predicted fertilizer (N, P₂O₅, K₂O) as per Eq. (11):

$$FertCro{p}_{gic}=HAre{a}_{gic}\times \sum _{j\in J}(Fert\_Pre{d}_{icj}\times PercCountr{y}_{g})$$

(11)

where $Fert\_Pre{d}_{icj}$ is the country-level prediction resulting from the HGB model after applying the adjustment, and $PercCountr{y}_{g}$ refers to the percentage of grid cell g that is occupied by the country j. This process was then applied to each fertilizer separately to obtain gridded maps for each fertilizer, year, and crop combination.

Data Records

The gridded fertilizer application data for N, P₂O₅, and K₂O by crops from 1961 to 2019 are available in a Figshare repository¹⁸³. The dataset spans from 180°E to 180°W and 90°S to 90°N at a resolution of 5 arc-min in WGS84 (EPSG: 4326). It is provided in .tiff format, which can be read by many tools, such as R and Python. The gridded application data by crops and fertilizers are stored in several files named “Crop_NameFertilizerYear.tiff”. Here, “Crop_Name” represents each crop class listed in Table 1, “Fertilizer” refers to N, P₂O₅, or K₂O, and “Year” indicates any year from 1961 to 2019.

Crop-specific N application

On a global scale, the N application has grown for all crops (Fig. 2). For example, the average N use of the three main cereals has risen from 17.1 ± 6.1 kg ha⁻¹, 26.6 ± 7.2 kg ha⁻¹, 12.1 ± 3.9 kg ha⁻¹ for wheat, maize and rice, respectively, in the 1960s, to 97.8 ± 4.2 kg ha⁻¹, 118.8 ± 4.2 kg ha⁻¹, 113.8 ± 1.9 kg ha⁻¹ in the 2010s decade. Moreover, the largest increases in N application occurred in vegetable crops, with a global growth of more than 120 kg ha⁻¹ between these two decades (Fig. 2). Conversely, the lowest increases occurred in soybean, where N application rates grew by less than 20 kg ha⁻¹. At the regional scale, the intensification of N fertilizer use has shifted from higher use at the beginning of the period in the USA and Europe to being currently dominated by Asian countries such as China and India (Fig. 2). This trend is particularly evident for some crops like vegetables and fruits, where China now has the areas with the highest N use worldwide, whereas in the 1960s, these areas were primarily in Southern Europe and California.

Crop-specific P₂O₅ application

The application of P₂O₅ also experienced global increases across all crops (Fig. 3), but to a lesser extent than N. The average P₂O₅ used for the three main cereals and soybean rose from 13.8 ± 3.3 kg ha⁻¹, 13.1 ± 2.4 kg ha⁻¹, 6.3 ± 1.9 kg ha⁻¹, and 12.6 ± 2.4 kg ha⁻¹ for wheat, maize, rice and soybean, respectively, in the 1960s to 35.5 ± 4.9 kg ha⁻¹, 43.0 ± 5.7 kg ha⁻¹, 39.9 ± 5.0 kg ha⁻¹, and 39.1 ± 6.6 kg ha⁻¹ in the 2010s. Similar to N, the largest increases occurred in vegetable crops, where P₂O₅ application rates increased by more than 50 kg ha⁻¹. Conversely, the smallest increases were observed in the other cereal crop class, where the average P₂O₅ application rate increased by only about 2.5 kg ha⁻¹ between the two decades. Regionally, a similar pattern occurred with P₂O₅ use, following the trend previously seen with N, where the hotspot shifted from Europe to Asia. This shift is particularly notable for wheat, where the hotspot of P₂O₅ intensification moved from Western Europe to northern India and northeastern China (Fig. 3).

Crop-specific K₂O application

Globally, the use of K₂O has also increased across almost all crop classes (Fig. 4). For wheat, maize, rice, and soybean, the average K₂O application rates have risen from 7.2 ± 1.6, 9.8 ± 2.0, 3.4 ± 0.5, and 11.6 ± 2.6 kg ha⁻¹, respectively, to 15.4 ± 4.1, 33.1 ± 4.8, 27.3 ± 3.9, and 9.8 ± 3.2 kg ha⁻¹. Unlike N and P₂O₅, the largest difference in average K₂O application occurred for the oil palm crop, which increased from 3.7 ± 1.4 kg ha⁻¹ during the 1960s to 87.6 ± 8.3 during the 2010s. Similar to P₂O₅, the other cereal class experienced the smallest change in K₂O use. In this case, the average K₂O application rate decreased from 11.7 ± 1.9 kg ha⁻¹ during the 1960s to 9.8 ± 3.2 kg ha⁻¹ during the 2010s. Regionally, a similar pattern emerged with K₂O, following the trend observed with N and P₂O₅, with the hotspot of K₂O fertilization shifting from Europe and the USA to Asia. However, this change was more pronounced in different crops, such as oil crops, where the use of K₂O has increased significantly in countries like Malaysia and Indonesia (Fig. 4).

Technical Validation

This section provides a detailed discussion of the validation efforts made to confirm the validity, consistency, and plausibility of our compiled dataset and predictions. First, the performance of the ML models is evaluated. Subsequently, we use SHAP values to confirm that our models used sensible features to make their predictions, based on literature. Finally, the predictions are validated by comparing them with reported data in both national and global databases.

ML Model performance

The performance of the ML models predicting the fertilizer application rates for the three fertilizers is shown in Table 5. Both XGB and HGB significantly outperformed the naive prediction, which uses the mean fertilizer application as its prediction. HGB consistently outperformed (or matched) XGB for all three fertilizers and performance metrics. For this reason, we will use the HGB model in the remainder of this technical validation, as well as any subsequent analyses.

Table 5 Performances of the eXtreme Gradient Boosting (XGB) and HistGradientBoosting (HGB) models on the test sets in a 2 × 5-fold nested cross validation grid search.

Full size table

SHAP value analysis

To examine the impact of the features on the prediction of the N, P₂O₅ and K₂O application rates, the SHAP values of the ten most important features for the three corresponding HGB models are illustrated in Fig. 5. Agrological drivers dominated the predictions, comprising six, seven, and eight of the ten highest ranked features, respectively. The impact of the features remained consistent across all fertilizers, albeit with varying magnitudes (Fig. 5d,e,f). In particular, the predicted fertilizer application rates were consistently positively impacted by the country fertilizer per ha and the crop nutrient removal per ha (as red dots, i.e. high values of country fertilizer per ha and high nutrient removal per ha, corresponded with positive SHAP values), while it was negatively impacted by the crop nutrient content (red dots corresponding with negative SHAP values; Fig. 5d,e,f). These relationships align with the expected influence of these features on fertilization at the crop-level¹⁸⁴. Across the different fertilizers, the most important socioeconomic features varied. For instance, the GDP per capita was the most important socioeconomic feature in the prediction of the P₂O₅ and K₂O application rates, while in the N prediction, the global crop price was more important. Fertilization at the country-level is usually associated with the economic development of the country, measured by GDP^166,185. However, at the crop-level, this relationship only held true for the most expensive fertilizers, P₂O₅ and K₂O. For N, the most affordable nutrient¹⁵⁶, factors such as global crop price and N cost from production appeared to be more significant (Fig. 5). Few environmental features seemed to be relevant for the predictions (Fig. 5); only the soil pH, soil OCS, and aridity index appeared in the top ten for some nutrients. Although the influence of these variables appeared to be low, the direction of these relationships confirmed the findings of other authors at the farm- or regional-level for soil organic carbon content and soil pH.^143,144,145.

Validation

To evaluate the validity of our results, we compare the compiled dataset based on the predictions against several national databases^{38,39,40,41,42,43,44,45,50,51,52,54,55,56,57,58} based on the MAE and MAPE errors between both, averaged over the available years as illustrated in Table 6. For most country/crop combinations, the differences are within reasonable ranges, with MAE values between 5–40 kg ha⁻¹ and MAPE values between 10%-50%. However, for some countries, the deviations are larger, suggesting that our models may not capture all the underlying intricacies in the data for each country or crop. This can be seen for Sweden where most results deviate between 20%-100%, or New Zealand where similar results can be found. However, it should be noted that these larger differences between our compiled dataset and the national databases cover only limited years as data was not always available for certain countries, as was the case for Sweden and New Zealand. Still, these discrepancies are slightly better than in earlier research⁹. Additionally, more detailed plots to evaluate the results per year for the USA and UK, based on the USDA and DEFRA respectively, are included in Figs. 6 and 7. For the USDA and DEFRA crop nutrient data, the MAPE values are less than 50% and usually less than 25%, except for USDA soybean N (Fig. 6). Figures 6 and 7 show that our predictions follow the real observed trend for the samples and thus form a reliable end source with only minimal differences. These discrepancies between the national databases and our compiled dataset can be attributed to occasional disparities between the application rates in the training data (data provided by the global dataset compilation) and the data in the national databases, e.g., the USA data for soybean N in 1998 differed by 400% between the two samples. These differences should be taken into account when comparing our results to the national databases, as our predictions are based on the global compiled dataset. As can be seen in Table 7, where the global databases data and the national databases are compared based on MAE and MAPE, most country/crop combination indicate an MAPE values between 10%-50%, which is similar to our resulting error in Table 6. Also, the lack of training samples for some country/crop combinations resulted in larger errors for these occurrences.

Table 6 Validation of our model predictions of the average application rate per ha against national database information for certain countries and crops per fertilizer.

Full size table

Table 7 Comparison of the data reported by global datasets^{16,17,18,19,20,21,22,32,33,34,35,36,37} of the average application rate per fertilizer per ha against national database information for certain countries and crops per fertilizer.

Full size table

To conclude, the model performances and logical feature importances, derived from the SHAP values, in conjunction with the relatively minor differences between this study and regional statistics, as well as earlier literature⁹, indicate that our crop-specific fertilizer application rate dataset is comparatively reasonable across regions and years.

Usage Notes

In this study, we provide detailed estimates on global N, P₂O₅, and K₂O fertilizer application rate based on the HGB model output and compile a comprehensive dataset of these estimates by major crop groups between 1961–2019. Tabular data of the country- and crop-level predictions are made available as well as the 5-arcmin resolution gridded maps from our application, rendering an easy to use complete dataset. Subsequent analysis can be done both on the tabular data and the outputted maps, such as a trend analysis of fertilizer application rate or causal discovery to identify drivers of fertilizer application rate. Furthermore, our dataset can be leveraged as a source in other models where for example yield, ecological impact or fertilizer pricing can be seen as the output rather than use.

Our results represent an improvement and advance in efforts to evaluate historical fertilizer consumption for different crop groups and fertilizers. However, as demonstrated during the validation process, this approach still has limitations that data source users should be made aware of. The limited amount of available data for some crops, nutrients, and regions can lead to biases, particularly in regions such as Africa, during certain years, especially in the 60 s, and for certain nutrients, mainly K₂O. Hence, the ML approach can be sensitive to outlying data points or noise and the limited data can make it prone to overfitting, which was mitigated as much as possible in the CV setup. In addition, our model is trained on data provided by global datasets^{16,17,18,19,20,21,22,31,32,33,34,35,36,37}, which means that while our predictions may align closely with them, it is essential to acknowledge that they might diverge from national data mainly due to the difference between the two data sources as highlighted by the validation. This discrepancy between global and national databases such as the USDA³⁸ or DEFRA⁵⁰ databases highlights the complexity of accurately capturing historical fertilizer consumption trends across different regions and crop types. Moreover, the gridded cropland data provided by the HYDE 3.3 project³⁰, is inconsistent with the one from satellite-derived land use (e.g., China and India^186,187) or data derived from a national census at regional scale (e.g., USA¹⁸⁸), as stated by Adalibieke et al.⁹. Furthermore, utilizing neighboring cells to allocate harvested areas across different crops, as well as leveraging the EARTHSTAT map²⁹, implies some assumptions (see Eqs. (6) to (9)). The main assumption is the suggestion that the distribution pattern of a specific cell mirrors that of its neighboring cells, which constrains potential changes in cropland over cells. The consistent use of the EARTHSTAT map²⁹ of the year 2000 also assumed that the crop group distribution of harvested area remains constant over time between 1961–2019. Finally, it is important to recognize that there are additional uncertainties stemming from the utilization of various data sources and methodological decisions within each data source, but these lie beyond the scope of our study.

Nevertheless, our study extends the current literature by providing a more detailed historical geospatial distribution of fertilizer application rate by crop and using ML to obtain detailed predictions with high precision. The detailed description and open-source code, in combination with the limited data sources used and ability to forecast, also make the method reproducible and easy to extend to forecast fertilizer application rate. In addition, our approach does not entail any assumptions, making it more flexible and robust than precious studies. Future research can build upon our study by expanding on more detailed specific fertilizer application rate. Considering the frequency of fertilizer application as well as the timing can be valuable for future research on the evaluation of fertilizer effectiveness and use. In addition, our study focuses on broad fertilizer applications, however, more detailed maps can be made for different types of specific fertilizers considered in our study (e.g., N fertilizer types). Furthermore, the time granularity of our maps can be improved. In addition, satellite data can be used to obtain even more fine-grained predictions, both in regions and more detailed time periods. Finally, a deeper investigation into the drivers of fertilizer application rate could enrich our understanding. While our focus has primarily been on the explainability of our model, exploring methodologies such as causal discovery or causal ML within a temporal setting could unveil the drivers of fertilizer application rate over time, potentially providing valuable insights and facilitating more detailed predictions.

Table 8 Overview of the used open source packages and respective programming language in the code for model training, SHAP value computation, validation and map building.

Full size table

Code availability

Our Python (3.10.3) code, encompassing the model training, prediction generation, SHAP value computation, model validation and plot creation, as well as the R (4.2.2) scripts made for map generation are made available alongside the provided data map resources¹⁸⁹. Open source packages used in the code are tabulated with their respective version in Table 8. Access to these resources is available at the designated ___location^183,189.

Change history

17 February 2025
A Correction to this paper has been published: https://doi.org/10.1038/s41597-025-04591-y

References

Xu, G., Fan, X. & Miller, A. J. Plant nitrogen assimilation and use efficiency. Annual Review of Plant Biology 63, 153–182, https://doi.org/10.1146/annurev-arplant-042811-105532 (2012).
Article CAS PubMed MATH Google Scholar
Shen, J. et al. Phosphorus dynamics: From soil to plant. Plant Physiology 156, 997–1005, https://doi.org/10.1104/pp.111.175232 (2011).
Article CAS PubMed PubMed Central MATH Google Scholar
Sardans, J. & Peñuelas, J. Potassium control of plant functions: Ecological and agricultural implications. Plants 10 https://doi.org/10.3390/plants10020419 (2021).
Sardans, J. & Peñuelas, J. Potassium: A neglected nutrient in global change. Global Ecology and Biogeography 24, 261–275, https://doi.org/10.1111/geb.12259 (2015).
Article MATH Google Scholar
Ludemann, C. I. et al. A global FAOSTAT reference database of cropland nutrient budgets and nutrient use efficiency (1961–2020): nitrogen, phosphorus and potassium. Earth System Science Data 16, 525–541, https://doi.org/10.5194/essd-16-525-2024 (2024).
Article ADS MATH Google Scholar
Xu, R. et al. Increased nitrogen enrichment and shifted patterns in the world’s grassland: 1860-2016. Earth System Science Data 11, 175–187, https://doi.org/10.5194/essd-11-175-2019 (2019).
Article ADS MATH Google Scholar
Sutton, M. et al. Our Nutrient World: The challenge to produce more food and energy with less pollution (Centre for Ecology and Hydrology (CEH), Edinburgh UK on behalf of the Global Partnership on Nutrient Management and International Nitrogen Initiative., 2013).
Penuelas, J., Janssens, I. A., Ciais, P., Obersteiner, M. & Sardans, J. Anthropogenic global shifts in biospheric n and p concentrations and ratios and their impacts on biodiversity, ecosystem productivity, food security, and human health. Global Change Biology 26, 1962–1985, https://doi.org/10.1111/gcb.14981 (2020).
Article ADS PubMed Google Scholar
Adalibieke, W., Cui, X., Cai, H., You, L. & Zhou, F. Global crop-specific nitrogen fertilization dataset in 1961–2020. Scientific data 10, 617, https://doi.org/10.1038/S41597-023-02526-Z (2023).
Article PubMed PubMed Central Google Scholar
Lu, C. & Tian, H. Global nitrogen and phosphorus fertilizer use for agriculture production in the past half century: Shifted hot spots and nutrient imbalance. Earth System Science Data 9, 181–192, https://doi.org/10.5194/essd-9-181-2017 (2017).
Article ADS MATH Google Scholar
Nishina, K., Ito, A., Hanasaki, N. & Hayashi, S. Reconstruction of spatially detailed global map of NH₄⁺ and NH₃⁻ application in synthetic nitrogen fertilizer. Earth System Science Data 9, 149–162, https://doi.org/10.5194/essd-9-149-2017 (2017).
Article ADS Google Scholar
MacDonald, G. K., Bennett, E. M., Potter, P. A. & Ramankutty, N. Agronomic phosphorus imbalances across the world’s croplands. Proceedings of the National Academy of Sciences of the United States of America 108, 3086–3091, https://doi.org/10.1073/pnas.1010808108 (2011).
Article ADS PubMed PubMed Central Google Scholar
Cao, P., Lu, C. & Yu, Z. Historical nitrogen fertilizer use in agricultural ecosystems of the contiguous united states during 1850-2015: Application rate, timing, and fertilizer types. Earth System Science Data 10 https://doi.org/10.5194/essd-10-969-2018 (2018).
Yu, Z., Liu, J. & Kattel, G. Historical nitrogen fertilizer use in china from 1952 to 2018. Earth System Science Data 14, 5179–5194, https://doi.org/10.5194/essd-14-5179-2022 (2022).
Article ADS MATH Google Scholar
Conant, R. T., Berdanier, A. B. & Grace, P. R. Patterns and trends in nitrogen use and nitrogen recovery efficiency in world agriculture. Global Biogeochemical Cycles 27, 558–566, https://doi.org/10.1002/gbc.20053 (2013).
Article ADS CAS Google Scholar
Ludemann, C. I., Gruere, A., Heffer, P. & Dobermann, A. Global data on fertilizer use by crop and by country. Scientific Data 9, 502, https://doi.org/10.1038/s41597-022-01592-z (2022).
Article Google Scholar
Martinez, A. & Diamond, R. B. Fertilizer Use Statistics In Crop Production. Tech. Rep. T-24, International Fertilizer Developemnt Center (1982).
Martinez, A. Fertilizer use statistics and crop yields. Tech. Rep. T-37, International Fertilizer Development Center, Muscle Shoals, Alabama (1990).
EFMA. Fertilizer application by crop in EU countries 2001/02. Tech. Rep., EFMA (unpublished).
EFMA. Fertilizer application by crop in EU countries 2006/07. Tech. Rep., EFMA (unpublished).
Fertilizer Europe. Fertilizer application by crop in EU countries 2011/12. Tech. Rep., Fertilizer Europe (unpublished).
Fertilizer Europe. Fertilizer application by crop in EU countries 2014/15. Tech. Rep., Fertilizer Europe (unpublished).
Chen, T. & Guestrin, C. XGBoost: A Scalable Tree Boosting System. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 13-17-August-2016, 785–794, https://doi.org/10.1145/2939672.2939785 (2016).
Article MATH Google Scholar
Pedregosa, F. et al. Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research 12, 2825–2830 (2011).
MathSciNet MATH Google Scholar
Lundberg, S. M., Allen, P. G. & Lee, S.-I. A Unified Approach to Interpreting Model Predictions. Advances in Neural Information Processing Systems 30 (2017).
Zou, T., Zhang, X. & Davidson, E. A. Global trends of cropland phosphorus use and sustainability challenges. Nature 611, 81–87, https://doi.org/10.1038/s41586-022-05220-z (2022).
Article ADS CAS PubMed MATH Google Scholar
Lassaletta, L., Billen, G., Grizzetti, B., Anglade, J. & Garnier, J. 50 year trends in nitrogen use efficiency of world cropping systems: The relationship between yield and nitrogen input to cropland. Environmental Research Letters 9, 105011, https://doi.org/10.1088/1748-9326/9/10/105011 (2014).
Article ADS Google Scholar
Einarsson, R. et al. Crop production and nitrogen use in european cropland and grassland 1961–2019. Scientific Data 8, 288, https://doi.org/10.1038/s41597-021-01061-z (2021).
Article CAS PubMed PubMed Central MATH Google Scholar
Monfreda, C., Ramankutty, N. & Foley, J. A. Farming the planet: 2. geographic distribution of crop areas, yields, physiological types, and net primary production in the year 2000. Global biogeochemical cycles 22 https://doi.org/10.1029/2007GB002947 (2008).
Klein Goldewijk, K., Beusen, A., Doelman, J. & Stehfest, E. Anthropogenic land use estimates for the holocene–hyde 3.2. Earth System Science Data 9, 927–953, https://doi.org/10.5194/essd-9-927-2017 (2017).
Article ADS Google Scholar
FAO, IFA & IFDC. Fertilizer use by crop 1. Tech. Rep. ESS/MISC/1992/3, FAO, Viale delle Termi di Caracalla, 00100, Rome (1992).
FAO, IFA & IFDC. Fertilizer use by crop 2. Tech. Rep. ESS/MISC/1994/4, FAO, Viale delle Termi di Caracalla, 00100, Rome (1994).
FAO, IFA & IFDC. Fertilizer use by crop 3. Tech. Rep. ESS/MISC/1996/1, FAO, Viale delle Termi di Caracalla, 00100, Rome (1996).
FAO, IFA & IFDC. Fertilizer use by crop 4. Tech. Rep., FAO, Viale delle Termi di Caracalla, 00100, Rome (1999).
FAO, IFA & IFDC. Fertilizer use by crop 5. Tech. Rep., FAO, Viale delle Termi di Caracalla, 00100, Rome (2002).
Heffer, P. Asessment of Fertilizer Use by Crop at the Global Level 2006/07 - 2007/08. Tech. Rep. A/09/55, International Fertilizer Industry Association, rue Marbeuf - 75008, Paris (2009).
Heffer, P., Gruère, A. & Roberts, T. Assessment of fertilizer use by crop at the global level 2014-2014/15. Tech. Rep. A/17/134, International Fertilizer Industry Association & International Plant Nutrition Insitute (2017).
USDA. Fertilizer use and price https://www.ers.usda.gov/data-products/fertilizer-use-and-price/ (2019).
Agricultural Census Division. All india bulletin on input survey 1986-87. Tech. Rep., Goverment of India (1992).
Agricultural Census Division. All india report on input survey 1991-1992. Tech. Rep. AGRI/2000-1 A, Goverment of India (2000).
Agricultural Census Division. All india report on input survey 1996-1997. Tech. Rep., Goverment of India (2007).
Agriculture Census Division. All india report on input survey 2001-02. Tech. Rep., Ministry of Agriculture (2008).
Agriculture Census Division. All india report on input survey 2006-07. Tech. Rep., Ministry of Agriculture (2012).
Agriculture Census Division. All india report on input survey 2011-12. Tech. Rep., Minister of Agriculture & Farmers Welfare (2016).
Agriculture Census Division. All india report on input survey 2016-17. Tech. Rep., Minister of Agriculture, Cooperation & Farmers Welfare (2021).
FAO. World programme for the census of agriculture 2020: Programme, concepts and definitions (2015).
FAOSTAT. Crops and livestock products https://www.fao.org/faostat/en/#data/QCL (2023).
Heffer, P. Asessment of Fertilizer Use by Crop at the Global Level 2010 - 2010/11. Tech. Rep. A/13/111, International Fertilizer Industry Association, rue Marbeuf - 75008, Paris (2013).
Maiz’Europ’. Figures https://www.maizeurop.com/en/structure/cepm/figures/ (2024).
DEFRA. British survey of fertiliser practice dataset https://www.gov.uk/government/statistical-data-sets/british-survey-of-fertiliser-practice-dataset (2023).
NFDC. Statistics national fertilizer development centre http://www.nfdc.gov.pk/stat.html (2023).
Philippine Statistics Authority. Estimated inorganic fertilizer use by geolocation, grade and year, by area harvested and year https://openstat.psa.gov.ph/ (2023).
Briones, R. M. The fertilizer industry and philippine agriculture: Policies, problems, and priorities. Philippine Journal of Development 43 (2017).
Official Statistics of Sweden. Use of fertilisers and animal manure in agriculture in 2010/11. Tech. Rep. SM 1203, Official Statistics of Sweden (2014).
Official Statistics of Sweden. Use of fertilisers and animal manure and cultivation measures in agriculture 2012/13. Tech. Rep. SM 1402, Official Statistics of Sweden (2014).
Official Statistics of Sweden. Use of fertilisers and animal manure and cultivation measures in agriculture 2015/16. Tech. Rep. SM 1702, Official Statistics of Sweden (2017).
Official Statistics of Sweden. Use of fertilisers and animal manure and cultivation measures in agriculture 2018/19. Tech. Rep. SM 2002, Official Statistics of Sweden (2020).
NZ, S. Agricultural production survey https://www.stats.govt.nz/indicators/fertilisers-nitrogen-and-phosphorus (2021).
FAOSTAT. Fertilizers by nutrient https://www.fao.org/faostat/en/#data/RFN (2023).
CEPAL, FAO & BID. El uso de fertilizantes en Argentina. Tech. Rep. E/CN.12/741, CEPAL (1966).
Fertilizar Asociación Civil. Consumo de fertilizantes por cultivo 2006 https://fertilizar.org.ar/estadisticas/ (2006).
Fertilizar Asociación Civil. Consumo de fertilizantes por cultivo 2011 https://fertilizar.org.ar/estadisticas/ (2011).
Fertilizar Asociación Civil. Consumo de fertilizantes por cultivo 2012 https://fertilizar.org.ar/estadisticas/ (2012).
Fertilizar Asociación Civil. Consumo de fertilizantes por cultivo 2013 https://fertilizar.org.ar/estadisticas/ (2013).
Fertilizar Asociación Civil. Consumo de fertilizantes por cultivo 2014 https://fertilizar.org.ar/estadisticas/ (2014).
Fertilizar Asociación Civil. Consumo de fertilizantes por cultivo 2015 https://fertilizar.org.ar/estadisticas/ (2015).
Fertilizar Asociación Civil. Consumo de fertilizantes por cultivo 2016 https://fertilizar.org.ar/estadisticas/ (2016).
Fertilizar Asociación Civil. Consumo de fertilizantes por cultivo 2017 https://fertilizar.org.ar/estadisticas/ (2017).
Fertilizar Asociación Civil. Consumo de fertilizantes por cultivo 2018 https://fertilizar.org.ar/estadisticas/ (2018).
FAO. Fertilizer use by crop in Brazil. Tech. Rep. TC/D/Y5376E/1/05.04/300, FAO, Viale delle Terme di Caracalla 00100, Rome (2004).
Boddey, R. M., Xavier, D. F., Alves, B. J. & Urquiaga, S. Brazilian agriculture: The transition to sustainability. Journal of Crop Production 9, 593–621, https://doi.org/10.1300/J144v09n01_10 (2003).
Article MATH Google Scholar
Pogue, S. J. et al. Regionalized life cycle inventory data collection and calculation for perennial forage production in canada: methodological best practices and limitations. International Journal of Life Cycle Assessment https://doi.org/10.1007/s11367-023-02199-1 (2023).
Beaton, J. D. & Berger, J. Present and Potential Use of Fertilizer for Forage Production in Temperate Zones, chap. 2, 17–37 (American Society of Agronomy, 1974).
CEPAL, FAO & BID. El uso de fertilizantes en Chile. Tech. Rep. E/CN.12/757, CEPAL (1966).
Nuñez, R. D. Estudio sobre el mercado de fertilizantes en la República Dominicana. Tech. Rep. IICA/E72/9, Instituto Inteamericano de Cooperación para la Agricultural, United States Agency for International Development, Secretariado Técnico de la Presidencia Oficina Nacional de Planificación ONAPLAN (1999).
Michaud, R., Lehman, W. F. & Rumbaugh, M. D. World distribution and historical development, chap. 2. Agronomy Monographs (American Society of Agronomy, 1988).
Nelson, L. B. et al. Changing patterns in fertilizer use (Soil Science Society of America, inc, 1968).
Russel, D. A., Joe Free, W. & McCune, D. L. Potential for Fertilizer Use on Tropical Forages, chap. 3, 39–65 (American Society of Agronomy, 1974).
Casanova, E. Problemática de los fertilizantes en Venezuela. Venesuelos 12, 5–16, http://saber.ucv.ve/ojs/index.php/rev_venes/article/view/969 (2004).
MATH Google Scholar
Dubeux, J. C., Sollenberger, L. E., Mathews, B. W., Scholberg, J. M. & Santos, H. Q. Nutrient cycling in warm-climate grasslands. Crop Science 47, 915–928, https://doi.org/10.2135/cropsci2006.09.0581 (2007).
Article CAS Google Scholar
Rawnsley, R. P., Smith, A. P., Christie, K. M., Harrison, M. T. & Eckard, R. J. Current and future direction of nitrogen fertiliser use in australian grazing systems. Crop and Pasture Science 70, 1034–1043, https://doi.org/10.1071/CP18566 (2019).
Article MATH Google Scholar
Barrow, P. M. Potassium fertilizer use in south australia. Agronomy branch report 1, 1–11 (1968).
MATH Google Scholar
Williams, P. H. & Haynes, R. J. Influence of improved pastures and grazing animals on nutrient cycling within new zealand soils. New Zealand Journal of Ecology 14, 49–57 (1990).
MATH Google Scholar
Buchgraber, K., Schaumberger, A. & Pötsch, E. M. Grassland farming in Austria - status quo and future prospective. Grassland Science in Europe 16, 13–24 (2011).
MATH Google Scholar
Vandermoere, S. Improving agricultural phosphorus use efficiency and reducing soil phosphorus losses at the field scale. Ph.D. thesis, Ghent University (2020).
Németh, T. Past, Present and Future Status of N-Fertilization Policies in Hungary, 243–252 (Springer Netherlands, 2000).
Raup, P. M. Postwar recovery of Western German agriculture. Journal of Farm Economics 32, 1–14, https://doi.org/10.2307/1233160 (1950).
Article MATH Google Scholar
Petersen, H. J. S. Forecasting Danish nitrogen fertilizer consumption. Industrial Marketing Management 6, 211–221, https://doi.org/10.1016/0019-8501(77)90020-7 (1977).
Article Google Scholar
Virkajärvi, P. et al. Dairy production systems in Finland. 18th Symposium of the European Grassland Federation, Grassland and forages in high output dairy farming systems 20, 51–66 (2015).
Google Scholar
Edwards, G. R. & Abbott, G. W. The Agricultural Economy of Finland, vol. 169 of ERS foreign (US Department of Agriculture, Economic Research Service, 1966).
Noë, J. L., Billen, G., Esculier, F. & Garnier, J. Long-term socioecological trajectories of agro-food systems revealed by n and p flows in french regions from 1852 to 2014. Agriculture, Ecosystems and Environment 265, 132–143, https://doi.org/10.1016/j.agee.2018.06.006 (2018).
Article CAS Google Scholar
Ministère de l’Agriculture. Les Praires en 1982, vol. 233 of Collections de statistique agricole (Ministère de l’Agriculture, 1984).
Rabaud, V. & Cesses, M. Enquête sur les practiques culturales - 2001. Tech. Rep. 159, Ministère de l’Agriculture (SSP) (2004).
Ministère de l’Agriculture (SSP). Pratiques culturales - 2006. https://agreste.agriculture.gouv.fr/agreste-web/disaron/Dos8/detail/, https://doi.org/10.34724/CASD.63.325.V2 (2006).
Ministère de l’Agriculture (SSP). Pratiques culturales sur les grandes cultures - 2017. https://agreste.agriculture.gouv.fr/agreste-web/disaron/Chd2009/detail/, https://doi.org/10.34724/CASD.56.3033.V1 (2020).
Pearson, H. A., Herbel, C. H. & Pendleton, D. T. A tour of East German agriculture. Rangelands 1, 9–11 (1979).
CAS MATH Google Scholar
Panos, D., Sotiriadis, S. & Fikas, E. Grassland’s progress in Greece. Der Z’ uchter 31, 37–47, https://doi.org/10.1007/BF00709827 (1961).
Article MATH Google Scholar
Hou, Y., Ma, L., Sárdi, K., Sisák, I. & Ma, W. Nitrogen flows in the food production chain of hungary over the period 1961–2010. Nutrient Cycling in Agroecosystems 102, 335–346, https://doi.org/10.1007/s10705-015-9703-8 (2015).
Article CAS MATH Google Scholar
Murphy, W. E., O’Keeffe, W. F. & Taluntais, F. Fertiliser use surveys, 1972, 1974 & 1975. Tech. Rep. 14, Fertiliser Association of Ireland (1978).
Murphy, W. E. & O’Keefe, W. F. Fertiliser use survey 1981-82. Tech. Rep. 24, The Fertiliser Association of Ireland (1983).
Murphy, W. E. & F., O. W. Fertiliser use survey. Tech. Rep. 27, The Fertiliser Association of Ireland (1987).
Coulter, B. S., Murphy, W. E., Culleton, N., Quinlan, G. & Connolly, L. A survey of fertilizer use from 2001-2003 for grassland and arable crops. Tech. Rep., Teagasc (2005).
Lalor, S., Coulter, B. S., Quinlan, G. & Connolly, L. A survey of fertilizer use in Ireland from 2004-2008 for grasslands and arable crops. Tech. Rep., Teagasc (2010).
Dillon, E., Buckley, C., Moran, B., Lennon, J. & Wall, D. Teagasc national farm survey, fertiliser use survey 2005-2015. Tech. Rep., Teagasc (2018).
Ireland CSO. Table ACEN2: Area used by rural district, farm land utilisation and year (1926–1980) https://data.cso.ie/ (2007).
Ireland CSO. Table AQA02: Farm land utilisation in june by type of land use, year and region (1980–1999) https://data.cso.ie/ (2020).
Ireland CSO. Table AQA01: Area farmed in june by type of land use, year and region (1991–2007) https://data.cso.ie/ (2021).
Ireland CSO. Table AQA05: Area farmed in june by type of land use, year and region (2008–2012) https://data.cso.ie/ (2020).
Ireland CSO. Table AQA06: Area farmed in june by type of land use, year and region (2013–2023) https://data.cso.ie/ (2024).
Walsh, T., Ryan, P. F. & Kilroy, J. A half century of fertiliser and lime use in Ireland. Journal of the Statistical and Social Inquiry Society of Ireland 19, 104–136 (1957).
MATH Google Scholar
Heavy, J. F. The economic optimum use of fertilisers in Ireland. Tech. Rep. 2, Fertiliser Association of Ireland (1969).
Prins, W. H. Limits to nitrogen fertilizer on grassland. Ph.D. thesis, Wageningen University & Research (1983).
Kurek, E. The fertilizing of main crops in peasant farming and its effectiveness. Zagadnienia Ekonomiki Rolnej 3, 59–71 (1971).
MATH Google Scholar
Jiménez, P. G.-S., Marotta, J. J. L., Criado, S. R. & García, M. N. Guía Práctica de la Fertilización Racional de los Cultivos en España. Parte 1. (Ministerio de Medio Ambiente y Medio Rural y Marino, 2010).
Åberg, E. Recent changes in swedish crop production. Advances in Agronomy 7, 39–74 (1955).
Article MATH Google Scholar
Church, B. M. & Webber, J. Fertiliser practice in england and wales: A new series of surveys. Journal of the Science of Food and Agriculture 22, 1–7, https://doi.org/10.1002/jsfa.2740220102 (1971).
Article MATH Google Scholar
DEFRA. United Kingdom land areas, livestock numbers and agricultural workforce on agricultural holdings on 1 June https://www.gov.uk/government/statistical-data-sets/structure-of-the-agricultural-industry-in-england-and-the-uk-at-june (2024).
DEFRA. Crops areas and livestock numbers in England from the June Census of Agriculture: 1900-2010 https://www.gov.uk/government/statistical-data-sets/structure-of-the-agricultural-industry-in-england-and-the-uk-at-june (2024).
Jóhannesson, T. Agriculture in Iceland Conditions and Characteristics (The Agricultural University of Iceland, 2010).
Helgadóttir, A., Eythórsdóttir, E. & Jóhannesson, T. Agriculture in iceland - a grassland based production. Grassland Science in Europe 18, 30–43 (2013).
MATH Google Scholar
Office fédéral de la statistique. Surface agricole utile sans les alpages https://www.bfs.admin.ch/bfs/fr/home/statistiques/agriculture-sylviculture/agriculture.assetdetail.30245951.html (2024).
Nordgård, A. Orientation and intensity of Norwegian agriculture. Norsk Geografisk Tidsskrift 29, 169–220, https://doi.org/10.1080/00291957508551985 (1975).
Article Google Scholar
Steinshamn, H., Nesheim, L. & Bakken, A. K. Grassland production in Norway. Grassland Science in Europe 21, 15–25 (2016).
MATH Google Scholar
The Federal Administration for Plant Protection and Veterinarian Medicine. Yugoslavia: Country report to the fao international technical conference on plan genetic resources (leipzig, 1996). Tech. Rep., FAO (1995).
Lugić, Z., Lazarević, D., Erić, P., Mihajlović, V. & Vučković, S. The state of forage crops production in Serbia. Biotechnology in Animal Husbandry 26, 29–47 (2010).
Google Scholar
Loza, G. & Kurtsev, I. The growth of productive forces in agriculture in the tenth five-year plan. Problems in Economics 19, 3–22, https://doi.org/10.2753/pet1061-199119103 (1977).
Article Google Scholar
Klatt, W. Reflections on the 1975 soviet harvest. Soviet Studies 28, 485–498, https://doi.org/10.1080/09668137608411087 (1976).
Article MATH Google Scholar
Shend, J. Y. Agricultural statistics of the former USSR republics and the Baltic States. No. 863 in Statistical bulletin (United States Department of Agriculture) (U.S. Department of Agriculture, 1993).
Suleimenov, M. & Oram, P. Trends in feed, livestock production, and rangelands during the transition period in three Central Asian countries. Food Policy 25, 681–700, https://doi.org/10.1016/S0306-9192(00)00037-3 (2000).
Article MATH Google Scholar
Zhang, F., Qi, J., Li, F. M., Li, C. S. & Li, C. B. Quantifying nitrous oxide emissions from chinese grasslands with a process-based model. Biogeosciences 7, 2039–2050, https://doi.org/10.5194/bg-7-2039-2010 (2010).
Article ADS CAS MATH Google Scholar
Ushimaru, A., Uchida, K. & Suka, T. Grassland Biodiversity in Japan: Threats, Management and Conservation, chap. 9, 22 (Taylor & Francis Group, 2017).
Lee, B. H., Kim, J. Y., Sung, K. I. & Kim, B. W. Investigation on the actual state of grassland in Republic of Korea. Journal of The Korean Society of Grassland and Forage Science 39, https://doi.org/10.5333/kgfs.2019.39.2.89 (2019).
FAO. Fertilizer use by crop in Indonesia. Tech. Rep. TC/D/Y7063E/1/05.05/300, FAO, Viale delle Terme di Caracalla 00100 Rome (2005).
Ghosh, P. K., et al. (eds.) The Indian Nitrogen Assessment: Sources of Reactive Nitrogen, Environmental and Climate Effects, Management Options, and Policies, 187–205, https://doi.org/10.1016/B978-0-12-811836-8.00013-6 (Elsevier, 2017).
Irfan, M. & Hasnain, N. Nitrogen emissions from agriculture sector in Pakistan: context, pathways, impacts and future projections. In Aziz, T. et al. (eds.) Nitrogen Assessment: Pakistan as a Case-Study, chap. 6, 99–125, https://doi.org/10.1016/B978-0-12-824417-3.00008-3 (Academic Presh, 2022).
Elrys, A. S., Abdel-Fattah, M. K., Raza, S., Chen, Z. & Zhou, J. Spatial trends in the nitrogen budget of the african agro-food system over the past five decades. Environmental Research Letters 14 https://doi.org/10.1088/1748-9326/ab5d9e (2019).
FAO. Fertilizer use by crop in Egypt. Tech. Rep. TC/D/Y5863E/1/01.05/300, FAO, Viale delle Terme di Caracalla, 00100 Rome, Italy (2005).
Esfahani, H. S. Aggregate trends in four main agricultural regions in Egypt, 1964–1979. International Journal of Middle East Studies 20, 135–164, https://doi.org/10.1017/S0020743800033900 (1988).
Article MATH Google Scholar
Bounejmate, M. The Role of Legumes in the Farming Systems of the Mediterranean Areas: proceedings of a workshop on the role of legumes in the farming systems of the Mediterranean areas, UNDPI ICARDA, vol. 38 of Developments in plant and soil science, chap. The role of legumes in the farming systems of Morocco, 85–93 (1 edn, Kluwer Academic Publishers, 1990).
FAO. Utilisation des engrais par culture au maroc. Tech. Rep. TC/D/A710F/1/10.06/300, FAO, Viale delle Terme di Caracalla, 00100 Rome, Italy (2006).
Niedertscheider, M., Gingrich, S. & Erb, K. H. Changes in land use in South Africa between 1961 and 2006: An integrated socio-ecological analysis based on the human appropriation of net primary production framework. Regional Environmental Change 12, 715–727, https://doi.org/10.1007/s10113-012-0285-6 (2012).
Article MATH Google Scholar
Smith, A. & Rhind, J. M. Eight decades of pasture plant improvement in South Africa. Journal of the Grassland Society of Southern Africa 1, 25–28, https://doi.org/10.1080/02566702.1984.9647962 (1984).
Article MATH Google Scholar
Marenya, P. P. & Barrett, C. B. Soil quality and fertilizer use rates among smallholder farmers in western Kenya. Agricultural Economics 40, 561–572, https://doi.org/10.1111/j.1574-0862.2009.00398.x (2009).
Article Google Scholar
Bora, K. Rainfall shocks and fertilizer use: A district level study of India. Environment and Development Economics 27, 556–577, https://doi.org/10.1017/S1355770X21000413 (2022).
Article MATH Google Scholar
Levers, C., Butsic, V., Verburg, P. H., Müller, D. & Kuemmerle, T. Drivers of changes in agricultural intensity in Europe. Land Use Policy 58, 380–393, https://doi.org/10.1016/j.landusepol.2016.08.013 (2016).
Article Google Scholar
Harris, I., Osborn, T. J., Jones, P. & Lister, D. Version 4 of the CRU TS monthly high-resolution gridded multivariate climate dataset. Scientific Data 7, 190, https://doi.org/10.1038/s41597-020-0453-3 (2020).
Article Google Scholar
Poggio, L. et al. SoilGrids 2.0: Producing soil information for the globe with quantified spatial uncertainty. SOIL 7, 217–240, https://doi.org/10.5194/soil-7-217-2021 (2021).
Article ADS CAS MATH Google Scholar
Middleton, N. & Thomas, D. S. World atlas of desertification, https://doi.org/10.2307/3060449 (1992).
Ju, X., Gu, B., Wu, Y. & Galloway, J. N. Reducing China’s fertilizer use by increasing farm size. Global Environmental Change 41, 26–32, https://doi.org/10.1016/j.gloenvcha.2016.08.005 (2016).
Article Google Scholar
FAOSTAT. Structural data from agricultural censuses https://www.fao.org/faostat/en/#data (2024).
Lowder, S. K., Skoet, J. & Raney, T. The number, size, and distribution of farms, smallholder farms, and family farms worldwide. World Development 87, 16–29, https://doi.org/10.1016/j.worlddev.2015.10.041 (2016).
Article Google Scholar
Jordan-Meille, L. et al. An overview of fertilizer-P recommendations in europe: Soil testing, calibration and fertilizer recommendations. Soil Use and Management 28, 419–435, https://doi.org/10.1111/j.1475-2743.2012.00453.x (2012).
Article MATH Google Scholar
Feder, G., Just, R. E. & Zilberman, D. Adoption of agricultural innovations in developing countries: a survey. Economic Development & Cultural Change 33, 255–298, https://doi.org/10.1086/451461 (1985).
Article MATH Google Scholar
Hossain, M. & Singh, V. P. Fertilizer use in Asian agriculture: Implications for sustaining food security and the environment. Nutrient Cycling in Agroecosystems 57, 155–169, https://doi.org/10.1023/A:1009865819925 (2000).
Article MATH Google Scholar
FAOSTAT. Fertilizers Archive https://www.fao.org/faostat/en/#data/RA (2020).
World Bank. World Bank Commodity Price Data (Pink Sheet). Tech. Rep., World Bank Development Group https://www.worldbank.org/en/research/commodity-markets (2024).
McArthur, J. W. & McCord, G. C. Fertilizing growth: Agricultural inputs and their effects in economic development. Journal of Development Economics 127, 133–152, https://doi.org/10.1016/j.jdeveco.2017.02.007 (2017).
Article PubMed PubMed Central MATH Google Scholar
Hijmans, R. J. et al. Package ‘terra’. Maintainer: Vienna, Austria (2022).
Weiss, D. J. et al. Global maps of travel time to healthcare facilities. Nature Medicine 26, 1835–1838, https://doi.org/10.1038/s41591-020-1059-1 (2020).
Article CAS PubMed MATH Google Scholar
Kleine-Kleffmann, U. The discovery of the first potash mine and the development of the potash industry since 1861. Journal of Plant Nutrition and Soil Science 186, 615–622, https://doi.org/10.1002/jpln.202300382 (2023).
Article CAS Google Scholar
Clarisse, L., Damme, M. V., Clerbaux, C. & Coheur, P. F. Tracking down global NH3 point sources with wind-adjusted superresolution. Atmospheric Measurement Techniques 12, 5457–5473, https://doi.org/10.5194/amt-12-5457-2019 (2019).
Article ADS CAS Google Scholar
FAOSTAT. Producer prices https://www.fao.org/faostat/en/#data/PP (2023).
FAOSTAT. Producer prices (old series) https://www.fao.org/faostat/en/#data/PA (2023).
World Bank. Official exchange rate (LCU per US $, period average) https://data.worldbank.org/indicator/PA.NUS.FCRF (2023).
Ha, J., Kose, M. A. & Ohnsorge, F. One-stop source: A global database of inflation. Journal of International Money and Finance 137 https://doi.org/10.1016/j.jimonfin.2023.102896 (2023).
Tilman, D. et al. Forecasting agriculturally driven global environmental change. Science 292, 281–284, https://doi.org/10.1126/science.1057544 (2001).
Article ADS CAS PubMed MATH Google Scholar
Xiang, T., Malik, T. H. & Nielsen, K. The impact of population pressure on global fertiliser use intensity, 1970–2011: An analysis of policy-induced mediation. Technological Forecasting and Social Change 152, 1–12, https://doi.org/10.1016/j.techfore.2019.119895 (2020).
Article MATH Google Scholar
Stenseth, N. C. & Mysterud, A. Climate, changing phenology, and other life history traits: Nonlinearity and match-mismatch to the environment. Proceedings of the National Academy of Sciences of the United States of America 99, 13379–13381 (2002).
Thiyagalingam, J., Shankar, M., Fox, G. & Hey, T. Scientific machine learning benchmarks. Nature Reviews Physics 4, 413–420, https://doi.org/10.1038/s42254-022-00441-7 (2022).
Article ADS MATH Google Scholar
Bertolini, M., Mezzogori, D., Neroni, M. & Zammori, F. Machine Learning for industrial applications: A comprehensive literature review. Expert Systems with Applications 175, 114820, https://doi.org/10.1016/J.ESWA.2021.114820 (2021).
Article MATH Google Scholar
Thessen, A. E. Adoption of Machine Learning Techniques in Ecology and Earth Science. One Ecosystem 1, e8621, https://doi.org/10.3897/ONEECO.1.E8621 (2016).
Article Google Scholar
Christin, S., Hervet, E. & Lecomte, N. Applications for deep learning in ecology. Methods in Ecology and Evolution 10, 1632–1644, https://doi.org/10.1111/2041-210X.13256 (2019).
Article MATH Google Scholar
Bondre, D. A. & Mahagaonkar, S. Prediction of crop yield and fertilizer recommendation using machine learning algorithms. International Journal of Engineering Applied Sciences and Technology 4, 371–376, https://doi.org/10.33564/IJEAST.2019.v04i05.055 (2019).
Article Google Scholar
Xiao, L. et al. Spatiotemporal co-optimization of agricultural management practices towards climate-smart crop production. Nature Food 5, 59–71, https://doi.org/10.1038/s43016-023-00891-x (2024).
Article PubMed MATH Google Scholar
Grell, M. et al. Point-of-use sensors and machine learning enable low-cost determination of soil nitrogen. Nature Food 2, 981–989, https://doi.org/10.1038/s43016-021-00416-4 (2021).
Article CAS PubMed MATH Google Scholar
Pacheco, C. et al. Exploring Data Preprocessing and Machine Learning Methods for Forecasting Worldwide Fertilizers Consumption. Proceedings of the International Joint Conference on Neural Networks 2022-July, https://doi.org/10.1109/IJCNN55064.2022.9892325 (2022).
Xu, P. et al. Fertilizer management for global ammonia emission reduction. Nature 626, 792–798, https://doi.org/10.1038/s41586-024-07020-z (2024).
Article ADS CAS PubMed MATH Google Scholar
Friedman, J., Hastie, T. & Tibshirani, R. Additive logistic regression: a statistical view of boosting. 28, 337–407, https://doi.org/10.1214/AOS/1016218223 (2000).
Ke, G. et al. LightGBM: A Highly Efficient Gradient Boosting Decision Tree. Advances in Neural Information Processing Systems 30 (2017).
Krstajic, D., Buturovic, L. J., Leahy, D. E. & Thomas, S. Cross-validation pitfalls when selecting and assessing regression and classification models. Journal of Cheminformatics 6, 1–15, https://doi.org/10.1186/1758-2946-6-10/FIGURES/16 (2014).
Article MATH Google Scholar
Stone, M. Cross-Validatory Choice and Assessment of Statistical Predictions. Journal of the Royal Statistical Society: Series B (Methodological) 36, 111–133, https://doi.org/10.1111/J.2517-6161.1974.TB00994.X (1974).
Article MathSciNet MATH Google Scholar
Varma, S. & Simon, R. Bias in error estimation when using cross-validation for model selection. BMC Bioinformatics 7, 1–8, https://doi.org/10.1186/1471-2105-7-91/FIGURES/4 (2006).
Article MATH Google Scholar
Coello, F. et al. Fertilizer application rate maps per crop and year. figshare https://doi.org/10.6084/m9.figshare.25435432 (2024).
Liu, Y., Pan, X. & Li, J. A 1961–2010 record of fertilizer use, pesticide application and cereal yields: a review. Agronomy for Sustainable Development 35, 83–93, https://doi.org/10.1007/s13593-014-0259-9 (2015).
Article CAS MATH Google Scholar
Longo, S. & York, R. Agricultural exports and the environment: A cross-national study of fertilizer and pesticide consumption. Rural Sociology 73, 82–104, https://doi.org/10.1526/003601108783575853 (2008).
Article MATH Google Scholar
Liu, M. & Tian, H. China’s land cover and land use change from 1700 to 2005: Estimations from high-resolution satellite data and historical archives. Global Biogeochemical Cycles 24, https://doi.org/10.1029/2009GB003687 (2010).
Tian, H., Banger, K., Bo, T. & Dadhwal, V. K. History of land use in india during 1880-2010: Large-scale land transformations reconstructed from satellite data and historical archives. Global and Planetary Change 121, 78–88, https://doi.org/10.1016/j.gloplacha.2014.07.005 (2014).
Article ADS Google Scholar
Yu, Z. & Lu, C. Historical cropland expansion and abandonment in the continental US during 1850 to 2016. Global Ecology and Biogeography 27, 322–333, https://doi.org/10.1111/geb.12697 (2018).
Article MATH Google Scholar
Janssens, I. et al. Code for “machine learning-driven global crop-specific fertilization dataset from 1960–2020”. figshare https://doi.org/10.6084/m9.figshare.25435594 (2024).
United Nations. Land area https://data.un.org/Data.aspx?d=FAO&f=itemCode:6601&c=2,4,5,6,7&s=countryName:asc,elementCode:asc,year:desc&v=1 (2024).
FAOSTAT. Definitions and standards. country group 2022 https://data.apps.fao.org/catalog/iso/457196b9-1d93-410d-8ad3-a57aadf09a1a (2022).
World Bank. Agricultural irrigated land (% of total agricultural land) https://data.worldbank.org/indicator/AG.LND.IRIG.AG.ZS (2023).
World Bank. Agricultural machinery, tractors https://data.worldbank.org/indicator/AG.AGR.TRAC.NO (2023).
FAOSTAT. Land use https://www.fao.org/faostat/en/<data/RL (2022).
World Bank. Goverment expenditure in education, total (% of GDP) https://data.worldbank.org/indicator/SE.XPD.TOTL.GD.ZS (2023).
United Nations. Per capita gdp at current prices - us dollars https://data.un.org/Data.aspx?d=SNAAMA&f=grID:101;currID:USD;pcFlag:1&c=2,3,5,6&s=_crEngNameOrderBy:asc,yr:desc&v=1 (2023).
FAOSTAT. Annual population https://www.fao.org/faostat/en/#data/OA (2023).
Van Rossum, G. & Drake, F. L. Python 3 Reference Manual (CreateSpace, Scotts Valley, CA, 2009).
Harris, C. R. et al. Array programming with NumPy. Nature 585, 357–362, https://doi.org/10.1038/s41586-020-2649-2 (2020).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
McKinney, W. Data structures for statistical computing in python. In van der Walt, S. & Millman, J. (eds.) Proceedings of the 9th Python in Science Conference, 56–61, https://doi.org/10.25080/Majora-92bf1922-00a (2010).
Gillies, S. et al. Rasterio: geospatial raster i/o for Python programmers https://github.com/mapbox/rasterio (2013–).
R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria https://www.R-project.org/ (2021).
Pebesma, E. & Bivand, R. Spatial Data Science: With applications in R https://doi.org/10.1201/9780429459016 (Chapman and Hall/CRC, 2023).
Pebesma, E. Simple Features for R: Standardized Support for Spatial Vector Data. The R Journal 10, 439–446, https://doi.org/10.32614/RJ-2018-009 (2018).
Article MATH Google Scholar
Pierce, D. & Pierce, M. D. Package ‘ncdf4’ https://www.vps.fmvz.usp.br/CRAN/web/packages/ncdf4/ncdf4.pdf (2019).
Daniel B. exactextractr: Fast Extraction from Raster Datasets using Polygons https://CRAN.R-project.org/package=exactextractr (2020).
Wickham, H. & Bryan, J. readxl: Read Excel Files https://readxl.tidyverse.org, https://github.com/tidyverse/readxl (2023).
Wickham, H. stringr: Simple, Consistent Wrappers for Common String Operations https://github.com/tidyverse/stringr (2023).
Wickham, H., François, R., Henry, L., Müller, K. & Vaughan, D. dplyr: A Grammar of Data Manipulation https://github.com/tidyverse/dplyr (2023).
Wickham, H., Hester, J. & Bryan, J. readr: Read Rectangular Text Data R package version 2.1.5, https://github.com/tidyverse/readr (2024).
Wickham, H. ggplot2: Elegant Graphics for Data Analysis https://ggplot2.tidyverse.org (Springer-Verlag New York, 2016).
Wickham, H. et al. Welcome to the tidyverse. Journal of Open Source Software 4, 1686, https://doi.org/10.21105/joss.01686 (2019).
Article ADS MATH Google Scholar
Weidmann, N. B., Kuse, D. & Gleditsch, K. S. The geography of the international system: The cshapes dataset. International Interactions 36, 86–106, https://doi.org/10.1080/03050620903554614 (2010).
Article MATH Google Scholar

Download references

Acknowledgements

I.J. was supported by the European Commission: Horizon 2020 framework program for research and innovation under grant agreement No 964545, Bio-Accelerated Mineral weathering (BAM!). FC, JS and JP were supported by the Spanish Government grants PID2020115770RB-I, PID2022-140808NB-I00, and TED2021-132627 B–I00 funded by MCIN, AEI/10.13039/ 501100011033 European Union Next Generation EU/PRTR, the Fundación Ramón Areces grant CIVP20A6621, and the Catalan Government grants SGR 2021–1333 and AGAUR2023 CLIMA 00118.

Author information

These authors contributed equally: Fernando Coello, Thomas Decorte.

Authors and Affiliations

Universitat Autònoma de Barcelona, 08193, Bellaterra, Spain
Fernando Coello
CREAF - Centro de Investigación Ecológica y Aplicaciones Forestales, Barcelona, 08193, Spain
Fernando Coello, Jordi Sardans & Josep Peñuelas
Global Ecology Unit, CSIC-CREAF-UAB, Barcelona, 08193, Spain
Fernando Coello, Jordi Sardans & Josep Peñuelas
University of Antwerp - imec - IDLab, Department of Mathematics, Antwerp, 2000, Belgium
Thomas Decorte & Tim Verdonck
University of Antwerp - imec - IDLab, Department of Computer Science, Antwerp, 2000, Belgium
Iris Janssens & Steven Mortier

Authors

Fernando Coello
View author publications
Search author on:PubMed Google Scholar
Thomas Decorte
View author publications
Search author on:PubMed Google Scholar
Iris Janssens
View author publications
Search author on:PubMed Google Scholar
Steven Mortier
View author publications
Search author on:PubMed Google Scholar
Jordi Sardans
View author publications
Search author on:PubMed Google Scholar
Josep Peñuelas
View author publications
Search author on:PubMed Google Scholar
Tim Verdonck
View author publications
Search author on:PubMed Google Scholar

Contributions

J.P., J.S., and T.V. designed the study. F.C. constructed the data. All authors analysed the data. I.J. and S.M. constructed the models and generated the SHAP values. T.D. and F.C. created the spatial maps and model validation. I.J., F.C., S.M. and T.D. drafted the paper. All co-authors discuss the methods and results and reviewed and commented on the manuscript.

Corresponding authors

Correspondence to Fernando Coello or Thomas Decorte.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Coello, F., Decorte, T., Janssens, I. et al. Global Crop-Specific Fertilization Dataset from 1961–2019. Sci Data 12, 40 (2025). https://doi.org/10.1038/s41597-024-04215-x

Download citation

Received: 13 June 2024
Accepted: 02 December 2024
Published: 09 January 2025
DOI: https://doi.org/10.1038/s41597-024-04215-x

This article is cited by

Rhythmic radial oxygen loss enhances soil phosphorus bioavailability
- Cai Li
- Hu Sheng
- Guoqiang Zhao
Nature Communications (2025)

Subjects

Abstract

Similar content being viewed by others

Global crop-specific nitrogen fertilization dataset in 1961–2020

NPKGRIDS: a global georeferenced dataset of N, P2O5, and K2O fertilizer application rates for 173 crops

Spatiotemporal co-optimization of agricultural management practices towards climate-smart crop production

Background & Summary

Methods

Data collection and preprocessing

Data collection

Data preprocessing

Machine learning

Models

Model training and evaluation

Model interpretability through SHAP value analysis

Adjustment to country totals

Validation

Gridded crop-specific application rate per fertilizer

Data Records

Crop-specific N application

Crop-specific P2O5 application

Crop-specific K2O application

Technical Validation

ML Model performance

SHAP value analysis

Validation

Usage Notes

Code availability

Change history

17 February 2025

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Rhythmic radial oxygen loss enhances soil phosphorus bioavailability

Search

Quick links

NPKGRIDS: a global georeferenced dataset of N, P₂O₅, and K₂O fertilizer application rates for 173 crops

Crop-specific P₂O₅ application

Crop-specific K₂O application