An improved spatially downscaled solar-induced chlorophyll fluorescence dataset from the TROPOMI product

Chen, Siyuan; Liu, Liangyun; Sui, Lichun; Liu, Xinjie; Ma, Yan

doi:10.1038/s41597-024-04325-6

Download PDF

Data Descriptor
Open access
Published: 22 January 2025

An improved spatially downscaled solar-induced chlorophyll fluorescence dataset from the TROPOMI product

Scientific Data volume 12, Article number: 135 (2025) Cite this article

2321 Accesses
2 Citations
Metrics details

Subjects

Abstract

Solar-induced chlorophyll fluorescence (SIF) is an indicator of vegetation photosynthesis, and multiple satellite SIF products have been generated in recent years. However, current SIF products are limited for applications toward vegetation photosynthesis monitoring because of low spatial resolution or spatial discontinuity. This study uses a spatial downscaling method to obtain a redistribution of the original TROPOspheric Monitoring Instrument (TROPOMI) SIF (OSIF). As a result, a downscaled SIF dataset (TroDSIF) with fine spatio-temporal resolutions (500 m, 16 days) was generated. Compared with a machine learning (ML) SIF product and OSIF, TroDSIF can better reproduce the OSIF signals with higher R², lower root mean square error (RMSE), and nearly zero residuals at different latitudes. Direct validation on TroDSIF using tower-based SIF measurements demonstrated a good consistency between them. However, TroDSIF is dependent on the linear hypothesis between OSIF and the ML-predicted SIF used in the redistribution process. Nonetheless, we believe TroDSIF is anticipated to be beneficial to conducting global vegetation photosynthesis and climate change studies at precise scales.

A high-resolution satellite-based solar-induced chlorophyll fluorescence dataset for China from 2000 to 2022

Article Open access 26 November 2024

Chlorophyll a fluorescence illuminates a path connecting plant molecular biology to Earth-system science

Article 09 August 2021

Ground far-red sun-induced chlorophyll fluorescence and vegetation indices in the US Midwestern agroecosystems

Article Open access 22 February 2024

Background & Summary

Solar-induced chlorophyll fluorescence (SIF) has been demonstrated to be closely linked to vegetation photosynthesis in terrestrial ecosystems and served as a superior indicator for estimating gross primary productivity (GPP)^{1,2,3,4,5,6,7}. Several remote sensing satellite SIF datasets have been successfully generated globally based on available satellite datasets^8,9,10 and successfully used for global GPP estimation^11,12.

From the perspective of vegetation photosynthesis monitoring, satellite SIF products with km-scale spatial resolution are essential for photosynthesis status at landscape scales. Currently, satellite-based SIF datasets can be categorized into two groups based on their spatial resolution. The first group of SIF products is the group with higher spatial resolutions (<2 km), and the second group of SIF products is the group with lower spatial resolutions (>10 km). Satellite SIF products with higher spatial resolution comprise the Orbiting Carbon Observatory-2 (OCO-2) SIF product¹³ and OCO-3 SIF product¹⁴, with spatial resolutions of 1.3 × 2.25 km and 1.6 × 2.2 km. In addition, the Chinese Carbon Dioxide Observation Satellite Mission (TanSat) SIF¹⁵ and the Terrestrial Ecosystem Carbon Inventory Satellite (TECIS-1) SIF¹⁶ product have a 2 × 2 km spatial resolution. However, the above fine-resolution SIF products have spatial discontinuity problems.

The Scanning Imaging Absorption spectrometer for Atmospheric Chartography (SCIAMACHY) SIF¹⁷, the Global Ozone Monitoring Experiment (GOME), and GOME-2 SIF products^18,19,20 cover the global scale but have a lower spatial resolution of 0.5° or even less. Therefore, they are limited to monitoring vegetation photosynthesis at a coarse spatial resolution²¹ and integrating ground GPP measurements from flux towers²².

Because of the current situation in remote sensing SIF products, research on acquiring finer spatial resolution and spatially continuous SIF datasets is constantly emerging. Among such studies, machine learning (ML) methods are commonly used to obtain global SIF products at a relatively fine resolution and high continuity, including the contiguous SIF dataset (CSIF), a new OCO-2 SIF dataset (GOSIF), a reconstructed TROPOMI SIF (RTSIF), and a continuous TanSat SIF product, with a 0.05° spatial resolution^23,24,25,26. These products utilized ML methods to derive statistical relationships between SIF (the target variable) and explanatory variables using a subset of the original SIF (OCO-2 or TROPOMI mission) periods and extrapolated the derived relationship in space and in time²⁷. Based on such mechanisms, ML-based algorithms provide a way to link multiple explanatory variables to SIF without an explicit physical and radiative transfer relationship²⁴. ML-based SIF products are predicted results trained using ML methods based on several explanatory variables to attain a predicted SIF (PSIF) with higher spatial resolutions at km scales²⁸. However, ML-based SIF products are actually simulated results derived from ML models based on different explanatory parameters. Instead, the downscaling methods^29,30 are different from the ML-based approach, which uses PSIF as an intermediate variable to redistribute the original coarser SIF datasets. The PSIF is used as a weighted coefficient to reproduce the coarse SIF retrievals. Therefore, downscaling methods can better preserve information from the original SIF retrieval while enhancing the spatial resolution compared with ML methods (ML methods use the PSIF as a target for SIF retrieval). That is to say, SIF based on the downscaling method can still be preserved as a measured signal while improving the spatial resolution, which provides a good solution for obtaining SIF datasets with higher spatial resolutions derived from the original SIF retrieval. At a finer spatiotemporal resolution of 3.5 × 7.5 km (3.5 × 5.5 km in the nadir mode) and a 16-day revisit time, the TROPOspheric Monitoring Instrument (TROPOMI) SIF^10,21,31,32 is better suited to obtaining an improved SIF dataset at a finer spatial resolution of 500 m based on the downscaling method.

The objectives of this study are (1) to generate a new SIF product with a spatial resolution of 500 m derived from the original 0.05° TROPOMI SIF using a spatial downscaling method; (2) to comprehensively assess the downscaled SIF product using tower-based SIF observations, original TROPOMI SIF (OSIF) and a ML-based SIF product (RTSIF); (3) to investigate the enhanced performance of the downscaled SIF (TroDSIF) in estimating GPP.

This study produced a downscaled SIF dataset (TroDSIF) with a spatial resolution of 500 m and well solved the above issues. TroDSIF agrees well with OSIF than RTSIF with higher R² and lower RMSE values across main vegetation types. Furthermore, TroDSIF enhanced the relationship towards tower-based GPP and SIF measurements. TroDSIF will serve as a new data source for vegetation photosynthesis monitoring, carbon cycling, climate change and other terrestrial ecosystem-related studies.

Methods

Datasets

The Caltech TROPOMI SIF data (https://doi.org/10.1029/2018GL079031) established by Kohler et al.^21,31 between March 2018 and July 2021 are derived from the Sentinel-5 Precursor satellite, with a swath of 2600 km, a 16-day revisit time, and an overpass time of 13:30, providing nearly daily global coverage. These instantaneous SIF retrievals are derived from a data-driven approach based on a linear forward model fitting top-of-atmosphere (TOA) radiance within two spectral regions (743–758 and 735–758 nm). The gridded far-red SIF (740 nm) daily corrected SIF (SIF_dc) is further obtained by accounting for the overpass time, length of the day, and SZA based on Frankenberg’s method². We selected SIF_dc at a 0.05° spatial resolution and a 16-day revisit time as the input original SIF (OSIF) to obtain the downscaled SIF. To match the explanatory variables from MODIS product with a sinusoidal projection, the original SIF_dc was first reprojected to the same projection and resampled to a spatial resolution of 5 km using a linear interpolation method before the redistribution.

As RTSIF²³ is a machine-learning SIF product derived from TROPOMI SIF, we selected it here for comparison with the downscaled SIF to explore the difference among the ML-based SIF and the downscaled SIF. Surface reflectance, photosynthetically active radiation (PAR), land surface temperature (LST), land cover, and C3/C4 fraction were used as explanatory variables for RTSIF modeling based on the extreme gradient boosting (XGBoost) approach. RTSIF has a fine spatial and temporal resolution of 0.05° and 8 days over the 2001–2020 period in clear-sky conditions.

MCD43C4 V006³³ is the MODIS Nadir bidirectional reflectance distribution adjusted reflectance (NBAR) product and was used to collect blue, green, red, and near-infrared (NIR) band reflectance as SIF explanatory variables for characterizing structural information on vegetation. NBAR from MCD43C4 has a moderate spatial resolution of 0.05° and a 16-day revisit, obtained at a nadir-viewing angle. Simultaneously, we calculated the normalized difference vegetation index (NDVI) by combining the red and NIR NBAR data. Additionally, the MCD43A4 (V061) NBAR product was used for obtaining 500 m reflectance and the NDVI.

MCD12Q1³⁴ is the MODIS land cover type dataset, which provides global land cover classification maps at a spatial resolution of 500 m per year. Multiple classification frameworks, including the International Geosphere–Biosphere Programme (IGBP), FAO-Land Cover Classification System (LCCS1), University of Maryland (UMD), and Plant Functional Types (PFT). The MCD12Q1 product was generated based on MODIS reflectance data^35,36 using a decision tree method and a boosting approach³⁷. The IGBP scheme was used for temporal pattern analysis of SIF across different vegetation types in this study.

Air temperature (AT) was considered a physiological vegetation signal and therefore served as an explanatory variable of SIF, which was obtained from the ERA5 dataset³⁸. TA data from ERA5 have a spatial resolution of 0.1° over hourly scales³⁹. For the RF model training, the original AT needs to be aggregated at 0.05° to match other explanatory variables. To obtain the weight coefficient at 500 m, AT needs to be reprojected to a sinusoidal projection and resampled with a spatial resolution of 500 m.

ChinaSpec is a network of tower-based continuous SIF measurements across mainland China⁴⁰. Continuous spectral measurements are collected using an automated SIF system with a QE 65Pro spectrometer within the wavelength range of 680–840 nm synchronously with flux observations. Six sites (XTS, DM, AR, HL, GC, and PYH) are collected to match the downscaled TROPOMI SIF for direct validation. Site information can be found in Supplementary Table S1. AmeriFlux measures the CO₂ exchange in ecosystems, energy fluxes, and water in most regions of North, Central, and South America, providing fluxes and meteorological observations at hourly scales⁴¹. The establishment of the AmeriFlux network was to provide critical flux measurements across different ecosystems and climate zones. Daily GPP estimates were calculated from hourly GPP observations based on partitioning NEE measurements and aggregated to a 16-day scale. Overall, 67 AmeriFlux sites that have synchronous measurements with TROPOMI SIF in 2019 were selected for this analysis (see Supplementary Table S1 for site information). Ten IGBP PFTs⁴² are included among these sites: CRO: croplands, DBF: deciduous broadleaf forests, ENF: evergreen needleleaf forests, GRA: grasslands, MF: mixed forests, OSH: open shrublands, SAV: savannas, WET: wetlands, WSA: woody savannas, and CSH: close shrublands.

Downscaling approach to reproduce TROPOMI SIF at 500 m

Figure 1 depicts the flowchart of the scheme to produce the spatial downscaled TROPOMI SIF (TroDSIF) with a spatial resolution of 500 m. Three main steps include A) the establishment of the RF model, B) the generation of the weight coefficient and C) the redistribution of the OSIF.

The RF model to predict TROPOMI SIF

Recently, ML algorithms have been used for remote sensing studies, especially for carbon and water flux research^43,44,45. For example, neural networks^24,27,46,47 and tree-based methods^25,26,30 have been successfully used to produce finer spatial resolution and temporally continuous SIF datasets. Among them, random forest (RF) model was first produced by Leo⁴⁸ and had been widely used in remote sensing applications, such as land cover classification and biomass estimation^49,50. Unlike other ML models, RF model is not sensitive to the unbalanced distributions and missing issue of the input samples. In addition, due to its random style in splitting tree nodes, RF model is also insensitive to overfitting problems^48,49. Besides, it performs better in dealing with large, high-dimensional datasets and multicollinear datasets^51,52, with stronger robustness in selecting noise and features^50,53. Therefore, we selected the RF model to establish the relationship between driving parameters and the TROPOMI daily corrected SIF (SIF_dc, with a coarser spatial resolution of 0.05°).

The determination of explanatory variables for SIF was mainly referred to Ma et al.^26,30. Based on the basic equation of SIF (Gu et al.⁵⁴), SIF can be expressed as:

$${\rm{SIF}}={\rm{PAR}}\times {\rm{FAPAR}}\times {\rm{\varepsilon }}\times {{\rm{\varnothing }}}_{{\rm{SIF}}}$$

(1)

where ε is the escape ratio of SIF photons from the canopy, FAPAR is a fraction of PAR which green leaves absorbs, ∅_SIF is the amount of SIF photons, namely the yield of the fluorescence quantum.

Two types of information are included in Eq. (1): canopy structure-related (ε, FAPAR) and physiological-related information (∅_SIF). As ε and FAPAR are both structure-related factors, canopy bi-directional reflectance and its different combinations (i.e. vegetation index) can be used to estimate ε and FAPAR⁴¹. Specifically, NIR reflectance is closely related to the SIF escape fraction ε, while the red band is associated with the absorption process^55,56. Blue band reflectance was selected because of its tight relationship with chlorophyll and carotenoid absorption⁵⁶. Green band reflectance may highly fluctuate under higher absorption conditions. Therefore, we used MODIS reflectance at NIR, red, blue and green bands as SIF driving variables. Simultaneously, the physiological component of SIF, ϕ_SIF, depends on the heat dissipation (NPQ) and the fraction of the PSII reaction center (q_L)⁵⁴. Since both NPQ and q_L are related to the illumination conditions under clear skies, the cosine function of the solar zenith angle (cos(SZA)) was selected for characterizing the illumination conditions^2,54,57. In addition, the AT was included to providing auxiliary information in characterizing vegetation physiological information.

To sum up, seven datasets in total were selected as explanatory variables, including four BRDF-corrected reflectance datasets at the red, NIR, green and blue bands, NDVI, cos (SZA), and TA, as shown in Eq. (2). All samples for each year were divided into two datasets, 70% for training and 30% for validation to satisfy larger training data size and higher accuracy standard simultaneously⁵⁸.

$$\text{SIF}\,=\,\text{RF}\left(\text{R}1,\text{R}2,\text{R}3,\text{R}4,\text{NDVI},\cos \left(\text{SZA}\right),\text{TA}\right)$$

(2)

where R1–R4 are the four NBAR values at the red, NIR, blue, and green bands derived from the MCD43C4 product, the NDVI is derived from MCD43C4 reflectance using the red and NIR bands (NDVI = (NIR-Red)/(NIR + Red)).

Redistribution of the original TROPOMI SIF

Explanatory variables with a 500 m spatial resolution were prepared for SIF prediction based on the RF model established in Procedure A. The predicted 500 m SIF (PSIF_500m) was subsequently used as a weight coefficient in the redistribution process, which was based on the assumption on the robust linear relationship between OSIF and PSIF (Procedure C). Specifically, the downscaled 500 m TROPOMI SIF (TroDSIF) was redistributed from the original 5 km reprojected TROPOMI SIF (OSIF_5km). Detailed descriptions of this redistribution principle are illustrated in Fig. 2. As the missing 500 m explanatory variables, PSIF_500m obtained from the RF model may also have some discontinuous values. Therefore, we first smoothed PSIF using a two-degree Gaussian function with a standard deviation (SD) of 25 km before it was used as a weight coefficient for OSIF redistribution. Each 500 m TroDSIF pixel in the central 5 km OSIF was calculated based on 5 × 5 window of 5 km TROPOMI SIF (OSIF_5km) pixels, 50 × 50 window of 500 m predicted SIF (PSIF_500m) pixels and 2D Gaussian function weights over 5 km and 500 m scales (Weights_5km and Weights_500m). The formulas involved are as follows:

$${\rm{If}}\,{\rm{ratio}}=\frac{{\sum }_{{\rm{i}}=1}^{{{\rm{n}}}_{1}\le 5\times 5}{{\rm{OSIF}}}_{{\rm{5km}}}\times {{\rm{Weights}}}_{{\rm{5km}}}({{\rm{OSIF}}}_{{\rm{25km}}})}{{\sum }_{{\rm{j}}=1}^{{{\rm{n}}}_{2}\le 50\times 50}{{\rm{PSIF}}}_{{\rm{500m}}}\times {{\rm{Weights}}}_{{\rm{500m}}}({{\rm{PSIF}}}_{{\rm{25km}}})} > 0\,{\rm{(Condition\; 1)}}$$

(3)

$${{\rm{DSIF}}}_{{\rm{500m}}}={{\rm{PSIF}}}_{{\rm{500m}}}\times \frac{{\sum }_{{\rm{i}}=1}^{{{\rm{n}}}_{1}\le 5\times 5}({{\rm{SIF}}}_{{\rm{5km}}}\times {{\rm{Weights}}}_{{\rm{5km}}})/{{\rm{n}}}_{1}}{{\sum }_{{\rm{j}}=1}^{{{\rm{n}}}_{2}\le 50\times 50}({{\rm{PSIF}}}_{{\rm{500m}}}\times {{\rm{Weights}}}_{{\rm{500m}}})/{{\rm{n}}}_{2}}$$

(4)

Else (Condition 2)

$${{\rm{DSIF}}}_{{\rm{500m}}}={\sum }_{{\rm{i}}=1}^{{{\rm{n}}}_{1}\le 5\times 5}({{\rm{OSIF}}}_{0.5^\circ }\times {{\rm{Weights}}}_{0.5^\circ })/{{\rm{n}}}_{1}-{\sum }_{{\rm{j}}\,=1}^{{{\rm{n}}}_{2}\le 50\times 50}({{\rm{PSIF}}}_{0.05^\circ }{\times {\rm{Weights}}}_{0.05^\circ })/{{\rm{n}}}_{2}+{{\rm{PSIF}}}_{{\rm{500m}}}$$

(5)

The ratio of OSIF_25km (consisting of 5 × 5 OSIF pixels) to PSIF_25km (consisting of 50 × 50 PSIF pixels) served as the condition to redistribute OSIF. When the ratio is greater than 0 (Condition 1), the TroDSIF is calculated using Eq. (2). However, because of the negative OSIF values, the ratio may be negative, resulting in the opposite information from OSIF_5km. Therefore, under this condition (Condition 2), we obtained the TroDSIF based on Eq. (3). In addition, PSIF may not exist everywhere, resulting in no PSIF_25km in some specific areas. Under this condition, we used OSIF_25km as a replacement. In addition, if OSIF_25km is absent, PSIF will be used. A 2D Gaussian function with an SD value of 15 km was used for gap filling in a 3 × 3 coarse window²² when there are no available PSIF or OSIF pixels. To distinguish different ways of obtaining TroDSIF, labels were designated for each TroDSIF pixel (Table 1).

Table 1 Label of the TroDSIF dataset.

Full size table

Data Records

Our improved spatially downscaled solar-induced chlorophyll fluorescence dataset (TroDSIF), is available at Zenodo https://doi.org/10.5281/zenodo.10060550⁵⁹. The data record contains TroDSIF data covering the range from April 2018 to July 2021 at a 500 m, 16-day spatio-temporal resolution. Approximately h5 format files per year. The unit is mW/m²/nm/sr. The file name SIF500m_corr_SIF_predict_ < YYYYDDD > _ < h**v** > .h5 provides information on the year, day of year, and the ___location index of each file (e.g., SIF500m_corr_SIF_predict_2018091_h00v08).

Technical Validation

Performance of the RF model

The built RF model was first validated using both the 70% training dataset and the 30% validation dataset. The coefficient of determination (R²), slopes, and the root mean square error (RMSE) were used as accuracy metrics. Figure 3 displays the performance of the RF model in 2019 in both the training and validation datasets. The PSIF well reproduced the OSIF with an R² of 0.908, an RMSE of 0.059 mW/m²/nm/sr when training, and gave an R² of 0.893 and an RMSE of 0.064 mW/m²/nm/sr when validating. We also evaluated the importance of the selected explanatory variables used in the RF model (Fig. 4). The results show that cos (SZA) and NIR reflectance are the top two critical variables, followed by the NDVI and green band reflectance.

The spatiotemporal patterns of the TroDSIF dataset

As global land is mainly located in the Northern Hemisphere, vegetation generally thrives during summer. Therefore, day of year (DOY) of 206 in 2019 was selected to display the global patterns of the downscaled 500 m TroDSIF. Figure 5 shows the global pattern of TroDSIF in 2019206, overall, TroDSIF successfully reproduced the spatial patterns of OSIF and is spatially continuous over a global scale. Moreover, higher values are concentrated in the northern hemisphere, which is in line with convention. Enlarged maps of TroDSIF and OSIF over north Guinea-Sierra Leone is also shown to stress the enhanced spatial details after downscaling (Fig. 6). Obvious patch effects are noticed in OSIF but smoother pixels in TroDSIF.

Globally averaged SIF values of TroDSIF and OSIF are displayed in Fig. 7. Similar patterns can be noticed for both SIF datasets and show a clear seasonality over the data coverage period.

Furthermore, SIF values of TroDSIF and OSIF were extracted across 10 major vegetation types (Fig. 8) based on MCD12Q1 land cover maps, including ENF, EBF, DNF, DBF, MF, SAV, GRA, CRO, OSH, and WSA. TroDSIF has similar variations with OSIF throughout each growing year in the selected vegetation types.

Comparison of TroDSIF with RTSIF and OSIF

The spatially downscaled 500 m TroDSIF dataset was re-aggregated to 0.05° and compared with the OSIF and RTSIF at a global scale. Scatterplots between the re-aggregated TroDSIF (with a 0.05° spatial resolution), RTSIF and the original 0.05° TROPOMI SIF (OSIF) of two DOYs in 2019 (14 and 206) are shown in Fig. 9. TroDSIF is highly consistent with OSIF, with higher R² values of 0.948 and 0.934, lower RMSE values of 0.057 and 0.067 mW/m²/nm/sr in 2019014 and 2019206, and is independent of latitude. RTSIF is relatively weakly correlated with OSIF, having smaller R² values of 0.886 and 0.857 and larger RMSE values of 0.086 and 0.109 mW/m²/nm/sr on the same DOYs. In addition, RTSIF is constant with near zero values, while OSIF varies over a wide data range and seems to be dependent on latitude. Higher consistency between TroDSIF and OSIF over the global scale indicates a better robustness comparing to RTSIF, which suggests the reasonability and accuracy of the spatial downscaling method, as it firstly used the characteristics of OSIF (RF predicted SIF values) as the coefficients to reallocate the OSIF spatially. Instead, ML-based SIF products used the model predicted values as final SIF retrievals, which induced higher discrepancies between RTSIF and OSIF across the global scale.

Simultaneously, we also conducted biome-level comparisons for both TroDSIF and RTSIF with the OSIF using the definitions in the MCD12Q1 product in the same DOYs (14 and 206) (Fig. 10). Overall, TroDSIF showed a stronger relationship with OSIF in most selected biomes, including the ENF, EBF, MF, SAV, GRA, CRO, CSH, OSH, and WSA biomes, with the highest R² and the lowest RMSE values. For RTSIF, it has lower R² values and higher RMSE values coupled to OSIF compared with TroDSIF.

In addition, to further assess the consistency of the downscaled dataset (TroDSIF) and ML-based dataset (RTSIF) with OSIF, we calculated the residuals between the TroDSIF/RTSIF and OSIF (the difference between the TroDSIF/RTSIF and OSIF). Global residuals between the TroDSIF, RTSIF, and the OSIF in 2019 are shown in Fig. 11. Overall, TroDSIF behaves with nearly zero residuals with OSIF at different latitudes. However, the ML-based RTSIF product has higher and inhomogeneous residuals across the global scale within the range of −0.2 to 0.2 mW/m²/nm/sr. The superior performance of TroDSIF is attributed to the principle of the spatial downscaling method, for it uses the ML-based SIF values as weight coefficients to redistribute OSIF for each pixel, which well solves the discrepancies across different latitudes in RTSIF.

Validation of TroDSIF with tower-based SIF

Tower-based SIF measurements provided by six ChinaSpec sites were used to validate the TroDSIF dataset. Half-hourly SIF estimates were first aggregated to a 16-day scale to match the TroDSIF pixels. Direct validation of TroDSIF and OSIF based on tower-based SIF measurements from six ChinaSpec sites is shown in Fig. 12. Overall, TroDSIF is highly consistent with the tower-based SIF comparing to OSIF. The RMSE values varied from 0.104 to 0.223, and mean absolute error (MAE) values were within the range of 0.077 and 0.163. Specifically, TroDSIF has the closest relationship with tower-based SIF with lower RMSE and MAE values at XTS than other sites. The improved spatial resolution of TroDSIF over a km-scale formed a good agreement on tower-based SIF measurements, which well reduced the spatial heterogeneity of the remote sensing SIF product.

Improved performance of TroDSIF with tower-based GPP

In order to further test the performance of TroDSIF in improving the estimation of GPP, tower-based GPP estimates from the AmeriFlux network at 67 different sites were used as references. Meanwhile, OSIF was selected for comparison. The results show a good relationship between the original SIF and the tower-based GPP having an overall R² value of 0.483 across different selected biomes (Fig. 13). Moreover, the downscaled SIF also performs better than the original TROPOMI retrievals with an R² value of 0.542 among all selected biomes (Fig. 13(k)), which is mainly attributed to the spatial coverage of the finer resolution is closer to the footprint of the tower-based measurements. For each biome, TroDSIF improves the SIF-GPP relationships with R² values varying between 0.193 to 0.894. Correspondingly, OSIF has lower R² values with EC GPP within a range of 0.125 and 0.866. The improved relationship of TroDSIF with tower-based GPP indicates an enhanced performance of SIF in capturing vegetation GPP, and can therefore better serve as a proxy for the vegetation photosynthesis dynamics comparing to a coarser SIF product.

Code availability

No custom code has been used in this paper.

References

Damm, A. et al. Remote sensing of sun‐induced fluorescence to improve modeling of diurnal courses of gross primary production (GPP). Global Change Biology. 16(1), 171–186 (2010).
Article ADS MathSciNet MATH Google Scholar
Frankenberg, C. et al. New global observations of the terrestrial carbon cycle from GOSAT: Patterns of plant fluorescence with gross primary productivity. Geophysical Research Letters. 38(17) (2011).
Joiner, J. et al. First observations of global and seasonal terrestrial chlorophyll fluorescence from space. Biogeosciences. 8(3), 637–651 (2011).
Article ADS CAS MATH Google Scholar
Guanter, L. et al. Global and time-resolved monitoring of crop photosynthesis with chlorophyll fluorescence. Proceedings of the National Academy of Sciences. 111(14), E1327–E1333 (2014).
Article ADS CAS MATH Google Scholar
Porcar-Castell, A. et al. Linking chlorophyll a fluorescence to photosynthesis for remote sensing applications: mechanisms and challenges. Journal of experimental botany. 65(15), 4065–4095 (2014).
Article CAS PubMed MATH Google Scholar
Gupana, R. S. et al. Remote sensing of sun-induced chlorophyll-a fluorescence in inland and coastal waters: Current state and future prospects. Remote Sensing of Environment. 262, 112482 (2021).
Article MATH Google Scholar
Porcar-Castell, A. A new approach to remote sensing of plant physiological status and GPP. in and Graduate School in “Physics, Chemistry, Biology and Meteorology of Atmospheric Composition and Climate Change” Annual Workshop 27.–29.4. 2009. (2009).
Damm, A. et al. Far-red sun-induced chlorophyll fluorescence shows ecosystem-specific relationships to gross primary production: An assessment based on observational and modeling approaches. Remote Sensing of Environment. 166, 91–105 (2015).
Article ADS Google Scholar
Guanter, L. et al. Retrieval and global assessment of terrestrial chlorophyll fluorescence from GOSAT space measurements. Remote Sensing of Environment. 121, 236–251 (2012).
Article ADS Google Scholar
Guanter, L. et al. The TROPOSIF global sun-induced fluorescence dataset from the Sentinel-5P TROPOMI mission. Earth System Science Data. 13(11), 5423–5440 (2021).
Article ADS MATH Google Scholar
Li, X. & Xiao, J. TROPOMI observations allow for robust exploration of the relationship between solar-induced chlorophyll fluorescence and terrestrial gross primary production. Remote Sensing of Environment. 268, 112748 (2022).
Article MATH Google Scholar
Sun, Y. et al. Overview of Solar-Induced chlorophyll Fluorescence (SIF) from the Orbiting Carbon Observatory-2: Retrieval, cross-mission comparison, and global monitoring for GPP. Remote Sensing of Environment. 209, 808–823 (2018).
Article ADS MATH Google Scholar
Frankenberg, C. et al. Prospects for chlorophyll fluorescence remote sensing from the Orbiting Carbon Observatory-2. Remote Sensing of Environment. 147, 1–12 (2014).
Article ADS Google Scholar
Eldering, A. et al. The OCO-3 mission: measurement objectives and expected performance based on 1 year of simulated data. Atmospheric Measurement Techniques. 12(4), 2341–2370 (2019).
Article ADS MathSciNet CAS MATH Google Scholar
Du, S. et al. Retrieval of global terrestrial solar-induced chlorophyll fluorescence from TanSat satellite. Science Bulletin. 63(22), 1502–1512 (2018).
Article ADS PubMed MATH Google Scholar
Du, S. et al. Prospects for solar-induced chlorophyll fluorescence remote sensing from the SIFIS payload onboard the TECIS-1 satellite. Journal of Remote Sensing, (2022).
Joiner, J. et al. Filling-in of near-infrared solar lines by terrestrial fluorescence and other geophysical effects: simulations and space-based observations from SCIAMACHY and GOSAT. Atmospheric Measurement Techniques. 5(GSFC-E-DAA-TN9416) (2012).
Joiner, J. et al. Global monitoring of terrestrial chlorophyll fluorescence from moderate spectral resolution near-infrared satellite measurements: Methodology, simulations, and application to GOME-2. Atmospheric Measurement Techniques Discussions. 6(2), 3883–3930 (2013).
ADS MATH Google Scholar
Joiner, J. et al. New methods for the retrieval of chlorophyll red fluorescence from hyperspectral satellite instruments: simulations and application to GOME-2 and SCIAMACHY. Atmospheric Measurement Techniques. 9(8), 3939–3967 (2016).
Article ADS CAS MATH Google Scholar
Joiner, J. et al. The seasonal cycle of satellite chlorophyll fluorescence observations and its relationship to vegetation phenology and ecosystem atmosphere carbon exchange. Remote Sensing of Environment. 152, 375–391 (2014).
Article ADS MATH Google Scholar
Köhler, P. et al. Global retrievals of solar‐induced chlorophyll fluorescence with TROPOMI: First results and intersensor comparison to OCO‐2. Geophysical Research Letters. 45(19), 10,456–10,463 (2018).
Article MATH Google Scholar
Duveiller, G. & Cescatti, A. Spatially downscaling sun-induced chlorophyll fluorescence leads to an improved temporal correlation with gross primary productivity. Remote Sensing of Environment. 182, 72–89 (2016).
Article ADS Google Scholar
Chen, X. et al. A long-term reconstructed TROPOMI solar-induced fluorescence dataset using machine learning algorithms. Scientific Data. 9(1), 427, https://doi.org/10.1038/s41597-022-01520-1 (2022).
Article CAS PubMed PubMed Central MATH Google Scholar
Zhang, Y. et al. A global spatially contiguous solar-induced fluorescence (CSIF) dataset using neural networks. Biogeosciences. 15(19), 5779–5800 (2018).
Article ADS CAS MATH Google Scholar
Li, X. & Xiao, J. A global, 0.05-degree product of solar-induced chlorophyll fluorescence derived from OCO-2, MODIS, and reanalysis data. Remote Sensing. 11(5), 517 (2019).
Article ADS MATH Google Scholar
Ma, Y. et al. Generation of a global spatially continuous TanSat solar-induced chlorophyll fluorescence product by considering the impact of the solar radiation intensity. Remote Sensing. 12(13), 2167 (2020).
Article ADS MATH Google Scholar
Wen, J. et al. A framework for harmonizing multiple satellite instruments to generate a long-term global high spatial-resolution solar-induced chlorophyll fluorescence (SIF). Remote Sensing of Environment. 239, 111644 (2020).
Article MATH Google Scholar
Shen, Q. et al. Exploring the potential of spatially downscaled Solar-induced chlorophyll fluorescence to monitor drought effects on gross primary production in winter wheat. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing. 15, 2012–2022 (2022).
Article ADS MATH Google Scholar
Duveiller, G. et al. A spatially downscaled sun-induced fluorescence global product for enhanced monitoring of vegetation productivity. Earth System Science Data. 12(2), 1101–1116 (2020).
Article ADS MATH Google Scholar
Ma, Y. et al. An improved downscaled sun-induced chlorophyll fluorescence (DSIF) product of GOME-2 dataset. European Journal of Remote Sensing. 55(1), 168–180 (2022).
Article MATH Google Scholar
Köhler, P. et al. Global retrievals of solar‐induced chlorophyll fluorescence at red wavelengths with TROPOMI. Geophysical Research Letters. 47(15), e2020GL087541 (2020).
Article ADS MATH Google Scholar
Guanter, L. et al. Potential of the TROPOspheric Monitoring nstrument (TROPOMI) onboard the Sentinel-5 Precursor for the monitoring of terrestrial chlorophyll fluorescence. Atmospheric Measurement Techniques. 8(3), 1337–1352 (2015).
Article ADS CAS MATH Google Scholar
Schaaf, C. & Wang, Z. MCD43C4 MODIS/Terra+Aqua BRDF/Albedo Nadir BRDF-Adjusted Ref Daily L3 Global 0.05Deg CMG V006. NASA EOSDIS Land Processes DAAC https://doi.org/10.5067/MODIS/MCD43C4.006 (2015).
Article Google Scholar
Friedl, M. & Sulla-Menashe, D. MCD12Q1 MODIS/Terra+Aqua Land Cover Type Yearly L3 Global 500m SIN Grid V006. NASA EOSDIS Land Processes DAAC https://doi.org/10.5067/MODIS/MCD12Q1.006 (2019).
Article Google Scholar
Friedl, M. A. et al. Global land cover mapping from MODIS: algorithms and early results. Remote sensing of Environment. 83(1-2), 287–302 (2002).
Article ADS MATH Google Scholar
Friedl, M. A. et al. MODIS Collection 5 global land cover: Algorithm refinements and characterization of new datasets. Remote sensing of Environment. 114(1), 168–182 (2010).
Article ADS MathSciNet MATH Google Scholar
Myneni, R. B. et al. Global products of vegetation leaf area and fraction absorbed PAR from year one of MODIS data. Remote sensing of environment. 83(1-2), 214–231 (2002).
Article ADS MATH Google Scholar
Muñoz-Sabater, J. et al. ERA5-Land: A state-of-the-art global reanalysis dataset for land applications. Earth System Science Data. 13(9), 4349–4383, https://doi.org/10.5194/essd-13-4349-2021 (2021).
Article ADS MATH Google Scholar
Hersbach, H. et al. ERA5 hourly data on single levels from 1979 to present. Copernicus climate change service (c3s) climate data store (cds). 10(10.24381) (2018).
Zhang, Y. et al. ChinaSpec: A Network for Long-Term Ground-Based Measurements of Solar-Induced Fluorescence in China. Journal of Geophysical Research: Biogeosciences. 126(3), e2020JG006042, https://doi.org/10.1029/2020JG006042 (2021).
Article ADS Google Scholar
Agarwal, D. A. et al. A data-centered collaboration portal to support global carbon-flux analysis. Concurrency and Computation: Practice and Experience. 22(17), 2323–2334, https://doi.org/10.1002/cpe.1600 (2010).
Article MATH Google Scholar
Belward, A. S., Estes, J. E. & Kline, K. D. The IGBP-DIS global 1-km land-cover data set DISCover: A project overview. Photogrammetric Engineering and Remote Sensing. 65(9), 1013–1020 (1999).
Google Scholar
Alemohammad, S. H. et al. Water, Energy, and Carbon with Artificial Neural Networks (WECANN): a statistically based estimate of global surface turbulent fluxes and gross primary productivity using solar-induced fluorescence. Biogeosciences. 14(18), 4101–4124 (2017).
Article ADS PubMed PubMed Central MATH Google Scholar
Jung, M. et al. Global patterns of land‐atmosphere fluxes of carbon dioxide, latent heat, and sensible heat derived from eddy covariance, satellite, and meteorological observations. Journal of Geophysical Research: Biogeosciences. 116(G3) (2011).
Tramontana, G. et al. Predicting carbon dioxide and energy fluxes across global FLUXNET sites with regression algorithms. Biogeosciences. 13(14), 4291–4313 (2016).
Article ADS CAS MATH Google Scholar
Gentine, P. & Alemohammad, S. Reconstructed solar-induced fluorescence: A machine learning vegetation product based on MODIS surface reflectance to reproduce GOME-2 solar-induced fluorescence. Geophysical research letters. 45(7), 3136–3146 (2018).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Yu, L. et al. High-resolution global contiguous SIF of OCO-2. Geophysical Research Letters. 46(3), 1449–1458 (2019).
Article ADS MATH Google Scholar
Breiman, Random forests. MACH LEARN. 2001,45(1)(-), 5–32 (2001).
Belgiu & Dragut, Random forest in remote sensing: A review of applications and future directions. ISPRS J PHOTOGRAMM. 2016,114(-), 24–31 (2016).
Vincenzi, S. et al. Application of a Random Forest algorithm to predict spatial distribution of the potential yield of Ruditapes philippinarum in the Venice lagoon, Italy. Ecological Modelling, (2011).
Gómez-Ramírez, J., Ávila-Villanueva, M. & Fernández-Blázquez, M. Á. Selecting the most important self-assessed features for predicting conversion to mild cognitive impairment with random forest and permutation-based methods. Scientific Reports. 10(1), 1–15 (2020).
Article Google Scholar
Matsuki, K., Kuperman, V. & Van Dyke, J. A. The Random Forests statistical technique: An examination of its value for the study of reading. Scientific Studies of Reading. 20(1), 20–33 (2016).
Article PubMed PubMed Central MATH Google Scholar
Rossini, M. Red and far red Sun-induced chlorophyll fluorescence as a measure of plant photosynthesis. Geophysical Research Letters, (2015).
Gu, L. et al. Sun‐induced Chl fluorescence and its importance for biophysical modeling of photosynthesis based on light reactions. New Phytologist. 223(3), 1179–1191 (2019).
Article CAS PubMed MATH Google Scholar
Vilfan, N. et al. Extending Fluspect to simulate xanthophyll driven leaf reflectance dynamics. Remote sensing of environment. 211, 345–356 (2018).
Article ADS Google Scholar
Woodgate, W. et al. tri-PRI: A three band reflectance index tracking dynamic photoprotective mechanisms in a mature eucalypt forest. Agricultural and Forest Meteorology. 272, 187–201 (2019).
Article ADS Google Scholar
Yoshida, Y. et al. The 2010 Russian drought impact on satellite measurements of solar-induced chlorophyll fluorescence: Insights from modeling and comparisons with parameters derived from satellite reflectances. Remote Sensing of Environment. 166, 163–177 (2015).
Article ADS MATH Google Scholar
Rodriguez-Galiano, V. F. et al. An assessment of the effectiveness of a random forest classifier for land-cover classification. ISPRS journal of photogrammetry and remote sensing 67, 93–104 (2012).
Article ADS MATH Google Scholar
Chen, S. & Liu, L. TroDSIF: an improved spatially downscaled solar-induced chlorophyll fluorescence product of TROPOMI dataset [Data set]. Zenodo. https://doi.org/10.5281/zenodo.10060550 (2023).

Download references

Acknowledgements

This study is financially supported by the National Natural Science Foundation of China (No. 42425001).

Author information

Authors and Affiliations

PowerChina Northwest Engineering Corporation Limited, Xi’an, 710065, China
Siyuan Chen
International Research Center of Big Data for Sustainable Development Goals, Beijing, 100094, China
Siyuan Chen, Liangyun Liu & Xinjie Liu
Key Laboratory of Digital Earth Science, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, 100094, China
Siyuan Chen, Liangyun Liu & Xinjie Liu
College of Geological Engineering and Geomatics, Chang’ an University, Xi’an, 710054, China
Lichun Sui
Zhejiang Academy of Surveying and Mapping, Hangzhou, 310001, China
Yan Ma

Authors

Siyuan Chen
View author publications
Search author on:PubMed Google Scholar
Liangyun Liu
View author publications
Search author on:PubMed Google Scholar
Lichun Sui
View author publications
Search author on:PubMed Google Scholar
Xinjie Liu
View author publications
Search author on:PubMed Google Scholar
Yan Ma
View author publications
Search author on:PubMed Google Scholar

Contributions

Siyuan Chen and Liangyun Liu designed research and wrote the paper. Lichun Sui, Liangyun Liu and Xinjie Liu revised the paper. Siyuan Chen, Yan Ma processed the data and validated the results.

Corresponding author

Correspondence to Liangyun Liu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Table

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Chen, S., Liu, L., Sui, L. et al. An improved spatially downscaled solar-induced chlorophyll fluorescence dataset from the TROPOMI product. Sci Data 12, 135 (2025). https://doi.org/10.1038/s41597-024-04325-6

Download citation

Received: 20 November 2023
Accepted: 18 December 2024
Published: 22 January 2025
DOI: https://doi.org/10.1038/s41597-024-04325-6