The three major axes of terrestrial ecosystem function

Migliavacca, Mirco; Musavi, Talie; Mahecha, Miguel D.; Nelson, Jacob A.; Knauer, Jürgen; Baldocchi, Dennis D.; Perez-Priego, Oscar; Christiansen, Rune; Peters, Jonas; Anderson, Karen; Bahn, Michael; Black, T. Andrew; Blanken, Peter D.; Bonal, Damien; Buchmann, Nina; Caldararu, Silvia; Carrara, Arnaud; Carvalhais, Nuno; Cescatti, Alessandro; Chen, Jiquan; Cleverly, Jamie; Cremonese, Edoardo; Desai, Ankur R.; El-Madany, Tarek S.; Farella, Martha M.; Fernández-Martínez, Marcos; Filippa, Gianluca; Forkel, Matthias; Galvagno, Marta; Gomarasca, Ulisse; Gough, Christopher M.; Göckede, Mathias; Ibrom, Andreas; Ikawa, Hiroki; Janssens, Ivan A.; Jung, Martin; Kattge, Jens; Keenan, Trevor F.; Knohl, Alexander; Kobayashi, Hideki; Kraemer, Guido; Law, Beverly E.; Liddell, Michael J.; Ma, Xuanlong; Mammarella, Ivan; Martini, David; Macfarlane, Craig; Matteucci, Giorgio; Montagnani, Leonardo; Pabon-Moreno, Daniel E.; Panigada, Cinzia; Papale, Dario; Pendall, Elise; Penuelas, Josep; Phillips, Richard P.; Reich, Peter B.; Rossini, Micol; Rotenberg, Eyal; Scott, Russell L.; Stahl, Clement; Weber, Ulrich; Wohlfahrt, Georg; Wolf, Sebastian; Wright, Ian J.; Yakir, Dan; Zaehle, Sönke; Reichstein, Markus

doi:10.1038/s41586-021-03939-9

Download PDF

Article
Open access
Published: 22 September 2021

The three major axes of terrestrial ecosystem function

Nature volume 598, pages 468–472 (2021)Cite this article

61k Accesses
133 Citations
223 Altmetric
Metrics details

Subjects

Abstract

The leaf economics spectrum^1,2 and the global spectrum of plant forms and functions³ revealed fundamental axes of variation in plant traits, which represent different ecological strategies that are shaped by the evolutionary development of plant species². Ecosystem functions depend on environmental conditions and the traits of species that comprise the ecological communities⁴. However, the axes of variation of ecosystem functions are largely unknown, which limits our understanding of how ecosystems respond as a whole to anthropogenic drivers, climate and environmental variability^4,5. Here we derive a set of ecosystem functions⁶ from a dataset of surface gas exchange measurements across major terrestrial biomes. We find that most of the variability within ecosystem functions (71.8%) is captured by three key axes. The first axis reflects maximum ecosystem productivity and is mostly explained by vegetation structure. The second axis reflects ecosystem water-use strategies and is jointly explained by variation in vegetation height and climate. The third axis, which represents ecosystem carbon-use efficiency, features a gradient related to aridity, and is explained primarily by variation in vegetation structure. We show that two state-of-the-art land surface models reproduce the first and most important axis of ecosystem functions. However, the models tend to simulate more strongly correlated functions than those observed, which limits their ability to accurately predict the full range of responses to environmental changes in carbon, water and energy cycling in terrestrial ecosystems^7,8.

Global patterns of plant functional traits and their relationships to climate

Article Open access 13 September 2024

Climatic and soil factors explain the two-dimensional spectrum of global plant trait variation

Article Open access 23 December 2021

Plant traits alone are poor predictors of ecosystem properties and long-term ecosystem functioning

Article 05 October 2020

Main

Terrestrial ecosystems provide multiple functions (for example, resource use and potential uptake of carbon dioxide, among others) and ecosystem services on which society depends⁵. To understand and predict the response mechanisms of ecosystems as a whole to climatic and other environmental changes, it is crucial to establish how many and which functions need to be measured to obtain a good representation of overall ecosystem functioning. So far, the key functional axes that control the behaviour of terrestrial ecosystems have not yet been quantified⁵. This can be achieved by identifying associations between a comprehensive set of ecosystem functions measured consistently across major terrestrial biomes and a range of climatic conditions.

Here, we identify and quantity the major axes of terrestrial ecosystem functions and sources of variation along these axes. First, we characterize multiple ecosystem functions across major terrestrial biomes. Second, we identify the most important axes of variation of ecosystem functions using an exploratory analysis similar to that used for the global spectrum of plant forms and functions³. Third, we analyse which variables drive the variation along these axes, from a suite of climatic variables, and the structural and chemical properties of the vegetation. Fourth, we analyse the extent to which two state-of-the-art land surface models (models that simulate the states and exchange of matter and energy between the Earth’s surface and the atmosphere) reproduce the key axes of ecosystem functions. Understanding and quantifying the main axes of variation of the multi-dimensional space of ecosystem functions, their drivers and the degree to which land surface models are able to correctly represent the axes is a crucial prerequisite for predicting which terrestrial functions are the most vulnerable to climate and environmental changes.

We use carbon dioxide (CO₂), water vapour (H₂O), and energy flux data from 203 sites (1,484 site years) from FLUXNET datasets^9,10. These sites cover a wide variety of climate zones and vegetation types (Extended Data Figs. 1–3, Supplementary Table 1). A previous report⁶ suggested a series of core ecosystem functional properties that can be derived from carbon, water and energy flux observations related to efficiencies or potential rates of key physiological and ecohydrological processes (for example, evapotranspiration, photosynthesis energy partitioning and so on) that control land surface–atmosphere interactions. For each site, we calculated a single set of functional properties (see ‘Calculation of ecosystem functions from FLUXNET’ in Methods for details on the calculation and definition of abbreviations): maximum gross CO₂ uptake at light saturation (GPP_s_at), maximum net ecosystem productivity (NEP_max), maximum evapotranspiration (ET_max), evaporative fraction (EF) (that is, the ratio between latent heat flux and available energy, indicative of energy partitioning), EF amplitude (EF_ampl), maximum dry canopy surface conductance (G_smax), maximum and mean basal ecosystem respiration (Rb_max and Rb, respectively), and apparent carbon-use efficiency (aCUE) (that is, the remaining fraction of carbon entering the ecosystem). We also computed several metrics of growing season water-use efficiency (WUE) that account in different ways for physical evaporation and stomatal regulation effects: underlying WUE (uWUE), stomatal slope at ecosystem scale (G1), and WUE_t, a second variant of WUE, but based on transpiration estimates¹¹ (see Methods). We calculated average climate and soil water availability variables for each site, encompassing the following: cumulative soil water availability index (CSWI), mean annual precipitation (P), mean shortwave incoming radiation (SW_in), mean air temperature (T_air), and mean vapour pressure deficit during the growing season (VPD). In addition, we compiled information on canopy-scale structural variables such as foliar nitrogen concentration (N%), maximum leaf area index (LAI_max), maximum canopy height (H_c), and above-ground biomass (AGB), when available (Methods, Supplementary Table 1).

The key axes of the multi-dimensional space of terrestrial ecosystem functions were identified using principal component analysis (PCA; see Methods). We find that the first three axes of variation (the principal components; PCs) explain 71.8% of the multi-dimensional functional space variation (Fig. 1a, b, Supplementary Information 2). The first axis (PC1) explains 39.3% of the variance and is dominated by maximum ecosystem productivity properties, as indicated by the loadings of GPP_sat and NEP_max, and maximum evapotranspiration (ET_max) (Fig. 1c, d). Also, Rb contributes with positive loadings to PC1 (Fig. 1d), indicating the coupling between productivity and ecosystem respiration (both autotrophic and heterotrophic)¹². The first axis runs from sites with low productivity and evapotranspiration to sites with high photosynthesis, high net productivity, and high maximum evapotranspiration; that is, from cold and arid shrublands and wetlands, to forests in continental, tropical and temperate climates (Fig. 2a, b). The second axis (PC2) explains 21.4% of the variance and refers to water-use strategies as shown by the loadings of water-use efficiency metrics (uWUE, WUE_t, and G1), evaporative fraction and maximum surface conductance (Fig. 1c, d). Plant functional types do not explain clearly the variability of the second axis, with the exception of the evergreen and mixed forest, and the wetlands that are at the opposite extremes of the range (Fig. 2c). This axis runs (Fig. 2c,d) from temperate forests, dry and subtropical sites with a low average evaporative fraction (that is, available energy is mainly dissipated by sensible heat) but higher water-use efficiency (Fig. 2d), to sites in cold or tropical climates, as well as wetlands with a high evaporative fraction (that is, available energy is used for evapotranspiration), high surface conductance and low water-use efficiency (Fig. 2c, d). The third axis (PC3) explains 11.1% of the variance and includes key attributes that reflect the carbon-use efficiency of ecosystems. PC3 is dominated by apparent carbon-use efficiency (aCUE), basal ecosystem respiration (Rb and Rb_max) and the amplitude of EF (EF_ampl) (Fig. 1c, d). Rb and aCUE contribute to PC3 with opposite loadings, indicating that the PC3 ranges from sites with high aCUE and low Rb to sites with low aCUE and high Rb. The third axis runs from Arctic and boreal sites with low PC values to hot and dry climates (Fig. 2f), potentially indicating the imprint of aridity and temperature over the efficiency of ecosystems to use the assimilated carbon. We find no clear relation to plant functional types, with the exception of deciduous and evergreen forests that are at the extremes of the PC3 range (Fig. 2e).

**Fig. 1: Key dimensions of multivariate space of terrestrial ecosystem functions.**

**Fig. 2: Distribution of plant functional types and climate types along the principal components (PC1–PC3).**

We analyse the predictive relative importance of five climatic variables (T_air, VPD, CSWI, P, and SW_in) and four vegetation structural characteristics (LAI_max, AGB, H_c and N%) on the predictability of the principal components using random forests (see ‘Predictive variable importance’ in Methods). We find that the maximum productivity axis (PC1) is largely explained by vegetation structure (LAI_max, AGB, H_c and N%) and VPD (Fig. 3a, Extended Data Fig. 4a–e). The water-use strategies axis (PC2) is mostly explained by maximum canopy height (H_c), followed by climate variables (Fig. 3b, Extended Data Fig. 4i–l). Structural and climate variables jointly explain the variability of the carbon-use efficiency axis (PC3). The most important structural predictors of PC3 are AGB and N%, whereas VPD, T_air and SW_in are the most important climate drivers (Fig. 3c, Extended Data Fig. 4m–q).

**Fig. 3: Importance of climate and vegetation properties.**

The dependencies described above can only be interpreted causally if the regression models are in fact causal regression models (see Supplementary Information 3 for a formal definition). In many situations, this fails to be the case owing to the existence of hidden confounders; that is, unmeasured variables that influence both the principal components and the covariates (here climate and structural variables)¹³. Using an invariance-based analysis (see ‘Invariant causal regression models and causal variable importance’ in Methods), we find evidence that the full regression model including all the selected structural and climatic variables might be causal (Supplementary Information 3.2.1, Supplementary Fig. 3.3). If this is indeed the case, we can make the following statements. When considering groupwise causal variable importance, we can conclude that vegetation structure is a stronger causal driver than climate of the spatial (that is, across sites) variability of the maximum realized productivity axis (PC1) (Supplementary Fig. 3.7), and both are significant (Supplementary Table 3.2). Consider two contiguous plots of forest experiencing the same climate conditions, one disturbed and the other not. The undisturbed forest, which is likely to be taller, with higher LAI and carbon stocks, would probably have higher maximum photosynthetic rates and net ecosystem production, which are the most important variables loading on the first axis. Although, in time, the variability of climate controls the variability of gross and net CO₂ uptake and productivity^14,15, which are variables related to the maximum productivity axis (PC1), in space (that is, across sites) we find only a marginal control in very cold and radiation-limited sites (Extended Data Fig. 5a for a PC1 map), or for very warm and high atmospheric aridity (high VPD) conditions (Extended Data Fig. 4d based on predictive variable importance). Both vegetation structure and climate variables seem to have a joint direct causal effect on PC2 (Supplementary Fig 3.7). Although vegetation canopy height is constrained by resource availability¹⁶, particularly water, our results suggest that it acts itself as a control on the water-use strategies axis (PC2) and that it has a stronger causal effect on PC2 than each of the climate variables (Supplementary Fig. 3.6). The importance of vegetation height for ecosystem water-use strategies is manifold. First, vegetation height controls the coupling between stomata and atmosphere by influencing surface roughness and then aerodynamic resistance¹⁷, which modulates leaf-to-air VPD and water use efficiency. Second, vegetation height reflects variation in water-use efficiency that decreases as a consequence of progressive hydraulic constraints on stomatal conductance to water vapour and growth in taller vegetation¹⁶. Third, canopy height might reflect stand age and it is influenced by disturbances. Studies on forest chronosequence show a more conservative use of water in younger forests, which results in higher water-use efficiency¹⁸. We cannot exclude that our results are indirectly affected by the gradient from grass to forests, but postulate that these effects are likely to be minimal (Extended Data Fig. 6). Vegetation structure has a direct causal effect on the carbon-use efficiency axis (PC3; Supplementary Fig 3.7). Previous studies show that vegetation structure reflects climatic constraints but also the successional stage of an ecosystem after disturbance¹⁹. Increasing stand age—which is typically associated with higher above-ground biomass—is also associated with reduced forest production efficiency²⁰. The negative partial dependence of PC3 on above-ground biomass (Extended Data Fig. 4n, based on predictive variable importance) is likely to be related to higher autotrophic and heterotrophic respiration rates per unit of CO₂ taken up by photosynthesis as biomass increases²¹. The positive dependence of PC3 on N% (Extended Data Fig. 4q, based on predictive variable importance) supports previous findings that carbon-use efficiency might be controlled by the nutrient status of the vegetation²².

The two representative—yet complementary—land surface models examined here (OCN and JSBACH) partially reproduce the main axes of terrestrial ecosystem functions (Extended Data Fig. 7). This is shown when comparing the PCA calculated from FLUXNET data with simulated ecosystem functional properties from 48 site-level runs, mostly in temperate and boreal sites (Extended Data Fig. 7). The models are broadly consistent with the FLUXNET observations in the description of the potential productivity axis (PC1), but diverge in the description of the water-use strategies (PC2) and the carbon-use efficiency (PC3) axes. Despite the overall good agreement between observed and modelled fluxes at a half-hourly timescale (Supplementary Table 4), we show that, first, models are limited in simulating the relationships between ecosystem functions (Extended Data Fig. 8); and, second, models tend to overstate observed correlation strengths among ecosystem functions, as shown by the larger variance explained by the PC1 in models compared to observations (Extended Data Fig 7h, i). As a result, the ecosystem functional space that can be simulated by the models, represented by the area shown in Extended Data Fig. 9, is smaller than that expected from observations, particularly in the plane spanned by the PC2 and PC3 (Extended Data Fig. 9d–f). The limited variability of the model output points to an insufficient representation of the actual variability of the vegetation properties by the average parameterization of plant functional types. Uncertain implementation of plant hydraulics and water acquisition or conservation strategies in land surface models is a key limitation²³ that explains the observed discrepancy in PC2. With regard to PC3, one limitation is that models lack flexibility in representing the response of respiration rates and carbon-use efficiency to climate, nutrients, disturbances and substrate availability (including biomass and stand age)^20,24.

The identification of the key axes of terrestrial ecosystem function and their relationships with climate and vegetation structure will help to support the development of the next generation of land surface models and complement their benchmarking²⁵. By comparing the contributions of the functions and their loadings to the principal components, we can assess whether the representations of ecosystem functions in the models and in the ‘real world’ are coherent, and if not, which key processes or model formulations need improvement. For example, we show that vegetation height controls the water-use strategies axis (PC2), which is not well reproduced by the land surface models²³. This suggests that future land surface models need to include a representation of water-use strategies that explicitly accounts for hydraulic limitations to growth, vegetation stature, vertical and horizontal structures and microenvironments of the canopy, and a refined parameterization of stomatal control. Likewise, the inclusion of a flexible representation of carbon-use efficiency would enable models to reproduce the third axis of ecosystem functions²⁴. The comparison of the variances explained by functional axes and the loadings of the functions in simulated and observed data will indicate whether simulated ecosystem functions are appropriately coordinated. The overly tight coupling of ecosystem functions by models indicates a lack of flexibility in ecosystem responses to environmental drivers, such as adaptive carbon and water couplings.

In summary, by analysing a consistent set of ecosystem functions across major terrestrial biomes and climate zones, we show that three key axes capture the terrestrial ecosystem functions. The first and most important axis represents maximum productivity and is driven primarily by vegetation structure, followed by mean climate. The second axis is related to water-use strategies, and is driven by vegetation height. The third axis is related to ecosystem carbon-use efficiency; it is controlled by vegetation structure, but shows a gradient related to aridity. We find that the plant functional type concept does not necessarily capture the variability of ecosystem functions, because the majority of plant functional types are evenly distributed along the water-use strategies (PC2) and carbon-use efficiency (PC3) axes. Our approach allows the overall functioning of terrestrial ecosystems to be summarized and offers a way towards the development of metrics of ecosystem multifunctionality⁵—a measure of ecosystem functions as a whole, which is crucial to achieving a comprehensive assessment of the responses of ecosystems to climate and environmental variability, as well as biodiversity losses⁵. The analysis focuses on relatively few critical functions related to carbon, water and energy cycling of ecosystems. To attain a fully comprehensive characterization of the key axes of terrestrial ecosystem functions, more parameters related to nutrient cycling, seed dispersal and chemical defences—among others—should be included. The concept of the key axes of ecosystem functions could be used as a backdrop for the development of land surface models, which might help to improve the predictability of the terrestrial carbon and water cycle in response to future changing climatic and environmental conditions.

Methods

FLUXNET data

The data used in this study belong to the FLUXNET LaThuile⁹ and FLUXNET2015 Tier 1 and Tier 2 datasets¹⁰, which make up the global network of CO₂, water vapour and energy flux measurements. We merged the two FLUXNET releases and retained the FLUXNET2015 (the most recent and with a robust quality check) version of the data when the site was present in both datasets. Croplands were removed to avoid the inclusion of sites that are heavily managed in the analysis (for example, fertilization and irrigation).

The sites used cover a wide variety of climate zones (from tropical to Mediterranean to Arctic) and vegetation types (wetlands, shrublands, grasslands, savanna, evergreen and deciduous forests). It should be noted though that tropical forests are underrepresented in the FLUXNET database (Extended Data Figs. 1, 3).

Sites were excluded in cases in which: (i) data on precipitation or radiation were not available or completely gap-filled; (ii) the calculation of functional properties failed because of low availability of measured data (see ‘Calculation of ecosystem functions from FLUXNET’); and (iii) fluxes showed clear discontinuities in time series indicating a change of instrumentation set-up (for example, changes in the height of the ultrasonic anemometer or gas analyser).

The final number of sites selected was 203 (1,484 site years). The geographical distribution is shown in Extended Data Fig. 1, the distribution in the climate space is shown in Extended Data Fig. 2 and the fraction of sites for each climate classes is reported in Extended Data Fig. 3.

For each site, we downloaded the following variables at half-hourly temporal resolution: (i) gross primary productivity (GPP, μmol CO₂ m^–² s^–¹) derived from the night-time flux partitioning²⁶ (GPP_NT_VUT_50 in FLUXNET 2015 and GPP_f in LaThuile), (ii) net ecosystem exchange (NEE, μmol CO₂ m^–² s^–¹) measurements filtered using annual friction velocity (u*, m s⁻¹) threshold (NEE_VUT_50 in FLUXNET 2015; NEE in LaThuile); (iii) latent heat (LE, W m⁻²) fluxes, which were converted to evapotranspiration (ET, mm); (iv) sensible heat (H, W m⁻²) fluxes; (v) air temperature (T_air, °C); (vi) vapour pressure deficit (VPD, hPa); (vii) global shortwave incoming radiation (SW_in, W m⁻²); viii) net radiation (R_n, W m⁻²); (ix) ground heat flux (G, W m⁻²); (x) friction velocity u* (m s⁻¹); and (xi) wind speed (u, m s⁻¹). For the energy fluxes (H, LE) we selected the fluxes not corrected for the energy balance closure to guarantee consistency between the two FLUXNET datasets (in the LaThuile dataset energy fluxes were not corrected).

The cumulative soil water index (CSWI, mm) was computed as a measure of water availability according to a previous report²⁷. Half-hourly values of transpiration estimates (T, mm) were calculated with the transpiration estimation algorithm (TEA)²⁸. The TEA has been shown to perform well against both model simulations and independent sap flow data²⁸.

For 101 sites, ecosystem scale foliar N content (N%, gN 100 g⁻¹) was computed as the community weighted average of foliar N% of the major species at the site sampled at the peak of the growing season or gathered from the literature^29,30,31,32. Foliar N% for additional sites was derived from the FLUXNET Biological Ancillary Data Management (BADM) product and/or provided by site principal investigators (Supplementary Table 1, Extended Data Fig. 1). It should be noted that this compilation of N% data might suffer from uncertainties resulting from the scaling from leaves to the eddy covariance footprint, the sampling strategy (including the position along the vertical canopy profile), the species selection and the timing of sampling. About 30% of the data comes from a coordinated effort that minimized these uncertainties^29,30, and for the others we collected N% data that were representative for the eddy covariance footprint^31,32.

Maximum leaf area index (LAI_max, m² m⁻²) and maximum canopy height (H_c, m) were also collected for 153 and 199 sites, respectively, from the literature^32,33, the BADM product, and/or site principal investigators.

Earth observation retrievals of above-ground biomass (AGB, tons of dry matter per hectare (t DM ha⁻¹)) were extracted from the GlobBiomass dataset³⁴ at its original resolution (grid cell 100 × 100 m) for each site location. All the grid cells in a 300 × 300 m and 500 × 500 m window around each location were selected to estimate the median and 95th percentiles of AGB for each site. The median of AGB was selected to avoid the contribution of potential outliers to the expected value of AGB. The analysis further explored the contribution of higher percentiles in the local variation of AGB as previous studies have highlighted the contribution of older and larger trees in uneven stand age plots to ecosystem functioning³⁵. According to the evaluation against AGB measured at 71 FLUXNET sites (Extended Data Fig. 10), we decided to use the product with median AGB values extracted from the 500 × 500 m window.

A total of 94 sites have all the data on vegetation structure (N%, LAI_max, H_c, and AGB).

The list of sites is reported in Supplementary Table 1 along with the plant functional type (PFT), Köppen-Geiger classification, coordinates, and when available N%, LAI_max, H_c and AGB.

In this study we did not make use of satellite information, with the exception of the AGB data product. Future studies will benefit from new missions such as the ECOsystem Spaceborne Thermal Radiometer Experiment on Space Station (ECOSTRESS), the fluorescence explorer (FLEX), hyperspectral, and radar and laser detection and ranging (LiDAR) missions (for example, Global Ecosystem Dynamics Investigation (GEDI)), to characterize a multivariate space of structural and functional properties.

Calculation of ecosystem functions from FLUXNET

Starting from half-hourly data, we calculated at each site a single value for each of the ecosystem functions listed below. For the calculations of functional properties we used, unless otherwise indicated, good-quality data: quality flag 0 (measured data) and 1 (good-quality gap-filled data) in the FLUXNET dataset.

Gross primary productivity at light saturation (GPP_sat)

GPP at light saturation using photosynthetically active radiation as driving radiation and 2,000 μmol m⁻² s⁻¹ as saturating light. GPP_sat represents the ecosystem-scale maximum photosynthetic CO₂ uptake^15,30,36. The GPP_sat was estimated from half-hourly data by fitting the hyperbolic light response curves with a moving window of 5 days and assigned at the centre of the moving window^30,37. For each site the 90th percentile from the GPP_sat estimates was then extracted.

Maximum net ecosystem productivity (NEP_max)

This was computed as the 90th percentile of the half-hourly net ecosystem production (NEP = −NEE) in the growing season (that is, when daily GPP is higher than 30% of the GPP amplitude). This metric represents the maximum net CO₂ uptake of the ecosystem.

Basal ecosystem respiration (Rb and Rb_max)

Basal ecosystem respiration at reference temperature of 15 °C was derived from night-time NEE measurements²⁶. Daily basal ecosystem respiration (Rb_d) was derived by fitting an Arrhenius type equation over a five-day moving window and by keeping the sensitivity to temperature parameter (E₀) fixed as in the night-time partitioning algorithms^26,38. Rb_d varies across seasons because it is affected by short-term variations in productivity^33,39, phenology⁴⁰ and water stress⁴¹. For each site, the mean of the Rb_d (Rb) and the 95^th percentile (Rb_max) were computed. The calculations were conducted with the REddyProc R package v.1.2.2 (ref. ³⁸).

Apparent carbon-use efficiency (aCUE)

The aCUE as defined in this study is the efficiency of an ecosystem to sequester the carbon assimilated with photosynthesis³⁹. aCUE is an indication of the proportion of respired carbon with respect to assimilated carbon within one season. A previous report⁶ showed that little of the variability in aCUE can be explained by climate or conventional site characteristics, and suggested an underlying control by plant, faunal and microbial traits, in addition to site disturbance history. Daily aCUE (aCUE_d) is defined as aCUE_d = 1 − (Rb_d/GPP_d), where GPP_d is daily mean GPP and Rb_d is derived as described above. For each site, aCUE was computed as the median of aCUE_d.

Metrics of water-use efficiency (WUE)

Various metrics of WUE are described below: stomatal slope or slope coefficient (G1), underlying water-use efficiency (uWUE), and water-use efficiency based on transpiration (WUE_t). The three metrics were used because they are complementary, as shown in previous studies^11,42.

Stomatal slope or slope coefficient (G1)

This is the marginal carbon cost of water to the plant carbon uptake. G1 is the key parameter of the optimal stomatal model derived previously⁴³. G1 is inversely related to leaf-level WUE. At leaf level, G1 is calculated using nonlinear regression and can be interpreted as the slope between stomatal conductance and net CO₂ assimilation, normalized for VPD and CO₂ concentration⁴³. A previous report⁴² showed the potential of the use of G1 at ecosystem scale, where stomatal conductance is replaced by surface conductance (G_s), and net assimilation by GPP. The methodology is implemented in the bigleaf R package⁴⁴. The metric was computed in the following situations: (i) incoming shortwave radiation (SW_in) greater than 200 W m⁻²; (ii) no precipitation event for the last 24 h⁴⁵, when precipitation data are available; and (iii) during the growing season: daily GPP > 30% of its seasonal amplitude⁴⁴.

Underlying water-use efficiency (uWUE)

The underlying WUE was computed following a previous method⁴⁶. uWUE is a metric of water-use efficiency that is negatively correlated to G1 at canopy scale⁴⁴:

$${\rm{uWUE}}=\frac{{\rm{GPP}}\sqrt{{\rm{VPD}}}}{{\rm{ET}}}.$$

uWUE was calculated using the same filtering that was applied for the calculation of G1. The median of the half-hourly retained uWUE values was computed for each site and used as a functional property.

Water-use efficiency based on transpiration (WUE_t)

The WUE based on transpiration (T) was computed to reduce the confounding effect resulting from soil evaporation^11,28:

$${{\rm{WUE}}}_{{\rm{t}}}=\frac{{\rm{GPP}}}{T},$$

where T is the mean annual transpiration calculated with the transpiration estimation algorithm (TEA) developed by in a previous study²⁸ and GPP is the mean annual GPP.

Maximum surface conductance (G _smax)

Surface conductance (G_s) was computed by inverting the Penman–Monteith equation after calculating the aerodynamic conductance (G_a).

Among the different formulations of G_a (m s^–¹) in the literature, we chose to use here the calculation of the canopy (quasi-laminar) boundary layer conductance to heat transfer, which ranges from empirical to physically based (for example, ref. ⁴⁷). Other studies^48,49 suggested an empirical relationship between G_a, the horizontal wind speed (u) and the friction velocity, u*:

$${G}_{{\rm{a}}}=\frac{1}{(\frac{u}{{u}^{* 2}}+6.2u{* }^{-0.67})}$$

G_s (m s⁻¹) is computed by inverting the Penman–Monteith equation:

$${G}_{{\rm{s}}}=\frac{{{\rm{LEG}}}_{{\rm{a}}}\gamma }{\Delta ({R}_{{\rm{n}}}-G-S)+\rho {C}_{{\rm{p}}}{G}_{{\rm{a}}}{\rm{VPD}}-{\rm{LE}}(\Delta +\gamma )}$$

where Δ is the slope of the saturation vapour pressure curve (kPa K⁻¹), ρ is the air density (kg m⁻³), C_p is the specific heat of the air (J K⁻¹ kg⁻¹), γ is the psychrometric constant (kPa K⁻¹), VPD (kPa), R_n (W m⁻²), G (W m⁻²) and S is the sum of all energy storage fluxes (W m⁻²) and set to 0 as not available in the dataset. When not available, G also was set to 0.

G_s represents the combined conductance of the vegetation and the soil to water vapour transfer. To retain the values with a clear physiological interpretation, we filtered the data as we did for the calculation of G1.

For each site, the 90th percentile of the half-hourly G_s was calculated and retained as the maximum surface conductance of each site (G_smax). G_s was computed using the bigleaf R package⁴⁴.

Maximum evapotranspiration in the growing season (ET_max)

This metric represents the maximum evapotranspiration computed as the 95th percentile of ET in the growing season and using the data retained after the same filtering applied for the G1 calculation.

Evaporative fraction (EF)

EF is the ratio between LE and the available energy, here calculated as the sum of H + LE (ref. ⁵⁰). For the calculation of EF, we used the same filtering strategy as for G1. We first calculated mean daytime EF. We then computed the EF per site as the growing season average of daytime EF. We also computed the amplitude of the EF in the growing season by calculating the interquartile distance of the distribution of mean daytime EF (EF_ampl).

Principal component analysis

A PCA was conducted on the multivariate space of the ecosystem functions. Each variable (ecosystem functional property, EFP) was standardized using z-transformation (that is, by subtracting its mean value and then dividing by its standard deviation). From the PCA results we extracted the explained variance of each component and the loadings of the EFPs, indicating the contribution of each variable to the component. We performed the PCA using the function PCA() implemented in the R package FactoMineR⁵¹.

We justify using PCA over nonlinear methods because it is an exploratory technique that is highly suited to the analysis of the data volume used in this study, whereas other nonlinear methods applied to such data would be over-parameterized. For the same reason, PCA was used in previous work concerning the global spectrum of leaf and plant traits, and fluxes^1,3,52.

To test the significance of dimensionality of the PCA, we used a previously described methodology⁵³. We used the R package ade4 (ref. ⁵⁴) and evaluated the number of significant components of the PCA to be retained to minimize both redundancy and loss of information (Supplementary Information 2). We tested the significance of the PCA loadings using a combination of the bootstrapped eigenvector method⁵⁵ and a threshold selected using the number of dimensions⁵⁶ (Supplementary Information 2).

Predictive variable importance

A random forests (RF) analysis was used to identify the vegetation structure and climate variables that contribute the most to the variability of the significant principal components, which were identified with the PCA analysis (see ‘Principal component analysis’). In the main text we refer to the results of this analysis as ‘predictive variable importance’ to distinguish this to the ‘causal variable importance’ described below.

The analysis was conducted using the following predictor variables: as structural variables, N% (gN 100 g⁻¹), LAI_max (m₂ m⁻²), AGB (t DM ha⁻¹) and H_c (m); as climatic variables, mean annual precipitation (P, mm), mean VPD during the growing season (VPD, hPa), mean shortwave radiation (SW_in, W m⁻²), mean air temperature (T_air, °C); and the cumulative soil water index (CSWI, −), as indicator of site water availability.

We used partial dependencies of variables to assess the relationship between individual predictors and the response variable (that is, PC1, PC2 and PC3).

The results from the partial dependency analysis can be used to determine the effects of individual variables on the response, without the influence of the other variables. The partial dependence function was calculated using the pdp R package⁵⁷.

The partial dependencies were calculated restricted to the values that lie within the convex hull of their training values to reduce the risk of interpreting the partial dependence plot outside the range of the data (extrapolation).

Invariant causal regression models and causal variable importance

We have quantified the dependence of the principal components on the different structural and climatic variables using nonlinear regression. Such dependencies can only be interpreted causally if the regression models are in fact causal regression models (see Supplementary Information 3 for a formal definition), which may not be the case if there are hidden confounders. To see whether the regression models allow for a causal interpretation, we use invariant causal prediction⁵⁸. This method investigates whether the regression models are stable with respect to different patterns of heterogeneity in the data, encoded by different environments (that is, subsets of the original dataset). The rationale is that a causal model, describing the full causal mechanism for the response variable, should be invariant with respect to changes in the environment if the latter does not directly influence the response variable^13,59. Other non-causal models may be invariant, too, but a non-invariant model cannot be considered causal.

How to choose the environments is a modelling choice that must satisfy the following criteria. First, it should be possible to assign each data point to exactly one environment. Second, the environments should induce heterogeneity in the data, so that, for example, the predictor variables have different distributions across environments. Third, the environments must not directly affect the response variable, only via predictors, although the distribution of the response may still change between environments. The third criterion can be verified by expert knowledge and is assumed to hold for our analysis. In addition, if it is violated, then, usually, no set is invariant⁵⁸, which can be detected from data.

In our analysis, we assigned each data point (that is, each site) to one of two environments (two subsets of the original dataset): the first includes forest sites in North America, Europe or Asia; and the second includes non-forest and forest ecosystems from South America, Africa or Oceania, and non-forest ecosystems from North America, Europe or Asia (see Supplementary Information 3.1.3.1 for details). Our choice satisfies the method’s assumption that the distribution of the predictors is different between the two environments (that is, they induce heterogeneity in the data; see Supplementary Fig. 3.1). Environments that are too small or too homogeneous do not provide any evidence against the full set of covariates being a candidate for the set of causal predictors. Other choices of environments than the one presented here yield consistent results (Supplementary Information 3.2.1, Supplementary Fig. 3.4).

For each subset of predictors, we test whether the corresponding regression model is invariant (yielding the same model fit in each environment). Although many models were rejected and considered non-invariant, the full model (with all the nine predictors and used in the predictive variable importance analysis) was accepted as invariant, establishing the full set of covariates as a reasonable candidate for the set of direct causal predictors. We used both RF (randomForest package in R⁶⁰) and generalized additive models, GAM⁶¹ (mgcv package⁶² in R) to fit the models. Both methods lead to comparable results but with a better average performance of the RF: GAM led to slightly better results than RF for PC1, whereas for PC2 and PC3 RF showed a much better model performance (Supplementary Table 3.1, Supplementary Information 3.2.2). Therefore, in the main text we showed only the results from the RF (except for PC1).

If, indeed, the considered regression models are causal, this allows us to make several statements. First, we can test for the existence of causal effects by testing for statistical significance of the respective predictors in the fitted models. Second, we can use the response curves of the fitted model to define a variable importance measure with a causal interpretation. In the main text we refer to this variable importance as ‘causal variable importance’. For details, see Supplementary Information 3.1.2. More formally, we considered the expected value of the predicted variables (the principal components) under joint interventions on all covariates (AGB, H_c, LAI_max, N%, T_air, VPD, SW_in, CSWI and P) at once, and then, to define the importance, we quantified how this expected value depends on the different covariates. We applied the same analysis to groups of vegetation structural and climate covariates (see ‘Groupwise variable importance’ in Supplementary Information 3.1.2.3, 3.2.3).

The details of the methodology and the results are described in Supplementary Information 3, in which we also provide further details on the choice of environment variable and on the statistical tests that we use to test for invariance. An overview of the invariance-based methodology is shown in Supplementary Fig. 3.1.

Land surface model runs

We run two widely used land surface models: Orchidee-CN (OCN) and Jena Scheme for Biosphere Atmosphere Coupling in Hamburg (JSBACH):

OCN

The dynamic global vegetation model OCN is a model of the coupled terrestrial carbon and nitrogen cycles^63,64, derived from the ORCHIDEE land surface model. It operates at a half-hourly timescale and simulates diurnal net carbon, heat and water exchanges, as well as nitrogen trace gas emissions, which jointly affect the daily changes in leaf area index, foliar nitrogen, and vegetation structure and growth. The main purpose of the model is to analyse the longer-term (interannual to decadal) implication of nutrient cycling for the modelling of land–climate interactions^64,65. The model can run offline, driven by observed meteorological parameters, or coupled to the global circulation model.

JSBACH

JSBACH v.3 is the land surface model of the MPI Earth System Model^66,67. The model operates at a half-hourly time step and simulates the diurnal net exchange of momentum, heat, water and carbon with the atmosphere. Daily changes in leaf area index and leaf photosynthetic capacity are derived from a prognostic scheme assuming a PFT-specific set maximum leaf area index and a set of climate responses modulating the seasonal course of leaf area index. Carbon pools are prognostic allowing for simulating the seasonal course of net land–atmosphere carbon exchanges.

We selected OCN and JSBACH because they are widely used land surface models with different structures. JSBACH is a parsimonious representation of the terrestrial energy, water and carbon exchanges used to study the coupling of land and atmosphere processes in an Earth system model⁶⁷. OCN has also been derived from the land surface model ORCHIDEE⁶⁸, but it includes a more comprehensive representation of plant physiology, including a detailed representation of the tight coupling of the C and N cycling⁶³. Both models contribute to the annual global carbon budget of the Global Carbon Project⁶⁹ and have shown good performance compared to a number of global benchmarks. OCN was further used in several model syntheses focused on the interaction between changing N deposition and CO₂ fertilization^70,71,72. Both OCN and JSBACH can operate at a half-hourly timescale and simulate net and gross carbon exchanges, water and energy fluxes, and therefore are ideal for the extraction of ecosystem functional properties, as done with the eddy covariance data.

The models were driven by half-hourly meteorological variables (shortwave and longwave downward flux, air temperature and humidity, precipitation, wind speed and atmospheric CO₂ concentrations) observed at the eddy covariance sites. OCN was furthermore driven by N deposition fields⁷³. Vegetation type, soil texture and plant available water were prescribed on the basis of site observations, but no additional site-specific parameterization was used. Both models were brought into equilibrium with respect to their ecosystem water storage and biogeochemical pools by repeatedly looping over the available site years. We added random noise (mean equal to 0 and standard deviation of 5% of the flux value) to the fluxes simulated by the models to mimic the random noise of the eddy covariance flux observations. An additional test conducted without noise addition showed only a marginal effect on the calculations of the functional properties and the ecosystem functional space.

We used runs of the JSBACH and OCN model for 48 FLUXNET sites (Supplementary Table 1). The simulated fluxes were evaluated against the observation to assess the performance of the models at the selected sites. From the model outputs and from each site we derived the ecosystem functions using the same methodology described above. Then the PCA analysis was performed on the three datasets (FLUXNET, OCN and JSBACH) and restricted to the 48 sites used to run the models. We ran the models only on the subset of sites for which the information for the parameterization and high-quality forcing was available. However, the different ecosystem functions emerge from the model structure and climatological conditions. Therefore, even with a smaller set of site we can evaluate whether models reproduce the key dimensions of terrestrial ecosystem function by comparing the PCA results from FLUXNET and the model runs.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this paper.

Data availability

Data used for this study are the FLUXNET dataset LaThuile (https://fluxnet.fluxdata.org/data/la-thuile-dataset/) and FLUXNET2015 (https://fluxnet.fluxdata.org/data/fluxnet2015-dataset/). Biological, ancillary, disturbance and metadata information for the sites were collected from databases and the literature and are available at the following address together with the reproducible workflow (https://doi.org/10.5281/zenodo.5153538). OCN and JSBACH model runs are available in the reproducible workflow (https://doi.org/10.5281/zenodo.5153538).

Code availability

The R codes used for this analysis are available at: https://doi.org/10.5281/zenodo.5153538. The R codes for the causality analysis are available at: https://doi.org/10.5281/zenodo.5153534. The TEA algorithm is available at https://doi.org/10.5281/zenodo.3921923.

References

Wright, I. J. et al. The worldwide leaf economics spectrum. Nature 428, 821–827 (2004).
Article ADS CAS PubMed Google Scholar
Reich, P. B., Walters, M. B. & Ellsworth, D. S. From tropics to tundra: global convergence in plant functioning. Proc. Natl Acad. Sci. USA 94, 13730–13734 (1997).
Article ADS CAS PubMed PubMed Central Google Scholar
Díaz, S. et al. The global spectrum of plant form and function. Nature 529, 167–171 (2016).
Article ADS PubMed CAS Google Scholar
Bruelheide, H. et al. Global trait–environment relationships of plant communities. Nat. Ecol. Evol. 2, 1906–1917 (2018).
Article PubMed Google Scholar
Manning, P. et al. Redefining ecosystem multifunctionality. Nat. Ecol. Evol. 2, 427–436 (2018).
Article PubMed Google Scholar
Reichstein, M., Bahn, M., Mahecha, M. D., Kattge, J. & Baldocchi, D. D. Linking plant and ecosystem functional biogeography. Proc. Natl Acad. Sci. USA 111, 13697–13702 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Anderegg, W. R. L. et al. Climate-driven risks to the climate mitigation potential of forests. Science 368, eaaz7005 (2020).
Article CAS PubMed Google Scholar
Bonan, G. B. Forests and climate change: forcings, feedbacks, and the climate benefits of forests. Science 320, 1444–1449 (2008).
Article ADS CAS PubMed Google Scholar
Baldocchi, D. ‘Breathing’ of the terrestrial biosphere: lessons learned from a global network of carbon dioxide flux measurement systems. Aust. J. Bot. 56, 1–26 (2008).
Article CAS Google Scholar
Pastorello, G. et al. The FLUXNET2015 dataset and the ONEFlux processing pipeline for eddy covariance data. Sci. Data 7, 225 (2020).
Article PubMed PubMed Central Google Scholar
Nelson, J. A. et al. Ecosystem transpiration and evaporation: insights from three water flux partitioning methods across FLUXNET sites. Global Change Biol. 26, 6916–6930 (2020).
Article ADS Google Scholar
Janssens, I. A. et al. Productivity overshadows temperature in determining soil and ecosystem respiration across European forests. Global Change Biol. 7, 269–278 (2001).
Article ADS Google Scholar
Pearl, J. Causality (Cambridge University Press, 2009).
Krich, C. et al. Functional convergence of biosphere–atmosphere interactions in response to meteorological conditions. Biogeosciences 18, 2379–2404 (2021).
Article ADS CAS Google Scholar
Musavi, T. et al. Stand age and species richness dampen interannual variation of ecosystem-level photosynthetic capacity. Nat. Ecol. Evol. 1, 0048 (2017).
Article Google Scholar
Ryan, M. G., Phillips, N. & Bond, B. J. The hydraulic limitation hypothesis revisited. Plant Cell Environ. 29, 367–381 (2006).
Article PubMed Google Scholar
De Kauwe, M. G., Medlyn, B. E., Knauer, J. & Williams, C. A. Ideas and perspectives: how coupled is the vegetation to the boundary layer? Biogeosciences 14, 4435–4453 (2017).
Article ADS CAS Google Scholar
Skubel, R. et al. Age effects on the water-use efficiency and water-use dynamics of temperate pine plantation forests. Hydrol. Processes 29, 4100–4113 (2015).
Article ADS Google Scholar
Law, B. E., Thornton, P. E., Irvine, J., Anthoni, P. M. & Van Tuyl, S. Carbon storage and fluxes in ponderosa pine forests at different developmental stages. Global Change Biol. 7, 755–777 (2001).
Article ADS Google Scholar
Collalti, A. et al. Forest production efficiency increases with growth temperature. Nat. Commun. 11, 5322 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
DeLucia, E. H., Drake, J. E., Thomas, R. B. & Gonzalez-Meler, M. Forest carbon use efficiency: is respiration a constant fraction of gross primary production? Global Change Biol. 13, 1157–1167 (2007).
Article ADS Google Scholar
Fernández-Martínez, M. et al. Nutrient availability as the key regulator of global forest carbon balance. Nat. Clim. Change 4, 471–476 (2014).
Article ADS CAS Google Scholar
Kennedy, D. et al. Implementing plant hydraulics in the community land model, version 5. J. Adv. Model. Earth Syst. 11, 485–513 (2019).
Article ADS Google Scholar
Manzoni, S. et al. Reviews and syntheses: carbon use efficiency from organisms to ecosystems – definitions, theories, and empirical evidence. Biogeosciences 15, 5929–5949 (2018).
Article ADS CAS Google Scholar
Eyring, V. et al. Earth System Model Evaluation Tool (ESMValTool) v2.0 – an extended set of large-scale diagnostics for quasi-operational and comprehensive evaluation of Earth system models in CMIP. Geosci. Model Dev. 13, 3383–3438 (2020).
Article ADS CAS Google Scholar
Reichstein, M. et al. On the separation of net ecosystem exchange into assimilation and ecosystem respiration: review and improved algorithm. Global Change Biol. 11, 1424–1439 (2005).
Article ADS Google Scholar
Nelson, J. A., Carvalhais, N., Migliavacca, M., Reichstein, M. & Jung, M. Water-stress-induced breakdown of carbon–water relations: indicators from diurnal FLUXNET patterns. Biogeosciences 15, 2433–2447 (2018).
Article ADS CAS Google Scholar
Nelson, J. et al. Coupling water and carbon fluxes to constrain estimates of transpiration: the TEA algorithm. J. Geophys. Res. Biogeosci. 123, 3617–3632 (2018).
Article CAS Google Scholar
Musavi, T. et al. The imprint of plants on ecosystem functioning: a data-driven approach. Int. J. Appl. Earth Obs. Geoinf. 43, 119–131 (2015).
ADS Google Scholar
Musavi, T. et al. Potential and limitations of inferring ecosystem photosynthetic capacity from leaf functional traits. Ecol. Evol. 6, 7352–7366 (2016).
Article PubMed PubMed Central Google Scholar
Fleischer, K. et al. Low historical nitrogen deposition effect on carbon sequestration in the boreal zone. J. Geophys. Res. Biogeosci.120, 2542–2561 (2015).
Article CAS Google Scholar
Flechard, C. R. et al. Carbon–nitrogen interactions in European forests and semi-natural vegetation. Part I: Fluxes and budgets of carbon, nitrogen and greenhouse gases from ecosystem monitoring and modelling. Biogeosciences 17, 1583–1620 (2020).
Article ADS CAS Google Scholar
Migliavacca, M. et al. Semiempirical modeling of abiotic and biotic factors controlling ecosystem respiration across eddy covariance sites. Global Change Biol. 17, 390–409 (2011).
Article ADS Google Scholar
Santoro, M. et al. The global forest above-ground biomass pool for 2010 estimated from high-resolution satellite observations. Earth Syst. Sci. Data 13, 3927–3950 (2021).
Article ADS Google Scholar
Besnard, S. et al. Quantifying the effect of forest age in annual net forest carbon balance. Environ. Res. Lett. 13, 124018 (2018).
Article ADS Google Scholar
Migliavacca, M. et al. Seasonal and interannual patterns of carbon and water fluxes of a poplar plantation under peculiar eco-climatic conditions. Agric. For. Meteorol. 149, 1460–1476 (2009).
Article ADS Google Scholar
Gilmanov, T. G. et al. Productivity, respiration, and light-response parameters of world grassland and agroecosystems derived from flux-tower measurements. Rangel. Ecol. Manag. 63, 16–39 (2010).
Article Google Scholar
Wutzler, T. et al. Basic and extensible post-processing of eddy covariance flux data with REddyProc. Biogeosciences 15, 5015–5030 (2018).
Article ADS CAS Google Scholar
Mahecha, M. D. et al. Global convergence in the temperature sensitivity of respiration at ecosystem level. Science 329, 838–840 (2010).
Article ADS CAS PubMed Google Scholar
Migliavacca, M. et al. Influence of physiological phenology on the seasonal pattern of ecosystem respiration in deciduous forests. Global Change Biol. 21, 363–376 (2015).
Article ADS Google Scholar
Reichstein, M. et al. Modeling temporal and large-scale spatial variability of soil respiration from soil water availability, temperature and vegetation productivity indices. Global Biogeochem. Cycles 17, 1104 (2003).
Article ADS CAS Google Scholar
Knauer, J. et al. Towards physiologically meaningful water-use efficiency estimates from eddy covariance data. Global Change Biol. 24, 694–710 (2018).
Article ADS Google Scholar
Medlyn, B. E. et al. Reconciling the optimal and empirical approaches to modelling stomatal conductance. Global Change Biol. 17, 2134–2144 (2011).
Article ADS Google Scholar
Knauer, J., El-Madany, T. S., Zaehle, S. & Migliavacca, M. bigleaf—an R package for the calculation of physical and physiological ecosystem properties from eddy covariance data. PloS ONE 13, e0201114 (2018).
Article PubMed PubMed Central CAS Google Scholar
Knohl, A. & Buchmann, N. Partitioning the net CO₂ flux of a deciduous forest into respiration and assimilation using stable carbon isotopes. Global Biogeochem. Cycles 19, GB4008 (2005).
Article ADS CAS Google Scholar
Zhou, S., Yu, B., Huang, Y. & Wang, G. The effect of vapor pressure deficit on water use efficiency at the subdaily time scale. Geophys. Res. Lett. 41, 5005–5013 (2014).
Article ADS Google Scholar
Verhoef, A., De Bruin, H. A. R. & Van Den Hurk, B. J. J. M. Some practical notes on the parameter kB⁻¹ for sparse vegetation. J. Appl. Meteorol. 36, 560–572 (1997).
Article ADS Google Scholar
Thom, A. S. in Vegetation and the Atmosphere (ed. Monteith, J. L.) 57–109 (Academic Press, 1975).
Thom, A. S. Momentum, mass and heat exchange of vegetation. Q. J. R. Meteorolog. Soc. 98, 124–134 (1972).
Article ADS Google Scholar
Gentine, P., Entekhabi, D., Chehbouni, A., Boulet, G. & Duchemin, B. Analysis of evaporative fraction diurnal behaviour. Agric. For. Meteorol. 143, 13–29 (2007).
Article ADS Google Scholar
Husson, F., Le, S. & Pages, J. Exploratory Multivariate Analysis by Example Using R (CRC Press, 2010).
Kraemer, G., Camps-Valls, G., Reichstein, M. & Mahecha, M. D. Summarizing the state of the terrestrial biosphere in few dimensions. Biogeosciences 17, 2397–2424 (2020).
Article ADS Google Scholar
Dray, S. On the number of principal components: a test of dimensionality based on measurements of similarity between matrices. Comput. Stat. Data Anal. 52, 2228–2237 (2008).
Article MathSciNet MATH Google Scholar
Dray, S. & Dufour, A.-B. The ade4 package: implementing the duality diagram for ecologists. J. Stat. Softw. 22, 20 (2007).
Article Google Scholar
Peres-Neto, P. R., Jackson, D. A. & Somers, K. M. Giving meaningful interpretation to ordination axes: assessing loading significance in principal component analysis. Ecology 84, 2347–2363 (2003).
Article Google Scholar
Richman, M. B. A cautionary note concerning a commonly applied eigenanalysis procedure. Tellus B 40B, 50–58 (1988).
Article ADS CAS Google Scholar
Friedman, J. H. Greedy function approximation: a gradient boosting machine. Ann. Statist. 29, 1189–1232 (2001).
Article MathSciNet MATH Google Scholar
Peters, J., Bühlmann, P. & Meinshausen, N. Causal inference by using invariant prediction: identification and confidence intervals. J. R. Stat. Soc. B 78, 947–1012 (2016).
Article MathSciNet MATH Google Scholar
Haavelmo, T. The probability approach in econometrics. Econometrica 12, 1–115 (1944).
Article MathSciNet MATH Google Scholar
Breiman, L. Random Forests. Mach. Learn. 45, 5–32 (2001).
Article MATH Google Scholar
Hastie, T. J. & Tibshirani, R. J. Generalized Additive Models Vol. 43 (CRC Press, 1990).
Wood, S. N. Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models. J. R. Stat. Soc. B 73, 3–36 (2011).
Article MathSciNet MATH Google Scholar
Zaehle, S. & Friend, A. D. Carbon and nitrogen cycle dynamics in the O-CN land surface model: 1. Model description, site-scale evaluation, and sensitivity to parameter estimates. Global Biogeochem. Cycles 24, GB1005 (2010).
Zaehle, S. et al. Carbon and nitrogen cycle dynamics in the O-CN land surface model: 2. Role of the nitrogen cycle in the historical terrestrial carbon balance. Global Biogeochem. Cycles 24, GB1006 (2010).
Article ADS Google Scholar
Zaehle, S., Friedlingstein, P. & Friend, A. D. Terrestrial nitrogen feedbacks may accelerate future climate change. Geophys. Res. Lett. 37, L01401 (2010).
Article ADS CAS Google Scholar
Raddatz, T. J. et al. Will the tropical land biosphere dominate the climate–carbon cycle feedback during the twenty-first century? Clim. Dyn. 29, 565–574 (2007).
Article Google Scholar
Mauritsen, T. et al. Developments in the MPI-M Earth system model version 1.2 (MPI-ESM1.2) and its response to increasing CO₂. J. Adv. Model. Earth Syst. 11, 998–1038 (2019).
Article ADS PubMed PubMed Central Google Scholar
Krinner, G. et al. A dynamic global vegetation model for studies of the coupled atmosphere-biosphere system. Global Biogeochem. Cycles 19, GB1015 (2005).
Article ADS CAS Google Scholar
Friedlingstein, P. et al. Global carbon budget 2019. Earth Syst. Sci. Data 11, 1783–1838 (2019).
Article ADS Google Scholar
Fleischer, K. et al. Amazon forest response to CO₂ fertilization dependent on plant phosphorus acquisition. Nat. Geosci. 12, 736–741 (2019).
Article ADS CAS Google Scholar
Meyerholt, J. & Zaehle, S. Controls of terrestrial ecosystem nitrogen loss on simulated productivity responses to elevated CO₂. Biogeosciences 15, 5677–5698 (2018).
Article ADS CAS Google Scholar
Zaehle, S. et al. Evaluation of 11 terrestrial carbon-nitrogen cycle models against observations from two temperate free-air CO₂ enrichment studies. New Phytol. 202, 803–822 (2014).
Article CAS PubMed PubMed Central Google Scholar
Zaehle, S., Ciais, P., Friend, A. D. & Prieur, V. Carbon benefits of anthropogenic reactive nitrogen offset by nitrous oxide emissions. Nat. Geosci. 4, 601–605 (2011).
Article ADS CAS Google Scholar
Wickham, H. ggplot2: Elegant Graphics for Data Analysis (Springer-Verlag, 2016).
Whittaker, R. H. Communities and Ecosystems 2nd edn (MacMillan Publishing Co., 1975).
Ricklefs, R. E. The Economy of Nature 6th ed. Ch. 5 (W. H. Freeman, 2008).
Liu, Y., Schwalm, C. R., Samuels-Crow, K. E. & Ogle, K. Ecological memory of daily carbon exchange across the globe and its importance in drylands. Ecol. Lett. 22, 1806–1816 (2019).
Article PubMed Google Scholar

Download references

Acknowledgements

This work has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement no. 721995. M.M. and M. Reichstein acknowledge the Alexander Von Humboldt Foundation for funding with the Max Planck Research Prize 2013 to M. Reichstein. This work used eddy covariance data acquired and shared by the FLUXNET community, including these networks: AmeriFlux, AfriFlux, AsiaFlux, CarboAfrica, CarboEuropeIP, CarboItaly, CarboMont, ChinaFlux, Fluxnet-Canada, GreenGrass, ICOS, KoFlux, LBA, NECC, OzFlux-TERN, Swiss FluxNet, TCOS-Siberia and USCCC. The ERA-Interim reanalysis data were provided by ECMWF and processed by LSCE. The FLUXNET eddy covariance data processing and harmonization was carried out by the European Fluxes Database Cluster, the AmeriFlux Management Project and the Fluxdata project of FLUXNET, with the support of the CDIAC and the ICOS Ecosystem Thematic Center, and the OzFlux, ChinaFlux and AsiaFlux offices. R.C. and J. Peters were supported by the VILLUM FONDEN (18968) and J. Peters in addition by the Carlsberg Foundation; P.R. acknowledges funding support from the US National Science Foundation (NSF) Long-Term Ecological Research (DEB-1831944) and Biological Integration Institutes (NSF-DBI-2021898); N.B. acknowledges funding from various SNF projects, including ICOS-CH (20FI21_148992, 20FI20_173691), the ETH Board and ETH Zurich (TH-1006-02); and T.F.K. acknowledges support from the Reducing Uncertainties in Biogeochemical Interactions through Synthesis and Computation Scientific Focus Area (RUBISCO SFA), which is sponsored by the Regional and Global Model Analysis (RGMA) Program of the Office of Biological and Environmental Research (BER) in the US Department of Energy Office of Science. OzFlux is supported by the Australian Government’s Terrestrial Ecosystem Research Network (TERN, www.tern.org.au). We thank K. Morris and S. Paulus for comments on the draft, K. Blakeslee for English editing and G. Bohrer for sharing nitrogen data for his site.

Funding

Open access funding provided by Max Planck Society.

Author information

Mirco Migliavacca
Present address: European Commission, Joint Research Centre (JRC), Ispra, Italy
Jürgen Knauer
Present address: Hawkesbury Institute for the Environment, Western Sydney University, Penrith, New South Wales, Australia

Authors and Affiliations

Max Planck Institute for Biogeochemistry, Jena, Germany
Mirco Migliavacca, Talie Musavi, Miguel D. Mahecha, Jacob A. Nelson, Silvia Caldararu, Nuno Carvalhais, Tarek S. El-Madany, Ulisse Gomarasca, Mathias Göckede, Martin Jung, Jens Kattge, David Martini, Daniel E. Pabon-Moreno, Ulrich Weber, Sönke Zaehle & Markus Reichstein
German Centre for Integrative Biodiversity Research (iDiv), Halle-Jena-Leipzig, Germany
Mirco Migliavacca, Miguel D. Mahecha, Jens Kattge & Markus Reichstein
Remote Sensing Center for Earth System Research, Leipzig University, Leipzig, Germany
Miguel D. Mahecha & Guido Kraemer
Helmholtz Centre for Environmental Research – UFZ, Leipzig, Germany
Miguel D. Mahecha
CSIRO Oceans and Atmosphere, Canberra, Australian Capital Territory, Australia
Jürgen Knauer
Department of Environmental Science, Policy and Management, University of California, Berkeley, Berkeley, CA, USA
Dennis D. Baldocchi & Trevor F. Keenan
Department of Forest Engineering, ERSAF Research Group, University of Cordoba, Cordoba, Spain
Oscar Perez-Priego
Department of Mathematical Sciences, University of Copenhagen, Copenhagen, Denmark
Rune Christiansen & Jonas Peters
Environment and Sustainability Institute, University of Exeter, Penryn, UK
Karen Anderson
Department of Ecology, University of Innsbruck, Innsbruck, Austria
Michael Bahn & Georg Wohlfahrt
Faculty of Land and Food Systems, Vancouver, British Columbia, Canada
T. Andrew Black
Department of Geography, University of Colorado, Boulder, CO, USA
Peter D. Blanken
Université de Lorraine, AgroParisTech, INRAE, UMR Silva, Nancy, France
Damien Bonal
Department of Environmental Systems Science, ETH Zurich, Zurich, Switzerland
Nina Buchmann & Sebastian Wolf
Fundación Centro de Estudios Ambientales del Mediterráneo (CEAM), Paterna, Spain
Arnaud Carrara
Departamento de Ciências e Engenharia do Ambiente, Universidade Nova de Lisboa, Caparica, Portugal
Nuno Carvalhais
European Commission, Joint Research Centre (JRC), Ispra, Italy
Alessandro Cescatti
Landscape Ecology & Ecosystem Science (LEES) Lab, Center for Global Change and Earth Observations, and Department of Geography, Environmental and Spatial Science, Michigan State University, East Lansing, MI, USA
Jiquan Chen
School of Life Sciences, University of Technology Sydney, Ultimo, New South Wales, Australia
Jamie Cleverly
Terrestrial Ecosystem Research Network, College of Science and Engineering, James Cook University, Cairns, Queensland, Australia
Jamie Cleverly
Climate Change Unit, Environmental Protection Agency of Aosta Valley, Aosta, Italy
Edoardo Cremonese, Gianluca Filippa & Marta Galvagno
Department of Atmospheric and Oceanic Sciences, University of Wisconsin-Madison, Madison, WI, USA
Ankur R. Desai
O’Neill School of Public and Environmental Affairs, Indiana University, Bloomington, IN, USA
Martha M. Farella
Research Group Plant and Ecosystems (PLECO), Department of Biology, University of Antwerp, Wilrijk, Belgium
Marcos Fernández-Martínez & Ivan A. Janssens
Institute of Photogrammetry and Remote Sensing, TU Dresden, Dresden, Germany
Matthias Forkel
Department of Biology, Virginia Commonwealth University, Richmond, VA, USA
Christopher M. Gough
Department of Environmental Engineering, Technical University of Denmark (DTU), Kongens Lyngby, Denmark
Andreas Ibrom
Institute for Agro-Environmental Sciences, National Agriculture and Food Research Organization, Tsukuba, Japan
Hiroki Ikawa
Earth and Environmental Science Area, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Trevor F. Keenan
Bioclimatology, Faculty of Forest Sciences and Forest Ecology, University of Goettingen, Goettingen, Germany
Alexander Knohl
Centre of Biodiversity and Sustainable Land Use (CBL), University of Goettingen, Goettingen, Germany
Alexander Knohl
Research Institute for Global Change, Institute of Arctic Climate and Environment Research, Japan Agency for Marine-Earth Science and Technology (JAMSTEC), Yokohama, Japan
Hideki Kobayashi
Image Processing Laboratory (IPL), Universitat de València, València, Spain
Guido Kraemer
Department of Forest Ecosystems and Society, Oregon State University, Corvallis, OR, USA
Beverly E. Law
Centre for Tropical, Environmental, and Sustainability Sciences, James Cook University, Cairns, Queensland, Australia
Michael J. Liddell
College of Earth and Environmental Sciences, Lanzhou University, Lanzhou, China
Xuanlong Ma
Institute for Atmospheric and Earth System Research/Physics, Faculty of Science, University of Helsinki, Helsinki, Finland
Ivan Mammarella
CSIRO Land and Water, Floreat, Western Australia, Australia
Craig Macfarlane
Consiglio Nazionale delle Ricerche, Istituto per la BioEconomia (CNR – IBE), Sesto Fiorentino, Italy
Giorgio Matteucci
Facoltà di Scienze e Tecnologie, Libera Universita’ di Bolzano, Bolzano, Italy
Leonardo Montagnani
Forest Services of the Autonomous Province of Bozen-Bolzano, Bolzano, Italy
Leonardo Montagnani
Department of Earth and Environmental Sciences (DISAT), University of Milano-Bicocca, Milan, Italy
Cinzia Panigada & Micol Rossini
Department for Innovation in Biological, Agro-Food and Forest Systems (DIBAF), University of Tuscia, Viterbo, Italy
Dario Papale
Hawkesbury Institute for the Environment, Western Sydney University, Penrith, New South Wales, Australia
Elise Pendall, Peter B. Reich & Ian J. Wright
CSIC, Global Ecology Unit CREAF-CSIC-UAB, Barcelona, Spain
Josep Penuelas
CREAF, Barcelona, Spain
Josep Penuelas
Department of Biology, Indiana University, Bloomington, IN, USA
Richard P. Phillips
Department of Forest Resources, University of Minnesota, Saint Paul, MN, USA
Peter B. Reich
Institute for Global Change Biology and School for Environment and Sustainability, University of Michigan, Ann Arbor, MI, USA
Peter B. Reich
Department of Earth and Planetary Sciences, Weizmann Institute of Science, Rehovot, Israel
Eyal Rotenberg & Dan Yakir
Southwest Watershed Research Center, USDA Agricultural Research Service, Tucson, AZ, USA
Russell L. Scott
INRAE, UMR EcoFoG, CNRS, Cirad, AgroParisTech, Université des Antilles, Université de Guyane, Kourou, France
Clement Stahl
Department of Biological Sciences, Macquarie University, Sydney, New South Wales, Australia
Ian J. Wright
Michael-Stifel-Center Jena for Data-driven and Simulation Science, Friedrich-Schiller-Universität Jena, Jena, Germany
Markus Reichstein

Authors

Mirco Migliavacca
View author publications
You can also search for this author in PubMed Google Scholar
Talie Musavi
View author publications
You can also search for this author in PubMed Google Scholar
Miguel D. Mahecha
View author publications
You can also search for this author in PubMed Google Scholar
Jacob A. Nelson
View author publications
You can also search for this author in PubMed Google Scholar
Jürgen Knauer
View author publications
You can also search for this author in PubMed Google Scholar
Dennis D. Baldocchi
View author publications
You can also search for this author in PubMed Google Scholar
Oscar Perez-Priego
View author publications
You can also search for this author in PubMed Google Scholar
Rune Christiansen
View author publications
You can also search for this author in PubMed Google Scholar
Jonas Peters
View author publications
You can also search for this author in PubMed Google Scholar
Karen Anderson
View author publications
You can also search for this author in PubMed Google Scholar
Michael Bahn
View author publications
You can also search for this author in PubMed Google Scholar
T. Andrew Black
View author publications
You can also search for this author in PubMed Google Scholar
Peter D. Blanken
View author publications
You can also search for this author in PubMed Google Scholar
Damien Bonal
View author publications
You can also search for this author in PubMed Google Scholar
Nina Buchmann
View author publications
You can also search for this author in PubMed Google Scholar
Silvia Caldararu
View author publications
You can also search for this author in PubMed Google Scholar
Arnaud Carrara
View author publications
You can also search for this author in PubMed Google Scholar
Nuno Carvalhais
View author publications
You can also search for this author in PubMed Google Scholar
Alessandro Cescatti
View author publications
You can also search for this author in PubMed Google Scholar
Jiquan Chen
View author publications
You can also search for this author in PubMed Google Scholar
Jamie Cleverly
View author publications
You can also search for this author in PubMed Google Scholar
Edoardo Cremonese
View author publications
You can also search for this author in PubMed Google Scholar
Ankur R. Desai
View author publications
You can also search for this author in PubMed Google Scholar
Tarek S. El-Madany
View author publications
You can also search for this author in PubMed Google Scholar
Martha M. Farella
View author publications
You can also search for this author in PubMed Google Scholar
Marcos Fernández-Martínez
View author publications
You can also search for this author in PubMed Google Scholar
Gianluca Filippa
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Forkel
View author publications
You can also search for this author in PubMed Google Scholar
Marta Galvagno
View author publications
You can also search for this author in PubMed Google Scholar
Ulisse Gomarasca
View author publications
You can also search for this author in PubMed Google Scholar
Christopher M. Gough
View author publications
You can also search for this author in PubMed Google Scholar
Mathias Göckede
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Ibrom
View author publications
You can also search for this author in PubMed Google Scholar
Hiroki Ikawa
View author publications
You can also search for this author in PubMed Google Scholar
Ivan A. Janssens
View author publications
You can also search for this author in PubMed Google Scholar
Martin Jung
View author publications
You can also search for this author in PubMed Google Scholar
Jens Kattge
View author publications
You can also search for this author in PubMed Google Scholar
Trevor F. Keenan
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Knohl
View author publications
You can also search for this author in PubMed Google Scholar
Hideki Kobayashi
View author publications
You can also search for this author in PubMed Google Scholar
Guido Kraemer
View author publications
You can also search for this author in PubMed Google Scholar
Beverly E. Law
View author publications
You can also search for this author in PubMed Google Scholar
Michael J. Liddell
View author publications
You can also search for this author in PubMed Google Scholar
Xuanlong Ma
View author publications
You can also search for this author in PubMed Google Scholar
Ivan Mammarella
View author publications
You can also search for this author in PubMed Google Scholar
David Martini
View author publications
You can also search for this author in PubMed Google Scholar
Craig Macfarlane
View author publications
You can also search for this author in PubMed Google Scholar
Giorgio Matteucci
View author publications
You can also search for this author in PubMed Google Scholar
Leonardo Montagnani
View author publications
You can also search for this author in PubMed Google Scholar
Daniel E. Pabon-Moreno
View author publications
You can also search for this author in PubMed Google Scholar
Cinzia Panigada
View author publications
You can also search for this author in PubMed Google Scholar
Dario Papale
View author publications
You can also search for this author in PubMed Google Scholar
Elise Pendall
View author publications
You can also search for this author in PubMed Google Scholar
Josep Penuelas
View author publications
You can also search for this author in PubMed Google Scholar
Richard P. Phillips
View author publications
You can also search for this author in PubMed Google Scholar
Peter B. Reich
View author publications
You can also search for this author in PubMed Google Scholar
Micol Rossini
View author publications
You can also search for this author in PubMed Google Scholar
Eyal Rotenberg
View author publications
You can also search for this author in PubMed Google Scholar
Russell L. Scott
View author publications
You can also search for this author in PubMed Google Scholar
Clement Stahl
View author publications
You can also search for this author in PubMed Google Scholar
Ulrich Weber
View author publications
You can also search for this author in PubMed Google Scholar
Georg Wohlfahrt
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Wolf
View author publications
You can also search for this author in PubMed Google Scholar
Ian J. Wright
View author publications
You can also search for this author in PubMed Google Scholar
Dan Yakir
View author publications
You can also search for this author in PubMed Google Scholar
Sönke Zaehle
View author publications
You can also search for this author in PubMed Google Scholar
Markus Reichstein
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.M., M. Reichstein, M.D.M. and T.M. conceived the study. M.M. and T.M. performed the majority of the analysis. R.C. and J. Peters designed and coded the causality analysis. J.A.N. provided the transpiration partitioning data. J. Knauer and S.Z. performed the land surface model runs. N.C. and U.W. processed the above-ground biomass data. O.P.-P. provided support with data analysis and discussions. M.M. wrote the first draft. All of the authors participated in intensive discussions on the manuscript and the revision phase, and contributed to writing the final manuscript. In addition, many site principal investigators contributed with additional data for their site.

Corresponding authors

Correspondence to Mirco Migliavacca or Markus Reichstein.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature thanks J. Hans C. Cornelissen, Diego Miralles and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data figures and tables

Extended Data Fig. 1 Map of the 203 FLUXNET sites used in this analysis.

Colours represent different plant functional types according to the IGBP classification. IGBP classes are: CSH (close shrublands); DBF (deciduous broadleaved forest), DNF (deciduous needleleaf forests), EBF (evergreen broadleaved forest), ENF (evergreen needleleaf forest), GRA (grasslands), MF (mixed forest), OSH (open shrublands), SAV (savannah), and WET (wetlands). The map was generated with the ggplot2 R package⁷⁴. The shape files used to create the maps were downloaded from https://github.com/ngageoint/geopackage-js.

Extended Data Fig. 2 FLUXNET sites used in the analysis plotted in the precipitation–temperature space.

The background represent climate space of the major biomes according to Whittaker⁷⁵ and further modifications⁷⁶. Biomes are defined as function of the mean annual temperature and mean annual precipitation (MAP). The figure is modified from Liu et al.,⁷⁷ using the code available in git (https://github.com/kunstler/BIOMEplot).

Extended Data Fig. 3 Distribution of the selected FLUXNET sites within the climate types.

Climate types were defined according to Köppen-Geiger classification as follow: Tropical (Aw, Af, Am), Dry (BSh, BSk, BWh), Temperate (Cfb), Sub-Tropical (Cfa, Csa, Csb, Cwa), Temperate/Continental Hot (Dfa, Dfb, Dwa, Dwb, Dwc), Arctic (ET)], and Boreal (Dfc, Dsc).

Extended Data Fig. 4 Results of the relative importance analysis conducted with the Random Forest and partial dependence.

See ‘Predictive variable importance’ in Methods. The slopes of the partial dependence plot indicate the sensitivity of the response (PCs) to the specific predictor. The out-of-bag cross-validation leads to predictive explained variance of 56.76% for PC1, 30.24% for PC2, and 20.41% for PC3. The portion of unexplained variance might be related to missing leaf traits predictor such as leaf mass per area or phenological traits. The partial dependence plots of all variables are shown: top panels for PC1 (a–e), middle panels for PC2 (f–l), and bottom panels for PC3 (m–q). The blue lines represent the locally estimated scatterplot (LOESS) smoothing of the partial dependence. Tick marks in the x axis represent the minimum, maximum and deciles of the variable distribution.

Extended Data Fig. 5 Map of FLUXNET sites colour-coded for the value of PC1 and PC2.

a, PC1. b, PC2. The map of the PC1 shows the areas of the globe with high productivity (positive values of PC1 in the temperate areas, Eastern North America, Eastern Asia, and Tropics), and areas characterized by lower productivity (Semi-arid regions, high latitude and Mediterranean ecosystems). The map of the PC2 shows the gradient of evaporative fraction and the spatial patterns of water use efficiency. This PC2 runs from sites with a high evaporative fraction (i.e. available energy is dissipated preferentially to evaporated or transpired water), high surface conductance, and low water use efficiency (positive PC2 values), to water limited sites (i.e. low evaporative fraction where available energy is mainly dissipated by sensible heat) that also show higher water-use efficiency (negative PC2 values). The maps were generated with the ggplot2 R package⁷⁴. The shape files used to create the maps were downloaded from https://github.com/ngageoint/geopackage-js.

Extended Data Fig. 6 Biplot resulting from the principal component analysis.

Plot as in Fig. 1. In panel a, points are colour-coded by grass vs. non-grass classes. In panel b, the points are colour-coded according to the logarithm of vegetation height. From these results we conclude that there is not a clear cluster in the biplot for grass and non-grass vegetation. In fact, in Extended Data Fig. 6a, the sites do not cluster according to the designation to grasslands or not, but there is a clear gradient as a function of the vegetation height (Extended Data Fig. 6b).

Extended Data Fig. 7 Comparing observed and modelled global ecosystem functional trade-offs.

PCA for a subset of 48 FLUXNET sites mainly distributed in temperate and boreal regions and 2 different land surface models (Supplementary Table 1). The left column is FLUXNET, the centre column is OCN, and the right column is JSBACH. Panels a, b, c: the biplot resulting from the PCA. Panels d, e, f, bar plot of the loading of each ecosystem functional property to each principal component. Orange bars represent the loadings that are selected as significant and with high contribution (Supplementary Information 2). Panels g, h, i report the variance explained by each principal component. EFP acronym list: apparent carbon-use efficiency (aCUE), evaporative fraction (EF), amplitude of EF (EF_ampl), maximum evapotranspiration (ET_max), gross primary productivity at light saturation (GPP_sat), maximum surface conductance (G_smax), maximum net ecosystem productivity (NEP_max), maximum and mean basal ecosystem respiration (Rb_max and Rb, respectively), and growing season underlying water-use efficiency (uWUE). Note that the PCA results for FLUXNET (panels a, d, g) are different from Fig. 1 because here we use the subset of 48 sites used for the modelling analysis.

Extended Data Fig. 8 Pairwise relationship between some key ecosystem functional properties derived from FLUXNET, and modelled with JSBACH and OCN.

n = 48 sites; see Supplementary Table 1. The grey areas represent the 95% confidence interval of the linear and nonlinear regression. Overall the correlation between modelled functions is larger than in the observations. Acronym list: evaporative fraction (EF), amplitude of EF (EF_ampl), gross primary productivity at light saturation (GPP_sat), maximum surface conductance (G_s), maximum net ecosystem productivity (NEP_max), basal ecosystem respiration (Rb), and growing season underlying water-use efficiency (uWUE).

Extended Data Fig. 9 Representation of the 2D ecosystem functional properties space derived from FLUXNET observations and land surface model runs (OCN, JSBACH).

The points represent the principal component (PC) value calculate for each site. The contour lines are computed using a 2D kernel density estimates. The contour lines show the area occupied by ecosystem functional properties and its boundary that, according to the results of the analysis, are set by vegetation characteristics (PC1), water availability, abiotic limitations, and vegetation height (PC2), and above-ground biomass, foliar nitrogen and atmospheric aridity (PC3). The areas computed for FLUXNET are wider than for the models, particularly for PC2 and PC3. This means that ecosystem functional properties as simulated by models are more constrained than for the observations.

Extended Data Fig. 10 Evaluation of above-ground biomass satellite products against FLUXNET observation.

n = 71. We evaluated the three above-ground biomass (AGB, t DM ha⁻¹) products derived from the GlobBiomass dataset as reported in the Method section. From the product at its original resolution (100 x 100 m) we extracted the 95th percentile of the estimated AGB in 5 by 5 grid cell windows (AGB5x5, panel a with all sites, and panel b with the grasslands excluded) centered around the location of the FLUXNET sites used for the evaluation. Further, we extracted the median in 3 by 3 and 5 by 5 grid cells centered around the location of the FLUXNET site (panels c and d). Total above-ground biomass observations were gathered from the BADM dataset downloaded from the AMERIFLUX network and from the FLUXNET LaThuile release. Only data with the clear indication of the unit of AGB expressed in in dry matter (t DM ha⁻¹) were retained for the analysis. Results show that the median of the 5 by 5 grid cell window (panel c) is the best extraction method to characterize AGB at the FLUXNET sites, and therefore retained for further analysis. Adjusted determination coefficient (R²), linear regression function, and p-value calculated with the F-test are also reported.

Supplementary information

Supplementary Information 2

Significance test of the PCA and information redundancy: We report the number of significant axes to be retained in the PCA analysis and summarize the results of the statistical analysis in Table S2.

Reporting Summary

Supplementary Information 3

Invariant causal regression models and causal variable importance. This section contains theoretical concepts and a detailed description of the methods used in the causality analysis, and additional results.

Supplementary Table 1

List of FLUXNET sites used in the analysis. Coordinates (latitude and longitude), plant functional type (IGBP class), Köppen Geiger class, nitrogen content (N%), maximum leaf area index (LAI_max), maximum vegetation height (H_c), and above-ground biomass from the GlobBiomass dataset (AGB) are reported.

Supplementary Table 4

Evaluation of land surface model performances. We report an additional evaluation of the land surface model outputs.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Migliavacca, M., Musavi, T., Mahecha, M.D. et al. The three major axes of terrestrial ecosystem function. Nature 598, 468–472 (2021). https://doi.org/10.1038/s41586-021-03939-9

Download citation

Received: 30 October 2019
Accepted: 20 August 2021
Published: 22 September 2021
Issue Date: 21 October 2021
DOI: https://doi.org/10.1038/s41586-021-03939-9

This article is cited by

Effects of biotic and abiotic factors on ecosystem multifunctionality of plantations
- Jiaxin Tian
- Tian Ni
- Fengri Li
Ecological Processes (2024)
Comprehensive physiological, transcriptomic, and metabolomic analyses revealed the regulation mechanism of evergreen and cold resistance of Pinus koraiensis needles
- Yan Li
- Xin Li
- Xiyang Zhao
BMC Plant Biology (2024)
Characterizing the structural complexity of the Earth’s forests with spaceborne lidar
- Tiago de Conto
- John Armston
- Ralph Dubayah
Nature Communications (2024)
Resistance of ecosystem services to global change weakened by increasing number of environmental stressors
- Guiyao Zhou
- Nico Eisenhauer
- Manuel Delgado-Baquerizo
Nature Geoscience (2024)
A shift in transitional forests of the North American boreal will persist through 2100
- Paul M. Montesano
- Melanie Frost
- Gerald V. Frost
Communications Earth & Environment (2024)