Abstract
We present a self-consistent, large ensemble, high-resolution global dataset of long‐term future climate, which accounts for the uncertainty in climate system response to anthropogenic emissions of greenhouse gases and in geographical patterns of climate change. The dataset is developed by applying an integrated spatial disaggregation (SD) − bias-correction (BC) method to climate projections from the MIT Integrated Global System Model (IGSM). Four emission scenarios are considered that represent energy and environmental policies and commitments of potential future pathways, namely, Reference, Paris Forever, Paris 2 °C and Paris 1.5 °C. The dataset contains nine key meteorological variables on a monthly scale from 2021 to 2100 at a spatial resolution of 0.5°x 0.5°, including precipitation, air temperature (mean, minimum and maximum), near-surface wind speed, shortwave and longwave radiation, specific humidity, and relative humidity. We demonstrate the dataset’s ability to represent climate-change responses across various regions of the globe. This dataset can be used to support regional-scale climate-related impact assessments of risk across different applications that include hydropower, water resources, ecosystem, agriculture, and sustainable development.
Similar content being viewed by others
Background & Summary
Global climate model simulations of the past, current, and future climate have become readily available through different phases of the Coupled Model Intercomparison Project (CMIP). Over the past several decades, the CMIP data archive has served as a foundational element of climate science and for supporting decision and policy-makers communities. The recent sixth phase of CMIP (CMIP6) has expanded the breadth of the coordinated climate model experiments considerably with more than 70 participating models of a wide variety from more than 30 participating model groups worldwide1.
The Scenario Model Intercomparison Project (ScenarioMIP) in CMIP6 provides the main set of future climate projections based on alternative scenarios directly relevant to societal concerns for climate change mitigation, adaptation, or impacts2. These climate projections are driven by a set of emissions and land use scenarios produced with integrated assessment models (IAMs) based on future pathways of societal development, the Shared Socioeconomic Pathways (SSPs), and related to the forcing levels of the Representative Concentration Pathways (RCPs)3. Compared to CMIP5, CMIP6 RCPs fill critical gaps for intermediate forcing levels (for example, short-lived species and land use) and the full set of SSPs-RCPs combinations spans a larger range of outcomes. However, these multi-model climate projections under a matrix of possible integrated scenarios may not adequately represent the desired risk-based assessment of climate change impacts and climate policy benefits due to several issues described below. First, there is a lack of consistency in socioeconomic and environmental factors and such consistency is highly relevant for assessing climate impacts and climate policy benefits. The CMIP model projections are exogenously driven via greenhouse gas concentrations, and therefore do not account for the direct responses and linkages between natural processes and human emissions and other climate-relevant activities. Further inconsistencies are attributed to the disconnected users and developers that develop the shared socioeconomic scenarios, and thus lead to difficulties in interpreting different model results, despite a common implemented scenario target (e.g., land use). Further, the collection of CMIP6 SSP-RCP scenarios was not produced by one IAM, but rather from a selection of distinctly different IAMs developed at institutes around the world. The emission and socioeconomic projections generated from these IAM scenarios are often not consistent with one another and could thus compromise any cross-scenario comparative analysis of climate policy benefits, climate-related impacts, transition risks, and sustainable development consequences. Second, global climate model outputs are prone to errors/biases that can be caused by a range of factors, including limited spatial resolution (large grid sizes), simplified thermodynamic processes and physics or incomplete understanding of the global climate system. The use of biased climate model outputs in impact models (i.e., crop models for agriculture, water resource models for hydrology) or local-scale climate impact assessment can often lead to unrealistic and distorted results. Third, the ensemble size of projections from the participating climate/Earth-system models are limited in providing quantitative insights on ‘risk’ or the probability of variables of interest.
We address these issues by developing a self-consistent, large ensemble, high-resolution, bias-corrected global dataset of future climates for a range of global emission pathways and/or climate targets. The dataset can be viewed as an update and commensurate to similar products based on earlier versions of CMIP4 and meets the data needs of studies on regional, risk-based climate change impacts across different applications, such as ecosystem5,6, agriculture7,8, water resource9,10,11, hydropower12, environment13, and sustainable development14. Our method combines plausible patterns of human-forced regional climate change with a comprehensive characterization of the global climate change response as determined by the MIT Integrated Global System Model (IGSM)15. IGSM consists of the linked MIT Earth System Model (MESM) of intermediate complexity and the Economic Projections and Policy Analysis model (EPPA) for analyzing human-Earth systems interactions15. The EPPA characterizes detailed economic activities to track inter-sectoral and inter-regional links, while the MESM represents key physical, chemical, and biological components of the Earth system that are impacted by human activity. Such integrated framework ensures consistent treatment of interactions among population growth, economic development, energy and land system changes and physical climate responses, which can provide improved assessments of climate impacts across multiple sectors16. The MESM contains a two-dimensional (zonally averaged, explicit representation of latitude and altitude) atmospheric model with interactive chemistry coupled to the zonally averaged version of Global Land System model and an anomaly-diffusing ocean model. This architecture allows for conducting a large ensemble of climate simulations for robust uncertainty analyses at significantly less computational cost than state-of-the-art climate models. In particular, the parameters that characterize climate sensitivity, the rate of ocean heat uptake, and the strength of aerosol forcing can be varied in MESM, a powerful feature to explore a range of uncertainty in the climate response. We first carry out a 50-member ensemble of MESM simulations (zonal) across a range of global emission pathways produced by EPPA. We then apply a combined spatial disaggregation (SD) – bias correction (BC) method. SD uses a pattern-scaling approach to achieve longitude representation at the desirable spatial resolution, while BC corrects the inherent biases in the MESM future climate projections.
Methods
Data acquisition
Various datasets used in this study and their characteristics are summarized in Table 1 and detailed as below.
Historical meteorology
We utilize the historical meteorological dataset from the third phase of the Global Soil Wetness Project (GSWP3, http://hydro.iis.u-tokyo.ac.jp/GSWP3/) as the baseline climate. It is a 3-hourly 0.5° global forcing product (1901–2014) based on the 20th Century Reanalysis version 217. The reanalysis was dynamically downscaled to 0.5° resolution based on the Global Spectral Model using a spectral nudging technique18. Bias corrections based on observations were made for temperature, precipitation, longwave radiation, and shortwave radiation using CRU TS v3.21 (Climate Research Unit)19, GPCCv7 (Global Precipitation Climatology Centre)20, and Surface Radiation Budget data sets, respectively. A wind-induced undercatch correction was also applied.
The meteorological variables considered in this study include precipitation, air temperature (mean, minimum and maximum), near-surface wind speed, shortwave and longwave radiation, specific humidity, and relative humidity. Minimum and maximum air temperature as well as relative humidity are not provided in the GSWP3 data, but derived as follows. Minimum and maximum air temperature are calculated as the monthly means of the minimum and maximum daily air temperature across the 8 3-hourly time steps, respectively. Relatively humidity is derived from surface air temperature (Tair), surface air pressure (Psf), and specific humidity (Qair) at a 3-hourly time step as in Eq. (1) and averaged into the monthly values.
Where e and es are vapor pressure and saturated vapor pressure, respectively. rh is the relative humidity in percent.
CMIP6 model simulations
The CMIP has become one of the foundational elements of climate science by distributing global climate model simulations of the past, current, and future climate. The CMIP6 includes more than 70 participating models of a wide variety developed at research institutes across the international scientific community. For the SD pattern-scaling procedure used in the construction of our large-ensemble future projections at the 0.5° resolution, we draw the simulations of the participating models from the 1pctCO2 experiment in which the concentration of atmospheric CO2 increases gradually at a rate of 1% per year from the global annual mean 1850 value until quadrupling. It serves as a consistent and useful benchmark for analyzing model transient climate response (TCR) to cumulative carbon emissions (TCRE). Our prior published work21 showed that similar response patterns are obtained for a particular climate model, regardless of the CMIP scenarios (i.e., 2xCO2 and RCPs), suggesting that the patterns are robust and represent a consistent forced response due to the human-induced radiative forcing. Multiple variations of model outputs may be available from a model family with different configurations (e.g., different spatial resolutions or components). In that case, only one sibling model is selected to represent the family. The precedence for selection is (highest to lowest): (1) all the key meteorological variables of our interest are archived, including precipitation, air temperature (mean, minimum and maximum), near-surface wind speed, shortwave and longwave radiation, specific humidity, and relative humidity; (2) an Earth System Model (ESM) is chosen whenever available; and (3) higher spatial resolution is desired. This resulted in 18 climate models that participated in the CMIP6 1pctCO2 experiment (Table 2). Their corresponding monthly outputs are then obtained for the pattern-scaling method (Section 3.2).
MESM future climate projections
We focus on four policy scenarios that were developed to span a range of possible global actions to abate greenhouse gas emissions over the coming century, specifically Reference (REF), Paris Forever (PF), Paris 2 °C (P2C), and Paris 1.5 °C (P1p5C)11. These policy scenarios are more aligned with the global emission pathways and/or climate targets established in the recent Paris Agreement. The REF scenario implements no explicit climate mitigation policy (no specific greenhouse gas emissions target) anywhere in the world for the sake of abating climate change, except for some energy policies such as fuel economy standards, renewable electricity requirements, the gradual phase-out of old coal power plants, reduced use of exhaustible resources, and conventional pollutants reduction. The REF serves as a baseline scenario and represents the upper bound of climate change risks. The PF scenario assumes that all Paris Agreement Nationally Determined Contributions (NDCs) are implemented through the year 2030 and maintained in perpetuity after that. While our PF scenario represents an unprecedented global commitment to limit greenhouse gas emissions, it neither stabilizes climate nor limits climate change. P2C and P1p5C scenarios extend from the Paris Agreement’s NDCs and align with its long-term goals. These two scenarios aim to limit and stabilize human-induced global climate warming to 2 °C and 1.5 °C, respectively, by the end of this century. Variations in mitigation policies lead to the uncertainty in patterns of resource and energy use, technology choices, and drag on overall economic growth, with additional uncertainty arising from the global climate response represented in the MESM4. These co-evolving uncertainties result in an overall probability of achieving the target at 50% for both P2C and P1p5C scenarios.
The most recent version of MESM has been extensively evaluated15. Its performance against observations at the global and zonal scales is comparable to global climate models from CMIP5. As such, MESM is an efficient model that can faithfully produce the global- and zonal-scale plausible climate responses to anthropogenic drivers. For each of four policy scenarios, we carry out a 50-member ensemble of MESM simulations (zonal) to account for the uncertainty in the climate system response to anthropogenic emissions of greenhouse gases. These ensemble simulations are conducted using different values of model parameters that characterize climate sensitivity, rate of heat and carbon uptake by the ocean, and strength of aerosol forcing. 50 sets of these parameters are generated by Latin Hypercube sampling from 3-dimensional parameter distribution, which is obtained by comparing with observations the changes in surface temperature and deep ocean heat content simulated by MESM forced by historical forcing22. We find the simulated surface air temperature distribution based on this 50-member ensemble replicates well that based on the 400-member ensemble used in the 2021 Global Change Outlook11. These four policy scenarios depict distinct pathways and distributions of global averaged changes in human forcing and climate variables (Figs. 1, 2). In comparison with the CMIP5 RCP scenarios, the MESM REF scenario presents weaker radiative forcing than its RCP counterpart (RCP8.5), while the more aggressive climate-based targets (P2C and P1p5C) fall between RCP4.5 and RCP2.6 scenarios. The PF scenario characterizes stronger radiative forcing than the RCP6.0 scenario. The impact of the aggressive mitigation policies on alleviating or eliminating the warming risk is immediately evident, with a majority of the distributions of global averaged annual surface-air temperature (SAT) trends in P2C and P1p5C scenarios falling outside that of the REF scenario in the latter half of the 21st century (Fig. 2). Despite a notable shift toward lower warming risk, considerable overlap persists till the end of the century between the SAT trends distributions of the PF scenario and those of the REF scenario.
SD-BC method
We employ an integrated SD-BC method to develop a long-term (2021–2100), large ensemble, high-resolution global dataset of future climate projections. SD uses a pattern-scaling approach to transform the MESM zonal climate to longitude representation at the spatial resolution consistent with that of observations. BC methods rely on the use of different statistical techniques to make climate model outputs more realistic and also achieve finer spatial resolution if high-resolution observations are available23,24. BC can be implemented in various ways, including delta change, quantile mapping, linear regression, variance scaling, etc. The delta change method is the simplest approach and involves applying a changing factor to the historical observation for constructing a new time series of the future climate4,24. A changing factor can be additive (difference) or multiplicative (ratio). Different changing factors can be employed for different climate variables. This method does not take into account changes in climate variability such as increasing extreme rainfall or longer dry spells. Linear regression performs a regression analysis using historical observations and climate model outputs during the same period and applies the regression parameters to construct bias-adjusted future climate time series. The regression can be made as simple or complex25. Quantile mapping generally uses a Gaussian or gamma distribution function to correct the distribution function of a climate variable and improve its fitting to observations26,27. Quantile mapping has become widely used because of its ability to correct bias at the extreme tails and its desired accuracy and robustness. However, the results may be sensitive to the choice of calibration period. All these BC methods are univariate which correct one variable at a time. Some multivariate BC (MBC) methods have been recently proposed to account for inter-variable, spatial or temporal dependencies of the observations28.
Here we use the delta change method as our BC method. Despite its simpleness, the delta change method is still one of the popular methods for climate model bias correction4. Recent studies29,30,31 have shown that the delta change method performs as well as or occasionally better than complex bias correction techniques (i.e., generalized additive models (GAMs), quantile mapping). The low computational demand of the delta method facilitates an efficient bias-correction of MESM future climate across a large ensemble and diverse emission scenarios over centuries. The basic principle behind the BC delta method involves adding changes (or deltas) in projected monthly climate to the high-resolution present-day climate (baseline). In our study, baseline climate is constructed to be free of any significant monotonic trends caused by human forcing, while changes (deltas) represent only the human-forced monotonic trends from MESM’s global climate response (without internal climate variability). The projected monthly climate from the MESM is at the coarse resolution (zonal) and need be spatially disaggregated through a pattern-scaling technique (described below) to match the high-resolution baseline climate before the addition is applied. The delta method assumes that changes in climates are only relevant at coarse scales but temporal variability of each variable from the baseline climate is maintained towards the future. This assumption may hold true in many cases, except for the highly heterogeneous landscapes where considerable variations may be induced within a relatively short distance. The bias-corrected climate projections resulting from the delta change method could be negative if future trends of meteorological variables (changes or deltas) are negative and larger than the corresponding detrended baseline values. In our procedure, we simply mask out any negative value (reassigned as zero) to ensure data validity.
The diagram for the SD-BC method is shown in Fig. 3 and comprises the following steps: (1) detrending monthly historical climate time series (GSWP3, 1931–2010) for each month (if trend is significant at 5%) at each grid-point; (2) adjusting detrended monthly values to those of year 2020 and mosaicking all the monthly values as a new 80-year baseline climate (free of any significant monotonic trend caused by human forcing) at 0.5° × 0.5°; (3) deriving pattern-scaling kernels from GSWP3 and pattern-change kernels from CMIP6 1pctCO2 simulations at 0.5° × 0.5° for each climatological month and each CMIP6 model; (4) applying pattern-scaling to a 50-member ensemble of MESM zonal historical and future monthly climate across 18 climate models (Table 2) following Eq. (2)21 (described in pattern-scaling method) to derive a 900-member ensemble of monthly changes (deltas) at 0.5° × 0.5° from 2021 to 2100; and (5) adding monthly changes (deltas) from 2021 to 2100 in step 4) to the 80-year monthly baseline climate in step 2 to obtain a 900-member ensemble of global monthly climate at 0.5° × 0.5° from 2021–2100. The procedure is implemented for each meteorological variable and each emission scenario. Details for each step are elaborated below.
Detrended baseline climate
The essence of our SD-BC method is to obtain a baseline climate that is free of any significant monotonic trends (caused by human forcing) while preserving the variance/variability around the trend. We perform significance test for the linear trend of 80-year (1931–2010) GSWP3 monthly time series for each month at each grid-point. If a linear trend is found statistically significant at the 95% confidence level, the new monthly time series are derived by first detrending the original monthly time series and then adjusting (adding) the detrended monthly values to the corresponding monthly values of year 2020 to emulate an 80-year (2021–2100) climate without climate change. The monthly values of year 2020 are obtained based on the linear trend. If a linear trend is found not statistically significant, the new monthly time series will simply take the original monthly time series. The derived new monthly time series of all the months are then merged, which is referred to as “detrended baseline climate” in the following steps. For precipitation, we apply the trend significance test to the logarithm of monthly precipitation to account for the log-normal nature of precipitation32 with the underlying assumption of normality in the trend significance test. A linear relationship is examined only if at least half of the total years (80) have the pre-transformed monthly precipitation larger than 1 mm/day. This condition will prevent any artificial linear trend as a result of very low precipitation amount in very dry areas (e.g., Sahara Desert), which could lead to unrealistically high values in the detrended precipitation time series. Detrending precipitation (if a linear trend is significant) is only applied to the years that involve in constructing the linear relationship. Other years will simply take the original monthly values. Selective grid-by-grid detrending may introduce some level of spatial inconsistencies. We have tested the uniform grid-by-grid detrending (regardless of whether trends are determined to be significant or not) and found that it could lead to spurious, unrealistic extreme values (high and low). These artifacts arise mainly from imposing a linear relationship on an insignificant-trend time series and we thus must apply arbitrary thresholds to ensure that a reasonable range is maintained for each variable. The mosaicked detrended baseline is shown to not present salient spatial inconsistency but capture the expected large-scale climate features across various regions of the globe.
Pattern-scaling method
In climate-change scenario assessments and impact studies, the use of various pattern-scaling methods is extensive33,34,35,36,37,38. For the 50-member ensemble of the MESM zonal climate projections, we adopt a similar method to expand across longitudes using pattern-scaling specifically tailored to the MESM zonal configuration to provide regional representation21. The inception of this method for use with MESM and IGSM is described in21. The procedure is analytically expressed by the following equations:
Where the overbar represents the average value during reference periods (i.e., t0, t1, and t2) or a moving average period. \({\bar{C}}_{x,y}\) is climatological pattern-scaling coefficient and calculated as the ratio of monthly climatology at each grid-point to monthly climatology of the corresponding latitude mean \(\left({\bar{V}}_{y}\right)\). \({\bar{C}}_{x,y}^{{t}_{0}}\) and \({\bar{V}}_{y}^{{t}_{0}}\) are derived from the 20-year (1991–2010, t0) GSWP3 data. \({\bar{V}}_{x,y}^{{t}_{0}}\) is the downscaled monthly climatology at a grid point (x,y) during the reference period t0. The choice of t1 and t2 is somewhat arbitrary but should span a sufficient amount of time such that a climate response (if any) has evolved as a result of the trends in the trace gas forcing. For the scenario considered herein, we chose the interval between t1 and t2 to span the number of years at which a doubling of CO2 at a transient rate of 1% yr−1 has been achieved, equivalent to 70 years. \({\bar{C}}_{x,y}^{{t}_{1}}\) \(\left({\bar{T}}_{Global}^{{t}_{1}}\right)\) and \({\bar{C}}_{x,y}^{{t}_{2}}\) \(\left({\bar{T}}_{Global}^{{t}_{2}}\right)\) are derived from the first 10 years and 71th~80th years 1pctCO2 simulations, respectively. \(d{\bar{C}}_{x,y}/d{\bar{T}}_{G}\), namely “pattern-change kernels” (PCKs), describes the shifts in \({\bar{C}}_{x,y}\) induced by human-forced climate warming. The implicit assumption here is that PCKs remain constant over the 70-year period interval. PCKs are constructed from the list of the 18 CMIP6 climate models (Table 2), with each climate model regridded to a 0.5° × 0.5° grid via area averaging (conservative regridding procedure) prior to PCKs construction. \(\Delta {\bar{T}}_{G}\) is calculated as the difference between 10-year backward moving average of monthly global mean temperature from 2021 to 2100 and its corresponding 2012–2021 mean. \({\bar{V}}_{y}\) represents the monthly MESM zonal future climate from 2021 to 2100 and is calculated as the 20-year backward moving average of a particular month. For example, January 2021 is calculated as the average of January from 2002 to 2021. The 20-year moving average is intended to capture only the human-forced monotonic trend from MESM’s global climate response without introducing the internal climate variability to the baseline climate. \({V}_{x,y}\) represents the downscaled monthly future climate at the grid point (x,y) from 2021 to 2100. The monthly change (or “delta”) is the difference between \({V}_{x,y}\) and \({\bar{V}}_{x,y}^{{t}_{0}}\), which is further smoothed using a 21-year running average centered on each year for that month (the first and last 10 years have fewer samples for smoothing). We combine each of the 18 PCKs \(\left(\frac{d{\bar{C}}_{x,y}}{d{\bar{T}}_{G}}\right)\) with the 50-member MESM zonal climate projections \(\left({\bar{V}}_{y}\right)\) via the Eq. (2) to develop a 900-member ensemble per policy scenario. This meta-ensemble per scenario represents a comprehensive range of plausible outcomes resulting from the uncertainty in climate system response to anthropogenic emissions of greenhouse gases (50-member ensemble) and uncertainty in geographical patterns of climate change (PCKs).
Data Records
Our global dataset comprises a 900-member ensemble of 9 climate variables (precipitation, mean, minimum and maximum air temperature, near-surface wind speed, shortwave and longwave radiation, specific humidity, and relative humidity) at 0.5° on a monthly time scale from 2021 to 2100 for four policy scenarios. Each ensemble member is packed in a single file of the self-describing netCDF format, a broadly accepted good-practice standard utilized in the weather and climate science research communities. The complete dataset is ~60 TB in size. All data are freely available at the repository of World Data Center for Climate (WDCC) at DKRZ39.
Technical Validation
Below we provide an evaluation of this dataset that demonstrates the features of the detrended baseline climate, the emerging patterns extracted from the CMIP6 models, and the resultant regional distributions of climate changes across various scenarios. We adopt 21 regions40 to evaluate the dataset’s regional-scale performances. The detailed information of 21 regions is listed in Table 3. These regions represent different climatic regimes and physiographic settings, and are defined to approximately cover all land areas in the World with the size of each region varying in the range of a few thousand to several thousand km in each direction40. The evaluation is conducted on the regional averages of seasonal means across the entire period for each variable. The relevant ensemble statistics (minimum, median, and maximum) are derived based on the regional averages of seasonal means of each ensemble member. The evaluation is performed for all the meteorological variables and all the seasons [December-February (DJF), March-May (MAM), June-August (JJA), and September-November (NOV)] over the 21 regions from 1931 to 2010 (the detrended baseline) and from 2021 to 2100 (future climate) for four emission scenarios, respectively. Only precipitation and surface air temperature in DJF and JJA are included for demonstration purposes. Only land grid points are used, except for the global maps shown in Fig. 20.
Seasonal PCKs
Seasonal PCKs of temperature and precipitation show a wide range of climate model response patterns as a result of forced climate warming (Figs. 4–7). The differences are reflected in the sign, magnitude, location, and extent of the changes in response. For temperature (Figs. 4, 5), the most striking feature is a colder ocean and warmer land (COWL) global pattern41 seen in both seasons across all the models, with the exceptions to this characterization found in different regions across various models. The model mean PCKs show that the relative cooling signal in land areas lies mostly over the northwesternmost regions of North America, western Europe, northern Siberia and parts of the southern Russia in DJF (Fig. 5). The maritime fetch of the relatively cooler ocean conditions may play a large role in the afore-mentioned relative cooling over the coastal regions. Also evident is the stronger warming over the high Northern Hemisphere latitudes in DJF than in JJA. High inter-model scatter in PCKs is ubiquitously observed in the boreal region of the Northern Hemisphere, particularly in DJF. In JJA, South Africa and interior portions of South America also present moderate inter-model scatter. The lowest model scatter generally occurs over much of the world’s oceans, particularly in the subtropics and with no evident seasonality. For precipitation (Figs. 6, 7), the model mean shows a wide extent of drier conditions attributed to climate warming, including a swath extending from central subtropical North Pacific to the North Atlantic and a path extending from the South Pacific basin to the South Atlantic and the Indian Ocean. The drying feature in these regions persists in both seasons, with the areas of prominent drying varying between two seasons. The drying is particularly enhanced over central subtropical North Pacific and the North Atlantic in DJF, but it is more prominent over the central America, Gulf of Mexico, Caribbean Sea and islands, east Indian Ocean and Indonesia islands in JJA. Over land areas, the most prominent drying occurs over Mexico, northern South America, northern African, and parts of the Middle East in DJF, but is largely confined to the Continental United States, Amazon, Southern Africa, Australia, the European continent, and Western Russia in JJA. The wetter conditions in both seasons over land areas are largely associated with the monsoon regions, including South and Southeast Asia, Africa, Australia, South and North America. High model scatter for precipitation is largely confined to the world’s oceans, particularly in DJF, including the subtropical Pacific, the western boundary of the North Atlantic, the tropical Atlantic, and the Indian Ocean. Over land, the Indian and Southeast Asian monsoon regions present the largest model scatter. Other notable areas with moderate model scatter include the eastern half of North America, most of Central America, South America, parts of Africa, Australia, Europe, and Western Russia. The lowest inter-model deviations occur in a large portion of Eurasia, Central North America, and Northern Africa in DJF, but are confined mostly in Northeast Brazil, Africa, and the Middle East in JJA.
Detrended baseline climate
Figures 8–11 show the comparisons of precipitation and surface air temperature between GSWP3 and detrended baseline from 1931 to 2010 over the 21 regions. Overall, both JJA and DJF GSWP3 precipitation does not show evident trends over all the regions, except for the JJA precipitation in WAF and EAF (Figs. 8, 9). We see that the corresponding detrended baseline precipitation eliminates or reduces the trends in these two regions. Detrending is more clearly observed in surface air temperature (Figs. 10, 11). As expected, GSWP3 surface air temperature exhibits strong upward trends over a majority of regions for both seasons. Some exceptions are observed for SSA, North America (WNA, CNA, ENA), GRL, and NEU in both seasons, for Asia (except for CAS) and TIB in JJA, as well as for WAF, SAH, and NAS in DJF. GSWP3 surface air temperature in DJF presents overall weaker upward trends than in JJA, except for EAS, SAS, and TIB. Regions in the same continent may display very different patterns. For example, CAS is only one out of four regions in Asia showing a strong upward trend in JJA temperature. Overall, the detrended baseline climate is seen to eliminate or reduce the trends inherent in the GSWP dataset to varying degrees, regardless of the meteorological variables, seasons, and regions.
Future climate
Figures 12–15 show the ensemble statistics (minimum, median, and maximum) of precipitation and surface air temperature in both seasons from 2021 to 2100 for the PF scenario, along with their detrended baseline counterparts. Ensemble medians of precipitation are largely aligned with the detrended baseline over most regions (Figs. 12, 13), with some exceptions in the high latitudes (ALA, GRL, NEU, and NAS) and dry regions (EAF, SAH, and SAS in JJA as well as WNA, CNA, and ENA, EAS, SAS, and TIB in DJF). Ensemble uncertainty (spread) over all the regions is small at the beginning of the period and becomes increasingly larger by the end of the century. The high latitude and dry regions are typically characterized by the small ensemble uncertainty in both seasons. The large ensemble uncertainty is generally found around the tropics, such as CAM (3.66 mm/day) and SEA (2.87 mm/day) in JJA as well as SEA (1.54 mm/day) and AMZ (1.18 mm/day) in DJF, consistent with what was found in previous studies42,43,44,45. All the ensemble members of surface air temperature exhibit consistently higher values than the detrended baseline throughout the entire period in both seasons over all the regions (Figs. 14, 15). The tropical regions present an overall small ensemble uncertainty in both seasons. The ensemble spread of JJA air temperature varies from 1.13 °C in the SEA to 3.16 °C in the CNA by 2100. TIB is the only other region with the ensemble uncertainty of JJA air temperature exceeding 3 °C (3.02 °C). DJF air temperature features large ensemble uncertainty in the high latitudes, particularly in the GRL (4.7 °C) and ALA (4.9 °C) regions, and captures the influence of the heterogeneity in the representations of snow and cold land processes by various Earth-system models46,47. These patterns of temperature uncertainty agree with what was shown in the CMIP6 models45.
Figures 16–19 show the ensemble medians of precipitation and surface air temperature in both seasons from 2021 to 2100 for all the four scenarios, along with their detrended baseline counterparts. The differences in precipitation of both seasons (Figs. 16, 17) attributed to the scenarios are much smaller than those attributed to the ensembles of the specific scenario (e.g., PF in Figs. 12, 13). By 2100, the spread of JJA precipitation across four scenarios (the difference between REF and P1p5C) is less than 0.2 mm/day over most of the regions (Fig. 16) and the spread in DJF precipitation is even smaller (Fig. 17). The dominance of model uncertainty over scenario uncertainty in regional precipitation projection uncertainty was also reported in previous studies45,48. Regardless of the scenario, precipitation of both seasons does not exhibit apparent drying or wetting trend over all the regions. In contrast, climate policy (P1p5C) effectively reduces surface air temperature, with the reduction ranging from 1.1 °C to 1.9 °C in JJA (Fig. 18) and from 1.0 °C to 2.9 °C in DJF (Fig. 19). The strongest reduction in JJA air temperature occurs mostly in the mid latitudes (e.g., CNA, CAS, MED, TIB) and SAH desert, while that in DJF is found mostly in high latitudes (2.9 °C in ALA, 2.5 °C in GRL, 2.4 °C in NAS, 1.9 °C in ENA). Many studies documented “Arctic amplification”, a phenomenon that the warming in the Arctic far outpaces the global average49,50, particularly in winter51. Climate mitigation efforts could thus have significant implications on land ice loss, wildlife and human livelihoods, and methane emissions in these high latitude regions.
Figure 20 presents the ensemble medians of precipitation and surface air temperature in both seasons in 2090 for the PF scenario, along with the differences from its corresponding detrended baseline counterparts. The ensemble median precipitation in both seasons indicate that maxima occur in the Tropics, specifically the intertropical convergence zone (ITCZ) in the Atlantic, Pacific, and Indian Oceans, the South Pacific convergence zone (SPCZ), as well as over tropical Africa and South America (Fig. 20a,b). This band of heavy rain moves north and south of the Equator seasonally. Major precipitation peaks in tropical regions are below the equator in DJF but are located further north in JJA. JJA is the season that is strongly affected by the Asian monsoon which brings heavy rain to China, southeast Asia, and India. In midlatitudes, the storm tracks in the Northern Hemisphere (NH) oceans are much stronger in DJF than in JJA, while the circumpolar storm tracks in the Southern Hemisphere (SH) are weaker in DJF than in JJA. DJF is also characterized by a secondary precipitation maximum along the northwest coast of North America from Alaska to California at the eastern end of the Pacific Ocean storm track. Except for the Maritime Continent, a majority of land areas are characterized by dry condition in both seasons with moderate precipitation found in Eastern North America, northern Europe, northeastern and southeastern China in JJA. These characteristics confirm the well-established behaviors from climate models in these regions.
Many regional characteristics of wetting and drying in the ensemble median precipitation changes are shared by both seasons and consistent with theoretical expectations and/or findings revealed in previous studies52 (Fig. 20c,d). The strongest positive changes are found in the equatorial Pacific with many regions being associated with Monsoon precipitation and they shift slightly between north and south seasonally. Extensive but weaker positive changes are also observed in the high latitudes (lands and oceans) of both hemispheres. A strong drying signal is projected over the central America, the Amazon, the Mediterranean, and Indonesia islands together with the Atlantic, Indian, and South Pacific Oceans. The most part of the United States (US) is projected to become wetter in DJF, particularly in the Eastern US and the west coast, while a large portion of the US will experience precipitation decreases in JJA, including the northern part of the country and the South-Central US. In-between lies a moderate wetting zone extending from the west to the east.
The ensemble medians of JJA and DJF surface air temperature (Fig. 20e,f) demonstrate the well-established temperature gradient from the equator to the high latitudes of both hemispheres. The JJA temperature maxima are located in the Sahara Desert, the Middle East, the Mojave Desert, and the Sonoran Desert in northwestern Mexico, while the temperature minima are confined to the Greenland and Antarctic. DJF temperature maxima are found in the Australian Deserts, while the minima are concentrated in the high latitude lands (North of 40°N) and Antarctic. The ensemble median air temperature is projected to increase over most of the globe relative to the detrended baseline, but the magnitude of increase varies regionally and seasonally (Fig. 20g,h). The immediately evident are the strongest JJA warming in the Antarctic Peninsula and the strongest DJF warming in the northern high latitudes (in particular the Arctic regions). Previous studies reported that the Arctic regions experience warming at twice the pace of the global average49,50 and unprecedented warming over the Antarctic Peninsula53. The general pattern of temperature increase indicates that warming is stronger over land than over oceans. The limited cooling signals are found over the northern Atlantic in both seasons as well as over the Arctic Ocean in JJA. Previous studies linked the cooling in the northern Atlantic to the slowing of the Atlantic Meridional Overturning circulation and the resulting slower northward surface-heat transport54,55. These pattern characteristics are consistent with what was shown in the CMIP6 models52.
In general, the evaluation suggests that the dataset is able to capture broadly the well-observed climate features across various regions of the globe and the results are consistent with what previous studies reported. This meta-ensemble, high-resolution, bias-corrected global dataset of long-term future climate can be used for meeting various needs associated with climate impact assessments, including uncertainty analyses, risk quantification, climate policy mitigation analyses, and driving climate impact models which require monthly data inputs, on both global and regional scales. Some caveats do exist, for example, the dataset cannot adequately represent the frequency and intensity of various extreme events (i.e., flooding, heatwave, or droughts) and users should thus take caution to use the data in this regard.
Code availability
All the relevant codes to produce the global dataset of future climates are publicly available at the GitHub repository (https://github.com/mit-jp/SD-BC). The pattern-change kernels” (PCKs) for all the CMIP6 models are created using Grid Analysis and Display System (GrADS). The SD-BC procedure is carried out using FORTRAN 90 programming language.
References
Eyring, V. et al. Overview of the Coupled Model Intercomparison Project Phase 6 (CMIP6) experimental design and organization. Geosci. Model Dev. 9, 1937–1958 (2016).
O’Neill, B. C. et al. The Scenario Model Intercomparison Project (ScenarioMIP) for CMIP6. Geosci. Model Dev. 9, 3461–3482 (2016).
Riahi, K. et al. The Shared Socioeconomic Pathways and their energy, land use, and greenhouse gas emissions implications: An overview. Global Environ. Chang. 42, 153–168 (2017).
Navarro-Racines, C., Tarapues, J., Thornton, P., Jarvis, A. & Ramirez-Villegas, J. High-resolution and bias-corrected CMIP5 projections for climate change impact assessments. Sci. Data 7, 7 (2020).
Guo, D., Arnolds, J. L., Midgley, G. F. & Foden, W. B. Conservation of quiver trees in Namibia and South Africa under a changing climate. J. Geosci. Environ. Prot. 04, 1–8 (2016).
Jennings, M. D. & Harris, G. M. Climate change and ecosystem composition across large landscapes. Landsc. Ecol. 32, 195–207 (2017).
Ramirez-Villegas, J. & Thornton, P. K. Climate change impacts on African crop production. CCAFS Working Paper 119 (2015).
Thornton, P. K., Boone, R. B. & Ramirez-Villegas, J. Climate Change Impacts on Livestock. CCAFS Working Paper 120 (2015).
Boehlert, B. et al. Climate change impacts and greenhouse gas mitigation effects on U.S. water quality. J. Adv. Model. Earth Syst. 7, 1326–1338 (2015).
Gao, X., Schlosser, C. A., Fant, C. & Strzepek, K. The impact of climate change policy on the risk of water stress in southern and eastern Asia. Environ. Res. Lett. 13, 064039 (2018).
Paltsev, S. et al. 2021 Global Change Outlook: charting the Earth’s future energy, managed resources, climate, and policy prospects. MIT Joint Program on the Science and Policy of Global Change https://globalchange.mit.edu/publications/signature/2021-global-change-outlook (2021).
Boehlert, B. et al. Climate change impacts and greenhouse gas mitigation effects on U.S. hydropower generation. Appl. Energy 183, 1511–1519 (2016).
Messina, J. P. et al. Mapping global environmental suitability for Zika virus. Elife 5, e15272 (2016).
Arndt, C. et al. Climate change and developing country growth: the cases of Malawi, Mozambique and Zambia. Clim. Change 154, 335–349 (2019).
Sokolov, A. et al. Description and Evaluation of the MIT Earth System Model (MESM). Adv. Model. Earth Syst. 10, 1759–1789 (2018).
Monier, E. et al. Toward a consistent modeling framework to assess multi-sectoral climate impacts. Nat. Commun. 9, 660 (2018).
Compo, G. P. et al. The Twentieth Century Reanalysis project. Q. J. R. Meteorol. Soc. 137, 1–28 (2011).
Yoshimura, K. & Kanamitsu, M. Dynamical global downscaling of global reanalysis. Mon. Weather Rev. 136, 2983–2998 (2008).
Jones, P. & Harris, I. CRU TS3. 21: Climatic Research Unit (CRU) Time-Series (TS) version 3.21 of high resolution gridded data of month-by-month variation in climate (Jan. 1901–Dec. 2012). (NCAS British Atmospheric Data Centre, 2013).
Schneider, U. et al. GPCC’s new land surface precipitation climatology based on quality-controlled in situ data and its role in quantifying the global water cycle. Theor. Appl. Climatol. 115, 15–40 (2014).
Schlosser, C. A. et al. Quantifying the likelihood of regional climate change: a hybridized approach. J. Clim. 26, 3394–3414 (2013).
Libardoni, A. G., Forest, C. E., Sokolov, A. P. & Monier, E. Estimates of climate system properties incorporating recent climate change. Adv. Stat. Climatol 4, 19–36 (2018).
Ehret, U., Zehe, E., Wulfmeyer, V., Warrach Sagi, K. & Liebert, J. Should we apply bias correction to global and regional climate model data? Hydrol. Earth Syst. Sci. Discuss. 9, 5355–5387 (2012).
Hawkins, E., Osborne, T. M., Ho, C. K. & Challinor, A. J. Calibration and bias correction of climate projections for crop modelling: an idealised case study over. Europe. Agric. For. Meteorol. 170, 19–31 (2013).
Belitz, K. & Stackelberg, P. E. Evaluation of six methods for correcting bias in estimates from ensemble tree machine learning regression models. Environ. Model Softw. 139, 105006 (2021).
Teutschbein, C. & Seibert, J. Bias correction of regional climate model simulations for hydrological climate-change impact studies: review and evaluation of different methods. J. Hydrol. 456, 12–29 (2012).
Themeßl, J. M., Gobiet, A. & Leuprecht, A. Empirical-statistical downscaling and error correction of daily precipitation from regional climate models. Int. J. Climatol. 31, 1530–1544 (2011).
François, B., Vrac, M., Cannon, A. J., Robin, Y. & Allard, D. Multivariate bias corrections of climate simulations: which benefits for which losses? Earth Syst. Dynam. 11, 537–562 (2020).
Beyer, R., Krapp, M. & Manica, A. An empirical evaluation of bias correction methods for palaeoclimate simulations. Clim. Past 16, 1493–1508 (2020).
Miralha, L. et al. Bias correction of climate model outputs influences watershed model nutrient load predictions. Sci. Total Environ. 759, 143039 (2021).
Sundaram, G. & Radhakrishnan, S. Performance evaluation of bias correction methods and projection of future precipitation changes using regional climate model over Thanjavur, Tamil Nadu, India. KSCE J. Civ. Eng. 27, 878–889 (2023).
Kedem, B. & Chiu, L. S. On the lognormality of rain rate. Proc. Natl. Acad. Sci. 84, 901–905 (1987).
Santer, B. D., Wigley, T. M. L., Schlesinger, M. E. & Mitchell, J. F. B. Developing climate scenarios from equilibrium GCM results. Tech. Rep. Max–Planck-Institut für Meteorologie, Hamburg, Germany (1990).
Wigley, T. M. L., Raper, S. C. B., Smith, S. & Hulme, M. The MAGICC/SCENGEN Climate Scenario Generator: Version 2.4. Technical Manual CRU, UEA, Norwich, U.K (2000).
Mitchell, T. D. Pattern scaling - an examination of the accuracy of the technique for describing future climates. Clim. Change 60(3), 217–242 (2003).
Frieler, K., Meinshausen, M., Mengel, M., Braun, N. & Hare, W. A scaling approach to probabilistic assessment of regional climate change. J. Clim. 25(9), 3117–3144 (2012).
Lopez, A., Suckling, E. B. & Smith, L. A. Robustness of pattern scaled climate change scenarios for adaptation decision support. Clim. Change 122(4), 555–566 (2013).
Herger, N., Sanderson, B. M. & Knutti, R. Improved pattern scaling approaches for the use in climate impact studies. Geophys. Res. Lett. 42, 3486–3494 (2015).
Gao, X., Sokolov, A. & Schlosser, A. MITJP-MSD: Large-ensemble 21st century monthly hydro-climatological forcing dataset. World Data Center for Climate (WDCC) at DKRZ. https://doi.org/10.26050/WDCC/MITJP-MSD_LE (2023).
Giorgi, F. & Francisco, R. Uncertainties in regional climate change prediction: a regional analysis of ensemble simulations with the HADCM2 coupled AOGCM. Clim. Dyn. 16, 169–182 (2000).
Broccoli, A. J., Lau, N.-C. & Nath, M. J. The cold ocean– warm land pattern: model simulation and relevance to climate change detection. J. Clim. 11, 2743–2763 (1998).
Kharin, V. V., Zwiers, F. W., Zhang, X. & Wehner, M. Changes in temperature and precipitation extremes in the CMIP5 ensemble. Clim. Change 119, 345–357 (2013).
O’Gorman, P. A. Sensitivity of tropical precipitation extremes to climate change. Nat. Geosci. 5, 697–700 (2012).
Sillmann, J., Kharin, V. V., Zwiers, F. W., Zhang, X. & Bronaugh, D. Climate extremes indices in the CMIP5 multimodel ensemble: part 2. future climate projections. J. Geophys. Res. Atmos. 118, 2473–2493 (2013).
Wu, Y. et al. Quantifying the uncertainty sources of future climate projections and narrowing uncertainties with bias correction techniques. Earth’s Future 10, e2022EF002963 (2022).
Slater, A. J. et al. The representation of snow in land-surface schemes: results from PILPS 2(d). J. Hydromet. 2, 7–25 (2001).
Luo, L. et al. Effects of frozen soil on soil temperature, spring infiltration, and runoff: results from the PILPS 2(d) experiment at Valdai, Russia. J. Hydromet. 5, 334–351 (2004).
Hawkins, E. & Sutton, R. The potential to narrow uncertainty in projections of regional precipitation change. Clim Dyn 37, 407–418 (2011).
Post, E. et al. The polar regions in a 2 °C warmer world. Sci. Adv. 5, eaaw988 (2019).
Rantanen, M. et al. The Arctic has warmed nearly four times faster than the globe since 1979. Commun. Earth Environ. 3, 168 (2022).
Bintanja, R. & Krikken, F. Magnitude and pattern of Arctic warming governed by the seasonality of radiative forcing. Sci Rep 6, 38287 (2016).
Tebaldi, C. et al. Climate model projections from the Scenario Model Intercomparison Project (ScenarioMIP) of CMIP6. Earth Syst. Dynam. 12, 253–293 (2021).
González-Herrero, S., Barriopedro, D., Trigo, R. M., López-Bustins, J. A. & Oliva, M. Climate warming amplified the 2020 record-breaking heatwave in the Antarctic Peninsula. Commun. Earth Environ. 3, 122 (2022).
Caesar, L., Rahmstorf, S., Robinson, A., Feulner, G. & Saba, V. Observed fingerprint of a weakening Atlantic Ocean overturning circulation. Nature 556, 191–196 (2018).
Keil, P., Mauritsen, T., Jungclaus, J. & Hedemann, C. Olon- scheck, D. & Gosh, R. Multiple drivers of the North Atlantic warming hole. Nat. Clim. Change 10, 667–671 (2020).
Acknowledgements
This work was supported by the U.S. Department of Energy, Office of Science, under Award Number DE-FOA-0001968 and other government, industry, and foundation sponsors of the MIT Joint Program on the Science and Policy of Global Change. For a complete list of sponsors, see http://globalchange.mit.edu/sponsors/. We acknowledge the modeling groups, the Program for Climate Model Diagnosis and Intercomparison (PCMDI), and the WCRP’s Working Group on Coupled Modeling (WGCM) to make available the WCRP CMIP6 model dataset. We also thank the provider for the Global Soil Wetness Project Phase 3 (GSWP3). We thank Dr. Eileen Hertwig and Heinke Hoeck for their help to host the data at WDCC and thank the WDCC for hosting our dataset.
Author information
Authors and Affiliations
Contributions
X. Gao drafted the manuscript, produced the datasets and performed the technical validation. A. Sokolov produced a 50-ensemble of MESM zonal climate from 1861 to 2100 for four policy scenarios and performed the relevant analyses of different scenarios as compared to their CMIP5 counterparts. A. Schlosser aided with the data production method and performed the quality check of the produced dataset. All authors have read and approved the final version of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Gao, X., Sokolov, A. & Schlosser, C.A. A Large Ensemble Global Dataset for Climate Impact Assessments. Sci Data 10, 801 (2023). https://doi.org/10.1038/s41597-023-02708-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-023-02708-9