A 30-year dataset of CO2 in flowing freshwaters in the United States

Toavs, Timothy R.; Hasler, Caleb T.; Suski, Cory D.; Midway, Stephen R.

doi:10.1038/s41597-022-01915-0

Download PDF

Data Descriptor
Open access
Published: 11 January 2023

A 30-year dataset of CO₂ in flowing freshwaters in the United States

Timothy R. Toavs¹,
Caleb T. Hasler²,
Cory D. Suski³ &
…
Stephen R. Midway ORCID: orcid.org/0000-0003-0162-1995¹

Scientific Data volume 10, Article number: 20 (2023) Cite this article

1728 Accesses
2 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Increasing atmospheric carbon dioxide (CO₂) concentrations have been linked to effects in a wide range of ecosystems and organisms, with negative effects of elevated CO₂ documented for marine organisms. Less is known about the dynamics of CO₂ in freshwaters, but the potential exists for freshwater organisms to be challenged by elevated CO₂. In flowing freshwaters CO₂ exhibits more variability than in lakes or the ocean, yet spatiotemporally extensive direct measures of CO₂ in freshwater are rare. However, CO₂ can be estimated from pH, temperature, and alkalinity—commonly collected water quality metrics. We used data from the National Water Quality Monitoring Council along with the program PHREEQC to estimate CO₂ in flowing freshwaters across 35,000 sites spanning the lower 48 US states from 1990 through 2020. Site data for water chemistry measurements were spatially joined with the National Hydrology Dataset. Our resulting dataset, CDFLOW, presents an opportunity for researchers to add CO₂ to their datasets for further investigation.

Widespread societal and ecological impacts from projected Tibetan Plateau lake expansion

Article 27 May 2024

Streamflow seasonality in a snow-dwindling world

Article 29 May 2024

Temporal and spatial aggregation of rainfall extremes over India under anthropogenic warming

Article Open access 31 May 2024

Background & Summary

Climate change caused by anthropogenically produced carbon dioxide (CO₂) is an issue that poses challenges around the world, including within marine and freshwater ecosystems. CO₂ concentrations in the atmosphere have been steadily increasing since the mid-nineteenth century with a total increase of around 40%¹ in that time. While CO₂ concentrations in the atmosphere have fluctuated throughout time, the rate of increase recorded since the 1850s is greater than any rate of increase that has occurred in the last million years². As CO₂ in the atmosphere rises, dissolution of CO₂ into the ocean increases, thus interacting with the ocean carbonate system and ultimately leading to a decrease in ocean pH and a decrease in surface calcium carbonate (CaCO₃) concentrations, a process known as ocean acidification³. Dissolved CO₂ in marine and freshwater environments is measured as the partial pressure of CO₂ (pCO₂)⁴. This rise in pCO₂ has been shown to affect a wide range of ecosystems and organisms, with negative effects of elevated pCO₂ documented for marine and freshwater organisms. More specifically, ocean acidification caused by increasing atmospheric CO₂ has been shown to alter fish behaviour and physiology⁵ and affect planktonic primary producers^6,7. Outcomes of the effects are difficult to predict due to the variability across taxa. However, possible outcomes include reduced fish populations^5,8 and declines in ocean primary productivity⁹. While the effects of elevated pCO₂ in marine environments are well documented, less is known about the dynamics of pCO₂ in freshwaters, but the potential exists for freshwater organisms to be challenged¹⁰.

While less is known about pCO₂ dynamics in freshwater, some general characteristics and processes have been documented. Flowing freshwaters have many different potential sources of pCO₂ and show high variability from one water body to another. Cole et al.¹¹ showed that pCO₂ in North American lakes was rarely at equilibrium with CO₂ in the atmosphere and found a range of concentration differences from 175 times lower pCO₂ than atmospheric CO₂ to 57 times greater. Flowing freshwaters show more variability and are typically supersaturated compared to the atmosphere, and have even been identified as sources of atmospheric CO₂¹²_. Butman and Raymond¹³ verified supersaturation in US flowing freshwaters and found that there is a relationship between pCO₂ and stream order suggesting a proportional relationship between stream size and pCO₂. Typically, pCO₂ in flowing freshwater is influenced by the water source of the flowing freshwater systems coupled with characteristics of that system including surrounding geologic conditions, pCO₂ residence time, and the gas transfer velocity¹⁴. Other contributing factors to pCO₂ include (but are not limited to) the balance between photosynthetic and respiration rates¹⁵ and terrestrial respiration¹². No matter the source, flowing freshwaters display high variability in pCO₂ and while not much is known about the potential impacts on freshwater organisms and ecosystems it is important to understand pCO₂ spatiotemporal trends to identify potential impacts.

Considering the high variability displayed in flowing freshwaters a large spatiotemporal dataset is needed for understanding patterns and trends. Direct measures of pCO₂ in flowing freshwaters are extremely limited making it challenging to define spatial or temporal pCO₂ trends. However, pCO₂ can be estimated from a combination of water quality metrics including pH, temperature, and alkalinity—commonly collected water quality metrics, and has been done numerous times throughout the literature^13,16,17,18. Our dataset (referred to as CDFLOW)¹⁹ fills the need for a large spatiotemporal dataset using pH, temperature, and alkalinity measurements from across the lower 48 United States (CONUS) from 1990 through 2020. To our knowledge, CDFLOW¹⁹ is the largest publicly available pCO₂ database with over 750,000 pCO₂ estimates coming from over 35,000 sites. CDFLOW¹⁹ is also integrated with the National Hydrologic Dataset (NHD)²⁰ allowing for the addition of other environmental and geospatial variables²¹ and ease when incorporating with other databases related to the NHD. CDFLOW¹⁹ provides an opportunity for spatiotemporal analysis of pCO₂ across the CONUS and the possibility of adding pCO₂ data to other researchers’ data.

Methods

Data query

Water quality measurements and their respective site-data (see below for site definition) were queried separately by each of the 48 CONUS states from the Water Quality Data Portal²² using the following filters:

Country = “United States of America”
Site Type = “Stream”
Date Range from = “01-01-1990”
Date Range to = “12-31-2020”
Sample media = “Water”
Characteristics = “Alkalinity, total”, “Alkalinity”, “pH”, and “Temperature, water”

The “Total alkalinity” and “Alkalinity” characteristic parameters are equivalent measurements but represent the different labels that respective reporting agencies use. The separate data queries for each state were merged using a shared variable called “MonitoringLocationIdentifier”. The data queries and subsequent data merges resulted in 48 water quality measurement datasets with matching site data, representing each state within the CONUS.

pCO2 estimation

The 48 datasets were processed and formatted separately then combined into one dataset for estimating pCO₂. The first step was to subset the datasets for quality and consistency among measurements. The following filters were applied:

Removing non-numeric measurement values; e.g., “alkalinity <1 mg/l”
Removing measurement values represented as statistical summaries and not observations; e.g., “average temp = 21 °C”
Removing measurements not taken at the surface of the respective waterbody.
Removing extreme water temperature measurements e.g., temperature ≤0 °C and temperature ≥40 °C
Removing impossible pH values e.g., pH >14
Removing pH values below 5.4

Hunt et al²³. found that when pH is under 5.4 there is an increased risk of overestimating pCO₂ due to the possibility of non-carbonate anions contributing to the total pH, thus filtering out pH values less than 5.4. pH over 14 was excluded because the standard pH scale goes from 0–14. No filters were applied to alkalinity measurements.

Next, we grouped temperature, pH, and alkalinity measurements by location, date, and time. Grouping was done by creating a key identification by concatenating the following columns: “MonitoringLocationIdentifier”, “ActivityStartDate”, and “ActivityStartTime”. If time data were not available for water quality measurements, they were still included but were grouped with water quality measurements also without time data. In grouping water quality measurements this way, they are grouped by the highest time/date resolution available, with day being the coarsest acceptable resolution. CDFLOW¹⁹ requires all three of the queried water quality metrics to be present in each group to estimate pCO₂.

Finally, if a group had records of temperature, pH, and alkalinity, a single pCO₂ value was estimated using the United States Geological Survey’s program PHREEQC v3²⁴. PHREEQC quantitatively accounts for the chemical composition of a solution by relying on mole-balancing equations and in solving the mole-balance equations it derives the most likely pCO₂ estimation²⁵. It should be noted that PHREEQC calculates pCO₂ under the assumption that alkalinity and pH in a system are determined by the current state of the carbonate system. PHREEQC can detect when this carbonate system assumption cannot be safely made in which case that group of observations was discarded. In cases where multiple measurements of a single water quality measurement were grouped with one or more of the two other required measurements, a measurement was chosen at random to be grouped for a pCO₂ estimate. All measurements not grouped were then discarded. Also, we excluded extreme outliers in the pCO₂ estimates which exceeded 2 standard deviations from the mean. The combination of the 48 processed, formatted, and estimated datasets resulted in a single dataset representing all our pCO₂ estimates across the CONUS.

Defining sites

The site data that was merged with water quality measurements included latitude and longitude coordinates. These coordinates corresponded with the location identifier for each water quality measurement, now a pCO₂ estimate, and labeled as “MonitoringLocationIdentifier” (referred to as MID). We created a separate dataset using our dataset of pCO₂ estimates across the CONUS created above, and this new dataset included each of the unique MIDs along with latitude and longitude coordinates. Using the dataset of unique MIDs, we spatially joined each unique MID with the Environmental Protection Agencies National Hydrological Dataset Plus V2^20,26 (NHD) based on the closest stream catchment feature within NHD. Stream catchment features were labeled with a unique code called a COMID²⁷. The spatial join resulted in a dataset with each unique MID now being associated with a COMID and was merged with our dataset of pCO₂ estimates across the CONUS. We also calculated the distance between MID’s and the associated COMID, when the distance was greater than 100 meters the associated pCO₂ estimate(s) was excluded from our dataset of pCO₂ estimates across the CONUS. Finally, we spatially joined MID coordinates with Hydrologic Unit (HUC12)²⁸ polygons included in the NHD. The result of the two spatial joins is the ability to group pCO₂ estimates at any Hydrologic Units Code level and now sites within CDFLOW¹⁹ are defined as what COMID the estimate resides.

All data queries, manipulations, and calculations were done using the statistical program R version 4.1.2²⁹. A visual representation of the workflow to create CDFLOW can be found in Fig. 1.

Data Records

CDFLOW¹⁹ exists as a single CSV file that has 779,186 pCO₂ estimates (rows) and 10 variables (columns) across the CONUS from 1990 through 2020 (Table 1). All 48 states within the CONUS are represented across 35,855 sites. CDFLOW¹⁹ and all supporting code needed to generate and validate the dataset can be downloaded from a public repository on Figshare (https://doi.org/10.6084/m9.figshare.19787326).

Table 1 Description of the data included in CDFLOW¹⁹.

Full size table

While CDFLOW¹⁹ has representation across all 48 states and 18 major watersheds within the CONUS, some areas are more represented than others. To display the spatial variability of CDFLOW we grouped estimates by hydrological unit codes (HUC2) and mapped them (Fig. 2). The South Atlantic Gulf and Mid Atlantic Watersheds had the most representation in CDFLOW¹⁹ followed by the Missouri and Arkansas-White-Red watersheds. Also, we normalized the quantity of estimates within HUC2s by calculating the number of estimates per 5,000 km of stream distance within the HUC2. Total stream distance was calculated by taking the sum of COMID distances within the NHD for each HUC2. The normalized quantity of stream estimates followed similar patterns to the total number of estimates (Fig. 2). Leading us to conclude that estimates are not proportional to quantity of water but other non-environmental factors. We also looked at the temporal scale of CDFLOW¹⁹ (Fig. 3). Generally, estimates increased going from the 1990’s to the early 2000 were they remained constant then started to decrease from 2015 to 2020. Finally, we inspected spatiotemporal trends of estimates across the CONUS by splitting CDFLOW¹⁹ into three decades (1990–2000, 2001–2010, 2011–2020). We found that the same spatial trends as the total number of estimates in Fig. 2 held constant across the three decades.

Technical Validation

Data validation

pCO₂ values in flowing freshwaters from the literature range widely with typical values falling between 1,300 to 4,300 micro atmospheres, but values in excess of 10,000 micro atmospheres have been reported^30,31,32,33 (micro atmospheres being the unit of the partial pressure of CO₂). CDFLOW estimates fall within the listed range with mean HUC2 values ranging from 1,200 to 4,500 micro atmospheres and a total interquartile range (25% to 75%) of 1,000 to 3,450 micro atmospheres. Also, CDFLOW does have values that reach in excess of 10,000 micro atmospheres as reported above. Although we find that CDFLOW estimates compare adequately to what is found in the literature, the majority of pCO₂ reported (including those cited here) come from estimated values using similar methods as CDFLOW. In a recent study, Liu et al.³⁴ assembled a data set of direct measurements of pCO₂ from other published studies. Liu et al.³⁴ calculated average pCO₂ values in different global ecoregions at 1810, 1540, and 2560 micro atmospheres in the arctic, temperate, and tropics respectively, and again CDFLOW had similar averages.

We downloaded the dataset assembled by Liu et al.³⁴ and compared it with CDFLOW. However, first, we did the same site join as done in CDFLOW to assign the direct measurements COMIDs and Hydrologic Unit Codes. We then filtered CDFLOW to the months that data from the direct measurements were from and the HUC8s data was located. Both datasets were then filtered so that each HUC8 had a minimum of 10 data points (in order to avoid comparing very low sample sizes). We then did a separate ANOVA comparing the data from CDFLOW and Liu et al.³⁴ for each HUC8. This resulted in 26 within-HUC8 comparisons. Of those comparisons, less than half (46%) were significantly different (p < 0.05), suggesting that most of the time our estimates were distributed the same as those in Liu et al. (2022). We also inspected the direction of the bias between the estimates and direct measurements by finding the difference between the median pCO₂ values in each HUC8. This result is akin to examining residuals from a linear model, in which we expect the differences to be centered on 0 and normally distributed. We found that the bias difference (i.e., residuals) between the medians was homoscedastic, which is strong evidence that neither our data or the Liu et al.³⁴ data was over- or under-estimating pCO2.

Site ground truth

To test the accuracy of the site join procedure used to define sites in CDFLOW we created a procedure to ground truth the site join. The procedure worked by randomly choosing 50 CDFLOW sites and mapping the original latitude and longitude as well as the given COMID and all COMID stream features within 0.025 degrees latitude and 0.025 degrees longitude of the original coordinates in 50 separate plots. The resulting 50 plots were then checked manually by 2 observers to demonstrate how often the unsupervised procedure led to a reliable result. Both observers independently found that 50/50 (100%) of the random sites were correctly assigned. The R-script for the analysis is available at the Figshare link (https://doi.org/10.6084/m9.figshare.19787326).

Water quality data portal

The Water Quality Data Portal is a water quality data repository hosted by the United States Geological Survey²². Users can interface and download data via the Water Quality Data Portal website (https://www.waterqualitydata.us). The Water Quality Data Portal is a dynamic data repository with over 290 million standardized records. A record being a single collected water quality metric. Contributing agencies include all water quality records reported to the United States Geological Survey, the United States Department of Agriculture, and the Environmental Protection Agency.

National hydrological dataset

The National Hydrological Dataset (NHD) is a national geospatial surface water framework hosted by the Environmental Protection Agency building in conjunction with the United States Geological Survey^20,26. NHD includes shapefiles mapping all flowing water systems throughout the United States.

StreamCat

The StreamCat dataset is incorporated into the NHD, which maps stream segments and their associated catchment within the CONUS²⁷.

PHREEQC

PHREEQC Version 3 is a computer program written in the C++ programming language that is designed to perform a wide variety of aqueous geochemical calculations²⁴. PHREEQC quantitatively accounts for the chemical composition of a solution by relying on mole-balancing equations. It is free and available (e.g. https://www.usgs.gov/software/phreeqc-version-3).

Usage Notes

Estimation uncertainty

PHREEQC relies on the equilibrium of the carbonate system in water in order to estimate pCO₂²⁵ and uncertainty has been documented for pCO₂ estimates that rely on carbonate equilibrium. When error is present in pCO₂ estimation using carbonate equilibria, overestimation is usually the error^23,35,36. We applied filters to data that went into pCO₂ estimation to mitigate overestimation (see methods). Further filters can be applied to data to further mitigate overestimation risks at the discretion of the user; e.g., removing pCO₂ estimates greater than 100,000 parts per million volume, and removing alkalinity values below 1,000 micro equivalents per kilogram water³⁶. While absolute values of CDFLOW¹⁹ pCO₂ estimates may be subject to overestimation relative values and trends are still valid.

Uncertainty estimates from PHREEQC are available as mole balance percent errors. However, when only including three metrics to compute pCO₂ this error term is always quite high but does not necessarily reflect a poor estimate. As discussed in Potter et al.³⁷ which compares modeled pCO₂ estimates using PHREEQC to direct measurements, they conclude that although mole change balance percent errors are high PHREEQC still provides a good estimate of pCO₂ using pH, temperature, and alkalinity. So, we have decided to exclude mole change balance percent error from the dataset as they are not relevant for modeling purposes and do not negate the validity of CDFLOW pCO₂ estimates.

Extra parameters

PHREEQC does allow for the inclusion of extra parameters when estimating pCO₂, and more specifically the inclusion of other dissolved inorganic species. However, data on other dissolved inorganic species that matches the same date, time, and location of the pH, temperature, and alkalinity is only available to a limited number of observations. Due to the limited number of other dissolved inorganic species for observation they were excluded from the PHREEQC estimation. However, the use of other dissolved inorganic species in estimating pCO₂ using PHREEQC would potentially allow for more robust estimates. If CDFLOW users are interested in the inclusion of other dissolved inorganic species a supporting script can be found at the Figshare link (https://doi.org/10.6084/m9.figshare.19787326) that describes and gives examples of the changes required to do so.

Expanding data

By defining sites in CDFLOW¹⁹ by which COMID they fall into gives each site all the data that corresponds to that COMID. COMID data can be accessed via the NHD (see technical validation). COMID data can also be accessed via R package NHD Tools³⁸.

Code availability

Code for the creation of CDFLOW is available as a series of R scripts via public repository on Figshare19 (https://doi.org/10.6084/m9.figshare.19787326).

References

Hartmann, D. L. et al. in Climate change 2013 the physical science basis: Working group I contribution to the fifth assessment report of the intergovernmental panel on climate change 159–254 (Cambridge University Press, 2013).
Doney, S. C. & Schimel, D. S. Carbon and Climate System Coupling on Timescales from the Precambrian to the Anthropocene. Annual Review of Environment and Resources 32, 31–66, https://doi.org/10.1146/annurev.energy.32.041706.124700 (2007).
Article Google Scholar
Doney, S. C., Fabry, V. J., Feely, R. A. & Kleypas, J. A. Ocean acidification: the other CO₂ problem. Annual Review of Marine Science 1, 169–192 (2009).
Article ADS Google Scholar
Solomon, S., Manning, M., Marquis, M. & Qin, D. Climate change 2007-the physical science basis: Working group I contribution to the fourth assessment report of the IPCC. Vol. 4 (Cambridge University Press, 2007).
Munday, P. L., Jarrold, M. D. & Nagelkerken, I. Ecological effects of elevated CO₂ on marine and freshwater fishes: from individual to community effects. Fish Physiology 37, 323–368, https://doi.org/10.1016/bs.fp.2019.07.005 (2019).
Article Google Scholar
Ross, P. M., Parker, L., O’Connor, W. A. & Bailey, E. A. The Impact of Ocean Acidification on Reproduction, Early Development and Settlement of Marine Organisms. Water 3, 1005–1030, https://doi.org/10.3390/w3041005 (2011).
Article CAS Google Scholar
Orr, J. C. et al. Anthropogenic ocean acidification over the twenty-first century and its impact on calcifying organisms. Nature 437, 681–686, https://doi.org/10.1038/nature04095 (2005).
Article ADS CAS Google Scholar
Munday, P. L. et al. Replenishment of fish populations is threatened by ocean acidification. Proceedings of the National Academy of Sciences 107, 12930–12934, https://doi.org/10.1073/pnas.1004519107 (2010).
Article ADS Google Scholar
Flynn, K. J. et al. Changes in pH at the exterior surface of plankton with ocean acidification. Nature Climate Change 2, 510–513 (2012).
Article ADS CAS Google Scholar
Hasler, C. T., Butman, D., Jeffrey, J. D. & Suski, C. D. Freshwater biota and rising pCO₂. Ecology Letters 19, 98–108, https://doi.org/10.1111/ele.12549 (2016).
Article Google Scholar
Cole, J. J., Caraco, N. F., Kling, G. W. & Kratz, T. K. Carbon dioxide supersaturation in the surface waters of lakes. Science 265, 1568–1570 (1994).
Article ADS CAS Google Scholar
Cole, J. J. et al. Plumbing the global carbon cycle: integrating inland waters into the terrestrial carbon budget. Ecosystems 10, 172–185 (2007).
Article Google Scholar
Butman, D. & Raymond, P. A. Significant efflux of carbon dioxide from streams and rivers in the United States. Nature Geoscience 4, 839–842, https://doi.org/10.1038/ngeo1294 (2011).
Article ADS CAS Google Scholar
Wetzel, R. G. Limnology: lake and river ecosystems. (Gulf Professional Publishing, 2001).
Sobek, S., Algesten, G., Bergström, A. K., Jansson, M. & Tranvik, L. J. The catchment and climate regulation of pCO₂ in boreal lakes. Global Change Biology 9, 630–641 (2003).
Article ADS Google Scholar
Lauerwald, R., Laruelle, G. G., Hartmann, J., Ciais, P. & Regnier, P. A. Spatial patterns in CO₂ evasion from the global river network. Global Biogeochemical Cycles 29, 534–554, https://doi.org/10.1002/2014gb004941 (2015).
Article ADS CAS Google Scholar
Jones, J. B. Jr, Stanley, E. H. & Mulholland, P. J. Long-term decline in carbon dioxide supersaturation in rivers across the contiguous United States. Geophysical Research Letters 30, https://doi.org/10.1029/2003gl017056 (2003).
Liu, S. & Raymond, P. A. Hydrologic controls on pCO₂ and CO₂ efflux in US streams and rivers. Limnology and Oceanography Letters 3, 428–435 (2018).
Article CAS Google Scholar
Toavs, T. M. Steve; Hasler, Caleb; Suski, Cory. Figshare. https://doi.org/10.6084/m9.figshare.19787326 (2022).
McKay, L. et al. (ed US Environmental Protection Agency) (2012).
Wieczorek, M., Jackson, S. & Schwarz, G. Select attributes for NHDPlus version 2.1 reach catchments and modified network routed upstream watersheds for the conterminous United States. US Geological Survey. https://doi.org/10.5066/F7765D7V (2018).
Read, E. K. et al. Water quality data for national‐scale aquatic research: The Water Quality Portal. Water Resources Research 53, 1735–1745, https://doi.org/10.1002/2016wr019993 (2017).
Article ADS Google Scholar
Hunt, C. W., Salisbury, J. E. & Vandemark, D. Contribution of non-carbonate anions to total alkalinity and overestimation of pCO₂ in New England and New Brunswick rivers. Biogeosciences 8, 3069–3076, https://doi.org/10.5194/bg-8-3069-2011 (2011).
Article ADS CAS Google Scholar
Parkhurst, D. L. & Appelo, C. in US geological survey techniques and methods Vol. 6 (ed USGS) 497 (2013).
Parkhurst, D. L. & Appelo, C. User’s guide to PHREEQC (Version 2): A computer program for speciation, batch-reaction, one-dimensional transport, and inverse geochemical calculations. Water-Resources Investigations Report 99, 312 (1999).
Google Scholar
(ed U.S. Geological Survey) (USGS, 2018).
Hill, R. A., Weber, M. H., Leibowitz, S. G., Olsen, A. R. & Thornbrugh, D. J. The Stream-Catchment (StreamCat) Dataset: A Database of Watershed Metrics for the Conterminous United States. JAWRA Journal of the American Water Resources Association 52, 120–128, https://doi.org/10.1111/1752-1688.12372 (2016).
Article ADS Google Scholar
Seaber, P. R., Kapinos, F. P. & Knapp, G. L. (ed USGS) (1987).
Team, R. C. (2013).
Richey, J. E., Melack, J. M., Aufdenkampe, A. K., Ballester, V. M. & Hess, L. L. Outgassing from Amazonian rivers and wetlands as a large tropical source of atmospheric CO₂. Nature 416, 617–620 (2002).
Article ADS CAS Google Scholar
Johnson, M. S. et al. CO₂ efflux from Amazonian headwater streams represents a significant fate for deep soil respiration. Geophysical Research Letters 35 (2008).
Humborg, C. et al. CO₂ supersaturation along the aquatic conduit in Swedish watersheds as constrained by terrestrial respiration, aquatic respiration and weathering. Global Change Biology 16, 1966–1978 (2010).
Article ADS Google Scholar
Cole, J. J. & Caraco, N. F. Carbon in catchments: connecting terrestrial carbon losses with aquatic metabolism. Marine and Freshwater Research 52, 101–110 (2001).
Article CAS Google Scholar
Liu, S. et al. The importance of hydrology in routing terrestrial carbon to the atmosphere via global streams and rivers. Proceedings of the National Academy of Sciences 119, e2106322119 (2022).
Article CAS Google Scholar
Golub, M., Desai, A. R., McKinley, G. A., Remucal, C. K. & Stanley, E. H. Large uncertainty in estimating pCO₂ from carbonate equilibria in lakes. Journal of Geophysical Research: Biogeosciences 122, 2909–2924 (2017).
Article ADS CAS Google Scholar
Abril, G. et al. Large overestimation of pCO₂ calculated from pH and alkalinity in acidic, organic-rich freshwaters. Biogeosciences 12, 67–78, https://doi.org/10.5194/bg-12-67-2015 (2015).
Article ADS Google Scholar
Potter, L., Tollrian, R., Wisotzky, F. & Weiss, L. C. Determining freshwater pCO₂ based on geochemical calculation and modelling using PHREEQC. MethodsX 8, 101430, https://doi.org/10.1016/j.mex.2021.101430 (2021).
Article CAS Google Scholar
nhdplusTools: Tools for Accessing and Working with the NHDPlus (2022).

Download references

Author information

Authors and Affiliations

Department of Oceanography and Coastal Sciences, Louisiana State University, Baton Rouge, LA, 70803, USA
Timothy R. Toavs & Stephen R. Midway
Department of Biology, The University of Winnipeg, Winnipeg, Manitoba, Canada
Caleb T. Hasler
Department of Natural Resources, University of Illinois, Urbana, IL, 61801, USA
Cory D. Suski

Authors

Timothy R. Toavs
View author publications
You can also search for this author in PubMed Google Scholar
Caleb T. Hasler
View author publications
You can also search for this author in PubMed Google Scholar
Cory D. Suski
View author publications
You can also search for this author in PubMed Google Scholar
Stephen R. Midway
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors were active in the development of this manuscript. S.R.M., C.T.H. and C.D.S. conceived of the idea and S.R.M. developed the estimates. T.R.T. led the effort for nationwide estimates, in addition to leading the writing of the manuscript. All authors were active in reviewing and editing the manuscript.

Corresponding author

Correspondence to Stephen R. Midway.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Toavs, T.R., Hasler, C.T., Suski, C.D. et al. A 30-year dataset of CO₂ in flowing freshwaters in the United States. Sci Data 10, 20 (2023). https://doi.org/10.1038/s41597-022-01915-0

Download citation

Received: 02 June 2022
Accepted: 15 December 2022
Published: 11 January 2023
DOI: https://doi.org/10.1038/s41597-022-01915-0

This article is cited by

Room temperature bio-engineered multifunctional carbonates for CO2 sequestration and valorization
- H. Mohamed
- K. Hkiri
- M. Maaza
Scientific Reports (2023)