On the use of complexity algorithms: a cautionary lesson from climate research

Delgado-Bonal, Alfonso

doi:10.1038/s41598-020-61731-7

Download PDF

Matters Arising
Open access
Published: 19 March 2020

On the use of complexity algorithms: a cautionary lesson from climate research

Alfonso Delgado-Bonal ORCID: orcid.org/0000-0001-6488-9281^1,2

Scientific Reports volume 10, Article number: 5092 (2020) Cite this article

1952 Accesses
4 Citations
Metrics details

Subjects

The Original Article was published on 23 June 2017

arising from: Z. Shao, Scientific Reports https://doi.org/10.1038/s41598-017-04584-x (2017).

Introduction

Complexity algorithms provide information about datasets which is radically different from classical moment statistics. Instead of focusing on the divergences from central values, they quantify other characteristics such as order, pattern repetitions, or the existence of attractors. However, those analyses must be done with the proper statistical treatment, which is, unfortunately, not always the case. In this contribution, I provide an example of the hazards of applying complexity measures without sufficient care by correcting a previously published analysis that aimed to quantify the complexity of climate. I clarify some misconceptions about the use of Sample Entropy and revise the incorrect assessments and conclusions drawn from the previous misapplication of the methods.

An attempt to contrast the complexity of the climate of the past 122,000 years and the recent 2,000 years has been developed recently¹. The idea behind the research was to use the Oxygen isotopic (δ¹⁸O) record of ice cores, a proxy for the temperature at the accumulation site through history, to determine the levels of complexity of the two different periods. The author obtained values of Sample Entropy (SampEn) of 0.7 ± 0.1 for the long record and 2.2 ± 0.2 for the short one, and values of Lempel-Ziv complexity (LZC) of 0.29 ± 0.03 and 0.99 ± 0.05 for the long and short records respectively, attributing those differences to changes in the complexity of climate. In this paper, I use the same data to show that the reported differences are attributable to the incorrect use of the algorithms rather than to actual changes in climatic complexity.

Complexity algorithms are statistical tools, and as such, the conclusions drawn from them are as good as the robustness of their application. The first issue with the previous research was related to the use of the data. The author used two different datasets of the δ¹⁸O record with different time resolution: the data record for the last 2,000 years had a time resolution of τ = 1 year, while the record for 122,000 years had a time resolution of τ = 20 years. The complexity algorithms used in this research require measurements equally spaced in time²,³ and, in general, the comparison of different time resolutions is impractical and lacks any statistical significance. In the second figure of the same research, the author also determined the complexity for the recent past 2,000 years by sampling the annually resolved record every τ = 20 years, obtaining a mean value of SampEn = 2.3 ± 0.5 and LZC = 1.16 ± 0.08. However, the NGRIP δ¹⁸O data set spanning the entire past 122,000 years consists of 20-year binned averages of the higher-resolution measurements. In order to avoid erroneous conclusions, instead, this data should have been compared to 20-year binned averages of the annually resolved record.

The second issue with the previous research was the misapplication of the complexity algorithms. The complexity of climate was determined by making use of Sample Entropy⁴ and Lempel-Ziv complexity⁵.

SampEn was designed as a variation of Approximate Entropy (ApEn)². These algorithms give higher values to complex and random systems and lower values to more predictable series. Both ApEn and SampEn are based on the probability of occurrence of patterns of data and have proved their validity to distinguish normal from abnormal data in instances where moment statistic approaches failed to show significant differences³. Furthermore, both statistics are relative measures and comparisons are intended between data series of the same length with homogeneous generating processes.

Lamentably, it is generally assumed that “SampEn is largely independent of record length”⁴. That idea, introduced in the literature by the creators of Sample Entropy, is extremely misleading and requires a clarification: in the case of a homogeneous data generating process, SampEn is independent on the record length whereas ApEn is biased in that sense. If the generating process does not change, there is a similar frequency of occurrence of a particular pattern and SampEn could be used independently of the number of observations while ApEn would require a large number of observations to be accurate. However, as evidenced in the next section, climate has changed dramatically during the last 122,000 years mixing heterogeneous epochs, and the results of the algorithm depend on the length of the data record. A comprehensive tutorial and detailed analysis of the SampEn and ApEn equations can be found in⁶.

Similarly, the comparisons of Lempel-Ziv complexity (LZC) lack of validity since they are made between heterogeneous epochs with a different number of data. In essence, this algorithm transforms the data record into a binary chain depending on the different values being higher or lower than the median, and then the number of patterns is analyzed. Given the highly variable shape of the original δ¹⁸O record from Fig. 1 (top), the comparison of each value with the median (corresponding to −39.88) is translated into a binary chain of many ones or zeros in a row, indicating low complexity for the 122,000 years dataset. On the contrary, as the 2,000 years record is almost flat, the mean divides the string into ones and zeros more diversely, providing more complexity and randomness.

In the next section, I show that throughout the last 122,000 years we can find subsequences of 2,000 years with values of SampEn and LZC similar to the most recent 2,000 years. This indicates that the previous 2,000 years do not stand out in terms of the complexity estimated from the δ¹⁸O record. Moreover, the results of the last 2,000 years are close to the mean values of the analysis. Therefore, the claim that the whole 122,000 years record is less complex than the last 2,000 years is unjustified and is nothing more than a statistical artifact.

Results

I analyze here three important issues arising when applying complexity algorithms: (i) the mixing of heterogeneous epochs; (ii) the length of the sequences; (iii) and the non-stationarity of the data.

(i) Both ApEn and SampEn have their roots in the entropy concept from Information Theory, where the alphabet is the list of possible outcomes of a random variable. In an experiment with dice, the alphabet would be the integers from 1 to 6. If we change that dice for a 12-sided dice, the alphabet would change and the comparison with the previous data record would be inaccurate; the frequency of occurrence of the different numbers has changed radically. The same situation happens in the analysis of climate. Figure 1 (top) shows that the δ¹⁸O data changed suddenly around 11,700 years ago, corresponding to the beginning of the Holocene and the abrupt warming after the last glacial period. The assumption that the epochs before and after that event have the same alphabet may lead to incorrect comparisons since both Shannon (for ApEn) and Rényi (for SampEn) entropies are dependent on the alphabet.

The goal of complexity algorithms is to characterize changes in similar data series. In this sense, we shall remember that both algorithms were designed for clinical applications and they have been used to analyze, for example, the changes in the complexity of heartbeats. The comparison of the results must be made, continuing with the example, between two time series of human heartbeats since they have similar alphabets; the direct comparison of the complexity of the heartbeat of two different species would not provide meaningful information. For an application of complexity measures to series with different alphabets, see⁷.

In Fig. 1, the LZ complexity roughly indicates how many regime shifts there are since the algorithm compares each value with the mean. Thus, Holocene values correspond to values during the glacial where there are almost no abrupt transitions.

(ii) Using sequences of the same length is fundamental and the comparison of 2,000 years with 122,000 years is impractical and leads to misleading results. To show that, I analyze the changes of complexity through time using rolling windows of 2,000 years. If the complexity estimates for the last 2,000 years were substantially different, it should stand out of the range of values obtained in all other windows of the same length.

Instead of using two different time series, I have made the calculations using only the NGRIP δ¹⁸O record of the entire previous 122,000 years on the GICC05modelext time scale in 20-year resolution. This record spans from 20 to 122,280 years b2k (before the year 2000) and contains N = 6114 observations, including the last 2,000 years. For the analysis, I divide the whole record into sequences of 2,000 years: the first sequence contains the data from [20, 2000] years b2k, the second sequence the data from [40, 2020] years b2k, etc. The time resolution is always τ = 20 years and the length record is N = 100 for all sequences.

Figure 1 shows the whole δ¹⁸O record (top), the Sample Entropy for the different sequences of 2,000 years (center), and the Lempel-Ziv complexity for the different sequences of 2,000 years (bottom). The mean value of SampEn for all the sequences is 1.76, and the standard deviation is 0.56. The mean value of LZC for the same sequences is 0.93, and the standard deviation is 0.24. The uncertainty in ApEn and SampEn is low; the standard deviation of ApEn(2, 0.15σ, 1000) determined through Monte Carlo simulations is less than 0.055 for a large class of models⁹.

It is clear from the figure that the first sequence corresponding to the recent past 2,000 years is not substantially different from the rest of the sequences, having a SampEn value of 2.18 and a LZC of 1.20. Therefore, there is no scientific argument to justify the claim that “By comparison, the recent 2000-year climate is further dominated anthropogenic forcing processes besides natural forcing processes”¹. Furthermore, previous epochs have had higher levels of complexity with values of SampEn around 4, four times the standard deviation from the mean. In those epochs, however, only natural forcing processes can be claimed.

As stated above, a general misconception about SampEn is that it is said to be independent on the record length⁴. In Fig. 1 (top) it can be seen that the climate has been very heterogeneous, and displayed a δ¹⁸O minimum of − 46.5 and a maximum of −32. To show that SampEn is not always independent of the record length, I generate a homogeneous gaussian set of random numbers contained between [−46.5, −32] and study the dependence of SampEn with the number of points considered. Figure 2 (red line) shows the results of that analysis. It can be seen that the SampEn of the homogeneous gaussian set behaves almost independently of the number of points considered. Therefore, it would be possible to compare a sequence of 1000 points with a sequence of 6000 points for that dataset. However, as the real δ¹⁸O contains trends and changes in variance, SampEn is strongly dependent on the number of points (black line). If instead of analyzing 2000 years (100 points) of data like in¹ we would consider 10,000 years (500 points) the complexity of the data would be similar, but as soon as data older than 12,000 years is included (600 points), SampEn drops dramatically because of the first regime-shift appearing (deglaciation from the glacial period to the Holocene). The tolerance of SampEn is measured as a function of the standard deviation, and abrupt variations in the data will change the sensitivity of complexity measures, making the results dependent on the number of points for heterogeneous datasets.

The effects of a small sample dataset are evidenced in the gaussian set (red line) for a number of points lower than 300.

(iii) Finally, I address the issue of the non-stationary of the data. Before doing the analysis, the stationarity of the dataset can be studied with unit root tests such as Dickey-Fuller or Kwiatkowski–Phillips–Schmidt–Shin (KPSS). The non-stationarity of the δ¹⁸O record implies, for example, that the mean of the whole 122,000 years record (−39.94) is substantially different from the mean of the first subsequence of 2,000 years (−35.32). Following Pincus, who designed the ApEn algorithm in the first place⁸, “If the time series is nonstationary, that is, contains one or more pronounced trends upward or downward, little can be inferred from moment (mean, variability), ApEn, or power spectral calculations, because the trends tend to dominate all other features. Physiologically, data with trends suggest a collection of heterogeneous epochs as opposed to a single homogeneous state. From the statistical perspective, it is imperative that any trends be removed before meaningful interpretation can be made from the statistical calculations”.

To properly quantify the complexity of the dataset, I transform the δ¹⁸O record by taking logarithms to stabilize the variance and then subtracting the previous value of the \({\rm{\log }}\,\) to eliminate the trend (log-ratio transformation, \({\rm{\log }}\,({x}_{t}/{x}_{t-1})\))⁹. The augmented Dickey-Fuller and KPSS tests verify that the transformed data set is stationary, and therefore the algorithms can be used to quantify complexity. Figure 3 (top) shows the variability of the δ¹⁸O record along with the measures of complexity (center and bottom). The mean and standard deviation of SampEn is 2.11 ± 0.35 and of LZC is 1.16 ± 0.07. It is clear from those figures that the last 2,000 years do not have any particularity regarding its complexity, because they are comparable to many other sequences of similar length throughout the entire last glacial period.

Data and Methods

This research uses the δ¹⁸O record of the NGRIP ice cores provided by the Center of Ice and Climate of the University of Copenhagen, the same dataset used in the original research¹. In particular, the data are those defined as: "Greenland Ice Core Chronology 2005 (GICC05) time scale for the NGRIP ice core (down to 60 ka b2k) extended to the base of the NGRIP ice core by merging with the ss09sea06bm model time scale. The resulting time scale is called GIC05modelext. Also supplied are 20 year means of δ18O data from NGRIP back to 123 ka b2k/3085 m depth. “The dataset is freely accessible at http://www.iceandclimate.nbi.ku.dk/data and described in^{10,11,12,13,14}.

The methods used in this paper are also those described in the original research.

Conclusions

In this paper, I reanalyzed the complexity of the last 122,000 years using the δ¹⁸O record of the NGRIP ice core to check the validity of the claims made in¹. I explained some of the major misconceptions when applying complexity measures and specifically showed that SampEn is not independent of the length of the dataset in general situations.

To show that the previously published analysis was inaccurate, instead of only comparing the last 2,000 years with the previous 122,000 years, I examined all consecutive sequences of 2,000 years, indicating that the complexity of the recent climate does not stand out from earlier segments of the Holocene or last glacial period. The values obtained for the recent past are within one standard deviation from the mean.

By making proper use of the algorithms, I showed how different epochs have had varying levels of complexity and some periods stand out by having a SampEn above 3 times the standard deviation from the mean. The ability to specify which epochs had greater complexity is useful for paleoclimate research to investigate the causes behind it, providing helpful knowledge to the geophysical community. However, based on the results of this paper, the claim that anthropogenic forcing processes have changed the complexity of the current climate in the last 2,000 years measured by the δ¹⁸O record up to a level which is different from previous epochs seems to be unjustified.

Complexity algorithms are an excellent tool to obtain insights about nonlinear dynamic processes and characterize the changes in a system. However, as in any other statistical analysis, it is essential to make a reasonable use of the algorithms by checking that we are not mixing heterogeneous epochs, verifying the effect of the length of the sequences, and studying the stationarity of the data. In general, the algorithms should be used with stationary data. However, it is not uncommon to find applications of ApEn and SampEn to raw data: in those instances, the comparisons should be made between similar systems with the same kind of cycles and trends.

Data availability

This research uses the δ¹⁸O record of the NGRIP ice core provided by the Center of Ice and Climate of the University of Copenhagen, freely accessible at http://www.iceandclimate.nbi.ku.dk/data.

References

Shao, Z.-G. Contrasting the complexity of the climate of the past 122,000 years and recent 2000 years. Scientific Reports 7, 2297–2301 (2017).
Article ADS Google Scholar
Pincus, S. M. Approximate entropy as a measure of system complexity. Proceedings of the National Academy of Sciences 88, 2297–2301 (1991).
Article ADS MathSciNet CAS Google Scholar
Pincus, S. M. & Huang, W.-M. Approximate entropy: Statistical properties and applications. Communications in Statistics - Theory and Methods 21, 3061–3077 (1992).
Article Google Scholar
Richman, J. S. & Moorman, J. R. Physiological time-series analysis using approximate entropy and sample entropy. American Journal of Physiology-Heart and Circulatory Physiology 278, H2039–H2049 (2000).
Article CAS Google Scholar
Lempel, A. & Ziv, J. On the complexity of finite sequences. IEEE Transactions on Information Theory 22, 75–81 (1976).
Article MathSciNet Google Scholar
Delgado-Bonal, A. & Marshak, A. Approximate Entropy and Sample Entropy: A comprehensive tutorial. Entropy 21, 541 (2019).
Article ADS MathSciNet Google Scholar
Delgado-Bonal, A. Quantifying the randomness of the stock markets. Scientific Reports 9, 12761 (2019).
Article ADS Google Scholar
Pincus, S. M. & Goldberger, A. L. Physiological time-series analysis: what does regularity quantify? American Journal of Physiology-Heart and Circulatory Physiology 266, H1643–H1656 (1994).
Article CAS Google Scholar
Pincus, S. & Kalman, R. E. Irregularity, volatility, risk, and financial market time series. Proceedings of the National Academy of Sciences 101, 13709–13714 (2004).
Article ADS MathSciNet CAS Google Scholar
Vinther, B. M. et al. A synchronized dating of three greenland ice cores throughout the holocene. Journal of Geophysical Research: Atmospheres 111 (2006).
Rasmussen, S. O. et al. A new greenland ice core chronology for the last glacial termination. Journal of Geophysical Research: Atmospheres 111 (2006).
Andersen, K. K. et al. The Greenland Ice Core Chronology 2005, 15 42 ka. Part 1: constructing the time scale. Quaternary Science Reviews 25, 3246–3257 (2006).
Article ADS Google Scholar
Svensson, A. et al. A 60 000 year greenland stratigraphic ice core chronology. Climate of the Past 4, 47–57 (2008).
Article ADS Google Scholar
Wolff, E., Chappellaz, J., Blunier, T., Rasmussen, S. & Svensson, A. Millennial-scale variability during the last glacial: The ice core record. Quaternary Science Reviews 29, 2828–2838 (2010).
Article ADS Google Scholar

Download references

Acknowledgements

Alfonso Delgado-Bonal’s research was supported by the NASA Postdoctoral Program at NASA GSFC, administered by Universities Space Research Association under contract with NASA.

Author information

Authors and Affiliations

NASA Goddard Space Flight Center, Earth Sciences Division, Greenbelt, Maryland, USA
Alfonso Delgado-Bonal
Universities Space Research Association, Columbia, Maryland, USA
Alfonso Delgado-Bonal

Authors

Alfonso Delgado-Bonal
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.D.-B. is the only author of the manuscript. All the content was originally developed by him for this manuscript.

Corresponding author

Correspondence to Alfonso Delgado-Bonal.

Ethics declarations

Competing interests

The author declares no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Delgado-Bonal, A. On the use of complexity algorithms: a cautionary lesson from climate research. Sci Rep 10, 5092 (2020). https://doi.org/10.1038/s41598-020-61731-7

Download citation

Received: 18 April 2019
Accepted: 26 February 2020
Published: 19 March 2020
DOI: https://doi.org/10.1038/s41598-020-61731-7

This article is cited by

Retraction Note: Contrasting the complexity of the climate of the past 122,000 years and recent 2000 years
- Zhi-Gang Shao
Scientific Reports (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.