Dimension reduction of thermoelectric properties using barycentric polynomial interpolation at Chebyshev nodes

Chung, Jaywan; Ryu, Byungki; Park, SuDong

doi:10.1038/s41598-020-70320-7

Download PDF

Article
Open access
Published: 10 August 2020

Dimension reduction of thermoelectric properties using barycentric polynomial interpolation at Chebyshev nodes

Jaywan Chung¹,
Byungki Ryu¹ &
SuDong Park¹

Scientific Reports volume 10, Article number: 13456 (2020) Cite this article

1078 Accesses
2 Citations
Metrics details

Subjects

Abstract

The thermoelectric properties (TEPs), consisting of Seebeck coefficient, electrical resistivity and thermal conductivity, are infinite-dimensional vectors because they depend on temperature. Accordingly, a projection of them into a finite-dimensional space is inevitable for use in computers. In this paper, as a dimension reduction method, we validate the use of high-order polynomial interpolation of TEPs at Chebyshev nodes of the second kind. To avoid the numerical instability of high order Lagrange polynomial interpolation, we use the barycentric formula. The numerical tests on 276 sets of published TEPs show at least 8 nodes are recommended to preserve the positivity of electrical resistivity and thermal conductivity. With 11 nodes, the interpolation causes about 2% error in TEPs and only 0.4% error in thermoelectric generator module performance. The robustness of our method against noise in TEPs is also tested; as the relative error caused by the interpolation of TEPs is almost the same as the relative size of noise, the interpolation does not cause unnecessarily high oscillation at unsampled points. The accuracy and robustness of the interpolation indicate digitizing infinite-dimensional univariate material data is practicable with tens or less data points. Furthermore, since a large interpolation error comes from a drastic change of data, the interpolation can be used to detect an anomaly such as a phase transition.

RETRACTED ARTICLE: Fractional boundary element solution of three-temperature thermoelectric problems

Article Open access 26 April 2022

High-throughput prediction of the carrier relaxation time via data-driven descriptor

Article Open access 08 October 2020

Predicting the Dispersion Relations of One-Dimensional Phononic Crystals by Neural Networks

Article Open access 25 October 2019

Introduction

A relationship between two physical properties is usually represented by a function of a real variable. If an analytic formula of the function is unavailable, the relationship cannot be completely described by a finite number of values since the function is an infinite-dimensional vector. Sharing the relationship for use in computers is also demanding because computers can handle only a finite number of values. The best remaining option is to project the infinite-dimensional data into a finite-dimensional space, and recover the data. As an example, physical properties from observations are given only at finite data points, and at other infinite data points, a presumption is made. The physical properties are often estimated by regression or linear interpolation. But regression does not preserve the observed data since it is a compromise between a regression model and the observed data. Furthermore, achieving high accuracy by regression is difficult because it requires the correct choice of a regression model. If the observed data is unreliable, regression may be preferable because a regression model alleviates anomalies. But here we assume the observed data is reliable and valuable. This is particularly true when the data is obtained from numerical simulations. On the other hand, linear interpolation is a simple method to find a curve preserving the observed data, but the curve is no longer differentiable. If one demands a smooth curve preserving the observed data, another approach is necessary. In this paper, we demonstrate an interpolation method to reconstruct a smooth curve from a finite number of data points, exemplified by thermoelectric material properties.

The thermoelectric effect¹, a direct and reversible energy conversion between electricity and heat, is governed by three thermoelectric material properties (TEPs): Seebeck coefficient $\alpha$, electrical resistivity $\rho$, and thermal conductivity $\kappa$. The performance parameters of thermoelectric power generation modules such as power output and efficiency are numerically computable with a given set of TEPs². However, since all the TEPs depend on temperature T, they are infinite-dimensional vectors as functions of a real variable $T{:}\,\alpha =\alpha (T), \rho =\rho (T)$, and $\kappa =\kappa (T)$. The infinite dimensionality of the TEPs hinders the numerical computations since computers can accept only a finite number of values as input. Therefore it is unavoidable to describe the TEPs with a finite number of values, i.e., to project the infinite-dimensional material properties into a finite-dimensional space and reconstruct them.

One way to reduce the dimension is to extract TEP values at a finite number of temperature values. Then the full TEP curves are reconstructed by interpolation which preserves the raw data. Suppose $n+1$ sample values $f_j, j=0,1,\ldots ,n,$ of a TEP are extracted at $n+1$ distinct temperature values $T_j, j=0,1,\ldots ,n$ where $T_j$’s are strictly increasing: $T_0< T_1< \cdots < T_n$. Among the many interpolation methods, here we focus on polynomial interpolation because it is computationally cheap, and the derivatives and integrals of polynomials are directly obtainable. The ease of differentiation can help to calculate significant transport properties such as the effective masses of electrons and holes. A well-known formula for polynomial interpolation is the Lagrange formula:

$$\begin{aligned} p_n(T) = \sum _{j=0}^n f_j \ell _j(T) \end{aligned}$$

(1)

where $\ell _j$ is the Lagrange polynomial

$$\begin{aligned} \ell _j(T) := \frac{\prod _{k=0, k\not =j}^n (T-T_k)}{\prod _{k=0, k\not = j}^n(T_j-T_k)} \end{aligned}$$

(2)

which satisfies $\ell _j(T_j)=1$ and $\ell _j(T_k)=0$ for $j\not = k$. The subscript n of the $p_n(T)$ denotes the degree of the polynomial.

A popular choice of $T_j$ is equidistant nodes:

$$\begin{aligned} T_j := \frac{T_n-T_0}{n}j + T_0, \quad j=0,1,\ldots ,n. \end{aligned}$$

However, the polynomial interpolation at equidistant nodes generates superfluous oscillations near the boundary of the interval $[T_0, T_n]$ for large n’s and even diverges as $n\rightarrow \infty$, as Runge³ first proved with the function $f(x)=(1+x^2)^{-1}$, $x \in [-5,5]$. The Runge’s phenomenon arises naturally for many continuous curves. As an example, consider a Ag-doped $\hbox {Mg}_2\hbox {Si}_{0.6}\hbox {Ge}_{0.4}$ thermoelectric material in⁴. The top of Fig. 1 shows the polynomial interpolation of the TEPs highly deviates from the exact curve near the boundaries of the temperature intervals.

To alleviate the Runge’s phenomenon, the choice of sample nodes $T_j$ is critical; the density of sample points should follow an asymptotic density proportional to $(1-x^2)^{-1/2}$ when the interval of x is $[-1,1]$. Hence the density should be higher near the boundaries of the interval than the inside. One of such a choice is Chebyshev nodes of the second kind:

$$\begin{aligned} T_j := \frac{T_0-T_n}{2} \cos \frac{j\pi }{n} + \frac{T_0+T_n}{2}, \quad j=0, 1, \ldots , n. \end{aligned}$$

(3)

A mathematical theory⁵ shows the Runge’s phenomenon would not be severe under the use of the Chebyshev nodes. The bottom of Fig. 1 shows that the polynomial interpolation at the Chebyshev nodes gives a substantially better result than the top of Fig. 1, overcoming the Runge’s phenomenon.

But still there is a computational issue in the Lagrange formula. When n is large, the direct computation of the Lagrange formula (1) is numerically unstable due to the high degree of the Lagrange polynomials (2); the numerator of $\ell _j(T)$ essentially contains the $T^n$ term so even with a moderate T, the numerator becomes too large to evaluate for large n. We use the barycentric formula of polynomial interpolation^6,7 as a numerically stable algorithm.

In this paper, using experimental thermoelectric data, we validate the use of the barycentric polynomial interpolation at the Chebyshev nodes of the second kind as an accurate dimension reduction method for thermoelectric material property curves. The interpolation is tested for 276 sets of TEPs acquired from published papers. Information on the TEP dataset can be found in Supplementary Information (SI). In the following section, the barycentric formula and its derivation are given. In subsequent sections, the accuracy of the interpolation on TEPs and module performance (figure of merit zT, power density, and efficiency) is studied. Then the effect of noise on the accuracy is tested. We conclude that the interpolation is accurate and robust for continuous TEPs, allowing its further application into various curves of scientific data.

Methods

Barycentric formula of polynomial interpolation

Since the barycentric formula has not been emphasized in elementary numerical analysis, here we include its derivation by following⁸. Let us define $\ell (T) := \prod _{k=0}^n (T-T_k)$ and the barycentric weights by

$$\begin{aligned} w_j := \frac{1}{\prod _{k=0,k\not =j}^n (T_j-T_k)}. \end{aligned}$$

(4)

Then obviously $\ell _j(T) = \ell (T) \frac{w_j}{T-T_j}$ from (2). Hence from the Lagrange formula (1) we have

$$\begin{aligned} p_n(T) = \ell (T) \sum _{j=0}^n \frac{w_j}{T-T_j} f_j. \end{aligned}$$

(5)

If the constant function 1 is interpolated, all the $f_j$’s are 1 and the right-hand side of (5) should be 1. Hence the modified Lagrange formula (5) yields

$$\begin{aligned} 1 = \ell (T) \sum _{j=0}^n \frac{w_j}{T-T_j}. \end{aligned}$$

Inserting this relation into (5), we have the barycentric formula:

$$\begin{aligned} p_n(T) = \frac{\displaystyle \sum _{j=0}^n \frac{w_j}{T-T_j} f_j}{\displaystyle \sum _{j=0}^n \frac{w_j}{T-T_j}}. \end{aligned}$$

(6)

Since the numerator and denominator in (6) have the same barycentric weights $w_j$, any scaling of (4) can be used instead. For equidistant nodes, the barycentric weights can be explicitly computed by $w_j = (-1)^j \left( {\begin{array}{c}n\\ j\end{array}}\right)$ with a proper scaling⁸, where $\left( {\begin{array}{c}n\\ j\end{array}}\right)$ is the binomial coefficient. For the Chebyshev nodes (3),

$$\begin{aligned} w_j = {\left\{ \begin{array}{ll} (-1)^j \frac{1}{2} &{} \text {if }j=0\text { or }j=n,\\ (-1)^j &{} \text {otherwise,} \end{array}\right. } \end{aligned}$$

(7)

with a proper scaling⁹. This simplicity of $w_j$’s makes the choice of the Chebyshev nodes (3) particularly intriguing among other choices of nodes. The barycentric polynomial interpolation (6) with (7) is explicit, hence its computational cost is cheap.

Due to the singular term $1/(T-T_j)$ in (6), the barycentric formula (6) is not defined at $T=T_j$ and need to be specially treated as the sample value $f_j$. However, when $T \simeq T_j$, because the numerator and the denominator have the same singular terms $1/(T-T_j)$, the inaccuracies due to the singular terms may cancel out⁸. If we use the Chebyshev nodes (3), the barycentric formula is indeed numerically forward stable¹⁰ and no severe inaccuracy arises at $T \simeq T_j$.

The derivative of the barycentric formula (6) can be written in the same form. Using the barycentric representation of $\ell _j(T)$, we can show⁸ that

$$\begin{aligned} \ell _j'(T_i) = {\left\{ \begin{array}{ll} \displaystyle \frac{w_j/w_i}{T_i-T_j} &{} \text {if }j \not = i,\\ \displaystyle -\sum _{k\not = i} \frac{w_k/w_i}{T_i-T_k} &{} \text {if }j=i. \end{array}\right. } \end{aligned}$$

From the Lagrange formula (1), $p_n'(T_i) = \sum _{j=0}^n f_j \ell _j'(T_i)$ hence

$$\begin{aligned} p_n'(T_i) =\sum _{j\not = i} \frac{(w_j/w_i)(f_j-f_i)}{T_i-T_j}. \end{aligned}$$

(8)

Considering the $p_n'(T_i)$ above as a new sample value at $T_i$, we have the barycentric formula of $p_n'(T)$ just replacing $f_i$ in (6) by $p_n'(T_i)$ in (8):

$$\begin{aligned} p_n'(T) = \frac{\displaystyle \sum _{j=0}^n \frac{w_j}{T-T_j} p_n'(T_j)}{\displaystyle \sum _{j=0}^n \frac{w_j}{T-T_j}}. \end{aligned}$$

Results

Accuracy of interpolation

We test the barycentric polynomial interpolation (6) at the Chebyshev nodes (3) by reconstructing 276 sets of TEPs from published papers. The list of the papers are given in the SI. The TEP dataset was previously used to validate a theory of thermoelectric conversion efficiency in^11,12.

To assess the accuracy of interpolating curves, an exact curve should be known but this is not possible since the determination of the exact curve requires an infinite (uncountable) number of measurements. Hence we assume that Seebeck coefficient curve $\alpha (T)$ is given by a second-order spline (a spline is a piecewise polynomial; see, e.g.,¹³), and electrical resistivity $\rho (T)$ and thermal conductivity $\kappa (T)$ curves are given by first-order splines (i.e., piecewise linear curves). With this assumption, only one exact curve is obtained for each TEP from the raw data points. We use a second-order spline for $\alpha$ because our evaluation of thermoelectric module performance requires the temperature derivative of Seebeck coefficient $\alpha '(T)=\frac{d\alpha }{dT}(T)$; this point will be clear in the next section. We use first-order splines for $\rho$ and $\kappa$ to secure the strict positivity of $\rho$ and $\kappa$; higher-order splines can give unphysical properties of zero or negative $\rho$ and $\kappa$ due to superfluous oscillations. Also note that the choice of the piecewise linear exact curve makes polynomial interpolation even harder, compared to the choice of higher-order splines; it would not be an easy task for smooth polynomials to imitate non-differentiable piecewise linear curves.

The top of Fig. 2 shows the superiority of Chebyshev nodes over equidistant nodes. In the figure, the relative error is measured by the $L^1$-norm:

$$\begin{aligned} \frac{\Vert f-{\hat{f}}\Vert _1}{\Vert f\Vert _1}, \quad \Vert f\Vert _1 := \int _{T_0}^{T_n} |f(T)|\,dT \end{aligned}$$

where f is an exact function and ${\hat{f}}$ is an interpolating function. As the number of nodes n increases, the error of the interpolation at Chebyshev nodes consistently decreases. With 11 Chebyshev nodes one may expect relative $L^1$-norm errors of 0.5% for $\alpha$ and $\kappa$, and 1% for $\rho$; see the bottom of Fig. 2. Meanwhile, the error of the interpolation at equidistant nodes significantly increases for large n’s: the error is at the minimum with 7 nodes and exceeds 6% with 16 nodes.

As shown in the top of Fig. 3, the superiority of Chebyshev nodes is more apparent if the relative $L^\infty$-norm

$$\begin{aligned} \frac{\Vert f-{\hat{f}}\Vert _\infty }{\Vert f\Vert _\infty }, \quad \Vert f\Vert _\infty := \max _{T \in [T_0,T_n]} |f(T)| \end{aligned}$$

is considered. The error for equidistant nodes grows serious and exceeds 50% with 16 nodes, while the error for Chebyshev nodes consistently decreases. With 11 Chebyshev nodes, one may expect relative $L^\infty$-norm errors of 2% for $\alpha$ and $\kappa$, and 2.5% for $\rho$, as shown in the bottom of Fig. 3.

Accuracy of module performance

Here we examine how much error in performance of thermoelectric modules is caused by the polynomial interpolation of $\alpha (T)$, $\rho (T)$ and $\kappa (T)$. We consider a single-material single-leg thermoelectric power generation module with the length L of $1 \,\mathrm {mm}$ and cross-sectional area A of $1 \,\mathrm {mm^2}$. Then the temperature distribution T(x) inside the module with a spatial coordinate $x \in [0,L]$ is given by the following second-order ordinary differential equation called the thermoelectric equation (for derivation, refer to²):

$$\begin{aligned} \frac{d}{dx}\Big (-\kappa (T)\frac{dT}{dx}\Big ) = \rho (T) J^2 - \frac{d\alpha }{dT}(T) \,T \frac{dT}{dx} J, \end{aligned}$$

(9)

where J is a given electric current density: $J=I/A$ for a given electric current I. We assume the thermoelectric module is under fixed temperatures at the boundaries: $T(0)=T_h$ and $T(L)=T_c$. The hot-side temperature $T_h$ and the cold-side temperature $T_c$ are chosen as the maximum and minimum temperature values in the TEP data where all the $\alpha (T), \rho (T)$ and $\kappa (T)$ are available. As before, the exact curves are assumed to be a second-order spline for $\alpha (T)$, and first-order splines for $\rho (T)$ and $\kappa (T)$. We avoided using a first-order spline for $\alpha (T)$ because the thermoelectric equation (9) contains the derivative of $\alpha (T)$; if $\alpha (T)$ is a first-order spline, then its derivative is discontinuous so the computation of a numerical solution of (9) becomes difficult.

The power P generated by the thermoelectric module is given by

$$\begin{aligned} P = I (V_{\mathrm {OC}} - IR) \end{aligned}$$

where the $V_{\mathrm {OC}}$ is the open-circuit voltage and R is the electrical resistance inside the module: $V_{\mathrm {OC}} = \int _{T_c}^{T_h}\alpha (T)\,dT$ and $R = \frac{1}{A}\int _0^L \rho (T(x))\,dx$. The energy conversion efficiency of the module is given by

$$\begin{aligned} \eta = \frac{P}{-\kappa (T_h)A\frac{dT}{dx}(0) + I \alpha (T_h) T_h}. \end{aligned}$$

Since the power and the efficiency depend on the given electric current I, we can maximize the power or efficiency by choosing a suitable I. Such maximum values are referred as the maximum power and maximum efficiency. Another popular performance parameter is the thermoelectric figure of merit zT. Here the z is defined by

$$\begin{aligned} z(T)=\frac{\alpha ^2(T)}{\rho (T)\kappa (T)}. \end{aligned}$$

(10)

The $zT_m := z \frac{T_h+T_c}{2}$ is proportional to the efficiency for temperature-independent TEPs¹⁴. Although the proportional relation is no longer valid for temperature-dependent TEPs^{11,12,15,16,17}, the zT has been widely used to assess thermoelectric materials due to the conciseness of the formula. Because the zT depends on T, the maximum value of zT over $T \in [T_c,T_h]$ is often used for material evalulation.

Here we consider three performance parameters of thermoelectric power generation modules: the maximum power density P/A, maximum efficiency $\eta$, and maximum zT. The errors in those parameters, caused by the interpolation, are given in Fig. 4. In the figure, 8 or more nodes are considered because fewer nodes did not guarantee the positivity of $\rho (T)$ for some of 276 TEPs. At least 8 nodes are recommended to secure the strict positivity of $\rho (T)$ and $\kappa (T)$. With 11 Chebyshev nodes, one may expect the relative error of 0.4% for the maximum power density, maximum efficiency, and the corresponding electric currents where the maximum values are attained. The low error is in accordance with the $L^1$-norm error in the interpolation of TEPs. This is not surprising because the solution T(x) of (9) is mainly affected by integrated quantities of TEPs rather than the TEP themselves; refer to an integral formulation of the thermoelectric equation in^11,12,15. On the contrary, the errors in the maximum zT and the corresponding T show a higher error of 1%. Since the zT depends on the TEPs directly, the error is in accordance with the $L^\infty$-norm error in the interpolation of TEPs.

Robustness on noise

We have assumed so far the values extracted at Chebyshev nodes are exact. If there is noise in the sampling, how much the accuracy of the interpolation is affected? To assess the robustness on the noise, we add random noise on the extracted value in (6) by replacing $f_j$ with

$$\begin{aligned} f_j + U\Big (-f_j \times \frac{p}{100}, +f_j \times \frac{p}{100}\Big ), \end{aligned}$$

where U(a, b) is the uniform random variable of which probability density function is given by $f_U(x) = \frac{1}{b-a}$ if $x \in (a,b)$ and $f_U(x)\equiv 0$ otherwise. Let us call this process $p\%$noise for convenience.

The Fig. 5 shows the error caused by the interpolation of TEPs linearly increases with the degree of noise. The percentage of the error is almost the same as the percentage of noise. This implies the interpolation does not cause unnecessarily high oscillation at unsampled points even there is sampling noise. A large number of nodes is slightly detrimental when there is a high degree of noise. This is because a high order polynomial needlessly struggles for interpolating the noise-added curves. But such degradation due to the choice of the number of nodes is less than 1%.

The Fig. 6 shows similar, linearly increasing error trends for module performance. The result is better for maximum power density and maximum efficiency; the percentage of error is below half of the percentage of noise. For example, with 11 Chebyshev nodes, the average relative error in maximum power density and maximum efficiency is below 2% under 5% noise. On the other hand, the result for the maximum zT is worse; the percentage of error is about one and a half of the percentage of noise. Also note that the error trends for the maximum zT are only valid when the interpolation preserves the positivity of $\rho$ and $\kappa$. If it does not due to the noise, the error soars because $\rho \times \kappa$ is the denominator of z as in (10). But negative $\rho$ and $\kappa$ are so unphysical and conspicuous that one would fix them by resampling. For that reason, we deliberately avoided such negative $\rho$ and $\kappa$ cases in the numerical simulation.

Physical implication of small and large interpolation error cases

We have verified that tens or less Chebyshev nodes provide an accurate polynomial interpolation on average. But even a smaller number of nodes is enough for many TEP curves. For example, with 7 nodes, 45% of the interpolation results has less than 1.5% error in the relative $L^\infty$-norm, and 80% of the results has less than 4.8% error (see Table S17 in SI). Let us call such well-interpolated curves normal and the other curves abnormal.

As an example of a normal curve, Fig. 7 shows that the nanostructured Bi-Sb-Te bulk alloys¹⁸ have slowly varying behavior of measured TEP values in the observed temperature range smaller than 550K. The electron and phonon Boltzmann transport equations (BTE) with the relaxation time approximation (RTA) within first-principles calculations^19,20,21 may predict the TEP curves of this material; refer to SI for the details of the method. The predicted curves in Fig. 7 not only follow the variation on temperature but also have a reliable size of TEP values. Within this semi-classical theory, the electron and phonon quasiparticles are responsible for the charge and heat transport in crystalline solids ²². Within the RTA, the interaction between fundamental particles and various imperfections lead to the particle scattering and enhanced resistivity^23,24,25. Hence, when there is no phase transition by temperature change, the thermoelectric properties predicted by the BTE with RTA are smooth functions of temperature T because the charge and heat carrier densities as well as the relaxation time are smooth functions of T. The strong agreement between the measured and computed TEPs suggests that the nanostructured Bi-Sb-Te bulk alloys in¹⁸ have no material phase transition and their TEP curves are smooth.

On the other hand, abnormal TEP curves have a large interpolation error with a small number of nodes because they undergo a drastic change with temperature. As a phase transition can be manifested through a drastic change or a discontinuity in TEP curves, temperature points where a large interpolation error occurs can be a phase transition temperature. Using this idea, we search for large interpolation error points to detect phase transition behaviors. Figure 8(a)–(c) show three examples of phase transition successfully detected by this method without using any domain knowledge. The detailed algorithm is given in SI.

In contrast to $\hbox {Bi}_2\hbox {Te}_3$-based thermoelectric materials, other tellurides can have phase transitions. One case is the $\hbox {Ag}_2$Te-based materials: at room temperature $\hbox {Ag}_2$Te has a low symmetric monoclinic phase and it transforms into a high temperature fcc phase above 417 K²⁶. Consistent with this fact, our algorithm detects an abnormal temperature for $\hbox {Ag}_2\hbox {Se}_{0.5}\hbox {Te}_{0.5}$²⁷ near 380 to 400 K; see Fig. 8(a). The drastic change of the TEP curves occurs in a narrow temperature range. It is argued that due to the alloying effect between $\hbox {Ag}_2\hbox {Te}$ and $\hbox {Ag}_2\hbox {Se}$, the phase transition seems to occur gradually in the narrow temperature range from 397 to 424 K^28,29. Noteworthily, such phase-transition-induced abnormal transport behaviors are detected for all $\hbox {Ag}_2\hbox {Te}$-containing materials in our dataset.

Figure 8(b) shows the abnormal transport behavior of $\hbox {Cu}_2$Se³⁰ known to exhibit a continuous phase transition. The continuous change of lattice angle in the temperature range from 340 to 410 K was confirmed by HTXRD data³⁰. The key temperature points are well detected by our algorithm.

SnSe is one of the well-known thermoelectric materials having high zT of 2.6 along the b-direction in the single crystalline phase³¹. Figure 8(c) shows our data analysis detects several abnormal temperature points for this material. There are two important temperature points, $\sim$ 600 K and $\sim$ 800 K, where the slopes of Seebeck coefficient and electrical resistivity rapidly change. The detected temperature point in 600–800 K well coincides with the range under the thermally activated generation of hole charge carriers in SnSe reported in³². From density functional theory calculations, it is shown that the formation of acceptor Sn-vacancy is responsible for the increasing electrical conductivity with temperature³²: $n_h = N \exp ( - E_{\mathrm {form}}/(kT))$ where $n_h$ is the hole carrier density, N is the site density of Sn, and $E_{\mathrm {form}}$ is the defect formation energy of Sn-vacancy. The next critical point is about 800 K. The origin of the derivative discontinuity at near $T = 800$ K is considered to be the phase transition from Pnma to Cmcm^31,32.

Mathematical interpretation of small and large interpolation error cases

We have observed that many TEP curves are accurately interpolated with even a small number of Chebyshev nodes, and a large interpolation may indicate a drastic change in the curves due to a phase transition. There is a mathematical basis for why the polynomial interpolation can perform well for many TEPs. For a given continuous function on a closed interval, there is a polynomial arbitrarily close to the given function in the $L^\infty$-norm, by the Stone-Weierstrass theorem (see, e.g., Theorem 7.26 in ³³). The issue is to find such a polynomial under a limited number of sampling nodes, and our approach here is the sampling at Chebyshev nodes.

On the other hand, for a discontinuous function, there is no reason for a polynomial interpolation to work. Suppose f(T) is a real-valued, $(n+1)$-times continuously differentiable function on a closed interval $[T_0, T_n]$. Then by Taylors’ theorem (see, e.g., Theorem 5.15 in ³³), the interpolating polynomial $p_n(T)$ of f(T) at $n+1$ nodes $\{T_j\}_{j=0}^{n}$ satisfies

$$\begin{aligned} f(T) - p_n(T) = \frac{f^{(n+1)}(\xi (T))}{(n+1)!} \prod _{j=0}^n (T-T_j), \end{aligned}$$

where $\xi (T) \in (T_0, T_n)$ is some value depending on T. If $\{T_j\}_{j=0}^{n}$ is the sequence of Chebyshev nodes of the second kind in (3), the normalized variable $s := \frac{T-T_0}{T_n-T_0}$ gives

$$\begin{aligned} f(T) - p_n(T) = \frac{{\hat{f}}^{(n+1)}({\hat{\xi }}(s))}{(n+1)!} \prod _{j=0}^n \Big (s - \frac{1}{2}\Big (1-\cos \frac{j \pi }{n}\Big )\Big ), \end{aligned}$$

where ${\hat{f}}(s) := f(T)$ and ${\hat{\xi }}(s) := \xi (T)$. The right-hand side shows the error depends on the normalized highest-order derivative ${\hat{f}}^{(n+1)}$. This explains why a drastically changing or discontinuous curve ruins the interpolation. Also note that the error does not depend on the size of the interval $T_n-T_0$ as long as the normalized highest-order derivative is the same. Hence the error is affected by the oscillation shape of the curves rather than the size of the interval. If the function f is continuous, there is a high-order polynomial arbitrary close to the f. As a polynomial has a high-order derivative, our Chebyshev-node interpolation becomes close to this polynomial as the number of nodes becomes large. Therefore by using more nodes, one can accurately recover the continuous function f. But for a discontinuous function, the discontinuity is not treatable with a high-order polynomial.

A simple method to handle a drastically changing or discontinuous curve is to first spot the problematic points and to apply our interpolation method in each interval where the curve is continuous and midly changing. Some examples of this approach are given in Fig. 8(a)–(c); the interpolation consisting of multiple polynomials (red lines) are noticeably improved over the single-polynomial interpolation (dark blue dashed lines). The problematic points used there is the previously found abnormal temperature points.

Conclusion

To reduce the infinite dimension of TEPs into a finite dimension, we propose the use of high-order polynomial interpolation at Chebyshev nodes of the second kind. To evaluate polynomials in a numerically stable way, the barycentric formula is used. The tests on 276 sets of published TEPs show our interpolation method is accurate, and robust on noise. For example, with 11 Chebyshev nodes, the error caused by the interpolation is about 2% for TEPs and 0.4% for module performance parameters of maximum power density and maximum efficiency. Even if there is noise in sampling, the interpolation does not cause any further error beyond the degree of noise.

While a small number of nodes is enough for many TEP curves, drastically changing or discontinuous curves can have a high interpolation error. A simple remedy to enhance the accuracy is to confine the temperature range to where the set of TEPs is continuous and midly changing. Meanwhile, as the drastic change may indicate a phase transition, the interpolation method can be used to detect an unidentified phase transition.

Since our polynomial interpolation method is robust on noise and presumes no physical model, it would perform well for various curves other than TEPs. The method projects infinite-dimensional vectors acquired from scientific experiments into a finite-dimensional space. Then a smooth, raw-data-preserving curve is efficiently constructed using the barycentric formula. Our empirical validation shows that the dimension reduction method allows digitizing infinite-dimensional univariate material data, with tens or less data points. Hence the method enables using the infinite-dimensional data for material informatics and machine learning.

Computational method

All the numerical computations in this paper are performed using the python programming language (version 3.6). The thermoelectric equation (9) with the Dirichlet boundary condition is solved by the fourth-order collocation method implemented in the solve_bvp function³⁴ of SciPy library (version 1.2.1). The maximum power density and maximum efficiency are found by the Brent-Dekker optimization method implemented in the minimize_scalar function³⁵ of the SciPy library.

Data availability

The data generated during the current study is available in the GitHub repository https://github.com/jaywan-chung/tep-chebyshev. More information on the TEP dataset is given in SI. The numeric values of the confidence intervals and standard deviations in Fig. 2, 3, 4, 5 and 6 are given in SI.

Code availability

The code used to generate data and figures in this paper is available in the GitHub repository https://github.com/jaywan-chung/tep-chebyshev.

References

Goldsmid, H. J. Introduction to Thermoelectricity 2nd edn. (Springer, Berlin, 2016).
Book Google Scholar
Goupil, C. (ed.) Continuum Theory and Modeling of Thermoelectric Elements (Wiley, New York, 2016).
Google Scholar
Runge, V. C. Über empirische funktionen und die interpolation zwischen äquidistanten ordinaten. Zeitschrift f. Mathematik u. Physik46, 224–243 (1901).
MATH Google Scholar
Noda, Y., Kon, H., Furukawa, Y., Nishida, I. A. & Masumoto, K. Temperature dependence of thermoelectric properties of ${\rm Mg}_2{\rm Si}_{0.6}{\rm Ge}_{0.4}$. Mater. Trans. JIM33, 851–855 (1992).
Article CAS Google Scholar
Brutman, L. On the Lebesgue function for polynomial interpolation. SIAM J. Numer. Anal.15, 694–704 (1978).
Article ADS MathSciNet Google Scholar
Taylor, W. J. Method of Lagrangian curvilinear interpolation. J. Res. Natl. Bur. Stand.35, 151–155 (1945).
Article MathSciNet Google Scholar
Dupuy, M. Le. calcul numérique des fonctions par l’interpolation barycentrique. C. R. Acad. Sci.226, 158–159 (1948).
Berrut, J.-P. & Trefethen, L. N. Barycentric Lagrange interpolation. SIAM Rev.46, 501–517 (2004).
Article ADS MathSciNet Google Scholar
Salzer, H. E. Lagrangian interpolation at the Chebyshev points $x_{n,\nu }\equiv \cos (\nu \pi /n)$, $\nu =0(1)n$; some unnoted advantages. Comput. J.15, 156–159 (1972).
Article MathSciNet Google Scholar
Higham, N. J. The numerical stability of barycentric Lagrange interpolation. IMA J. Numer. Anal.24, 547–556 (2004).
Article MathSciNet Google Scholar
Ryu, B., Chung, J. & Park, S. Thermoelectric efficiency has three degrees of freedom. Preprint at https://arxiv.org/abs/1810.11148 (2018).
Ryu, B., Chung, J. & Park, S. General efficiency theory of thermoelectric conversion. Preprint at https://arxiv.org/abs/1910.11132 (2019).
de Boor, C. A Practical Guide to Splines revised edn (Springer, erlin, 2001).
Ioffe, A. F. Semiconductor Thermoelements and Thermoelectric Cooling (Infosearch, London, 1957).
Google Scholar
Kim, H. S., Liu, W., Chen, G., Chu, C.-W. & Ren, Z. Relationship between thermoelectric figure of merit and energy conversion efficiency. Proc. Natl. Acad. Sci. USA112, 8205–8210 (2015).
Article ADS CAS Google Scholar
Borrego Larralde, J. M. Optimum Impurity Concentration in Semiconductor Thermoelements. Ph.D. thesis, Massachusetts Institute of Technology (1961).
Ryu, B. et al. Counterintuitive example on relation between ZT and thermoelectric efficiency. Appl. Phys. Lett.116, 193903 (2020).
Article ADS CAS Google Scholar
Poudel, B. et al. High-thermoelectric performance of nanostructured bismuth antimony telluride bulk alloys. Science320, 634–638 (2008).
Article ADS CAS Google Scholar
Madsen, G. K. H. & Singh, D. J. BoltzTraP. a code for calculating band-structure dependent quantities. Comput. Phys. Commun.175, 67–71 (2006).
Article ADS CAS Google Scholar
Togo, A. phono3py software homepage. https://phonopy.github.io/phono3py/. (2015).
Kresse, G. & Furthmüller, J. Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. Phys. Rev. B54, 11169–11186 (1996).
Article ADS CAS Google Scholar
Ashcroft, N. W. & Mermin, N. D. Solid State Physics (Cengage Learning, Brooks/Cole, 1976).
MATH Google Scholar
Ryu, B. & Oh, M.-W. Computational simulations of thermoelectric transport properties. J. Korean Ceram. Soc.53, 273–281 (2016).
Article CAS Google Scholar
Qiu, B. et al. First-principles simulation of electron mean-free-path spectra and thermoelectric properties in silicon. EPL Europhys. Lett.109, 57006 (2015).
Article ADS Google Scholar
Togo, A., Chaput, L. & Tanaka, I. Distributions of phonon lifetimes in brillouin zones. Phys. Rev. B91, 094306 (2015).
Article ADS Google Scholar
Kashida, S., Watanabe, N., Hasegawa, T., Iida, H. & Mori, M. Electronic structure of $\text{ Ag}_2$Te, band calculation and photoelectron spectroscopy. Solid State Ion.148, 193–201 (2002).
Article CAS Google Scholar
Drymiotis, F., Day, T. W., Brown, D. R., Heinz, N. A. & Snyder, G. J. Enhanced thermoelectric performance in the very low thermal conductivity ${\rm Ag}_2{\rm Se}_{0.5}{\rm Te}_{0.5}$. Appl. Phys. Lett.103, 143906 (2013).
Article ADS Google Scholar
Massalski, T., Okamoto, H., Subramanian, P. & Kacprzak, L. (eds.) Binary Alloy Phase Diagrams 2nd (3 volumes) edition (ASM International, Philadelphia, 1990).
Aramov, N., Odin, I. & Boncheva-Mladenova, Z. Study of the silver selenide–silver telluride system. Thermochim. Acta20, 107–113 (1977).
Article CAS Google Scholar
Liu, H. et al. Ultrahigh thermoelectric performance by electron and phonon critical scattering in Cu$_2$Se$_{1-x}$I$_x$. Adv. Mater.25, 6607–6612 (2013).
Article CAS Google Scholar
Zhao, L.-D. et al. Ultralow thermal conductivity and high thermoelectric figure of merit in SnSe crystals. Nature508, 373–377 (2014).
Article ADS CAS Google Scholar
Dewandre, A. et al. Two-step phase transition in SnSe and the origins of its high power factor from first principles. Phys. Rev. Lett.117, 276601 (2016).
Article Google Scholar
Rudin, W. Principles of Mathematical Analysis 3rd edn. (McGraw-Hill, New York, 1976).
MATH Google Scholar
The SciPy community. solve_bvpFunction Description. https://docs.scipy.org/doc/scipy/reference/generated/scipy.integrate.solve_bvp.html. (2019).
The SciPy community. minimize_scalarFunction Description. https://docs.scipy.org/doc/scipy/reference/optimize.minimize_scalar-brent.html. (2019).

Download references

Acknowledgements

This work was supported by the Korea Institute of Energy Technology Evaluation and Planning (KETEP) funded by the Ministry of Trade, Industry & Energy (MOTIE), Republic of Korea: Grant No. 20172010000830. It was also supported by the Korea Electrotechnology Research Institute (KERI) Primary Research Program through the National Research Council of Science and Technology (NST) funded by the Ministry of Science and ICT (MSIT) of the Republic of Korea: Grant No. 20A01025.

Author information

Authors and Affiliations

Energy Conversion Research Center, Korea Electrotechnology Research Institute (KERI), Changwon-si, Gyeongsangnam-do, 51543, Republic of Korea
Jaywan Chung, Byungki Ryu & SuDong Park

Authors

Jaywan Chung
View author publications
You can also search for this author in PubMed Google Scholar
Byungki Ryu
View author publications
You can also search for this author in PubMed Google Scholar
SuDong Park
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.C. and B.R. planned the research. J.C. performed the numerical computation. All authors discussed the results. All authors contributed to the writing and revision of this manuscript.

Corresponding author

Correspondence to Jaywan Chung.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chung, J., Ryu, B. & Park, S. Dimension reduction of thermoelectric properties using barycentric polynomial interpolation at Chebyshev nodes. Sci Rep 10, 13456 (2020). https://doi.org/10.1038/s41598-020-70320-7

Download citation

Received: 15 May 2020
Accepted: 27 July 2020
Published: 10 August 2020
DOI: https://doi.org/10.1038/s41598-020-70320-7

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.