Clustering method for time-series images using quantum-inspired digital annealer technology

Inoue, Tomoki; Kubota, Koyo; Ikami, Tsubasa; Egami, Yasuhiro; Nagai, Hiroki; Kashikawa, Takahiro; Kimura, Koichi; Matsuda, Yu

doi:10.1038/s44172-023-00158-0

Download PDF

Article
Open access
Published: 10 January 2024

Clustering method for time-series images using quantum-inspired digital annealer technology

Communications Engineering volume 3, Article number: 10 (2024) Cite this article

3905 Accesses
38 Altmetric
Metrics details

Subjects

Abstract

Time-series clustering is a powerful data mining technique for time-series data in the absence of prior knowledge of the clusters. Here we propose a time-series clustering method that leverages an annealing machine, which accurately solves combinatorial optimization problems. The proposed method facilitates an even classification of time-series data into closely located clusters while maintaining robustness against outliers. We compared the proposed method with an existing standard method for clustering an online distributed dataset and found that both methods yielded comparable results. Furthermore, the proposed method was applied to a flow measurement image dataset containing noticeable noise with a signal-to-noise ratio of approximately unity. Despite a small signal variation of approximately 2%, the proposed method effectively classified the data without any overlaps among the clusters. In contrast, the clustering results of the existing methods exhibited overlapping clusters. These results indicate the effectiveness of the proposed method.

Equivalence partition based morphological similarity clustering for large-scale time series

Article Open access 11 April 2023

A self-organizing, living library of time-series data

Article Open access 07 July 2020

Objective comparison of methods to decode anomalous diffusion

Article Open access 29 October 2021

Introduction

The collection of large-sized datasets has drastically increased with advancements in data storage and data acquisition technologies. Time-series data containing one or multiple variables (e.g., images) that vary with time is extensively recorded and analyzed in various fields, such as science, engineering, medical science, economics, and finance^1,2,3. Clustering is a powerful data mining technique for classifying these data into related groups in the absence of sufficient prior knowledge of the groups^4,5,6. In particular, when dealing with time-series data, the clustering technique is referred to as time-series clustering^7,8,9. Many studies on time-series clustering have been summarized in review papers^{2,7,8,9,10,11}. In addition, several libraries for time-series clustering have been made available on the web^{12,13,14,15,16} and are widely used. Following the literature^7,8, time-series clustering is defined as “the process of unsupervised partitioning a given time-series dataset into clusters, in such a way that homogenous time-series data are grouped together based on a certain similarity measure, is called time-series clustering.” Three main methods are commonly employed for time-series clustering: raw-data-based/shape-based, feature-based, and model-based approaches^7,8. As an example, the raw-data-based/shape-based approach is illustrated in Fig. 1. These methods differ in their initial calculation procedures. The raw-data-based/shape-based approach directly uses the raw data for clustering, whereas the feature-based approach transforms the raw data into a low-dimensional feature vector. The model-based approach assumes that the time-series data are generated from a stochastic process model, and the parameters of the model are estimated from the data. The raw-data-based and feature-based approaches are more commonly used because the performance of the model-based approach degrades when clusters are close to each other^2,8. The subsequent step involves calculating the similarity or distance between two data points, feature-vectors, or models. Then, the data is grouped into clusters based on the measured similarity or distance using machine learning methods. Clustering algorithms commonly employed for time-series data include partitioning, hierarchical, model-based, and density-based clustering algorithms^7,8. Among partitioning clustering algorithms, k-means clustering is one of the most widely used algorithms^5,6,17,18. Its main advantage lies in its low computational cost. However, the method requires user to pre-determine a number of clusters. In a hierarchical clustering algorithm, the number of clusters does not need to be pre-determined. However, once clusters are split or merged using the divisive or agglomerative methods, they cannot be adjusted. Neural network approaches such as self-organizing maps¹⁹ and hidden Markov model⁵ are employed as model-based clustering approaches. In addition to the above-mentioned disadvantage of the performance degradation for close clusters, these approaches suffer from high computational costs. Density-based methods, such as density-based spatial clustering of applications with noise (DBSCAN)²⁰, do not require users to pre-determine a number of clusters and is robust to outliers. However, in density-based methods, an appropriate choice of parameters is difficult, and it is known that they suffer from the curse of dimensionality. Overall, each method has its advantages and disadvantages. A definitive method that can be used for all datasets does not exist, and an appropriate method should be employed depending on the purpose and dataset to be processed. Recently, continued attempts have been made to improve the performance of each method. As examples, studies published in the last three years are introduced as follows: the extension of dynamic time warping (DTW)^21,22, the measure based on quantile cross-spectral density²³, and the measure of two linear fuzzy information granule time-series²⁴ have been proposed to calculate the similarity or distance. A clustering method that focuses on the time-varying moment was proposed for financial time-series data²⁵. A model-based approach based on the mixture of linear Gaussian state space model was proposed²⁶. A notable research trend is approaches based on deep learning^27,28,29, which is different from the previously mentioned unsupervised methods. As a contrasting approach, a computationally efficient approach based on single-template matching was proposed³⁰. However, to the best of the authors’ knowledge, no method has been reported to concurrently achieve a high clustering performance (e.g., classification of data points that are close to each other, robustness to outliers, etc.) and low computational cost.

**Fig. 1: Typical clustering procedure of raw-data-based/shape-based approach.**

In this study, we propose a time-series clustering method that can achieve a higher clustering performance and lower computational cost. To achieve this goal, we focused on clustering algorithms using an annealing machine. As mentioned above, research has not included the study of clustering algorithms as much as the calculation of similarity or distance, with the exception of the use of deep learning. Annealing/Ising machines, such as quantum annealing and digital Ising machines, solve combinatorial optimization problems faster and more accurately than conventional computers^31,32,33,34. Therefore, we expect that our proposed method can achieve clustering tasks that are challenging to achieve with existing methods. A unique characteristic of the proposed method, which is not found in existing methods, is its ability to evenly classify time-series data into closely related clusters while maintaining robustness against outliers. More specifically, the method can equally classify periodic time-series images into several phase ranges by assuming a sufficient number of images for each phase, given the long duration of the time-series data relative to the period. This paper provides a comprehensive explanation of our proposed method. We used the third-generation Fujitsu Digital Annealer (DA3), which is a quantum-inspired computing technology, for the clustering calculation. DA3 can solve quadratic unconstrained binary optimization (QUBO) problems, and the clustering problem can be formulated as an Ising model that is equivalent to a QUBO problem^35,36. DA3 provides a solution in a large-scale problem space of up to 100 kbits. Subsequently, we applied our proposed method to two time-series datasets: one obtained from “the UEA & UCR time-series classification repository”^37,38,39, and the other consisted of flow measurement image data capturing the Kármán vortex street, periodic wakes, obtained in our previous data^40,41,42. We specifically chose flow measurement data because it is typically high dimensional (~10⁶) and contains a measurement noise. For the clustering process, we employed raw-data-based and feature-based approaches. Furthermore, we compared our results with those obtained from existing standard methods, specifically “tslearn”¹² available online, and the conditional image sampling (CIS) method^43,44 (only for flow measurement data).

Results and discussion

Clustering of online available time-series dataset

We demonstrated the application of the proposed method to classify the “crop” dataset available from the UEA & UCR time-series classification repository^37,38,39. The clustering results obtained using the “TimeSeriesKMeans” function in “tslearn” and the proposed methods are shown in Fig. 2. The “crop” dataset contained 24 clusters. However, we present the results of two representative clusters. In this dataset, the correct classifications were known and displayed in Fig. 2. In addition, ensemble-averaged data for each method were calculated. As shown in Fig. 2a, the proposed method successfully classified the data, whereas the results obtained by the standard existing method (tslearn) exhibited some unfavorable classifications. We calculated the root mean squared error (RMSE) between the ensemble-averaged data of the correct data and those obtained by the proposed method and “tslearn”. The RMSEs of the proposed and existing methods shown in Fig. 2a were 0.115 and 0.121, respectively. This further confirmed that the proposed method surpassed the standard existing method. On the other hand, the RMSEs of the proposed and the existing methods shown in Fig. 2b were 0.117 and 0.096, respectively. In this condition, the result obtained by the proposed method was inferior to that of the existing method. However, since the variance of the correct data is large, as shown in Fig. 2b, the classification is inherently difficult. The demonstrations for other datasets are provided in Supplementary Note 1. Consequently, we can conclude that the results of the proposed method are comparable to those of conventional methods.

**Fig. 2: Typical clustering results for “crop” dataset from the UEA & UCR time-series classification repository using the proposed and existing methods.**

Clustering of flow measurement time-series dataset

We applied our method to the flow measurement dataset of the Kármán vortex street to demonstrate its effectiveness for noisy data. A typical data of a snapshot is shown in Fig. 3a, and the image shows that the data contain noticeable noise with a signal-to-noise ratio (SNR) of approximately 1. The dimensions of the measurement area are shown in Fig. 3b. This periodic time-series dataset should be equally classified into each phase range because a sufficient number of images were acquired for each phase owing to the long duration of the measurement relative to the period. Therefore, this is a typical dataset to demonstrate the effectiveness of this method. In this study, we classified this time-series data into nine clusters using the proposed method, “tslearn,” and the CIS method. The clustering results are shown in Fig. 4, where the data are presented on a two-dimensional scatter plot using multi-dimensional scaling (MDS). In the MDS calculation, the distance between the data x_i and x_j is represented as $|\sin ({\theta }_{i,j}/2)|$, where |a| represents the absolute value of a, and θ_i,j corresponds to the angle between data vectors x_i and x_j. Since the Kármán vortex street dataset used in the analysis is a periodical phenomenon with a maximum distance of unity, the data points were distributed along a circle with a radius of 1/2. As illustrated in Fig. 4a, the proposed method successfully classified the data points without overlaps. The data points were evenly classified into each cluster, and the cluster sizes were similar, which is a favorable result. The data points outside the circle with a radius of 1/2 were considered outliers, which is reasonable because these data points were considered disturbances deviating from periodic phenomena. However, the outliers were classified into one of the clusters in the standard existing method (Fig. 4b). This will be inappropriate when calculating the ensemble average of the data. The CIS method only classified the data points on the circle as shown in Fig. 4c. However, some clusters exhibited overlapping regions and did not form discrete clusters. Density-based methods, such as DBSCAN, are known as powerful clustering methods. However, the data points on the circle were classified into a single cluster in DBSCAN.

**Fig. 3: Typical raw data of PSP measurement and calculation condition.**

**Fig. 4: Clustering results shown in two-dimensional scatter plot based on MDS.**

The ensemble-averaged pressure distributions are shown in Figs. 5–7. The proposed method (Fig. 5) and the CIS method (Fig. 7) effectively extract a periodic vortex generation despite a small pressure variation of approximately 2%. On the other hand, the pressure distribution obtained from the standard method failed to accurately extract the periodic motion. For example, the vortex located at the upper side suddenly disappeared from phase 2 to phase 3, and the vortex at the upper side reversed its flow direction from phase 5 to phase 6 (Fig. 6). This discrepancy can be attributed to the overlapping clusters observed in Fig. 4b. As the pressure decreases when the vortex comes, we compared the minimum pressure at the center of the vortex between the proposed and CIS methods. The ensemble-averaged pressure values were p/p_ref = 0.982 ± 0.001 and p/p_ref = 0.984 ± 0.002 for the proposed and CIS methods, respectively, where the error represents the standard deviation and p_ref denotes the atmospheric pressure. The pressure obtained by the CIS method was slightly higher than that of the proposed method, which aligned with the observations in Figs. 5 and 7. The difference indicates that the vortex was weakened in the CIS method because of the previously mentioned overlapping clusters, where data from different phases were also included in the ensemble averaging process. These findings provide further evidence that the proposed method is a powerful clustering approach for analyzing periodic phenomena.

**Fig. 5: Ensemble-averaged pressure distribution for the proposed method.**

**Fig. 6: Ensemble-averaged pressure distribution for the existing standard method (tslearn).**

**Fig. 7: Ensemble-averaged pressure distribution for the conditional image sampling (CIS) method.**

Conclusions

We propose a novel clustering method using an annealing machine. We added a term that adjusts the number of data classified into each cluster to a QUBO model. In this study, we applied our proposed method to two distinct datasets: one is the “crop” dataset available from the UEA & UCR time-series classification repository and the other is a flow measurement image dataset obtained in our previous study. For the clustering of “crop” dataset, we also employed a standard existing method distributed as “tslearn,” in which the distance between each data was calculated based on the Euclidean distance and the clustering was calculated by the k-means++ algorithm. Comparing the results obtained from our proposed method and the existing method, we observed that the variation of the data points obtained by the proposed method was smaller than that by the existing method. In this dataset, the correct clustering result was provided. Then, we calculated the ensemble-averaged data, and the root mean squared errors (RMSEs) between the correct data and the ensemble-averaged data were compared. Our findings indicate that both methods provide similar results for this dataset.

Next, we applied our clustering method to the flow measurement image dataset, which consisted of the time-series pressure distributions induced by the Kármán vortex street. This dataset exhibited periodicity. Another characteristic of this data is that the dataset contains a noticeable noise with a signal-to-noise ratio of approximately 1. For comparison, the dataset was also classified using the standard existing method and the conditional image sampling (CIS) method, which is specifically designed for flow measurement data. The proposed method successfully classified the data without any overlap between the clusters in spite of the small pressure variation of approximately 2%. On the other hand, both the existing and the CIS methods exhibited overlapping of clusters, failing to form discrete clusters. In particular, the overlap between the clusters calculated by the existing method was large; thus, the vortex suddenly disappeared at times and exhibited reverse flow at other times in the ensemble-averaged pressure distribution. It was also found that the vortex was weakened in the ensemble-averaged pressure distribution obtained by the CIS method. These outcomes highlight the superior performance of the proposed method in the clustering periodic phenomena. The clustering algorithm using an annealing machine is a promising algorithm for large dataset. However, the calculation of similarity or distance is conducted by conventional computers. This is considered to be a major limitation that needs to be resolved when handling large datasets.

Methods

Proposed method for time-series clustering

We propose a clustering method using an annealing machine. We focused on the raw-data-based and feature-based approaches for time-series data analysis. We considered a clustering problem that a given dataset of n time-series data X = {x₁, x₂,⋯, x_n}, where x_i is a column vector, is classified to ${{{{{\mathcal{k}}}}}}$ clusters ${{{{{\mathcal{c}}}}}}=\left\{{c}_{1},{c}_{2},\cdots ,{c}_{{{{{{\mathcal{k}}}}}}}\right\}$. Since DA is designed to solve QUBO problems, an objective function is expressed as a QUBO problem. The Hamiltonian for the clustering problem is described as follows^45,46:

$${{{{{{\mathcal{H}}}}}}}=\mathop{\sum }\limits_{{{{{{\mathcal{c}}}}}}}\mathop{\sum }\limits_{i \neq j}{d}_{i,j}{q}_{{{{{{\mathcal{g}}}}}},i}{q}_{{{{{{\mathcal{g}}}}}},j}-{\lambda }_{1}\mathop{\sum }\limits_{{{{{{\bf{X}}}}}}}{\left(\mathop{\sum }\limits_{{{{{{\mathcal{c}}}}}}}{q}_{{{{{{\mathcal{g}}}}}},j}-1\right)}^{2}$$

(1)

where ${q}_{{{{{{\mathcal{g}}}}}}{{{{{\mathscr{,}}}}}}i}=1$ when x_i belongs to cluster ${{{{{{\mathcal{c}}}}}}}_{{{{{{\mathcal{g}}}}}}}$ and ${q}_{{{{{{\mathcal{g}}}}}}{{{{{\mathscr{,}}}}}}i}=0$ when ${{{{{{\bf{x}}}}}}}_{i}$ does not belong to the cluster ${{{{{{\mathcal{c}}}}}}}_{{{{{{\mathcal{g}}}}}}}$, that is,

$${q}_{{{{{{\mathcal{g}}}}}},i}=\left\{\begin{array}{c}1:{{{{{{\bf{x}}}}}}}_{i}\in {{{{{{\mathcal{c}}}}}}}_{{{{{{\mathcal{g}}}}}}}\\ 0:{{{{{{\bf{x}}}}}}}_{i}\notin {{{{{{\mathcal{c}}}}}}}_{{{{{{\mathcal{g}}}}}}}\hfill\end{array}\right.$$

(2)

The similarity or inverse of the distance between ${{{{{{\bf{x}}}}}}}_{i}$ and ${{{{{{\bf{x}}}}}}}_{j}$ is denoted as ${d}_{i,{j}}$, and ${\lambda }_{1}$ is a hyperparameter. The sum ${\sum }_{i\ne j}{d}_{i,{j}}{q}_{{{{{{\mathcal{g}}}}}}{{{{{\mathscr{,}}}}}}i}{q}_{{{{{{\mathcal{g}}}}}}{{{{{\mathscr{,}}}}}}j}$ represents the sum of the similarity or the inverse of the distance between two data points belonging to a cluster. The sum ${\sum }_{{{{{{\mathcal{c}}}}}}}$ represents the sum over all clusters in the first term of Eq. (1). Clustering can be calculated by minimizing $-{{{{{{\mathcal{H}}}}}}}$, i.e.,

$$\min -\mathop{\sum }\limits_{{{{{{\mathcal{c}}}}}}}\mathop{\sum }\limits_{i \neq j}{d}_{i,j}{q}_{{{{{{\mathcal{g}}}}}}{{{{{\mathscr{,}}}}}}i}{q}_{{{{{{\mathcal{g}}}}}}{{{{{\mathscr{,}}}}}}j}+{\lambda }_{1}\mathop{\sum }\limits_{{{{{{\bf{X}}}}}}}{\left(\mathop{\sum }\limits_{{{{{{\mathcal{c}}}}}}}{q}_{{{{{{\mathcal{g}}}}}}{{{{{\mathscr{,}}}}}}j}-1\right)}^{2}$$

(3)

The second term in Eq. (3) represents a constrained term ensuring each data point belongs to only one cluster^45,46. The value λ₁ determines the strictness of this constraint, where a smaller value enables some data points to be treated as outliers and not assigned them to any cluster. This study considered the following minimization problem:

$$\min -\mathop{\sum }\limits_{{{{{{\mathcal{c}}}}}}}\mathop{\sum }\limits_{i \neq j}{d}_{i,j}{q}_{{{{{{\mathcal{g}}}}}}{{{{{\mathscr{,}}}}}}i}{q}_{{{{{{\mathcal{g}}}}}}{{{{{\mathscr{,}}}}}}j}+{\lambda }_{1}\mathop{\sum }\limits_{{{{{{\bf{X}}}}}}}{\left(\mathop{\sum }\limits_{{{{{{\mathcal{c}}}}}}}{q}_{{{{{{\mathcal{g}}}}}}{{{{{\mathscr{,}}}}}}j}-1\right)}^{2}+{\lambda }_{2}\mathop{\sum }\limits_{{{{{{\mathcal{c}}}}}}}{\left(\mathop{\sum }\limits_{j}{q}_{{{{{{\mathcal{g}}}}}}{{{{{\mathscr{,}}}}}}j}\right)}^{2}$$

(4)

where the third term in Eq. (4) adjusts the number of data points classified into each cluster. We denote ${S}_{{{{{{\mathcal{g}}}}}}}={\sum }_{j}{q}_{{{{{{\mathcal{g}}}}}}{{{{{\mathscr{,}}}}}}j}$ to simplify the notation, indicating the number of data points belonging to the cluster ${{{{{{\mathcal{c}}}}}}}_{{{{{{\mathcal{g}}}}}}}$. Then, the third term of Eq. (4) is written as

$$\mathop{\sum }\limits_{{{{{{\mathcal{c}}}}}}}{\left(\mathop{\sum }\limits_{j}{q}_{{{{{{\mathcal{g}}}}}},j}\right)}^{2}=\mathop{\sum }\limits_{{{{{{\mathcal{c}}}}}}}{S}_{{{{{{\mathcal{g}}}}}}}^{2}$$

(5)

The mean number of data points and the variance of data points belonging to each cluster are represented by μ and σ², respectively. Eq. (5) is written as

$$\mathop{\sum }\limits_{{{{{{\mathcal{c}}}}}}}{S}_{{{{{{\mathcal{g}}}}}}}^{2}=\mathop{\sum }\limits_{{{{{{\mathcal{c}}}}}}}\left\{{\left({S}_{{{{{{\mathcal{g}}}}}}}-\mu \right)}^{2}+2{S}_{{{{{{\mathcal{g}}}}}}}\mu -{\mu }^{2}\right\}={{{{{\mathcal{k}}}}}}\left({\sigma }^{2}+{\mu }^{2}\right)$$

(6)

When m data points are classified into one of ${{{\mathcal{k}}}}$ clusters, the mean $\mu =m/{{{{{\mathcal{k}}}}}}$ is a constant. Then, as the variance decreases, i.e., the third term in Eq. (4) becomes smaller, the data points are evenly classified into each cluster. As the number of data points classified into each cluster decreases, the mean μ decreases and the third term also becomes smaller. In other words, adding this term enables us to easily adjust the number of data points in each cluster by only varying λ₂. The effect of λ₂ on the clustering of the flow measurement dataset is discussed in Supplementary Note 2. This adjustment is difficult for many existing clustering algorithms.

Time-series dataset for demonstration

We applied the proposed clustering method to two time-series datasets. One of the datasets, named “crop,” was obtained from the UEA & UCR time-series classification repository^37,38,39. These time-series data were derived from images taken by the FORMOSAT-2 satellite. The dataset consists of 24 classes corresponding to an agricultural land-cover map, and each data point corresponds to its temporal evolution. The time-series length was 46, and the data were one-dimensional. The data were standardized to have a mean of 0 and a variance of 1. We compared the clustering results obtained by the proposed method and those obtained by “tslearn.”¹² In this study, we used the “TimeSeriesKMeans” function in “tslearn.” The parameters in the function were set to general settings as follows: the number of clusters was 24, the metric (distance between each data) was Euclidean, the method for initialization was k-means++, and the other parameters were employed default values. This is a standard time-series clustering method. In the proposed method, the Euclidean distance was also used as the metric, and the inverse of the metric was used to minimize the first term in Eq. (4). The data were multiplied by 10⁴ before being transferred to DA3 because it can only handle integer values. Since all data points should belong to one of the clusters in this dataset, the parameter λ₁ was approximately 100 times larger than λ₂. The actual values used for the calculation are shown in the code attached in Supplementary Note 3. In this condition, a solution that all data points belonged to one of the clusters (the second term of Eq. (4) was 0) was obtained.

The second dataset used in this study was the flow image data obtained in our previous study^40,41,42, which were measured using the pressure-sensitive paint (PSP) method^47,48,49. The PSP method is a pressure distribution measurement technique based on the oxygen quenching of the phosphorescence emitted from the dyes incorporated into the PSP coating. The measured data were the pressure distribution induced by the Kármán vortex street behind a square cylinder as shown in Fig. 3a. The data size was 780 × 780 spatial grids. The flow velocity was 50 m/s, and the Reynolds number was 1.1 × 10⁵. The number of data points was 720. The pressure difference was too small to be detected using the PSP technique because of the small variation in the phosphorescence intensity. Then, the measured pressure contained noticeable noise, and the noise should be reduced from the data. It is well known that the Kármán vortex is a periodic phenomenon. The data were classified into several phase groups and averaged within these groups to reduce the noise and extract useful patterns, which is one of the purposes of time-series clustering. The cosine similarity measure was used to assess the similarity between the data because we focused on the phase information of the vortex. Since the PSP data were a time-series image data with two spatial dimensions and one temporal dimension, the pressure distribution data were reshaped into a column vector. Consequently, the time-series PSP data are written as n time-series data ${{{{{\bf{X}}}}}}=\{{{{{{{\bf{x}}}}}}}_{1},{{{{{{\bf{x}}}}}}}_{2},\cdots ,{{{{{{\bf{x}}}}}}}_{n}\}$, where ${{{{{{\bf{x}}}}}}}_{i}$ is a vector corresponding to a reshaped pressure distribution. Since the measured PSP data contains significant noise of SNR ~ 1, the denoised data was used for the calculation of the similarity. Following the literature⁵⁰, the dataset with small noise can be obtained by considering the truncated singular value decomposition (SVD). We considered a data matrix ${{{{{\bf{Y}}}}}}=[{{{{{{\bf{x}}}}}}}_{1}\,{{{{{{\bf{x}}}}}}}_{2}\cdots {{{{{{\bf{x}}}}}}}_{n}]$, where the data matrix Y was obtained by arranging vectors x_i in time-series order. SVD provides the following representation:

$${{{{{\bf{Y}}}}}}={{{{{\bf{U}}}}}}{{{{{\boldsymbol{\Sigma }}}}}}{{{{{{\bf{V}}}}}}}^{{{{{{\rm{\top }}}}}}}$$

(7)

where the matrices U and V are unitary matrices, and the superscript ⊤ shows the transpose. The matrix Σ is a diagonal matrix of singular values arranged in descending order. It is well known that the data can be approximated by a truncated SVD⁵¹ as follows:

$$\widetilde{{{{{{\bf{Y}}}}}}}=\widetilde{{{{{{\bf{U}}}}}}}\widetilde{{{{{{\boldsymbol{\Sigma }}}}}}}{\widetilde{{{{{{\bf{V}}}}}}}}^{{{{{{\rm{\top }}}}}}}$$

(8)

where $\widetilde{{{{{{\boldsymbol{\Sigma }}}}}}}$ is a first r × r diagonal matrix and r is a truncation rank. The matrices $\widetilde{{{{{{\bf{U}}}}}}}$ and $\widetilde{{{{{{\bf{V}}}}}}}$ are reduced matrices corresponding to $\widetilde{{{{{{\boldsymbol{\Sigma }}}}}}}$. Then, we obtained the noise-reduced time-series data matrix of $\widetilde{{{{{{\bf{Y}}}}}}}=\left[{\widetilde{{{{{{\bf{x}}}}}}}}_{1}\,{\widetilde{{{{{{\bf{x}}}}}}}}_{2}\cdots {\widetilde{{{{{{\bf{x}}}}}}}}_{n}\right]$ or the time-series data of $\widetilde{{{{{{\bf{X}}}}}}}=\left\{{\widetilde{{{{{{\bf{x}}}}}}}}_{1},{\widetilde{{{{{{\bf{x}}}}}}}}_{2},\cdots ,{\widetilde{{{{{{\bf{x}}}}}}}}_{n}\right\}$. We set r = 5, which is a commonly used truncation value. Subsequently, the cosine similarity $\cos {\theta }_{i,j}$ was calculated as follows:

$$\cos {\theta }_{i,j}{{{{{\boldsymbol{=}}}}}}\frac{\left\langle {\widetilde{{{{{{\bf{x}}}}}}}}_{i},{\widetilde{{{{{{\bf{x}}}}}}}}_{j}\right\rangle }{{{{{{{\boldsymbol{||}}}}}}{\widetilde{{{{{{\bf{x}}}}}}}}_{i}{{{{{\boldsymbol{||}}}}}}}_{2}{{{{{{\boldsymbol{||}}}}}}{\widetilde{{{{{{\bf{x}}}}}}}}_{j}{{{{{\boldsymbol{||}}}}}}}_{2}}$$

(9)

where $\left\langle {\widetilde{{{{{{\bf{x}}}}}}}}_{i},{\widetilde{{{{{{\bf{x}}}}}}}}_{j}\right\rangle$ is the inner product of ${\widetilde{{{{{{\bf{x}}}}}}}}_{i}$ and ${\widetilde{{{{{{\bf{x}}}}}}}}_{j}$, and ${{{{{{\boldsymbol{||}}}}}}{\widetilde{{{{{{\bf{x}}}}}}}}_{i}{{{{{\boldsymbol{||}}}}}}}_{{{{{{\boldsymbol{2}}}}}}}$ is the ${{{{{\mathcal{l}}}}}}2$ norm of ${\widetilde{{{{{{\bf{x}}}}}}}}_{i}$. In the similarity calculation, we only considered the pressure distribution behind the square cylinder to reduce the computational cost (see Fig. 3b). Substituting ${d}_{i,j}=\cos {\theta }_{i,j}$ in Eq. (4), we calculated the clustering using DA3. Since the data were also multiplied by 10⁴ before being transferred to DA3, ${d}_{i,{j}} \sim {10}^{3}$. The parameter ${\lambda }_{1}$ was 40 times larger than λ₂ (λ₁ = 1 × 10⁶), ensuring that each term in Eq. (4) was of a similar magnitude. In this condition, some data were classified as outliers. The images within the same cluster were ensemble averaged to extract useful patterns. Here, we note that the original image data X was averaged to extract the patterns, while the truncated dataset of $\widetilde{{{{{{\bf{X}}}}}}}$ was not used.

Considering that $\cos {\theta }_{i,j}$ lies within the range of −1 to 1, we introduced the following relation r_i,j, which range from 0 to 1:

$${r}_{i,j} =\frac{\cos {\theta }_{i,j}+1}{2}\\ ={\cos }^{2}\frac{{\theta }_{i,j}}{2}\\ =1-{\sin }^{2}\frac{{\theta }_{i,j}}{2}$$

(10)

where $1-{r}_{i,j}=0$ when i = j (the same data). Then, we define the distance metric between the data as $|\sin \left({\theta }_{i,j}/2\right)|$, where |a| represents the absolute value of a and ${\theta }_{i,j}$ is the angle between data vectors. The maximum value of the distance is unity in this distance metric. This distance metric was used in the MDS calculation.

The time-series data were also classified by the “TimeSeriesKMeans” function in “tslearn” described above. In addition, we used the CIS method^43,44, which is a specialized methods designed specifically for PSP measurements. In the CIS method, the time-series data were classified into several phase groups based on the pressure data measured by a pressure transducer sensor which is a point sensor with a higher sampling rate than PSP. In other words, the CIS method requires an additional sensor for clustering. This reliance on an extra sensor can be considered one of the limitations of the CIS method.

Data availability

The dataset, named “crop,” was obtained from the UEA & UCR time-series classification repository http://timeseriesclassification.com/description.php?Dataset=Crop. The flow measurement dataset is available in Zenodo with identifier https://doi.org/10.5281/zenodo.10215642.

Code availability

The code for the time-series clustering developed in this study is included in Supplementary Note 3. The code for computing the observation matrix from time-series data is available in Zenodo with identifier https://doi.org/10.5281/zenodo.10215642.

References

Brockwell, P. J. & Davis, R. A. Time Series: Theory and Methods (Springer, New York, 1991).
Mitsa, T. Temporal Data Mining (Chapman & Hall/CRC, Boca Raton, 2010).
Kitagawa, G. Introduction to Time Series Modeling (CRC Press, Boca Raton, 2010).
Aggarwal, C. C. Data Mining: The Textbook (Springer, Cham, 2015).
Bishop, C. M. Pattern Recognition and Machine Learning (Information Science and Statistics) (Springer-Verlag New York, Inc., 2006).
Hastie, T., Tibshirani, R. & Friedman, J. H. The Elements of Statistical Learning: Data Mining, Inference, and Prediction (Springer, 2009).
Warren Liao, T. Clustering of time series data—a survey. Pattern Recognit. 38, 1857–1874 (2005).
Article Google Scholar
Aghabozorgi, S., Seyed Shirkhorshidi, A. & Ying Wah, T. Time-series clustering – A decade review. Inf. Syst. 53, 16–38 (2015).
Article Google Scholar
Ali, M., Alqahtani, A., Jones, M. W. & Xie, X. Clustering and classification for time series data in visual analytics: a survey. IEEE Access 7, 181314–181338 (2019).
Article Google Scholar
Keogh, E. & Kasetty, S. On the need for time series data mining benchmarks: a survey and empirical demonstration. Data Min. Knowl. Discov. 7, 349–371 (2003).
Article MathSciNet Google Scholar
Ezugwu, A. E. et al. A comprehensive survey of clustering algorithms: state-of-the-art machine learning applications, taxonomy, challenges, and future research prospects. Eng. Appl. Artif. Intell. https://doi.org/10.1016/j.engappai.2022.104743 (2022).
Romain Tavenard, J. F. et al. A machine learning toolkit for time series data. J. Mach. Learn. Res. 21, 1–6 (2020).
Google Scholar
Paparrizos, J. & Gravano, L. Fast and accurate time-series clustering. ACM Trans. Database Syst. 42, 1–49 (2017).
Article MathSciNet Google Scholar
Bianchi, F. M., Scardapane, S., Løkse, S. & Jenssen, R. Reservoir computing approaches for representation and classification of multivariate time series. IEEE Trans. Neural Netw. Learn. Syst. 32, 2169–2179 (2021).
Article Google Scholar
Laurinec, P. TSrepr R package: Time Series Representations. J. Open Source Softw. https://doi.org/10.21105/joss.00577 (2018).
sktime: A. Unified Interface for Machine Learning with Time Series v. v0.13.4 (Zenodo, 2022).
Forgy E, W. Cluster analysis of multivariate data: efficiency versus interpretability of classifications. Biometrics 21, 768–769 (1965).
Google Scholar
Macqueen, J. Some methods for classification and analysis of multivariate observations. Proc. Fifth Berkeley Symp. Math. Statist. Prob. 1, 281–297 (1967).
MathSciNet Google Scholar
Kohonen, T. The self-organizing map. Proc. IEEE 78, 1464–1480, https://doi.org/10.1109/5.58325 (1990).
Article Google Scholar
Ester, M., Kriegel, H.-P., Sander, J. & Xu, X. A density-based algorithm for discovering clusters in large spatial databases with noise. in Proceedings of the Second International Conference on Knowledge Discovery and Data Mining. 226–231 (AAAI Press).
Li, H. et al. Adaptively constrained dynamic time warping for time series classification and clustering. Inf. Sci. 534, 97–116 (2020).
Article MathSciNet Google Scholar
Li, H. Time works well: dynamic time warping based on time weighting for time series data mining. Inf. Sci. 547, 592–608 (2021).
Article MathSciNet Google Scholar
López-Oriona, Á. & Vilar, J. A. Quantile cross-spectral density: a novel and effective tool for clustering multivariate time series. Expert Syst. Appl. https://doi.org/10.1016/j.eswa.2021.115677 (2021).
Yang, X., Yu, F., Pedrycz, W. & Li, Z. Clustering time series under trend-oriented fuzzy information granulation. Appl. Soft Comput. https://doi.org/10.1016/j.asoc.2023.110284 (2023).
Cerqueti, R., D’Urso, P., De Giovanni, L., Giacalone, M. & Mattera, R. Weighted score-driven fuzzy clustering of time series with a financial application. Expert Syst. Appl. https://doi.org/10.1016/j.eswa.2022.116752 (2022).
Umatani, R., Imai, T., Kawamoto, K. & Kunimasa, S. Time series clustering with an EM algorithm for mixtures of linear Gaussian state space models. Pattern Recognit. https://doi.org/10.1016/j.patcog.2023.109375 (2023).
Lee, C. & Schaar, M. V. D. Temporal phenotyping using deep predictive clustering of disease progression. in Proceedings of the 37th International Conference on Machine Learning (eds Daumé, H. III & Singh, A.) 5767–5777 (PMLR, 2020).
Lafabregue, B., Weber, J., Gançarski, P. & Forestier, G. End-to-end deep representation learning for time series clustering: a comparative study. Data Min. Knowl. Discov. 36, 29–81 (2021).
Article MathSciNet Google Scholar
Eskandarnia, E., Al-Ammal, H. M. & Ksantini, R. An embedded deep-clustering-based load profiling framework. Sustain. Cities Soc. https://doi.org/10.1016/j.scs.2021.103618 (2022).
Okawa, M. Time-series averaging and local stability-weighted dynamic time warping for online signature verification. Pattern Recognit. https://doi.org/10.1016/j.patcog.2020.107699 (2021).
Boixo, S., Albash, T., Spedalieri, F. M., Chancellor, N. & Lidar, D. A. Experimental signature of programmable quantum annealing. Nat. Commun. 4, 2067 (2013).
Article Google Scholar
Boixo, S. et al. Evidence for quantum annealing with more than one hundred qubits. Nat. Phys. 10, 218–224 (2014).
Article Google Scholar
Aramon, M. et al. Physics-inspired optimization for quadratic unconstrained problems using a digital annealer. Front. Phys. https://doi.org/10.3389/fphy.2019.00048 (2019).
Matsubara, S. et al. Digital annealer for high-speed solving of combinatorial optimization problems and its applications. in 2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC). 667–672. https://doi.org/10.1109/ASP-DAC47756.2020.9045100 (2020).
Lucas, A. Ising formulations of many NP problems. Front. Phys. https://doi.org/10.3389/fphy.2014.00005 (2014).
Chapuis, G., Djidjev, H., Hahn, G. & Rizk, G. Finding maximum cliques on the D-wave quantum annealer. J. Signal Process. Syst. 91, 363–377 (2019).
Article Google Scholar
Bagnall, A., Lines, J., Vickers, W. & Keogh, E. The UEA & UCR Time Series Classification Repository www.timeseriesclassification.com (2017).
Bagnall, A., Lines, J., Bostrom, A., Large, J. & Keogh, E. The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances. Data Min. Knowl. Discov. 31, 606–660 (2017).
Article MathSciNet Google Scholar
Tan, C. W., Webb, G. I. & Petitjean, F. Indexing and classifying gigabytes of time series under time warping. in Proceedings of the 2017 SIAM International Conference on Data Mining. 282–290. https://doi.org/10.1137/1.9781611974973.3 (2017).
Egami, Y., Hasegawa, A., Matsuda, Y., Ikami, T. & Nagai, H. Ruthenium-based fast-responding pressure-sensitive paint for measuring small pressure fluctuation in low-speed flow field. Meas. Sci. Technol. https://doi.org/10.1088/1361-6501/abb916 (2021).
Inoue, T. et al. Data-driven approach for noise reduction in pressure-sensitive paint data based on modal expansion and time-series data at optimally placed points. Phys. Fluids https://doi.org/10.1063/5.0049071 (2021).
Inoue, T. et al. Data-driven optimal sensor placement for high-dimensional system using annealing machine. Mech. Syst. Signal Process. https://doi.org/10.1016/j.ymssp.2022.109957 (2023).
Yorita, D., Nagai, H., Asai, K. & Narumi, T. Unsteady PSP technique for measuring naturally-disturbed periodic phenomena. in 48th AIAA Aerospace Sciences Meeting Including the New Horizons Forum and Aerospace Exposition. https://doi.org/10.2514/6.2010-307 (2010).
Asai, K. & Yorita, D. Unsteady PSP measurement in low-speed flow – overview of recent advancement at Tohoku University. in 49th AIAA Aerospace Sciences Meeting including the New Horizons Forum and Aerospace Exposition. https://doi.org/10.2514/6.2011-847 (2011).
Kumar, V., Bass, G., Tomlin, C. & Dulny, J. Quantum annealing for combinatorial clustering. Quantum Inf. Process. https://doi.org/10.1007/s11128-017-1809-2 (2018).
Matsumoto, N., Hamakawa, Y., Tatsumura, K. & Kudo, K. Distance-based clustering using QUBO formulations. Sci. Rep. 12, 2669 (2022).
Article Google Scholar
Liu, T., Sullivan, J. P., Asai, K., Klein, C. & Egami, Y. Pressure and Temperature Sensitive Paints. 2 edn (Springer, Cham, 2021).
Bell, J. H., Schairer, E. T., Hand, L. A. & Mehta, R. D. Surface pressure measurements using luminescent coatings. Annu. Rev. Fluid Mech. 33, 155–206 (2001).
Article Google Scholar
Huang, C.-Y., Matsuda, Y., Gregory, J. W., Nagai, H. & Asai, K. The applications of pressure-sensitive paint in microfluidic systems. Microfluid. Nanofluidics 18, 739–753 (2015).
Article Google Scholar
Pastuhoff, M., Yorita, D., Asai, K. & Alfredsson, P. H. Enhancing the signal-to-noise ratio of pressure sensitive paint data by singular value decomposition. Meas. Sci. Technol. 24, 075301 (2013).
Article Google Scholar
Brunton, S. L. & Kutz, J. N. Data-Driven Science and Engineering: Machine Learning, Dynamical Systems, and Control (Cambridge University Press, 2019).

Download references

Acknowledgements

The authors would like to express their gratitude to Dr Yasuhumi Konishi, Hiroyuki Okuizumi and Yuya Yamazaki for their assistance during the wind tunnel testing conducted at the Institute of Fluid Science, Tohoku University. The authors would also like to acknowledge Takafumi Oyama for the insightful discussions. We also gratefully appreciate Tayca Corporation for providing the titanium dioxide. Finally, we would like to thank Editage (www.editage.com) for English language editing. Part of the work was conducted under the Collaborative Research Project J23I041 of the Institute of Fluid Science, Tohoku University.

Author information

Authors and Affiliations

Department of Modern Mechanical Engineering, Waseda University, 3-4-1 Ookubo, Shinjuku-ku, Tokyo, 169-8555, Japan
Tomoki Inoue, Koyo Kubota & Yu Matsuda
Institute of Fluid Science, Tohoku University, 2-1-1 Katahira, Aoba-ku, Sendai, Miyagi-prefecture, 980-8577, Japan
Tsubasa Ikami & Hiroki Nagai
Department of Mechanical Engineering, Aichi Institute of Technology, 1247 Yachigusa, Yakusa-Cho, Toyota, Aichi-prefecture, 470-0392, Japan
Yasuhiro Egami
Quantum Application Core Project, Quantum Laboratory, Fujitsu Research, Fujistu Ltd, Kawasaki, Kanagawa, 211-8588, Japan
Takahiro Kashikawa & Koichi Kimura

Authors

Tomoki Inoue
View author publications
You can also search for this author in PubMed Google Scholar
Koyo Kubota
View author publications
You can also search for this author in PubMed Google Scholar
Tsubasa Ikami
View author publications
You can also search for this author in PubMed Google Scholar
Yasuhiro Egami
View author publications
You can also search for this author in PubMed Google Scholar
Hiroki Nagai
View author publications
You can also search for this author in PubMed Google Scholar
Takahiro Kashikawa
View author publications
You can also search for this author in PubMed Google Scholar
Koichi Kimura
View author publications
You can also search for this author in PubMed Google Scholar
Yu Matsuda
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T. Inoue: conceptualization, data curation, formal analysis, investigation, methodology, software, validation, visualization. K. Kubota: formal analysis, investigation, methodology, software, validation, visualization. T. Ikami: data curation, investigation, resources, visualization, writing—review and editing. Y.E.: data curation, formal analysis, investigation, resources, writing—review and editing. H.N.: investigation, resources, writing—review and editing. T.K.: methodology, software, validation. K. Kimura: methodology, software, validation. Y.M.: conceptualization, data curation, formal analysis, funding acquisition, investigation, methodology, project administration, resources, software, supervision, validation, visualization, writing—original draft, writing—review and editing.

Corresponding author

Correspondence to Yu Matsuda.

Ethics declarations

Competing interests

The authors declare the following competing interests: T.K. and K. Kimura are employees of Fujitsu Ltd. All other authors declare no competing interests.

Peer review

Peer review information

Communications Engineering thanks Absalom Ezugwu and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editors: Or Perlman and Mengying Su. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Inoue, T., Kubota, K., Ikami, T. et al. Clustering method for time-series images using quantum-inspired digital annealer technology. Commun Eng 3, 10 (2024). https://doi.org/10.1038/s44172-023-00158-0

Download citation

Received: 05 July 2023
Accepted: 27 December 2023
Published: 10 January 2024
DOI: https://doi.org/10.1038/s44172-023-00158-0