Forecasting of coronavirus active cases by utilizing logistic growth model and fuzzy time series techniques

Mahanty, Chandrakanta; Patro, S Gopal Krishna; Rathor, Sandeep; Rachapudi, Venubabu; Muzammil, Khursheed; Islam, Saiful; Razak, Abdul; Khan, Wahaj Ahmad

doi:10.1038/s41598-024-67161-z

Download PDF

Article
Open access
Published: 04 August 2024

Forecasting of coronavirus active cases by utilizing logistic growth model and fuzzy time series techniques

Chandrakanta Mahanty¹,
S Gopal Krishna Patro²,
Sandeep Rathor³,
Venubabu Rachapudi⁴,
Khursheed Muzammil⁵,
Saiful Islam⁶,
Abdul Razak⁷ &
…
Wahaj Ahmad Khan⁸

Scientific Reports volume 14, Article number: 18039 (2024) Cite this article

1076 Accesses
15 Altmetric
Metrics details

Subjects

Abstract

Coronavirus has long been considered a global epidemic. It caused the deaths of nearly 7.01 million individuals and caused an economic downturn. The number of verified coronavirus cases is increasing daily, putting the whole human race at danger and putting strain on medical experts to eradicate the disease as rapidly as possible. As a consequence, it is vital to predict the upcoming coronavirus positive patients in order to plan actions in the future. Furthermore, it has been discovered all across the globe that asymptomatic coronavirus patients play a significant part in the disease’s transmission. This prompted us to incorporate similar examples in order to accurately forecast trends. A typical strategy for analysing the rate of pandemic infection is to use time-series forecasting technique. This would assist us in developing better decision support systems. To anticipate COVID-19 active cases for a few countries, we recommended a hybrid model utilizing a fuzzy time series (FTS) model mixed with a non-linear growth model. The coronavirus positive case outbreak has been evaluated for Italy, Brazil, India, Germany, Pakistan, and Myanmar through June 5, 2020 in phase-1, and January 15, 2022 in phase-2, and forecasts active cases for the next 26 and 14 days respectively. The proposed framework fitting effect outperforms individual logistic growth and the fuzzy time series techniques, with R-scores of 0.9992 in phase-1 and 0.9784 in phase-2. The proposed model provided in this article may be utilised to comprehend a country’s epidemic pattern and assist the government in developing better effective interventions.

A new time-varying coefficient regression approach for analyzing infectious disease data

Article Open access 06 September 2023

TW-SIR: time-window based SIR for COVID-19 forecasts

Article Open access 31 December 2020

EKF-SIRD model algorithm for predicting the coronavirus (COVID-19) spreading dynamics

Article Open access 04 August 2022

Introduction

Numerous epidemics and infectious viral outbreaks, like as Zika, Ebola, MERS, and pneumonia, have already erupted, wreaking havoc on the human lives and economies of the world’s main countries and regions¹. The globe is now dealing with a contagious disease triggered by SARS-CoV-2, which was identified in the China and has since spread throughout all continents². As of August 1, 2022, 228 countries and territories worldwide had reported a total of 600.74 million active coronavirus cases and 6.47 million deaths³. The infectious disease is rapidly spreading and has claimed the lives of billions of individuals worldwide. Discovering the transmission pattern and making forecasts using scientific approaches may help policymakers frame measures and limit the epidemic.

Forecasting time series is among the most important statistical analytic methodologies that play a significant part in making decisions under uncertainty. Time series is described as a set of observations and historical records that are related to each other for a phenomenon and for specific periods of time and are typically equal in length⁴. Fluctuation is a time series feature that may be used to evaluate the pattern and make inferences. FTS gives semantic significance for fluctuating data. FTS is more informative than regular time series. Many authors have used fuzzy time series techniques and non-linear growth models separately for COVID-19 active cases prediction. Their COVID-19 active cases prediction is not very much close to the actual COVID-19 active cases. Using FTS and logistic growth models, the current paper developed a hybrid FTS technique for forecasting upcoming coronavirus positive cases in many countries. For better COVID-19 active cases prediction we have combined fuzzy time series techniques with the logistic non-linear growth model. The current model’s forecast aims to give a foundation for decision makers to create a protective strategy during the coronavirus pandemic. Coronavirus prediction models may also aid with medical resource planning, such as the number of ventilators, beds, and emergency healthcare facilities allocated to coronavirus patients.

The objectives and contributions of the present study are outlined below:

1. Development of a logistic growth technique to estimate infection rates.
2. Design of hybrid technique by using fuzzy time series technique and non-liner growth model.
3. Forecasting of coronavirus infected people for trained and untrained data in several countries.

Literature review

Time-series forecasting is vital throughout epidemics because it gives critical data that might lead to disease suppression. Various forecasting methods are being utilized to anticipate future coronavirus infections.

The official statistics for COVID-19 is very ambiguous⁵ and fuzzy logic is a term that signifies unpredictability and can appropriately represent it⁶. Fuzzy logic techniques that can effectively handle the ambiguity in the coronavirus data must be used in this situation. Numerous researchers have utilized fuzzy logic in their prediction models in order to do this. In order to anticipate the coronavirus time series in Mexico, Patra et al.⁷ suggested several fuzzy ensemble neural network models. To estimate and predict the confirmed cases of coronavirus in China, Al-Qaness et al.⁸ suggested using an adaptive neurofuzzy inference system (ANFIS) optimized using flower pollination and salp swarm algorithm. Fuzzy logic inference was suggested by Dhiman and Sharma⁹ for the treatment and detection of coronavirus. For the investigation of the coronavirus epidemic, Fong et al.¹⁰‘s proposal of hybridized deep learning and fuzzy rule generation was made. Fatima et al.¹¹ used IOT techniques to allow smart monitoring of coronavirus disease with a related fuzzy inference technique. Van Tinh et al.¹² used in conjunction with particle swarm optimization (PSO) for coronavirus prediction. Verma et al.¹³ combined ARIMA and FTS techniques. Castillo et al.¹⁴‘s hybrid intelligent technique, which combines fuzzy logic and fractal mathematics for predicting coronavirus active cases. Zhao et al.¹⁵ utilized SEIR technique to estimate the number of coronavirus patients in US. Chowdhury et al.¹⁶ have suggested the use of the FPASSA-ANFIS technique to predict the number of positive coronavirus cases in China. A coronavirus optimum control technique was given by Khatua et al.¹⁷ employing fuzzy dynamical framework. For estimating the trend of coronavirus in India, Bedi et al.¹⁸ suggested a modified SEIRD model. In order to mimic the coronavirus epidemic in Malaysia, Hassan et al.¹⁹ effectively implemented a fuzzy graph technique utilizing the c-FACS technique. Asl et al.²⁰ suggested a COVID-19-specific interval type-2 fuzzy expert system for predicting ICU admittance. Iwendi et al.²¹ suggested the usefulness of an ANFIS in COVID-19 infection categorization based on symptoms. Yogachi et al.²² predicted the percentage rise in coronavirus confirmed patients in West Java using the fuzzy logic mamdani approach. Kumar et al.²³ used PSO approach to optimize all three hyperparameters for the coronavirus pandemic FTS forecasts. Kumar and Kaur et al.²⁴ suggested a hybrid approach for future prediction of COVID-19 cases in India based on self-organized maps and FTS. Luo et al.²⁵ utilized XGBoost and long short-term memory algorithms to forecast coronavirus positive patients in America. Recently various authors^{26,27,28,29,30,31,32} used fuzzy time series techniques and non-linear growth models for prediction of COVID-19 active, death and recovery cases.

Methodology

Fuzzy time series (FTS) model representation

FTS is a popular statistical model in analytics that examines both prior and recent data to create predictions. Utilization of fuzzy logic for time series prediction has been implemented in the literature and provides a considerable improvement over standard statistical approaches since it is non-linear and can approximate any complicated nonlinear system more accurately than linear statistical approaches. According to the fuzzy logic given by Zadeh et al.³³, an element may concurrently belong to a set and not belong to it at other levels if its membership value falls within the range [0, 1]. A linguistic variable to portray the numerical time series was proposed by Chen et al.³⁴, and these regions will serve as the linguistic words of our variable. Chen et al.³⁴ proposed the idea of trying to divide the universe of discourse into several intervals (fuzzy sets) and seeing how each region behaves (deriving rules using time series sequences).

Definition 1:

Fuzzy Set: Universe of discourse can be depicted as $U=\{{u}_{1}, {u}_{12},\dots .{u}_{n}\}$. ${A}_{i}={f}_{{A}_{i}}$(${u}_{1})/ {u}_{1}$ $+ {f}_{{A}_{i}}$(${u}_{2})/ {u}_{2}$ ${+\dots +f}_{{A}_{i}}$(${u}_{n})/ {u}_{n}$, where ${A}_{i}$ represented as fuzzy set of $U$ and ${f}_{{A}_{i}}$ denoted as the membership function of ${A}_{i}$. The range of ${f}_{{A}_{i}}$ is in between 0 to 1. Let ${u}_{j}$ should be treated as one of the elements of fuzzy set ${A}_{i}$ and ${f}_{{A}_{i}}$(${u}_{j})$ is the degree to which ${u}_{j}$ belongs. f_Ai (u_j) ∊ [0,1] where 1 ≤ j ≤ n. The following is a basic overview of the definition of FTS:

Definition 2:

FTS: Let $X\left(t\right)\left(t ranges from \text{0,1},2..\right),$ be a subset of R. Let universe of discourse can be represented as $X\left(t\right)$ which is described by fuzzy set ${f}_{i}\left(t\right).$ $F\left(t\right)$ is specified as FTS on $X\left(t\right)\left(t ranges from \text{0,1},2..\right)$ iff $F\left(t\right)$ is an assemblage of ${f}_{i}\left(t\right) \left(i ranges from\text{1,2}..\right)$.

Definition 3:

FLR: $F\left(t\right)$ is assumed to be generated by $F\left(t-1\right)$, if there is a fuzzy relation $R\left(t-1,t\right),$ so that $F\left(t\right)=F\left(t-1\right)*R\left(t-1,t\right),$ in which x act as an operator. The relationship among $F\left(t\right)$ and $F\left(t-1\right)$ is treated as fuzzy logical relationship (FLR) denoted by ${A}_{i}{\to A}_{j}$, when $F\left(t\right)= {A}_{j}$ and $F\left(t-1\right)= {A}_{i}$.

Definition 4:

FLRG: Utilizing the same fuzzy set on the left, FLR may be generated into fuzzy logical relationship groups (FLRG). Consider the case where ${A}_{i}{\to A}_{j1}$ , ${A}_{i}{\to A}_{j2}$ , ${A}_{i}{\to A}_{jm}$ are fuzzy logical relationships. They may have formed a FLRG ${A}_{i}{\to A}_{j1}$,${A}_{j2},\dots {A}_{jm}$. We used Chen's model ³⁴ to estimate the forecast values of COVID-19 data sets, however Hurang's algorithm ³⁵ average based length is used to establish interval's length. FTS model's algorithm is described below.

In first step, we have to define or describe the universe. The range of discourse is denoted by the notation ${U [L}_{c}-{D}_{1}$, ${U}_{c}+{D}_{2}]$, where ${U}_{c}$ and ${L}_{c}$ are the upper and lower bounds of the data and ${D}_{2}$ and ${D}_{1}$ are the constants that are obtained by taking 10% of the ${L}_{c}$ and ${U}_{c}$. Therefore, as a security margin, we have expanded the upper and lower boundaries by 10%. In second step, based on Hurang et al.³⁵ we split the universe of discourse into ${\prime}l{\prime}$ length intervals by utilizing average based length algorithm. In third step, we achieve ${u}_{1}, {u}_{12},\dots .{u}_{n}$, define fuzzy sets ${A}_{i}$; ${A}_{1}, {A}_{2},{\dots .A}_{n}$ and fuzzify the time series data. In fourth step, we identify FLRs and FLRGs using definitions 3 and 4. In fifth step, we utilized Chen’s model’s approach for defuzzification and estimate the predicted values. In sixth step, we utilized Root Mean Square Error to evaluate the performance analysis.

Logistic growth model

The COVID-19 epidemic may be followed using a logistic growth model³⁶. It has been commonly used to simulate population growth where there are physical and resource constraints. Epidemiology frequently uses logistic models. It is normal practice to examine the risk factors linked with a particular disease and to estimate the likelihood that the disease would manifest given the risk factors. Through logistic regression analysis, we can roughly forecast how epidemiology will evolve and spread.

$${Y}_{t}=a/1+\text{exp}(b-c)*(t-{t}_{0}))$$

a is the anticipated maximum number of confirmed cases, and ${Y}_{t}$ is is the total number of cumulative active cases. The fitting coefficients are b and c. The number of days after the first instance is represented as t. The first instance of the case happened at time ${t}_{0}$.

Calculate R² score (regression coefficient)

The following formula may be used to get the regression coefficient ${R}^{2}$, which is used to assess how well different approaches fit data.

$$R^{2} = 1 - \sum\limits_{i = 1}^{n} {\left[ {y_{i} - \hat{y})} \right]}^{2} /\sum\limits_{i = 1}^{n} {\left[ {y_{i} - \overline{y})} \right]}^{2}$$

ŷ represented as the predicted cumulative positive cases; y_i and y denoted as the actual cumulative positive cases and average of the actual cumulative positive cases. The fitting coefficient is closer to 1.

Hybrid model

We combined the FTS and logistic growth models in the hybrid model. It is assumed that a time series consists of two components: a linear autocorrelation framework element and a non-linear element. The hybrid approach summarized as given below.

${T}_{t}{={L}_{t}+N}_{t}$, Where ${T}_{t}$ denoted as actual time series, ${N}_{t}$ and ${L}_{t}$ denoted as non-linear and element. Here we have to evaluate the residual values (${E}_{t })$ of logistic growth model by ${E}_{t}={T}_{t}-\widehat{{ L}_{t}}$ formula. Using the FTS model, we have estimated non-linear residuals and getting the non-linear element of the hybrid model by ${E}_{t}={f(E}_{t-1},{E}_{t-2},{\dots .E}_{t-n})+ {\varepsilon }_{t}$ formula. ${\varepsilon }_{t}$ and $f$ denoted as the random error and non-linear function evaluated for the model. By utilizing the above formula, we get the non-linear element of the time series $\widehat{N}$, so that the forecast values of the time series are evaluated by $\widehat{{T}_{t} }= \widehat{{ L}_{t}}+\widehat{{ N}_{t}}$.

Experimental results and discussion

For this study, we utilized time series information on COVID-19 infection from John Hopkins University that was made available on GitHub³⁷. Phase 1 data collection runs from 28 January 2020 through 5 June 2020. Phase 2 data collection runs from 10 October 2021 until 15 January 2022. All countries’ positive, recovered and death cases are included in the statistics. The models’ packages are in the “pyFTS” library and the methods are implemented using Python 3.7³⁸. The Python Matplotlib package is used to create the plots. The forecast of coronavirus positive cases has been studied for Italy, Brazil, India, Germany, Pakistan, and Myanmar till June 5, 2020 in phase-1, and January 15, 2022 in phase-2, and projections for the number of active cases for the next 26 and 14 days have been produced. COVID-19 active cases for the above countries are represented in Table 1and Table 2 as phase-1 and phase-2 respectively for better visualization. In this article, we recommended a hybrid model which is a combination of both logistic growth model and fuzzy time series techniques. The value of fitting coefficient is closer to 1 is required for accurate estimation of coronavirus active cases. In order to get a higher ${R}^{2}$ value that is closer to 1 and a lower RMSE value, we thus presented this technique. The recommended model’s major objective is to obtain the best fit analysis when contrast to various models.

Table 1 The recommended model forecasted coronavirus patients in Italy, Brazil, India, Germany, Pakistan, and Myanmar over the following 26 days beginning from June 6, 2020.

Full size table

Table 2 The recommended model forecasted coronavirus patients in Italy, Brazil, India, Germany, Pakistan, and Myanmar over the following 14 days beginning from 16 January 2022.

Full size table

Phase-1 Analysis

Analysis of coronavirus active cases for Italy

As demonstrated in Fig. 1, the proposed model’s fitting impact outperforms other models (FTS and Logistic growth) used in Italy. As per our forecast, the number of coronavirus positive patients on 1 July 2020 is 240,760, while the proposed model predicts 240,502. The x-axis represents the number of days. The y-axis represents the number of COVID-19 active cases.

Analysis of coronavirus active cases for Brazil

As demonstrated in Fig. 2, the proposed model’s fitting impact outperforms other models used in Brazil. As per our forecast, the number of coronavirus positive patients on 1 July 2020 is 1,459,494, while the proposed model predicts 1,455,432.

Analysis of coronavirus active cases for India

As demonstrated in Fig. 3, the proposed model’s fitting impact outperforms other models used in India. As per our forecast, the number of coronavirus positive patients on 1 July 2020 is 604,641, while the proposed model predicts 601,480.

Analysis of coronavirus active cases for Germany

As demonstrated in Fig. 4, the proposed model’s fitting impact outperforms other models used in Germany. As per our forecast, the number of coronavirus positive patients on 1 July 2020 is 194,725, while the proposed model predicts 194,269.

Analysis of coronavirus active cases for Pakistan

As demonstrated in Fig. 5, the proposed model’s fitting impact outperforms other models used in Pakistan. As per our forecast, the number of coronavirus positive patients on 1 July 2020 is 217,809, while the proposed model predicts 216,681.

Analysis of coronavirus active cases for Myanmar

As demonstrated in Fig. 6, the proposed model’s fitting impact outperforms other models used in Myanmar. As per our forecast, the number of coronavirus positive patients on 1 July 2020 is 304, while the proposed model predicts 299.

Phase-2 Analysis

Analysis of coronavirus active cases for Italy

As demonstrated in Fig. 7, the proposed model’s fitting impact outperforms other models used in Italy. As per our forecast, the number of coronavirus positive patients on 29 January 2022 is 10,821,375, while the proposed model predicts 10,819,855.

Analysis of coronavirus active cases for Brazil

As demonstrated in Fig. 8, the proposed model’s fitting impact outperforms other models used in Brazil. As per our forecast, the number of coronavirus positive patients on 29 January 2022 is 25,256,198, while the proposed model predicts 25,254,306.

Analysis of coronavirus active cases for India

As demonstrated in Fig. 9, the proposed model’s fitting impact outperforms other models used in India. As per our forecast, the number of coronavirus positive patients on 29 January 2022 is 41,092,522, while the proposed model predicts 41,090,386.

Analysis of coronavirus active cases for Germany

As demonstrated in Fig. 10, the proposed model’s fitting impact outperforms other models used in Germany. As per our forecast, the number of coronavirus positive patients on 29 January 2022 is 9,618,245, while the proposed model predicts 9,617,381.

Analysis of coronavirus active cases for Pakistan

As demonstrated in Fig. 11, the proposed model’s fitting impact outperforms other models used in Pakistan. As per our forecast, the number of coronavirus positive patients on 29 January 2022 is 1,417,991, while the proposed model predicts 1,417,481.

Analysis of coronavirus active cases for Myanmar

As demonstrated in Fig. 12, the proposed model’s fitting impact outperforms other models used in Myanmar. As per our forecast, the number of coronavirus positive patients on 29 January 2022 is 535,080, while the proposed model predicts 534,898.

Performance analysis

In comparison to previous models, the proposed model has the highest ${R}^{2}$ values, 0.9992 in phase 1 and 0.9784 in phase 2. The recommended ${R}^{2}$ value is closer to 1, which denotes that the forecast is accurate for the COVID-19 positive patients. The logistic growth model has a higher ${R}^{2}$ than the FTS model.

Discussion

Strengths and weaknesses of the proposed model

The proposed hybrid model combining the FTS technique with the nonlinear logistic growth model stands out for its capabilities of making accurate predictions of the active cases of COVID-19. However, the proposed model is highly efficient in terms of predictive capacities because of its ability to identify linear and nonlinear trends in the data. The model’s R-scores of 0.9992 in phase-1 and 0.9784 in phase-2 underscore its robustness and reliability. Additionally, the model’s adaptability to different countries with varying epidemic dynamics showcases its flexibility and generalizability. The FTS component, while adept at handling nonlinearity, can be sensitive to the choice of intervals and the fuzzification process, potentially impacting prediction accuracy. Moreover, the logistic growth model assumes a saturation point which might not be applicable in scenarios with fluctuating infection rates due to external interventions like lockdowns or mass vaccination drives.

Limitations and differences in prediction results

Further research should be focused on the reasons for variation in predictions for different countries and different time intervals. The differences in the prediction accuracy can be attributed to several factors:

Government interventions

The measures like the implementation of lock down, social distancing measures, and vaccination programs differ greatly in terms of stringency and timing across nations. These interventions can cause sudden changes in infection trends, which can be difficult for the model to follow.

Healthcare Infrastructure

Differences in the health care systems and their capacity around the world help infect the disease and affects the ability to treat it thereby affecting the ability of the model to predict the disease. Countries with well-established health care systems may therefore have different epidemic trends than those with constrained health care systems.

Cultural and behavioral factors

Literature also indicates that people’s adherence to advised health measures, cultural practices, and social norms also influence disease transmission dynamics and thus vary the prediction results.

Phase-specific factors

The two phases analyzed include January 28, 2020, to June 5, 2020 (phase-1), and October 10, 2021 to January 15, 2022 (phase-2), which are two distinct eras of the pandemic. Factors affecting the model during the early phase include high transmission rate and few treatments, while factors during the late phase include vaccination and improved management techniques.

To overcome these limitations and improve the reliability of our model in the future, more variables including population mobility, social distance, and vaccination data, humidity, and average temperature which are known to affect virus spread will be included. We also envisaged the use of more complex algorithms such as neuro-fuzzy systems and weighted sum algorithms in order to increase prediction precision.

Conclusion

Considering the exponential rise in COVID-19 cases globally, nations must arm themselves with the necessary tools to combat this pandemic. This may be accomplished by making accurate forecasts that will enable governments to make decisions appropriately and build additional infrastructure as needed. When we examine the impact of lockdowns in various nations, we discover that total lockdowns considerably lowered the number of affected incidents. Despite the fact that the lockdown has gradually been lifted, the use of the media and social media to disseminate awareness and information about preventative measures, monitoring, running tests, implementing policy proposals in various countries, as well as the power of health care services, has been more effective in reducing disease spread. The current research focuses on providing patterns and forecasts for the COVID-19 outbreak in Italy, Brazil, India, Germany, Pakistan, and Myanmar. In this paper, we used logistic growth and FTS techniques and proposed model for analyzing and predicting the COVID-19 epidemic. Proposed model predicts better outcomes than other models, with R-scores of 0.9992 in phase 1 and 0.9784 in phase 2. As of 1 July 2020, positive coronavirus patients for Italy, Brazil, India, Germany, Pakistan and Myanmar forecasted as 240,502, 1,455,432, 601,480, 194,269, 216,681 and 299 respectively by using proposed model in phase-1. As of 29 January 2022, positive coronavirus patients for Italy, Brazil, India, Germany, Pakistan and Myanmar forecasted as 10,819,855, 25,254,306, 41,090,386, 9,617,381, 1,417,481 and 534,898 respectively by using proposed model in phase-2. In future work, we will evaluate the estimated number of isolation beds and ICU needs for every region. We could try different advanced algorithms like weighted sum algorithms, neuro fuzzy, etc. The impact of humidity and temperature and on the number of coronavirus infection cases was not taken into account by any of the models presented. We might experiment with several cutting-edge algorithms, such as neuro fuzzy and weighted sum algorithms.

Data availability

The dataset is shared open source by Johns Hopkins CSSE. Availability Link: https://github.com/CSSEGISandData/.

References

Batistela, C. M., Correa, D. P., Bueno, Á. M. & Piqueira, J. R. C. SIRSi compartmental model for COVID-19 pandemic with immunity loss. Chaos Solitons Fractals 142, 110388. https://doi.org/10.1016/j.chaos.2020.110388 (2021).
Article MathSciNet PubMed Google Scholar
World Health Organization (WHO). URL: https://www.who.int/emergencies/diseases/novelcoronavirus-2019 [Accessed on April 14, 2022].
COVID-19 CORONAVIRUS PANDEMIC. URL: https://www.worldometers.info/coronavirus/ [Accessed on August 21, 2022].
Hajirahimi, Z. & Khashei, M. Hybrid structures in time series modeling and forecasting: A review. Eng. Appl. Artif. Intell. 86, 83–106. https://doi.org/10.1016/j.engappai.2019.08.018 (2019).
Article Google Scholar
Anastassopoulou, C., Russo, L., Tsakris, A. & Siettos, C. Data-based analysis, modelling and forecasting of the COVID-19 outbreak. PLoS One 15(3), e0230405. https://doi.org/10.1371/journal.pone.0230405 (2020).
Article CAS PubMed Google Scholar
Eyoh, I., John, R. & De Maere, G. Time series forecasting with interval type-2 intuitionistic fuzzy logic systems. In 2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE) (eds Eyoh, I. et al.) (IEEE, 2017).
Google Scholar
Patra, G. R., Das, A. & Mohanty, M. N. A time-series prediction model using long-short term memory networks for prediction of Covid—19 data. Int. J. Adv. Sci. Technol. 29(12), 2179–2183 (2020).
Google Scholar
Al-Qaness, M. A., Ewees, A. A., Fan, H. & Abd El Aziz, M. Optimization method for forecasting confirmed cases of COVID-19 in China. J. Clin. Med. 9(3), 674. https://doi.org/10.3390/jcm9030674 (2020).
Article CAS PubMed Google Scholar
Dhiman, N. & Sharma, M. Fuzzy logic inference system for identification and prevention of Coronavirus (COVID-19). Int. J. Innov. Technol. Explor. Eng. https://doi.org/10.3594/ijitee.F4642.049620 (2020).
Article Google Scholar
Fong, S. J., Li, G., Dey, N., Crespo, R. G. & Herrera-Viedma, E. Composite Monte Carlo decision making under high uncertainty of novel coronavirus epidemic using hybridized deep learning and fuzzy rule induction. Appl. Soft Comput. 93, 106282. https://doi.org/10.1016/j.asoc.2020.106282 (2020).
Article PubMed Google Scholar
Fatima, S. A. et al. IoT enabled smart monitoring of coronavirus empowered with fuzzy inference system. Int. J. Adv. Res. Ideas Innov. Technol. 6(1), 188–194 (2020).
Google Scholar
Van Tinh, N. Forecasting of COVID-19 confirmed cases in Vietnam using fuzzy time series model combined with particle swarm optimization. Comput. Res. Progr. Appl. Sci. Eng. 6(2), 114–120 (2020).
Google Scholar
Verma, P., Khetan, M., Dwivedi, S., & Dixit, S. (2020). Forecasting the covid-19 outbreak: an application of arima and fuzzy time series models. https://doi.org/10.21203/rs.3.rs-36585/v1
Castillo, O. & Melin, P. Forecasting of COVID-19 time series for countries in the world based on a hybrid approach combining the fractal dimension and fuzzy logic. Chaos Solitons Fractals 140, 110242. https://doi.org/10.1016/j.chaos.2020.110242 (2020).
Article MathSciNet PubMed Google Scholar
Zhao, H. et al. COVID-19: Short term prediction model using daily incidence data. PloS One 16(4), e0250110. https://doi.org/10.1371/journal.pone.0250110 (2021).
Article CAS PubMed Google Scholar
Chowdhury, A. A., Hasan, K. T. & Hoque, K. K. S. Analysis and prediction of COVID-19 pandemic in Bangladesh by using ANFIS and LSTM network. Cogn. Comput. 13(3), 761–770. https://doi.org/10.1007/s12559-021-09859-0 (2021).
Article Google Scholar
Khatua, D., De, A., Kar, S., Samonto, S., Seikh, A. A., & Guha, D. A fuzzy dynamic optimal model for COVID-19 epidemic in India based on granular differentiability. Available at SSRN 3621640. Journal of Shanghai Jiaotong University (Science) https://doi.org/10.2139/ssrn.3621640 (2020).
Bedi, P., Dhiman, S., Gole, P., Gupta, N. & Jindal, V. Prediction of COVID-19 trend in India and Its four worst-affected states using modified SEIRD and LSTM models. SN Comput. Sci. 2(3), 1–24. https://doi.org/10.1007/s42979-021-00598-5 (2021).
Article Google Scholar
Hassan, N. et al. A fuzzy graph approach analysis for COVID-19 outbreak. Results Phys. 25, 104267. https://doi.org/10.1016/j.rinp.2021.104267 (2021).
Article PubMed Google Scholar
Asl, A. A. S., Ershadi, M. M., & Sotudian, S. (2021). Fuzzy Expert Systems for Prediction of ICU Admission in Patients with COVID-19. arXiv preprint arXiv:2104.12868. https://doi.org/10.48550/arXiv.2104.12868
Iwendi, C. et al. Classification of COVID-19 individuals using adaptive neuro-fuzzy inference system. Multimed. Syst. 28(4), 1223–1237. https://doi.org/10.1007/s00530-021-00774-w (2022).
Article PubMed Google Scholar
Yogachi, E. F., Nasution, V. M., & Prakarsa, G. (2021, March). Design and Development of Fuzzy Logic Application Mamdani Method in Predicting The Number of Covid-19 Positive Cases in West Java. In IOP Conference Series: Materials Science and Engineering (Vol. 1115, No. 1, p. 012031). IOP Publishing. https://doi.org/10.1088/1757-899X/1115/1/012031
Kumar, N. & Susan, S. Particle swarm optimization of partitions and fuzzy order for fuzzy time series forecasting of COVID-19. Appl. Soft Comput. 110, 107611. https://doi.org/10.1016/j.asoc.2021.107611 (2021).
Article PubMed Google Scholar
Kumar, A. & Kaur, K. A hybrid SOM-Fuzzy time series (SOMFTS) technique for future forecasting of COVID-19 cases and MCDM based evaluation of COVID-19 forecasting models. In 2021 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS) (ed. Kumar, A.) (IEEE, 2021).
Google Scholar
Luo, J., Zhang, Z., Fu, Y. & Rao, F. Time series prediction of COVID-19 transmission in America using LSTM and XGBoost algorithms. Results Phys. 27, 104462. https://doi.org/10.1016/j.rinp.2021.104462 (2021).
Article PubMed Google Scholar
Patria, L. Fuzzy time series application in predicting the number of confirmation cases of Covid-19 patients in indonesia. Int. J. Quant. Res. Model. 2(4), 193–200. https://doi.org/10.46336/ijqrm.v2i4.194 (2021).
Article ADS Google Scholar
Siregar, A. R., Sari, R. F. & Widyasari, R. forecasting the number of covid-19 sufferers in north Sumatra using the automatic clustering fuzzy time series Markov chain method. J. Math. Sci. Comput. Appl. 2(1), 46–54 (2021).
Google Scholar
Kumar, N. & Kumar, H. A novel hybrid fuzzy time series model for prediction of COVID-19 infected cases and deaths in India. ISA Trans. 124, 69–81. https://doi.org/10.1016/j.isatra.2021.07.003 (2022).
Article PubMed Google Scholar
Husain, A. A., Surarso, B., & Irawanto, B. (2021, July). Forecasting model of Covid-19 cases using fuzzy time series using persentage change. In Journal of Physics: Conference Series (Vol. 1943, No. 1, p. 012127). IOP Publishing. https://https://doi.org/10.1088/1742-6596/1943/1/012127
Xian, S., Chen, K. & Cheng, Y. Improved seagull optimization algorithm of partition and XGBoost of prediction for fuzzy time series forecasting of COVID-19 daily confirmed. Adv. Eng. Softw. https://doi.org/10.1016/j.advengsoft.2022.103212 (2022).
Article PubMed Google Scholar
Kibria, H. B., Jyoti, O. & Matin, A. Forecasting the spread of the third wave of COVID-19 pandemic using time series analysis in Bangladesh. Inf. Med. Unlocked 28, 100815. https://doi.org/10.1016/j.imu.2021.100815 (2022).
Article Google Scholar
Salman, S. A. & Aboudi, E. H. A hybrid ARFIMA-fuzzy time series (FTS) model for forecasting daily cases of Covid-19 in Iraq. Int. J. Nonlinear Anal. Appl. 13(1), 627–641. https://doi.org/10.2207/IJNAA.2022.5553 (2022).
Article Google Scholar
Zadeh, L. A. Fuzzy sets. Inf. Control 8(3), 338–353. https://doi.org/10.1016/S0019-9958(65)90241-X (1965).
Article Google Scholar
Chen, S. M. Forecasting enrollments based on fuzzy time series. Fuzzy Sets Syst. 81(3), 311–319. https://doi.org/10.1016/0165-0114(95)00220-0 (1996).
Article Google Scholar
Huarng, K. Effective lengths of intervals to improve forecasting in fuzzy time series. Fuzzy Sets Syst. 123, 387–394. https://doi.org/10.1016/S0165-0114(00)00057-9 (2001).
Article MathSciNet Google Scholar
Tsoularis, A. & Wallace, J. Analysis of logistic growth models. Math. Biosci. 179(1), 21–55. https://doi.org/10.1016/S0025-5564(02)00096-2 (2002).
Article MathSciNet CAS PubMed Google Scholar
Novel coronavirus COVID-19 (2019-nCoV), (2020), Data repository by Johns Hopkins CSSE. Retrieved from URL: https://github.com/CSSEGISandData/ [Accessed on April 14, 2022].
pyFTS - Fuzzy Time Series for Python. URL: https://pyfts.github.io/pyFTS/build/html/index.html [Accessed on January 10, 2022].

Download references

Acknowledgements

The authors extend their appreciation to the Deanship of Research and Graduate Studies at King Khalid University for funding this work through a Large Research Project under grant number RGP2/220/45.

Author information

Authors and Affiliations

Department of Computer Science & Engineering, GITAM School of Technology, GITAM Deemed to Be University, Visakhapatnam, 530045, India
Chandrakanta Mahanty
School of Technology, Woxsen University, Hyderabad, Telangana, 502345, India
S Gopal Krishna Patro
Department of CEA, GLA University, Mathura, 281406, India
Sandeep Rathor
Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Vaddeswaram, Guntur, Andhra Pradesh, 522302, India
Venubabu Rachapudi
Department of Public Health, College of Applied Medical Sciences, Khamis Mushait Campus, King Khalid University, Abha, Kingdom of Saudi Arabia
Khursheed Muzammil
Civil Engineering Department, College of Engineering, King Khalid University, 61421, Abha, Saudi Arabia
Saiful Islam
Department of Mechanical Engineering, P. A. College of Engineering (Affiliated to Visvesvaraya Technological University, Belagavi), Mangaluru, India
Abdul Razak
School of Civil Engineering & Architecture, Institute of Technology, Dire-Dawa University, 1362, Dire Dawa, Ethiopia
Wahaj Ahmad Khan

Authors

Chandrakanta Mahanty
View author publications
You can also search for this author in PubMed Google Scholar
S Gopal Krishna Patro
View author publications
You can also search for this author in PubMed Google Scholar
Sandeep Rathor
View author publications
You can also search for this author in PubMed Google Scholar
Venubabu Rachapudi
View author publications
You can also search for this author in PubMed Google Scholar
Khursheed Muzammil
View author publications
You can also search for this author in PubMed Google Scholar
Saiful Islam
View author publications
You can also search for this author in PubMed Google Scholar
Abdul Razak
View author publications
You can also search for this author in PubMed Google Scholar
Wahaj Ahmad Khan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

“Chandrakanta Mahanty. S Gopal Krishna Patro and Sandeep Rathor wrote the main manuscript text and Venubabu Rachapudi. Khursheed Muzammil and Saiful Islam prepared Figs. 1–3. All authors reviewed the manuscript.”

Corresponding author

Correspondence to Wahaj Ahmad Khan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Mahanty, C., Patro, S.G.K., Rathor, S. et al. Forecasting of coronavirus active cases by utilizing logistic growth model and fuzzy time series techniques. Sci Rep 14, 18039 (2024). https://doi.org/10.1038/s41598-024-67161-z

Download citation

Received: 14 March 2024
Accepted: 08 July 2024
Published: 04 August 2024
DOI: https://doi.org/10.1038/s41598-024-67161-z

Keywords

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

A new time-varying coefficient regression approach for analyzing infectious disease data

TW-SIR: time-window based SIR for COVID-19 forecasts

EKF-SIRD model algorithm for predicting the coronavirus (COVID-19) spreading dynamics

Introduction

Literature review

Methodology

Fuzzy time series (FTS) model representation

Definition 1:

Definition 2:

Definition 3:

Definition 4:

Logistic growth model

Calculate R2 score (regression coefficient)

Hybrid model

Experimental results and discussion

Phase-1 Analysis

Analysis of coronavirus active cases for Brazil

Analysis of coronavirus active cases for India

Analysis of coronavirus active cases for Germany

Analysis of coronavirus active cases for Pakistan

Analysis of coronavirus active cases for Myanmar

Phase-2 Analysis

Analysis of coronavirus active cases for Italy

Analysis of coronavirus active cases for Brazil

Analysis of coronavirus active cases for India

Analysis of coronavirus active cases for Germany

Analysis of coronavirus active cases for Pakistan

Analysis of coronavirus active cases for Myanmar

Performance analysis

Discussion

Strengths and weaknesses of the proposed model

Limitations and differences in prediction results

Government interventions

Healthcare Infrastructure

Cultural and behavioral factors

Phase-specific factors

Conclusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Comments

Search

Quick links

Calculate R² score (regression coefficient)