Introduction

Amyotrophic lateral sclerosis (ALS) is a progressive neurodegenerative disorder with a variable natural history, which affects 0.6–3.8 per 100,000 person/year worldwide1. The initial clinical presentation and evolution of ALS is heterogeneous2. First symptoms usually occur in the fifth or sixth decade of life1. Most patients die within 3–5 years after onset, generally due to respiratory failure3. Thus, monitoring respiratory function is of significant importance for ALS patients4, 5.

Recently, several biochemical parameters have been demonstrated to be related to respiratory impairment6,7,8,9,10. EMPOWER study founded a weak longitudinal correlation between vital capacity and plasma creatinine6. Adriano Chiò et. al. suggested that lower albumin and creatinine (CREA), levels were strongly related to forced vital capacity (FVC)7. And cross-sectional study from Japan suggested that FVC was associated with serum levels of total cholesterol, low-density lipoprotein cholesterol (LDL), CREA, and urate8. Study from USA and Poland also showed that plasma CREA correlated with FVC9, 10. These studies suggested that there were some correlations between hematologic and biochemical laboratory parameters and respiratory function. However, the latest study did not confirm a correlation between plasma CREA and FVC11. The contradictory results suggested that further studies of blood biomarkers and FVC are needed.

The aim of the present study was to analyze a potential utility of readily available, relatively inexpensive, and rapid to determine laboratory parameters in the assessment of respiratory impairment in patients with ALS; and to develop and validate a model of hematologic and biochemical parameters for predicting respiratory impairment for patients with ALS.

Methods

Ethics statement

This study protocol was reviewed and approved by Ethics Committee of West China Hospital, Sichuan University, approval number 2020-842 and 2021-799, and all the participants signed written informed consent. All protocols and procedures of our research were conducted ethically in accordance with the World Medical Association Declaration of Helsinki. All methods were performed in accordance with the relevant guidelines and regulations.

Patients

Patients from two cohorts were included in our study. In the retrospective cohort, 319 patients with ALS were enrolled from 1 January 2015 to 30 December 2020. And in the prospective cohort, 97 participants from 1 March 2017 to 30 October 2021 were enrolled. Our medical center was a large, regional referral ALS clinic located in Southwest China. The diagnosis of ALS was made according to the revised-El Escorial criteria for probable or definite ALS12, 13. Patients with other medical or neurological diseases and patients with missing baseline hematologic and biochemical values were excluded. Patients with manifestations of neoplastic disorders or patients who could not perform respiratory function were also excluded.

Clinical variables

The clinical analysis included age, sex, age at ALS onset, disease duration, onset body region (limb/bulbar), diagnostic level (probable/definite), blood pressure and comorbidities. We extracted data on comorbidities present at diagnosis as listed in medical records. Charlson comorbidity index (CCI) was used to evaluate comorbid conditions14. Weight and height were measured following international guidelines, and body mass index (BMI) was calculated as weight (kg)/height (m)2 at the time of pulmonary function tests. High blood pressure was defined as systolic blood pressure (SBP) above 140 mmHg and/or diastolic blood pressure (DBP) above 90 mmHg in this study.

Venous blood parameters were obtained using an automated hematology analyzer (KX-21 N, Sysmex America, Lincolnshire, IL, USA). Hematologic data on platelet count (PLT), white blood count (WBC), neutrophil count (NEUT#), lymphocyte count (LYMPH#), monocyte count (MONO#), eosinophil count (EO#), and basophil count (BASO#) as well as and biochemical parameters on albumin (ALB), glucose (GLU), urea (UREA), creatinine (CREA), uric acid (URIC), triglyceride (TG), cholesterol (CHOL), low-density lipoprotein cholesterol (LDL), high-density lipoprotein cholesterol (HDL), and creatine kinase (CK) were obtained. PLR was calculated as the ratio of PLT to LYMPH# and NLR as the ratio of NEUT# to LYMPH#. LDL/HDL was measured as the ratio of LDL to HDL. FVC was measured using a standard volumetric spirometer, with the patient in a standing position. Spirometer was quality controlled using 2005 criteria15. The spirometry methods produce comparable measurements of forced vital capacity (FVC)16. At least three acceptable manoeuvres were performed. Obstructive physiology was defined as the presence of airflow limitation with FEV1/FVC < 0.7017. For participants without airflow limitation (FEV1/FVC ≥ 0.7), restrictive physiology was operationalized as FVC < 80% predicted17.

Statistical analysis

(1) Correlation analysis: Continuous variables with normal distribution were reported as mean ± standard deviation; non-normal variables were presented as median (interquartile range [IQR]). The linear correlations of these variables and the FVC were calculated by Spearman. For categorical variables, the correlations were performed by the nonparametric test (Wilcoxon rank sum test). For normal distribution data, differences were performed by Independent Samples T-Test. For non-normal distribution, differences were assessed for significance using the Mann–Whitney test or chi-squared tests. (2) The training set and the test set: The retrospective cohort was divided into two independent sets by random sampling: a training set and a test set. The training set included 70% (220/319) of the retrospective cohort. Four participants with missing data were deleted. The test set included 30% (95/319) of the retrospective cohort. Models were developed using the characteristics in the training set. To verify the model, the test set was used to validate the models for internal validation, and the prospective cohort for external validation, respectively. In order to improve the robustness of the model, fivefold cross-validation was used. And down sampling method was used to deal with the class imbalance problem. (3) Logistic regression: Logistic regression was used to construct a model to predict FVC. For factors among each other (NEUT# vs. WBC, onset.age vs. age, LDL vs. CHOL) with rho > 0.75 in correlation analysis, only one factor was screened out in the next stage of the analysis. For groups of factors (weigh, height and BMI; NEUT#, LYMPH#, and PLR; PLT, LYMPH#, and NLR; LDL, HDL and LDL/HDL; SP, DP and hypertension) with strong clinical correlation, only BMI, PLR, NLR, LDL/DHL and hypertension were screened out for the next stage of analysis. Correlation analysis between each variable and FVC was carried out to determine factor for logistic regression. Then, a total of 23 variables including gender, disease duration, onset age, site of onset, diagnostic level, CCI, blood pressure category (hypertension or not), BMI, WBC, NLR, PLR, MONO#, EO#, BASO#, ALB, GLU, UREA, CREA, URIC, TG, CHOL, LDL/HDL, and CK were included to construct the model. Based on the results of backward stepwise regression method combined with Akaike information criterion (AIC), the equation of FVC for predicting respiratory impairment in ALS patients were established. Heatmap of FVC was depicted. Receiver operating characteristic (ROC) curves was used to assess the prediction accuracy of the model. All statistical analyses were performed using SPSS 24.0 (IBM, Chicago, IL, USA) and R 4.1.0 (www.rproject.org).

Results

There were 92 male patients (69.70%) with FVC < 3.062 and 40 female patients (30.30%) with FVC < 2.266 in the retrospective cohort (Table 1). In the prospective cohort, there were a similar percentage of male (66.67%) and female (33.33%) ALS individuals with abnormal FVC (Table 1). There were significant differences in age, onset age, weight,

Table 1 Comparison of the analyzed clinical and laboratory data in ALS patients stratified by FVC.

BMI, PLR, TG and HDL when stratified by FVC either in the retrospective cohort or in the prospective cohort (all P < 0.05, Table 1). Statistically significant correlations were observed between older age, later onset age, lower weight, lower BMI, lower levels of PLR and TG, and higher levels of HDL with reduced FVC.

To further analyze the correlation between these factors and FVC, we developed a multivariable logistic regression model for the association between the hematologic, biochemical laboratory parameters, clinical factors and FVC in ALS patients in the test set. The regression equation was created from the Estimate values obtained by z-value and was presented in Table 2. Moreover, we constructed a heatmap to show the relation between these variables in the equation and FVC (Fig. 1).

Table 2 Variable in the prediction model by backward stepwise regression method combined with AIC for logistic regression equation.
Figure 1
figure 1

Heatmap of FVC in this study. The grey histogram represents values of FVC. Below the histogram are factors in the logistic regression model equation. The brightness of the color varied upon the value for continuous variables. FVC forced vital capacity, BMI body mass index, BASO# basophil count, EO# eosinophil count, CREA creatinine, URIC uric acid, LDL low-density lipoprotein cholesterol, HDL high-density lipoprotein cholesterol, LDL/HDL the ratio of LDL to HDL.

Model FVC = 0.022 + 0.199 BMI + 1.029 EO# + 16.960 BASO# + 0.043 CREA level + 0.004 URIC level + 0.260 LDL/HDL + 0.021 disease duration + 0.101 Onset.age + 0.839 Site.of.onset 2 (bulbar onset) (Table 2).

Receiver operating characteristic (ROC) curve with 9 predictive variables revealed that the area under the curve was 73.5% in the test set (Table 3, Fig. 2). And the ROC curve has standard error of 0.013 with 95% confidence interval as 0.710–0.760 in the test set. To validate the model, we tried to predict the data from the prospective cohort and calculated the ROC curve, which yielding a concordance statistic of 0.721 (95% CI 0.616–0.825) (Table 3, Fig. 3).

Table 3 ROC curve for the test set and the validation set.
Figure 2
figure 2

Receiver operating characteristic (ROC) curve for the backward stepwise regression method combined with AIC model for predicting FVC in ALS patients in the test set.

Figure 3
figure 3

Receiver operating characteristic (ROC) curve of FVC in the validation set.

Discussion

Respiratory failure is the main cause of death in ALS patients3. It is of significance to find some easily accessible, accurate, and low-cost parameters to assess the respiratory function. We created and validated a multivariable logistic regression model for the association between FVC and the venous blood biomarkers and clinical factors in ALS patients. Our single-site study found that FVC related to hematologic and biochemical laboratory parameters. EO#, BASO#, CREA, LDL /HDL and URIC are easily accessible, accurate, and low-cost parameters useful in assessment of the FVC in ALS.

To the best of our knowledge, there are few studies about models predicting respiratory function in ALS patients11, 18. In our model, onset age, site of onset, disease duration, BMI, CREA, LDL /HDL and URIC were factors previously reported to predict respiratory impairment6,7,8,9, 18. In addition, EO# and BASO# were two new features predicting to FVC in ALS. This prediction model could be useful in clinical settings in which the respiratory function is not available.

Older age and bulbar onset are consistently reported to have poorer outcomes19. BMI, onset age and disease duration were factors correlated with ALS prognostication either in retrospective or in prospective studies20,21,22,23. Recently, a large multinational study with participants aged 40 years and over founded that low BMI was one of the most influential risk factors for chronic airflow obstruction24, which implicated a potential reason for low BMI associated with impaired FVC in this study.

Lower level of CREA was related to impaired FVC in ALS in this study, which was consistent with previous studies7,8,9. In these studies, Ken Ikeda et.al found that the annual decline of FVC ≥ 30% was significantly linked to baseline serum levels of CREA8. Serum CREA is a product of nonenzymatic catabolism of creatine phosphate in muscles, and is transported from muscle through the circulation to the kidneys25.

A decreased level of plasma CREA in ALS patients is expected because of the variation in muscle mass observed in these patients. Moreover, CREA levels are correlated with lean body mass in healthy individuals26 and with BMI in ALS individuals7.

The mean TG level was significantly lower among patients with a lower FVC but not statistical significance at multivariable analysis. We confirmed that decreased LDL/HDL ratio was correlated with decreased FVC, as reported by previous studies8, 18. Whereas in a Dutch study, authors found that mean CHOL and LDL levels were lower in patients with FVC < 70%. Higher serum LDL/HDL ratio was correlated with increased survival27. Study from French reported that serum levels of CHOL and LDL were significantly increased in ALS patients, and the elevation of LDL /HDL ratio was associated with prolonged survival28. Ethical and environmental backgrounds may lead to different lipid levels in patients with ALS. Multicenter, prospective study are needed to elucidate the relationship between lipid and respiratory function.

Study from Japan showed that the rapid worsening of annual FVC was associated with serum levels of URIC8. Interestingly, the level of URIC was not statistically significant between FVC normal group and FVC lower group but independently related to decreased FVC at multivariable analysis in our study. Study from USA found strong hazard ratios relating plasma CREA and ALSFRS-R when using trajectories of all three measures of plasma CREA, plasm URIC, and ALSFRS-R to predict time to death. These studies suggest complex biochemical interactions exist in ALS patients. More studies are warranted to better understand the metabolic mechanisms of disease progression in ALS.

Peripheral EO# is an important clinical biomarker in the management of asthma and chronic obstructive pulmonary disease (COPD)29, 30. For persistent childhood asthma patients, normal lung function and serum EO# at baseline are clinical prognostic indicators of remission by adulthood31. For COPD patients, counts of 4% or greater or 300 cells per μL or more might identify a deleterious effect of inhaled corticosteroids withdrawal32. Furthermore, retrospective cohort study also shown that ALS patients concomitant with COPD relates to poor outcome23. One possibility of EO# predicting respiratory in this study is that some ALS patients coexisting with COPD, the other possibility is that ALS patient concomitant with some indirect factors which correlates with EO# and FVC. Further studies are warranted to understand the mechanism underpinning the association of EO# and FVC in ALS.

In respiratory disease, BASO# plays a distinct role in the pathogenesis of allergic and nonallergic33. Recently, authors found that the number of circulating BASO# was significantly elevated in patients with aspirin-exacerbated respiratory disease34. Except for respiratory disease, Katie Lunnon et al. have found increased numbers of BASO# in people with MCI and AD, Yet the basophil counts are within the normal acceptable range35. More studies are needed to clarify the mechanisms of BASO# and respiratory impairment in ALS and other neurodegenerative disorders.

Our results should be interpreted with caution and a number of limitations should be borne in mind. Firstly, the prediction model derived from the retrospective cohort could not analyze data potentially important to risk of FVC. For example, eosinophils and uric acid values measured at admission may be affected by the other conditions coexisted with ALS, which we did not analyze. Secondly, possible ALS patients were excluded in this study, thus the prediction model is only suitable for probable and definite ALS patients. Thirdly, the model should be tested on multicenter, prospective cohorts to study the validity and predictability.

In conclusion, our study indicates that EO#, BASO#, CREA, LDL/HDL and URIC might be important ALS biomarkers and may be used to assess FVC in ALS patients when lung function was not available. This finding needs to be validated in prospective, multi-site studies.