Introduction

Clinical decision making is expected to be grounded in high-quality scientific evidence. However, the presence of significant, unwarranted variation in treatment suggests that decision making is also influenced by patients' geographical location rather than their individual needs and preferences1. Hence, practice variation might partly be driven by differences in physicians’ beliefs about the effectiveness of the treatment. Previous studies have reported substantial regional variation in surgical rates for Lumbar Radicular Syndrome (LRS) caused by degenerative disc disease (DDD) within and between regions and countries2.

LRS, resulting from degenerative disc disease, can be attributed to a herniated disc, spinal stenosis, or spondylolisthesis. It is a prevalent disorder in routine healthcare practice leading to disability, sick leave, and high societal and healthcare cost3,4. High-quality evidence on the effectiveness and timing of surgical treatment for  low back disorders improved significantly in the last decades5,6,7,8. In the Netherlands, recommendations from high-quality research have been incorporated into new evidence-based guidelines published first in 20089,10,13. These guideline recommendations have further been integrated into the "Wise choices concerning a lumbar disc herniation" initiative as part of the Dutch "Choosing Wisely Campaign." However, the quality of evidence regarding the effectiveness of instrumented fusion surgery in patients with degenerative spondylolisthesis, degenerative low back pain or recurrent herniation is notably limited9. This was also stated in the 2019 Knowledge Agenda of the Dutch Association of Neurosurgeons (NvVN).

Previously, we observed a decline in the number of procedures for lumbar disc herniations in the Netherlands between 2007 and 202011. Consequently, we are now interested in investigating the current regional variation in surgical treatment for sciatica resulting from lumbar degenerative disc disease. Given the availability of evidence-based guidelines, we anticipate observing low variation12. Furthermore, we hypothesize that there will be lower variation in laminectomies and discectomies for lumbar DDD compared to instrumented fusion surgery due to the presence of high-quality evidence supporting the former procedures and the lack of evidence supporting the latter9,10,13. Secondary, we aim to explore the influence of both hospital and patient factors in explaining the observed regional variation in surgical rates.

Methods

Data sources and population

For the purpose of this study, we utilized and merged two databases. Firstly, we accessed a nationwide claims database provided by VEKTIS. This database collects claims data from all healthcare insurance companies in the Netherlands, ensuring comprehensive coverage of over 99% of the Dutch population. It has been determined to be over 95% accurate when compared to hospital records14. Secondly, we obtained a database from Statistics Netherlands (CBS), which includes detailed patient characteristics15. The VEKTIS database contained data on all Dutch individuals aged 18 years and older who sought hospital care for lumbar degenerative disc disease between January 1, 2016, and December 31, 2019. The CBS databases encompassed information on all Dutch inhabitants aged 18 years and older. CBS data was provided through the Extramural LUMC Academic Network (ELAN) data infrastructure project16.

Definitions and variables

In the VEKTIS database, we extracted specific data related to lumbar degenerative disc disease. This included diagnosis codes that are associated with sciatica, as well as surgical care products for discectomy, laminectomy, and fusion surgery (Supplemental files, Tables S1, S2). To ensure consistency in the registration process, we used multiple codes to capture indications for surgical treatment, minimizing potential variation in registration among physicians. We excluded patients who had cervical degenerative disc disease, spinal infections, fractures or trauma, malignancies, or congenital diseases. Additionally, individuals who had undergone back surgery within the past year were excluded from the analysis.

To account for physician registration differences and their potential impact on surgical rates, we also combined data on laminectomies and discectomies. We categorized patients into two age groups: those younger than 55 years and those older than 54 years. This distinction allowed us to differentiate between lumbar disc herniations, which are more prevalent in younger individuals, and lumbar spinal stenosis, which is commonly observed among the elderly. We separately analyzed the rates of instrumented fusion surgery.

We collected additional information on opioid prescriptions, which were recorded during the same year as the Diagnosis Treatment Combination (DBC) code registration. These prescription data were based on mandatory basic health insurance records for all Dutch inhabitants. Furthermore, we obtained details on lumbar injection therapy, including the corresponding DBC codes and procedure codes (found in the supplemental files). Lastly, we gathered data on the type of hospital (general hospital, teaching hospital, university hospital, private clinic), and the number of hospitals visited (indicating referrals from other hospitals if more than one).

From the CBS database, we extracted patient-specific information, including age, sex, comorbidities assessed using the Charlson Comorbidity Index, ZIP code, average household income, level of education (categorized as low, middle, high), type of income (employer, employee, retired, student, unemployed), and migration background (whether they were born in the Netherlands or another country).

Calculation of indirect adjusted surgical rates

To account for potential confounding factors, surgical rates were adjusted for age, sex, Charlson Comorbidity Index (CCI), household income, and level of education17. Advanced age is associated with increased incidence of spine disease, while females and individuals with lower socioeconomic status exhibit an elevated likelihood of experiencing suboptimal outcomes following sciatica18,19,20. Moreover, patients with comorbidities face an elevated risk of perioperative complications, prompting a hesitancy among surgeons towards surgical interventions21. The above mentioned factors are also associated with regional variation in the use of surgery in general12. Adjusted rates were calculated per 10,000 inhabitants in two-digit postal code areas and per 10,000 patients in neurosurgical spine clusters, which represent hospital service areas (HSAs) in the Netherlands. Referrals within these clusters were considered in the analysis. Private clinics were analyzed separately. The calculation of indirectly adjusted surgical rates per 10,000 inhabitants or hospital visitors followed the following formula:

$$\begin{aligned} &\frac{{Observed \;number \;of \;procedures \;in \;HSA, \;Postal \;code \;area \;or \;neurosurgical \;cluster }_{z}}{{Expected \;number \;of \;procedures \;in \;HSA, \;Postal \;code \;area \;or \;neurosurgical \;cluster}_{z}}\\ &\times mean \;procedure \;rate \;per \;\mathrm{10,000} \;inhabitants \;or \;per \;1000 \,hospital \,visitors \end{aligned}$$

Hereafter, we will refer to surgical or referral rates per 10,000 inhabitants in two-digit postal code areas as ‘regional surgical rates’ and to surgical rates per 10,000 inhabitants in neurosurgical clusters as HSA surgical rates.

Analysis

For all rates, we calculated the Extreme Quotient (EQ, Highest/Lowest rate), EQ5-95 (95th percentile/5th percentile), the interquartile range (IQR, 75th percentile/25th percentile), the Coefficient of Variation (CoV, Mean/SD (Standard Deviation)), the Systematic Component of Variation (SCV). The SCV estimated the systematic variation between areas that cannot be account for by the random variation within each area and is calculated by the following formula:

$$\mathrm{SCV }=\frac{1}{N} \left({\sum }_{k=1}^{i}\frac{{(Oi-Ei)}^{2}}{{Ei}^{2}} - {\sum }_{k=1}^{i}\frac{1}{Ei}\right) \times 100$$

An SCV below 5 was considered ‘low variation’, an SCV between 5 and 10 was considered high variation, and an SCV of 10 or higher was considered very high variation22.

We utilized mixed-effects logistic regression analyses to investigate the factors underlying the observed variation in regional surgical rates and HSA surgical rates. The analysis of regional rates included patient characteristics such as age, sex, CCI, level of education, household income, type of income, and migration background. For the HSA rates analysis, we further considered variables including referral from another hospital (referred patients), referral center (which were indicated by authors and neurosurgeons WP and WM), neurosurgical regions, prescription of opioids, and lumbar injection therapy. Two-digit postal code areas and HSAs were treated as random effects in the analyses. We opted against the inclusion or examination of interaction terms for variables, primarily due to the extensive size of our database and challenges associated with interpreting outcomes when incorporating such terms. It is important to note that we acknowledge the potential influence of this decision on the odds ratio.

Analytic approach

All analyses were conducted at the patient level, ensuring that each patient was included only once per year. Continuous variables were presented as mean (SD) or median (IQR) for nonparametric data. Missing data were examined, and multiple imputation with 10 imputation sets was employed to address missing values. Age, sex, CCI, level of education, household income, and postal code area were utilized as predictors. Statistical significance was defined as p-value < 0.05. SPSS Statistics (version 26) was employed for all statistical analyses, while Python was used to generate maps illustrating the two-digit postal code areas.

Ethical approach

The current study adheres to the principles outlined in the Declaration of Helsinki. Our study protocol (N20.075) underwent thorough review and approval by the Medical Ethics Committee Leiden Den Haag and Delft. The committee determined that official approval was not required because participants were not directly involved in the study and patient anonymity was safeguarded in the database. Thereby, the informed consent was waived by the Medical Ethics Committee of Leiden Den Haag and Delft.

Results

Study population

The Netherlands had an average population of 13.8 million adults aged 18 years or older between 2016 and 2019 (Table 1). Among them, we identified 119,148 adult individuals who visited the hospital annually for lumbar degenerative disc disease (Table 2). The annual number of procedures ranged from 14,348 to 15,492. Detailed information on specific diagnoses and procedures can be found in Table 1 of the supplemental files. The overall rate of missing data for all variables was less than 5%, except for the variable 'level of education,' which had 41% missing data. The missing data were determined to be missing at random and were imputed as described in the methods section.

Table 1 Baseline characteristics for our study population between 2016 and 2019.
Table 2 Baseline characteristics of hospital visitors for lumbar DDD between 2016 and 2019.

Variation in surgical rates

On average, 108 per 100,000 inhabitants age 18 years or older received surgery for lumbar DDD annually between 2016 and 2019. Higher surgical rates were observed in patients aged 55 years and older compared to the younger age group (Fig. 1). Furthermore, laminectomy and discectomy rates were much higher than instrumented fusion surgical rates. Figure 1 and Fig. 2 show that postal code area ‘45’ (which is the white area in the southwest on the geographic maps) has a very low surgical rate, which has a huge impact on the extreme quotient for regional differences. For laminectomy/discectomy rates, low variation was observed between 2016 and 2019 in postal code areas as well as neurosurgical clusters, whereas very high variation was observed for instrumented fusion rates in postal code areas and high variation in neurosurgical clusters as shown by the SCV (Table 3, Fig. 3). Removing outliers, as shown by the EQ5-95 in Table 3, surgical rates in both postal code areas and neurosurgical clusters differ more than twofold for instrumented fusion surgery. Surgical rates for laminectomy and discectomy differed between 1.3- and 1.8-fold when removing outliers.

Figure 1
figure 1

Surgical rates per 100,000 inhabitants in 2-digit postal code areas in the Netherlands.

Figure 2
figure 2

Relative deviation (in percentages) of surgical rates compared to the median rate per 100,000 inhabitants in 2-digit postal code areas in the Netherlands.

Table 3 Practice variation outcomes on surgical rates between 2016 and 2019.
Figure 3
figure 3

Surgical rates per 1000 hospital visitors in neurosurgical clusters. Bars represents the 11 HSAs in the Netherlands. The red line represents the median surgical rate.

Explanatory analyses

The risk of receiving a laminectomy or discectomy was 30% lower for female patients compared to male patients (Table 4). Furthermore, the risk was 30% lower for welfare recipients compared to employees, 20% lower for employers, and 10% lower for students. Patients with a migration background had a 40% lower risk of receiving laminectomy or discectomy. Furthermore, the odds of receiving a laminectomy or discectomy were 2.5 times higher for patients who received an opioid prescription and 5.5 times higher for referred patients. The risk of receiving laminectomy or discectomy was 30% lower for patients who already had a lumbar epidural injection.

Table 4 Odds ratios for patient and hospital factors associated with surgical treatment.

The odds of receiving instrumented fusion surgery was 1.2 times higher for females compared to males. Moreover, retired patients had a 30% lower risk of receiving instrumented fusion surgery compared to employees, whereas patients with a migration background had a 40% lower risk. The odds of receiving instrumented fusion surgery were 4.5 times higher for patients who received an opioid prescription, 3.0 times higher for referred patients, and 13.5 times higher for patients in a referral center. Lastly, patients who already received a lumbar epidural injection had a 40% lower risk of receiving instrumented fusion surgery.

Discussion

This study analyzed practice variation outcomes for surgical treatment of lumbar DDD as a part of a quality-of-care cycle. Low variation in adjusted surgical rates between regions was expected due to widely available national evidence-based guidelines and efforts to implement the guidelines.

Summary of principal outcomes

This study showed overall low variation for discectomies and laminectomies in patients with lumbar DDD in 2-digit postal code areas as well as in HSAs. We found low variation in both the younger age group (more likely suffering from HNP) and the older age group (more likely suffering from lumbar stenosis). However, high variation was observed for instrumented fusion surgical rates in 2-digit postal code areas and in HSAs. Surgical rates were influenced by both patients and hospital factors, where patient’s referral status seemed to be the most important factor for both discectomies/laminectomies and instrumented fusion surgery. Prescription of opioids was another important predictor for receiving surgery. It is noticeable that the risk of receiving laminectomy and discectomy was significantly higher for males, whereas the risk of receiving instrumented fusion surgery was higher for females.

Strengths and limitations

The use of administrative data helped us performing a population-based study containing data from all inhabitants in the Netherlands over a timeframe of four years. However, due to the use of administrative data, it was impossible to include disease severity, disease symptoms (i.e. cauda equina syndrome), and patient outcomes. Also, specific indications for surgery remain unclear. We hope that analyzing age groups could help by distinguishing between lumbar stenosis (older population) and lumbar disc herniation (younger population), but the choice for younger and older than 55-years was made arbitrary. Moreover, we were not able to include variation in timing of surgical treatment for lumbar DDD in our analyses, since data on first visit to the General Practitioner (GP) were not available. Hereby, it is not possible to make strong conclusions on guideline implementation nor on the effect on quality of care. Also, disease severity might still explain a part of the observed variation for which we cannot account in this study. Our outcomes clearly show that variation between regions for these procedures was highly influenced by outliers such as postal code area ‘45’, in which patients probably receive surgical treatment in Belgium. Due to the (Dutch) boundaries of our administrative database, these procedures were not included in our database.

Findings in relation to other studies

In 2013, a report was written on practice variation in the Netherlands between 2009 and 201123. Ten-fold variation between regions was observed in surgical rates for patients suffering from lumbar disc herniations, with an interquartile range of 1.5. Removing outliers, we observed less than two-fold variation between regions and hospital service areas and a considerably lower interquartile range compared to the 2009–2011 data from the 2013 report.

Clinical implications

The finding of low variation in laminectomies and discectomies for lumbar DDD suggests that the outcomes of the Sciatica trial were implemented nationwide. The delineated process concerning lumbar disc herniations serves as an exemplar of a knowledge and quality-of-care cycle. In this iterative cycle, high-quality clinical research initiates the formulation of guidelines. Subsequently, new knowledge gaps emerge during the guideline development process prompting revisions to the guidelines, and ultimately leading to their implementation. However, this study did not investigate the extent of dissemination of the guidelines, since we did not investigate potential factors influencing variation and no methodological methods or statistical analysis was used to evaluate (long-term) impact of guideline implementation.

Consequently, within the scope of the current study, it remains unclear regarding the extent to which the guidelines, alongside other factors, have contributed to the observed decrease in practice variation. Therefore, no causal effect of evidence-based research and guideline implementation on practice variation can be derived from the current results. Nevertheless, the finding of high variation for instrumented fusion surgery can be seen as such a knowledge gap and suggests that high-quality research on effectiveness is needed on this topic to improve quality of care for this patient group as well. In 2019, this was already stated at the Knowledge Agenda of the Dutch Association of Neurosurgeons (NvVN).

We observed that patient’s referral status was the most important factor for receiving surgical treatment. In the Netherlands, healthcare insurance companies head for regional collaboration between hospitals. The neurosurgical spine clusters are a good example of such a collaboration and the fact that referral status was such an important factor in our analyses matches this. The observation that opioid prescription emerged as a significant predictor for undergoing surgery may suggest increased severity of pain within this cohort. Nevertheless, given the recommended caution in prescribing opioids for chronic pain conditions, like lumbar stenosis, the correlation implies potential variability in opioid prescription practices, suggesting deviations from established guidelines. It is important to note, however, that the precise timing of opioid prescription remains unknown. This ambiguity could arise from the possibility of opioids being prescribed post-surgery, consequently indicating suboptimal outcomes and hampering the interpretation of this outcome. In literature, conflicting data on a higher risk for females on developing spondylolisthesis compared to males are described24,25. However, it was previously found that women have a slower rate of recovery and a higher risk of unsatisfactory outcomes after surgery, which might eventually lead to the need for fixation surgery19. The question arises as to why women exhibit an increased propensity for suboptimal outcomes following surgery. While empirical data to substantiate this notion are currently unavailable, one could posit that surgeons may defer surgical intervention in females, potentially resulting in a heightened susceptibility to unsatisfactory outcomes, as delineated in the study conducted by Bailey et al.6. This matches with the fact that we found females to have a lower risk of receiving a discectomy/laminectomy, but a higher risk of receiving instrumented fusion surgery compared to males.

Conclusion

Our data revealed low regional and hospital variation in laminectomy and discectomy rates, but very high variation in instrumented fusion rates. Further studies are needed to investigate the extent of guideline implementation and its impact on practice variation.