Introduction

Studies on dependence on nicotine-containing products have primarily focused on cigarettes. With increased availability of multiple smoke-free tobacco and nicotine products (TNPs) (e.g. smokeless tobacco, e-cigarettes) that deliver nicotine in a less harmful manner, there is a clear need to develop alternative tools for assessing perceived dependence across a wide spectrum of products to better understand the public health impact of these smoke-free products (SFPs)1,2,3,4.

Leveraging self-reported symptoms as listed under the World Health Organization’s International Classifications of Diseases and Injuries (ICD) 10th Revision5 or the American Psychiatric Association’s Diagnostic and Statistical Manual Fifth Edition (DSM-5)6 to diagnose nicotine dependence, researchers have traditionally relied on self-report measures to assess perceived dependence on TNPs. Numerous measures have thus been developed, with the vast majority assessing perceived dependence in cigarette smokers7,8,9,10,11, including the very widely used Fagerström Test for Nicotine Dependence (FTND; now known as the Fagerström Test for Cigarette Dependence, FTCD)12,13,14. Such measures are not substitutes for clinical diagnoses of tobacco use disorder or nicotine dependence as defined in the DSM-5 and ICD-10 respectively, but they serve as key tools to support research on the use of TNPs.

Some of these measures have been adapted and validated for other TNPs, for example, the FTCD adapted for smokeless tobacco15. In contrast, other attempts have been made to develop non-cigarette-specific dependence measures for smokeless tobacco, e-cigarettes, and waterpipes16,17,18,19. Product-specific measures are required to consider product characteristics, including pharmacology, associated behaviors, and stimuli1. However, these measures do not allow product comparability, as the dependence scores derived from each product-specific measure are not necessarily based on the same metric. This raises the need for a measure to ensure appropriate comparability of the degree of perceived dependence across different TNPs. As poly-use (concurrent use of several TNPs) has become more frequent20,21, studies are now comparing levels of perceived dependence as a function of the number and type of product use2,22,23. This underscores the need to have a reliable and valid measure of perceived dependence for every product but also to assess the degree of self-reported dependence irrespective of the type and number of TNPs used.

There are two avenues toward achieving the goal of comparable measurement of perceived dependence among exclusive and poly-users: first, resort to existing measures and empirically derive their common core using quantitative methods; second, initiate a new scale development process starting from scratch and supported by qualitative and quantitative evidence. Both approaches have pros and cons. The first leverages what already exists, which may facilitate faster results. However, a possible limitation is that some items may not represent the concept of interest as they are selected from existing measures and do not stem from a dedicated conceptual framework. Even more problematic is the absence of relevant context-specific items to inform the conceptualization. In psychometrics, this issue is referred to as content validity, which is the cornerstone for any outcome measure development24,25. While the second approach is more time consuming because of its comprehensiveness, it ensures a better qualitative underpinning of the concept of interest and a closer match between the measure and conceptual framework. The first approach was recently used to compare the prevalence of well-known indicators of perceived dependence across different types of tobacco users22. Based on established cigarette dependence measures, Strong et al.22 derived a 16-item measure to enable perceived dependence comparison among cigarette, cigar, smokeless tobacco, e-cigarette, and hookah users. However, as noted by Morean et al.26, the proposed measure may have some limitations due to the lack of evidence that all items represent the concept of interest across all TNPs26. It is therefore worthwhile to develop a new scale that accurately reflects the evolving TNP landscape.

We undertook the development of a new self-report measure, the ABOUT-Dependence. It is part of the ABOUT Toolbox (Assessment of Behavioral OUtcomes related to Tobacco and nicotine products) initiative27, which promotes the use of best practice guidelines to develop fit-for-purpose self-report measures to assess perceptions and behaviors associated with the use of a wide range of TNPs. The present paper provides a detailed description of the multi-phase approach followed to develop and validate the measure.

Basic conceptual foundation

We developed an initial conceptual framework based on a comprehensive literature review of theoretical contributions, empirical studies, and existing self-report questionnaires13,14,28,29,30,31,32,33,34. A literature search was conducted on EMBASE in February 2017 to identify existing self-reported measures and conceptualizations of dependence. In addition to the electronic search, 70 articles were identified manually. Articles and measures were selected for further review if they were relevant to tobacco or nicotine dependence. In total, twenty-five self-report measures7,8,9,10,11,12,15,16,17,18,19,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52 and three interview-based assessments22,53,54,55 of perceived dependence were identified and reviewed. Supplementary Table S1 summarizes, for each of the 28 measures, the number of domains and items, the TNP for which it was initially developed and validated, and some key features of development and validation (namely, item generation, item reduction and psychometric analyses).

From these sources, perceived lack of control emerged as the core characteristic of perceived dependence, along with seven other sub-concepts as shown in the preliminary conceptualization (Fig. 1). Two experts in tobacco and nicotine addiction (JR and KF) and two experts in self-report measure development (TS and SC) reviewed the proposed conceptualization and contributed to the development of the first draft measure, which included nine items that characterized the degree of perceived dependence on three different response scales (intensity, frequency, or duration) adapted to the characteristics of the individual symptoms (see Supplementary Table S2). This preparatory work constituted the foundation for the subsequent qualitative and quantitative research to develop further and examine the psychometric properties of the new measure.

Figure 1
figure 1

Evolution of conceptualization of perceived dependence for the ABOUT-Dependence Measure.

Results

Qualitative research

Table 1 presents the participants’ demographics. Over half (52%) of the sample were poly-TNP users.

Table 1 Overview of the sample demographics for both qualitative study and quantitative cross-sectional survey.

Concept elicitation data suggested several concepts directly linked to perceived dependence on a wide range of TNPs that largely confirmed the draft conceptualization developed in the early foundational work (Table 2). Key topics included amount of use, cravings, failure to quit, reasons for use, risky behaviors, or use despite negative consequences. Overall, there were no differences in concepts between poly- and exclusive users. However, when discussing amount and frequency of use, those who used e-cigarettes, pipes, or waterpipes reported they had greater difficulty quantifying the amount or times they used their products in a day than those who smoked cigarettes or used smokeless tobacco. Those who used cigarettes felt that the stigma of use was much greater than that of other TNPs, which corresponded with a greater likelihood of hiding or “sneaking” their use.

Table 2 Key themes and concepts elicited in the qualitative study.

Cognitive debriefing interviews with the first 20 participants in Wave 1 led to the removal of a problematic item focused on the “proportion of available time” the person used the product because participants had difficulty conceptualizing and estimating the idea of available time to use their product. Revisions were also made to some items and response options through the two waves of interviews to enhance comprehension. In total, 11 items that were specifically elicited by the participants in the concept elicitation part were added to the draft measure (see Supplementary Table S2 for more details on concepts, their definitions, related items, and examples of indicative quotes from participants). Incorporation of the findings from the qualitative research and review by the four experts (SC, TS, KF, and JR) resulted in an updated conceptualization (see Fig. 1) and 19-item revised draft measure for the psychometric assessment carried out in subsequent quantitative research.

Quantitative research

Table 1 presents the participants’ demographics, highlighting the appropriate implementation of quotas in the study.

Outcomes of the Rasch Measurement Theory (RMT) analyses against the acceptability criteria outlined in Supplementary Table S3 are summarized in Table 3. All but one participant completed all items in the measure, which supports acceptability. The initial psychometric analyses revealed some problems in the 19-item draft measure; in particular, thresholds for 3 items were disordered, 13 out of 19 items were found to have fit residuals out of the recommended guidelines (this was solely due to the large sample size as graphical inspection did not reveal any hint of unequal item discrimination), and 9 pairs of items had residual correlations > 0.30, which indicated local dependence and thus redundancy. Importantly, RMT analyses suggested that the 19-item measure did not form a strictly unidimensional set. Further item deletion or combining items into subtests to account for response dependency did not result in statistical indicators demonstrating a strictly unidimensional scale. As a consequence, perceived dependence was reconceptualized (see Fig. 1), based on a discussion with the four experts, as a multidimensional construct including three domains: Extent of Use, covering the timing, urgency, or pervasiveness with which the product(s) is/are used; Behavioral Impact, comprising aspects of how perceived dependence impacts daily activities; and Signs and Symptoms of perceived dependence experienced by TNP users.

Table 3 Psychometric Performance Summary (Rasch Measurement Methods [RMT]).

Six items were removed from the 19-item draft measure based on conceptual redundancy (local dependence), while one was omitted based on psychometric misfit (see Supplementary Table S4 for item reduction details. The reduced 12-item measure retained full content coverage of perceived dependence, and the three domains showed good psychometric performance according to RMT criteria (Tables 3 and 4).

Table 4 Final item calibration and fit (Rasch measurement methods [RMT]).

Suitability of the response format and targeting: Response option thresholds were properly ordered, suggesting a meaningful response format. In terms of targeting, person measurements were well covered (80–90%).

Item hierarchy: Regarding the Behavioral Impact domain, we expected some order relations between the items based on conceptual considerations and qualitative research findings. Being a basic manifestation of perceived dependence, “using more of the product than intended” should be most easily endorsed. Endorsing “stopping an activity to use a TNP” was supposed to require less perceived dependence compared to avoiding an activity altogether. For the Signs and Symptoms domain, “a strong desire” was supposed to be more readily endorsed than finding it “hard to control the urge,” as the latter implies the former but not vice versa. The RMT analysis confirmed these expected relationships. See Table 4 for item calibration, location, and fit statistics for all 12 items within each domain.

Reliability: Person separation indices (PSIs) for all three domains were satisfactory (0.73 to 0.89) with the Signs and Symptoms domain having a slightly higher reliability compared to the other domains.

Item invariance: For all three domains, item invariance, crucial for comparability of measurements, was supported in that there was no differential item functioning (DIF) by type of user (exclusive vs. poly) or key sociodemographics (age, sex, education).

Unidimensionality: Separately, all three domains were unidimensional. As the three domain measures were correlated in the order of 0.5 to 0.8 in the entire sample, the ABOUT-Dependence index (composite score) capturing overall perceived dependence was formed covering all three domains, to be considered in addition to the individual scale scores. However, the relationship of the Extent-of-use domain and the two remaining domains varied by TNP and user type. Given the same dependence, DIF was noted, as e-cigarette users reported a greater extent of use, waterpipe and cigar users reported a lower extent of use, and other TNPs (cigarettes, smokeless tobacco, pipes, nicotine replacement therapy [NRT]) were very similar regarding the extent of use (Fig. 2). The DIF was accounted for by estimating separate subtest parameters for some TNPs.

Figure 2
figure 2

Item characteristic curves (ICC) for the extent of use domain across different tobacco and nicotine products (TNPs) once the item (subtest) was split to account for Differential Item Functioning (DIF). DIF was accounted for by estimating separate subtest parameters for some TNPs.

Classical Test Theory (CTT) criteria Table 5 provides a summary of the CTT analyses (according to the criteria defined in Supplementary S5), supporting the psychometric validity and reliability of the item-reduced measure. Moreover, Pearson and intra-class correlations showed very good test–retest reliability (> 0.70).

Table 5 Scaling assumptions (classical test theory [CTT]).

Convergent validity The validity of the new measure was supported by moderately high correlations with existing product-specific perceived dependence measures among exclusive users (Table 6).

Table 6 Convergent validity and correlations between ABOUT–Dependence and existing dependence measures.

Known group validity: As expected, domain scores differed by key product use patterns. Very light users (< 5 units per day) scored lowest on the ABOUT-Dependence index, while heavy (20 + units per day) users scored highest. While there was no difference between light (5–10 units per day) and moderate (10–20 units per day) users (Scheffé post-hoc test, p = 0.17), all other pairwise comparisons showed significant differences in the expected direction (Scheffé post-hoc tests, all p < 0.001). These results are consistent with previous findings that greater TNP consumption is associated with higher perceived dependence56. Poly-users reported higher perceived dependence than exclusive users (t = 15.7, df = 2430, p < 0.001), which is also in line with previous studies22,57. A difference in the amount of total TNP use partly explains the difference. However, including total use as a covariate still showed a significant difference between exclusive and poly-users (one-way analysis of variance F = 169.1, df1 = 1, df2 = 2431, p < 0.001 for user type).

Perceived dependence across product types: Scores by type of TNP for exclusive users in the study (Table 7) showed that cigarette smokers reported higher perceived dependence scores, followed by users of NRT, e-cigarettes, smokeless tobacco, waterpipe, pipes, and cigars/cigarillos.

Table 7 ABOUT-Dependence measure score of perceived dependence across different products and user types.

Scoring and estimation of measures: To facilitate appropriate use of the ABOUT-Dependence measure, a calibrated scoring scheme was developed to enable conversion of unweighted sum scores into linear measures based on weighted likelihood estimation (WLE)58, and the calibration was done with the unrestricted Rasch model for polytomous responses59,60. For complete data, the resulting conversion table transfers sum scores to logit measures, which are mapped to a 0–100 scale for convenience. Depending on the research question and focus of interest, the measure can be scored and interpreted using separate scores for each domain and/or total ABOUT-Dependence index across the three domains in the measure61. Full details and information on the access, use, and scoring of the measure are provided through eProvide by Mapi Research Trust (https://eprovide.mapi-trust.org/).

While the new measure is primarily meant for comparative measurement across TNPs, it can also be used for exclusive product use. As we expect existing product-specific measures to remain in use, to ensure comparability of research findings over time, it is important to link the new measure and its metric to existing scales. Salzberger et al. provide further information on how various existing measures (e.g., FTCD) can be linked to the newly established metric from the ABOUT-Dependence62. This equating procedure establishes crosswalks that provide the opportunity to transform raw scores on various existing product-specific measures to one another and to a common metric established by the new measure.

Discussion

Psychometric performance of the new ABOUT-Dependence measure was strong across both RMT and CTT analyses, supporting the conclusion that the 12-item measure, consisting of a 2-item Extent of Use scale, a 5-item Behavioral Impact, and a 5-item Signs and Symptoms scale, is a psychometrically reliable and valid self-report measure. In addition, a composite ABOUT-Dependence Index can be formed as a summary measure across all three scales. Compared to the individual scales, the index exhibited even smaller floor and ceiling effects.

There are four key strengths of this new measure. First, the content validity was evidenced by comprehensive information gathered from a literature review, expert opinions, and concept elicitation and cognitive debriefing interviews with TNP users in line with best practices and guidelines63. Such a comprehensive approach has not always been applied consistently in the development and validation strategies of TNP self-report dependence measures. For example, as identified in the literature review, the development of existing measures varied widely, with TNP users’ involvement in the item generation and reduction only sporadically included.

Second, the diversity of subpopulations of TNP users sampled in the studies provided a broad frame of reference that demonstrated the validity of the measure. In total, seven TNP categories were explored, allowing for the assessment of measurement comparability across a large spectrum of TNPs and providing a solid foundation for the measure to be used with other products (e.g., heated tobacco products and nicotine pouches).

Third, the fit of the data to the Rasch measurement model and the lack of DIF by TNP for the Behavioral Impact and Signs and Symptoms domains demonstrate that these items worked as a set, representing manifestations of unidimensional experienced symptoms and perceived burden of dependence, respectively, for diverse products. At the same time, the presence of DIF by product type for the Extent of Use scale as part of the composite variable may be linked to the high variability in urgency and frequency of use when considering products such as cigarettes and e-cigarettes versus waterpipes and cigars.

Finally, compared to existing measures, the new measure provides a more interpretable measure of perceived dependence by referring to the hierarchy of individual items and their response categories reflected in their locations on the continuum of perceived dependence. Thus, for each level of perceived dependence, the most likely response to each item provides a detailed characterization of the degree of dependence perceived by a respondent. In addition, the ABOUT-Dependence index allows for consolidation of the ABOUT-Dependence measure and existing product-specific measures by providing crosswalks that bridge the metrics of different measures62.

There are also some limitations to our development. First, when examining scores across products with the ABOUT-Dependence measure, it is important to realize that it assesses the perception of existing perceived dependence on a specific TNP or—in the case of poly-users—a bundle of TNPs. In terms of the findings on perceived dependence estimates by TNP type in this study, caution should be exercised when interpreting the estimates as population statistics. This study’s purpose was scale development; it was not designed to provide general population estimates of perceived dependence on different TNPs, nor can it indicate the physiological addiction or abuse liability of TNPs. Second, while the implementation of quotas per TNP type and key sociodemographics aimed at broad coverage of the population and possible applications of the measure, an adequate representation was not fully achieved for products such as NRTs. When looking at the small NRT user subgroup, only one NRT user was recruited in the qualitative study, limiting the assessment of content validity of the measure in this group. In addition, in the quantitative study, it appears that this group may not be representative of exclusive NRT users and those who use NRTs as intended; some were long-term users of NRTs with no intention to quit, and some reported occasional use of other TNPs during the past 12 months. This could explain why the level of perceived dependence of NRT users was not very different from what was observed in cigarette smokers. This clearly calls for further research, particularly in different categories of NRT users, to better contextualize the interpretation of perceived dependence scores and further validation of the measure. In addition, it is worth noting that in the quantitative research, the GkF KnowledgePanel® utilized probability-based sampling aimed to be representative of the US population and whilst this was fairly reflected in the sample of respondents in terms of age, gender, race, and educational attainment64, greater diversity of the sample could have further supported the universal applicability of the measure across different demographics and the broader population of TNP users.

Finally, it is worth noting that Strong et al.22 also demonstrated that a 16-item measure of perceived dependence adapted for use with specific TNPs identified a common dimension perceived of dependence that could be used to assess variability in perceived dependence across different TNP users. Nevertheless, the ABOUT-Dependence measure has a broader conceptual coverage, especially since it captures the domain of behavioral impact, which is notably missing from Strong et al.’s measure but was shown in the current findings to be an important aspect of perceived dependence. The behavioral impact domain may be worth investigating further to better understand the effect of perceived dependence on different TNPs on behavioral aspects of daily life.

In all, based on a conceptual framework of perceived dependence, the interplay of insights from qualitative research, and psychometric analysis, a new three-domain measure of perceived dependence could be empirically supported. Applied longitudinal studies will further reveal how perceived dependence may evolve over time and the predictive validity of the measure with regard to other TNP use behaviors65. Additional cross-validation studies are also necessary to provide further evidence of the utility of the new measure to advance the understanding and interpretation of self-reported measurement of dependence across the whole spectrum of TNPs, user types, and cultural contexts. The measure could be used to provide additional standardized assessment and data for use in clinical research and practice to better understand impact of the use of multiple and different TNPs, the psychological and behavioral aspects of tobacco and nicotine dependence and evaluate the effectiveness of related clinical harm reduction or cessation interventions. With the outlook of increasing use of the ABOUT-Dependence in tobacco and nicotine research studies, we envision a rapidly expanding knowledge base to support tobacco harm reduction and help inform regulators and health and public policy communities when assessing the potential public health impact of SFPs.

Methods

Qualitative research

Design overview

The qualitative research served the purpose of expanding our conceptual understanding of perceived dependence in TNP users and examining to what extent the instructions and item content of the initially drafted measure were appropriate, relevant, comprehensive, and understandable to different types of TNP users. The research comprised two waves of concept elicitation and cognitive debriefing interviews (August 2017) and consultations with the same four subject matter experts from the foundational work (September 2017).

Study population and recruitment process

The research was performed in accordance with relevant guidelines/regulations, including the Declaration of Helsinki. Forty individuals (20 exclusive users and 20 poly TNP users) in the United States (US) participated in an open-ended concept elicitation interview combined with cognitive debriefing. Participants were recruited through a proprietary database maintained by the market research company AOC Marketing Research (Charlotte, NC, US). The sample was purposively recruited to produce diversity in terms of characteristics such as sex, age, race, educational attainment, and a balanced representation of TNPs (cigarettes, cigar/cigarillos, e-cigarettes, and smokeless tobacco) among exclusive TNP users. Participants received USD $100 as compensation for their participation.

The main inclusion criteria for eligibility in the study were participants to be of legal age of smoking or above and a regular current TNP user who have used their product at least 100 times in their lifetime and used their product at least once in the past 7 days. The product use criteria were based on existing classifications66,67 for current TNP users and to ensure participants were using their product at the time of the study. The main exclusion criteria were self-reported use of illegal drugs in the last 12 months; participation in a market research study in the past 6 months; and employment in certain sectors related to advertising, media, public relations, or marketing; a company that manufactures tobacco and nicotine products; or healthcare, government, and clinical organizations.

Interview procedure

The study was carried out in two waves over the course of 4 days. In the first 2 days (Wave 1), 20 interviews were conducted. Following some modifications to the measure based on the first 20 interviews, a second wave of 20 interviews was executed similarly (Wave 2).

Each interview was divided into two parts and followed a semi-structured interview guide. In the first part, concept elicitation techniques were used to identify aspects of perceived dependence relevant to different TNP users. This was geared toward developing new items to supplement the initial nine-item draft measure. In the second part of the interview, cognitive debriefing was used to ensure relevance, clarity, and ease of completion of the previously generated nine items. In Wave 2, the draft supplementary items were developed based on the Wave 1 concept elicitation. A “think aloud” process68,69 was followed where participants were asked to complete the items while saying out loud what they were thinking, and specifically to note any problematic areas (instructions, items, rating scales) and suggest ways to improve the measure. Interviews were audio-taped and transcribed verbatim.

Data analysis: concept elicitation

Transcripts were analyzed thematically using Microsoft Excel (Seattle, WA, US) and MAXQDA software (VERBI, Berlin, Germany; https://www.maxqda.com/). Participant-generated concepts were inductively categorized into concepts of the initial conceptualization. Saturation (the point at which no new information is arising from the qualitative data) was also assessed by comparing concepts arising in the first five interviews to those emerging in the next five interviews, for a total of eight sets of data to compare. If no new information arose in the eighth set, saturation was said to be achieved.

Data analysis: cognitive debriefing

The analysis aimed to identify wording ambiguities and assess the relevance and acceptability of each item, the response scale, and the instructions. Additional items suggested by Wave 1 participants that could further expand the measurement of perceived dependence were also explored and tested with participants of Wave 2.

Quantitative research

Design overview

The psychometric evaluation study was designed as a cross-sectional, two-wave, internet-based survey with purposive stratified sampling of legal age TNP users in the US. Participants who completed the first survey (Wave 1; Baseline) were invited to complete a second similar but shorter survey 7–10 days later to determine the test–retest reliability of the new measure (Wave 2; 7–10-day follow-up).

The sampling provided for quotas in terms of TNP use behavior (50% exclusive and 50% poly-users). Exclusive TNP users were equally split into cigarette, smokeless tobacco, e-cigarette, cigars/cigarillos, and other TNPs (pipe, waterpipe, and NRTs). At least 40% of TNP poly-users had to smoke cigarettes. In addition, quota sampling based on age, sex, and education was applied to the intended total sample size of 2500 for Wave 170. The target sample size for the Wave 2 follow-up survey was 1000 completers (500 exclusive TNP users and 500 poly-TNP users). The survey fieldwork took place from April to May 2018.

Study population and recruitment process

The research was performed in accordance with relevant guidelines/regulations, including the Declaration of Helsinki. Participants were identified via the proprietary Growth from Knowledge (GfK, Nuremberg, Germany) consumer online panel KnowledgePanel®, which aimed to be representative of the US population. The same inclusion and exclusion criteria used in the qualitative research were also applied here. In terms of product use specifically, exclusive TNP users should self-report having used only one product at least 100 times in their lifetime and have used this product at least once in the past 7 days and poly-users should self-report as current users of at least two TNPs, have used at least each of the products 100 times in their lifetime, and have used each at least once in the past 7 days. Participants were compensated for their participation according to the GfK KnowledgePanel® points systems incentive program in which respondents are credited points which can be redeemed for cash or other rewards for completing surveys.

Measures and survey procedure

Upon participants’ enrollment in the study, the cognitively debriefed draft version of the measure was administered first. Depending on the TNP(s) poly-user participants declared using in the last 7 days at the time of the survey, they were administered additional product-specific dependence measures for assessment of convergent validity: FTCD for cigarette and cigar users12, smokeless tobacco FTCD (SLT-FTCD) for users of smokeless tobacco products16, the Penn State e-cigarette dependence index (PS-ECDI) for e-cigarette users18, the Lebanon Waterpipe Dependence Scale (LWDS) for waterpipe users19; the cigarette dependence scale (CDS) for cigarette smokers or an adaptation for users of other TNPs7, and the brief Wisconsin Inventory of Smoking Dependence Motives (WISDM) for cigarette smokers46. A shorter but otherwise similar survey to Wave 1 was administrated at Wave 2 (7 to 10 days after Wave 1).

Data analysis

Item reduction and internal construct validity: Assessed through the application of the RMT analyses71 aiming at comparing observed data against the stringent criteria of the Rasch model (see Supplementary Table S3 for more details on the definition and acceptability criteria for RMT analysis). Analyses rooted in CTT72 were also conducted on the item-reduced scales to provide a comprehensive psychometric assessment (see Supplementary Table S5 for more details on the definition and acceptability criteria for CTT analysis).

Test–retest reliability: Pearson and intra-class correlations were calculated between the scores of the item-reduced measure at Wave 1 and Wave 2.

Convergent validity: Spearman correlations between the item-reduced measure and the existing product-specific dependence measures.

Known group validity: The following group comparisons were conducted: heavy (≥ 20 units/day) vs. moderate (10–20 unit/day) vs. light (5–10 units/day) vs. very light users (less than 5 units/day) (as the number of units consumed per day increases, perceived dependence was expected to increase)56; poly- versus exclusive users (with level of perceived dependence in poly-users expected to be higher)22,26.

RMT analyses were conducted using RUMM2030 (RUMM Laboratory Pty Ltd., Duncraig, Australia; https://www.rummlab.com.au/rumm-2030)73, and CTT analyses, test–retest, known group, and convergent validity analyses were conducted using IBM SPSS 21 (IBM Corp., Armonk, NY, US)74.

Ethics approval and consent to participate

The qualitative study was approved by the New England Institutional Review Board (IRB) (NEIRB# 120170184) on July 12, 2017, and written informed consent was obtained from participants. The quantitative study was approved by the New England IRB (NEIRB# 120180022) on March 14, 2018, and electronic informed consent was obtained from participants.