Introduction

In today’s environment of a knowledge-based economy, it is not easy for businesses to assess the usefulness of acquiring or developing customer relationship management (CRM) tools to implement the corresponding strategies. Companies find it difficult to quantify this usefulness clearly and simply and to determine the economic value of the information generated and managed from the perspective of managing the customer base (Heidemann et al., 2013). It is increasingly common to require those responsible for the different business areas to know the detailed level of return on investments in commercial tools to enjoy greater credibility inside and outside the organization.

Although the conceptual foundations of a CRM strategy are not currently in question, its implementation faces many challenges (Rodríguez and Boyer, 2020). One of the cornerstones of this research is the concept of CRM as a strategic bridge between IT and marketing strategies aimed at building profitable customer relationships. IT management in customer profitability analysis can be considered a resource that enables companies to collect much more specific data, leading to competitive advantage, which in turn translates into superior performance. These resources facilitate the free flow of information, allowing the company to transform and exploit it, providing potential resources and capabilities to create economic value by implementing successful strategies, mainly through the mediating influence of customer service and understanding customers’ needs better (Harrigan et al., 2020; Krizanova et al., 2018).

There is abundant literature related to the quantification of the economic value of a company’s customer portfolio. However, there is no clear link between the needed IT tools and the management of information as an asset, or with the relationships that have arisen with the different dimensions of CRM. Almost all studies are limited to proposing exclusively theoretical models or models that, due to their technical complexity, can hardly be put into practice in a real work environment (Bauer and Jannach, 2021). Despite its growing importance, little research has focused on developing a valid measurement scale and empirically testing the CRM concept concerning business performance and knowledge management leveraging the use of IT (Krizanova et al., 2018). As an approach to the valuation of customers as assets, we highlight the work of Kumar (2018) and his Customer Valuation Theory, which brings together an extensive compendium of factors that influence customer valuation. However, an integrated deterministic quantitative model has not yet been proposed.

The main problem indicated by some authors is the lack of rigorous studies that demonstrate the value that Business Intelligence (BI) brings to the company because the expected benefits are largely intangible (Xie et al., 2016). In his work, Trieu (2017) reviewed the literature on how business value is obtained from BI systems, underscoring that very few cases focus on customer value management.

Much CRM research addresses the need to develop a comprehensive customer profitability model, as knowing which customers are profitable is a critical starting point (Hwang et al., 2004). Consequently, for effective CRM, it is essential to have information about customers’ potential value. Until now, only a few studies have quantified the connection between soft and hard metrics so managers can use the resulting information (Hanssens and Pauwels, 2016). Thus, there remains a need for holistic and interdisciplinary research to help both managers and researchers assess the performance of integrating marketing, information, and technology systems in organizations (Iacobucci et al., 2019).

Finally, of particular interest for this research, as a previous scenario from a marketing perspective, is the work carried out by Glazer (1991), the precursor of the concept of “Vic” or value of the information associated with the exchanges between a company and its customers. This seminal paper treats information or knowledge as an asset that helps examine the consequences of increasing information intensity for some key strategy components.

The priority objective of this research is the conceptualization and empirical verification of an exploratory model that quantifies the economic value of the information and knowledge generated by the different computer tools that help implement CRM solutions. Specifically, it is intended to be applied in B2C business environments and from a customer value management perspective. More specifically, this study has three goals: (1) to propose and test a new model to measure the value of business information in CRM environments rooted in the information valuation theory. This theory is based on the fact that information, like any asset, has a value according to the benefits obtained through its use (Moody and Walsh, 1999); (2) to compare the performance of the model with traditional methods; and (3) to analyze the influence/sensitivity of each component in the model. From the managerial point of view, small and medium-sized enterprises (SMEs) companies may be the most interested in this research because the use of CRM is particularly beneficial in an organizational context such as SMEs (Li and Mao, 2012).

In this paper, we expand the traditional definition of CLV models to reflect the customer’s actual contribution to the value of the company more accurately. Beyond the due mathematical rigor applied, it has been shown in practice what measures or indicators can be useful to assess customers with the enormous amount of information companies handle. New segmentations compared to other traditional models, a guide to developing IT CRM or awareness of the great potential of customer information, among others, are relevant findings of this research.

Work proposals

Then, the three working propositions that support the materialization of the model are developed.

  1. (1)

    The use of information creates value for companies. Information is previously conceptualized as an asset generated by CRM tools that can create value. This is based on the theories of the value generated for the company (Value to Firm) (Verhoef et al., 2015) through the theory of general utility, derived from the belief that a commodity’s value is given by its utility (Iyengar, 1997). Customers are the company’s main source of profitability; therefore, if it is possible to estimate the value of current and future customers, it is also possible to estimate a large part of the value of a company (Ferrentino et al., 2016).

  2. (2)

    Existence of a direct and clear relationship between CRM tools and the information they generate for the company. Thanks to the existence of this relationship, it is feasible to generate a model that estimates the value that the combined use of both terms will have for a company (Krush et al., 2016), in this case, from a customer value management approach as a consequence of the knowledge generated in organizations (Toriani, and Angeloni, 2011).

  3. (3)

    Analysis of the constructs involved in the model. Below is an explanation of why the following constructs involved in the model named VICRM have been specifically proposed.

Figure 1 represents the general structure of the model VICRM as follows:

Fig. 1: General structure of the model VICRM.
figure 1

A diagram that shows the dependency relationships of the components of the VICRM model.

The Customer Lifetime Value (CLV) has been and is increasingly a fundamental concept to consider in the field of marketing as a discipline (Bauer and Jannach, 2021) and from a professional or managerial point of view (Ferrentino et al., 2016). It has been shown to be a good predictor of future customer profitability. Experts believe that in the next 5 or 10 years, it will be the most important metric for companies. However, despite its success, it has not yet been accepted in all industries (Kumar, 2018).

Among the multitude of definitions in the literature, this concept can be defined synthetically as the present value of future profits generated by a client throughout their life with the company (Kumar, 2018), or also as the discounted value of cash flows generated by a client.

The base formula is proposed for calculating the transactional CLV. In the following expression, for the present moment, t = 0:

$${{{\mathrm{CLV}}}} = \mathop {\sum}\limits_{t = 0}^n {\frac{{\left( {{\mathrm{GP}}c;s} \right)}}{{\left( {1 + d} \right){{{\mathrm{t}}}}}}}$$
(1)

where

  • GPc;s\({\mathbb{R}}\) = gross profit for a customer c or a segment s (including the total costs of serving them)

  • t \({\mathbb{R}}\)+ = annual period of occurrence of the cash flow

  • n\({\mathbb{R}}\)+ = total number of projected annual periods

  • d\({\mathbb{R}}\)+ = annual discount factor or rate.

The traditional CLV is a dependent variable that considers the expected potential economic value of strictly transactional relationships that a customer can bring to the company.

There is no doubt that customers can interact and create value for companies in various ways. Organizations are beginning to realize that different customers bring different economic value and are, therefore, moving away from product-centric or brand-centric marketing toward a customer-centric approach as the primary asset (Heidemann et al., 2013). Assessing the value of customers based solely on their transactions does not seem to be enough, and valuing this engagement correctly is crucial to avoid both undervaluing and overvaluing customers (Tomczyk, 2016). This context has led to the emergence of customer valuation theory, which can be defined as a mechanism for measuring the future value of each customer based on three aspects: (1) The contribution of the customer’s direct economic value, expressed as a margin contribution or net profit. (2) The depth of that direct value, referring to the intensity of the contributions to the company through their purchases, including acquisition and retention of profitable customers based on their future value and purchase potential across multiple categories. (3) The amplitude of the indirect economic value contribution, expressed as the volatility of customer cash flows. The latter refers to customers’ contributions through their referral behavior, their influence on the purchases of potential customers and on existing customers, and their comments on the company’s offers (Kumar, 2018).

Aligning the disciplines of marketing and IT is no longer enough. An extension to a financial management perspective is required to address new situations. Thus emerges the new concept of a triad (marketing, IT, and finance) in the business world. The methods used in finance have become important because of their similarity to CRM systems, as both are based on the construction and management of a portfolio (Gneiser, 2010). For their part, accounting and finance are not as precise disciplines as one might imagine, and they are also becoming increasingly qualitative (Persson and Ryals, 2010). Recent literature has shown that the inclusion of nonfinancial and attitudinal measures in market response models increases their predictive and diagnostic power. They are the so-called customer mindset metrics, that is, self-reported measures of customers’ perceptions, attitudes, and intentions (Matsuoka, 2020; Srinivasan et al., 2010).

Considering the above, a reformulation of the CLV is proposed as a new function represented by the following expression.

$$f\left( {{\mathrm{CLV}}} \right) \supset \left[ { {\mathrm{+ CEV \pm CKV \pm CSV}}} \right]$$
(2)

ƒ(CLV) is a function of the CLV that includes the expected economic value, on the one hand, of the customer’s transactional operations together with the mediating effect of the three factors (endogenous variables) or constructs that influence the result. The extended CLV is explained in detail below.

Customer engagement value (CEV)

It is a diffuse concept, contradictory, and difficult to measure (Kumar et al., 2010). Although it has been conceptualized as a multidimensional construct that collects the client’s different activities that affect the company, providing added value, in this research, we will treat it in a more restricted way. The reason is that some of its dimensions (indirect contributions), as defined by Pansari and Kumar (2017), such as the value of references, and the influence and knowledge a company can derive from the client, are included in other constructs used here, such as customer social value (CSV) and customer knowledge value (CKV). Of these authors’ ideas about customer engagement, the part they call direct contribution is maintained. Its antecedent is satisfaction, and its result is an increase in customers’ purchases (Yoon et al., 2018).

Satisfaction becomes the main driver of loyalty, which in turn, is a function of commitment (Plangger, 2012). Loyalty represents the main key to measure customer retention (Mohamed, 2016). Repurchase intention is often known as loyalty, with high levels of repurchase being a characteristic of the most loyal customers (Singh and Saini, 2016). However, these relationships among satisfaction, retention, and profitability are not always necessarily immediate and linear and are usually moderated by the characteristics of the customer and the industry (Sánchez García and Curras-Perez, 2019).

Given the absence, in academic literature, of a clear and fully generalizable quantitative criterion at the time of weighing the intervening variables, the following metric is proposed:

$${{{\mathrm{CEV}}}} = \left( {{{{\mathrm{NPS}}}} + {{{T}}}_{{\mathrm{on}}}} \right)/2 \ast \underbrace {\left( {{{{V}}}_{{{\mathrm{o}}}} + {{{V}}}_{{{\mathrm{i}}}} + {{{P}}}_{{{{\mathrm{op}}}}}} \right)}_{{{{\mathrm{Normalized}}}}/{{{\mathrm{standardized}}}}\,{{{\mathrm{quantity}}}}\,{{{\mathrm{variables}}}}}$$
(3)

where

  • NPS  [−1;1] = metric based on a customer’s willingness to recommend the company to others. At the individual level, it can also be used to quantify an individual’s propensity to recommend the company.

  • Ton (Tone)  [0;1] = sentiment analysis towards a company measured through text mining techniques on a customer’s messages about the company (Ho et al., 2021). Direct metrics can also be used, such as the CSAT (customer satisfaction). A direct influence of these indicators has been shown with trust in the brand and this, in turn, with customer loyalty.

  • Vo (Volume) \({\mathbb{R}}\)+ = number of comments made through digital social media.

  • Vi (Virality) \({\mathbb{R}}\)+ = number of comments shared on social media by the client, both made by the company itself and comments where it is mentioned.

  • Pop (Popularity) \({\mathbb{R}}\)+ = number of “likes” made by the customer in comments made by the company or in those that mention it.

CEV is a mediating variable of the retention rate that collects the value of the customer’s engagement with the company measured by the level of affinity towards it.

Subject to:

$$0 \le {{{r}}} \ast {{{\mathrm{CEV}}}} \le 1 \to {{{r}}} \ast {{{\mathrm{CEV}}}} \in \left[ {0;1} \right]$$

CKV

Global knowledge of customers helps create value for companies (Wu et al., 2013) if it is processed correctly because in some cases, it can become a value destroyer (Tomczyk, 2016). This contribution to value creation can be defined as CKV and must be captured and included in any model that aims to measure the customers’ value (Kumar et al., 2010). This customer value is captured, among other ways, through their comments and the information available in the CRM (Kumar and Reinartz, 2016).

It has been shown that the quality of information positively affects the relationship with the customer, which consequently improves business performance (Soltani and Navimipour, 2016), as well as being a significant antecedent of both perceived usefulness and user satisfaction (Kim and Lee, 2014). When measuring its usefulness, the level of recency, immediacy, or renewal of the information is also important because old knowledge has limited usefulness due to its obsolescence (Crié and Micheaux, 2007).

Following the ideas collected above, the following indicators are suggested in a reasoned way to measure the CKV variable:

$${{{\mathrm{CKV}}}} = {{{Q}}}_{{{\mathrm{i}}}} \ast \left[ {\left( {{{{\mathrm{Act}}}} + {{{K}}}_{{{\mathrm{i}}}}} \right)/2} \right]$$
(4)

where

  • Qi (Quality)  [−1;1] = metric based on the level of usefulness found by a user in the information about a client when achieving their business objectives. As with research on credibility, quality can be directly assessed by asking respondents to indicate whether the information is credible or, in this case, of quality (Wathen and Burkell, 2002).

  • Act (Recency)  [0;1] = renewal or update level of the information that the company has about the customer.

  • Ki (Quantity)  [0;1] = amount of information about a client that the company has for its analysis and exploitation.

CKV  [−1;1]: is an endogenous variable, a mediator of the CLV, which considers the value that customer knowledge brings to a company measured by the amount and recency of the available data based on the quality perceived by the user who manages the data. Due to the fact that it has been impossible to find a satisfactory explanation of the degree of impact of each variable of the equation on the dependent variable, it has been proposed that the quantity and the level of recency have the same weight, using “quality” as a moderating variable that indicates the sign (direction) of the metric.

CSV

Most research has implicitly assumed that a customer’s value is independent of others. However, in an increasingly interconnected society, in many cases, the effects of the network can be strong and ignoring them can lead to underestimating the CLV, as occurs in traditional models (Gupta et al., 2006). We begin to find this explicitly in the recent literature that tries to analyze the social value a client brings (CSV as analyzed in this paper). That is, the potential value that a customer contributes or detracts thanks to the influence they exert on the environment in which customers operate. Weinberg and Berger (2011) deal with an analogous concept; the customer social media value (CSMV), which refers to the value of the customer in social media but is limited to the influence exerted on digital social media. This indicator ignores the part of the influence that a person exercises outside the digital environment through traditional channels, which we think must also be considered within the social component of value. We found other concepts and metrics in the literature related to this construct, such as the customer’s referral value (Kumar and Reinartz, 2016) or the effect of the customer’s influence on people (Kumar, 2018). Both the customer reference value and the customer’s influence value are generally combined in the CSV concept (Tomczyk, 2018), in the same line as followed in this work. The explanation is that it seems more reasonable to group into a single concept everything related to an individual’s interactions with their social environment.

There is no doubt that as a network adds members to its structure, its value increases because the potential links increase with each new user (Van Hove, 2016). Previous research shows that this form of interconnection positively affects cooperation, knowledge transfers, and the final performance of companies (Palmatier, 2008). In this context, a very popular heuristic is Metcalfe’s law, which emerged in the early 1980s, stating that the value of a network is proportional to the square of the number of users. That is, the added value of the network grows quadratically: n*(n−1) ≈ n2 when n → ∞ (Van Hove, 2016). This is true for any business process (Alavi et al., 2012), where the quantity and quality of participation in a network are considered the main indicators of success, as well as the individual’s level of credibility and reputation, which act as precursors to their level of influence.

In this way, based on the well-proven Metcalfe’s law, we can estimate a customer’s CSV as the potential value that their network of contacts can bring to the company due to the effect of their influence:

$${{{\mathrm{CSV}}}} = {{{Q}}}_{{{\mathrm{m}}}} \ast {{{N}}}^2$$
(5)

where

  • Qm [−1;1] = variable that measures the quality of contacts; as a coupling force between the size of the network and the value generated thanks to the client’s authority in their network, measured by the level of influence exerted for the company and the volume of participation in the network, Qm = 0: no authority and/or no social activity, Qm = 1; maximum social activity and authority with a level of positive influence for the company and Qm = −1: maximum social activity and authority with a level of negative influence.

  • N \({\mathbb{R}}\)+ = number of contacts who have a close relationship with the customer and whom the customer can influence sensitively

$${{{\mathrm{CSV}}}} \in {\Bbb R}$$

CSV is an endogenous variable, a mediator of the CLV, which considers the potential value that a client’s social relationships can contribute or remove from the company, both through the acquisition/loss of potential customers and through the improvement or worsening of unit revenues and the retention of those who already were customers.

Finally, Table 1, which synthesizes the metrics that establish each of the constructs of f(CLV), is presented for a better understanding.

Table 1 Summary of the metrics of the constructs of f(CLV).

Model construction

The structure of the model can be explained mathematically as follows:

$${{{V}}}_{{{{\mathrm{ICRM}}}}} = {\sum} {f\left( {{{{\mathrm{CLV}}}}} \right) \supset \left[ { + {{{\mathrm{CEV}}}} \pm {{{\mathrm{CKV}}}} \pm {{{\mathrm{CSV}}}}} \right]}$$
(6)

From the above equation, it follows that the value of information in CRM environments (VICRM) is the result of the sum of the CLV functions of all the customers or segments that make up the company’s portfolio. Continuing with the development of the previous formula, it is extended with each of the components that the CLV function contains: ƒ(CLV):

$${{{V}}}_{{{{\mathrm{ICRM}}}}} = \sum [\underbrace {{\mathrm{CLV}} \ast {{{r}}}\,{\mathrm{CEV}}}_{{{{\mathrm{Engagement}}}}\,{{{\mathrm{value}}}}} + \underbrace {\left| {{\mathrm{CLV}}} \right| \ast {{{r}}}\,{\mathrm{CKV}}}_{{{{\mathrm{Knowledge}}}}\,{{{\mathrm{value}}}}} + \underbrace {{{{\mathrm{CLV}}}} \ast {{{\mathrm{CSV}}}} \ast r\,{\mathrm{CE}}}_{{{{\mathrm{Social}}}}\,{{{\mathrm{value}}}}}V]$$
(7)

Each of the components contained in the ƒ(CLV) will be developed and explained, analyzing the different constructs and how they are predictably related to the CLV, which ultimately allows us to provide each component with units of economic measurement (see Fig. 2).

Fig. 2: Table with the relations of the ƒ(CLV) and its components.
figure 2

The correlation between the constructs allows us to carry out a first analysis that will later be validated.

o Engagement value component measures the potential that a customer’s engagement will have for the company, measured by the probability that they will continue generating all the CLV possible, or, on the contrary, will leave the company or not continue consuming in the next periods, with the consequent loss of profit. It is estimated by multiplying the CLV of the client or segment by the CEV, adjusted by the retention rate of the company or of a particular segment or, failing that, based on the data of the industry or of a comparable company. That is:

$$\left[ {{{{\mathrm{CLV}}}} \ast {{{r}}}\,{{{\mathrm{CEV}}}}} \right]$$

o Knowledge value component aims to measure the potential that a customer’s knowledge will have for the company. It is understood as the probability that the customer will continue generating all the theoretical CLV if the information is perfect or, on the contrary, due to the bad information available, no additional value is provided. It is estimated by multiplying the absolute value of the CLV of a customer (i) or of a “typical” customer who is representative at the company/segment level (m). It is estimated by multiplying the absolute value of the CLV by its corresponding CKV together with the retention rate of a “typical” customer who is representative at the company/segment level. Unlike the rest of the constructs that make up the ƒ(CLV), we will use the absolute value of the CLV because in those cases in which the CLV could turn negative by its own definition, as it would not be logical for a customer with a negative CLV who also had a negative CKV, which would result in a positive knowledge value. The generic expression would be the following:

$$\left[ {\left| {CLVi\_m} \right| \ast {{{\mathrm{r}}}}\,{{{\mathrm{CKV}}}}} \right]$$

o Social value component of a customer for a company is understood as the potential value that their network of contacts can bring to the company. Measurement of this value is a result of multiplying the theoretical CLV by the CSV variable, considering the corresponding retention rate, adjusted by the CEV of the company. The estimation of this construct can be approached using the CLV, the CEV, and the “r” of an average customer of a cohort or of a “typical” client considered representative. Mathematically expressed as follows:

$$\left[ {{{{\mathrm{CLV}}}}_{{{{\mathrm{i}}}}\_{{{\mathrm{m}}}}} \ast {{{\mathrm{CSV}}}} \ast {{r{\mathrm{CEV}}}}_{{\mathrm{i}}\_{\mathrm{m}}}} \right]$$

Finally, simplifying the VICRM equation, which includes the extended formula with each of the components and their respective metrics, the following equation is obtained for its individual calculation:

$$\begin{array}{l}{{{V}}}_{{{{\mathrm{ICRM}}}}} = \mathop {\sum}\limits_{I = 1}^N {\left[ {\mathop {\sum}\limits_{t = 0}^n {\frac{{\left( {\overline {{\mathrm{GP}}} c:s} \right)}}{{\left( {1 + d} \right)^{{{\mathrm{t}}}}}}} } \right. \times {{{r}}}^{{{t}}}\left( {\left[ {\left( {{{{\mathrm{NPS}}}} + {{{\mathrm{Ton}}}}} \right)/2} \right] \ast \left( {{{{V_{\mathrm{o}}}}} + {{{V_{\mathrm{i}}}}} + {{{\mathrm{Pop}}}}} \right)} \right)} \\\qquad\quad\;\;\; + \mathop {\sum}\limits_{t = 0}^n {\frac{{\left| {\overline {{\mathrm{GP}}} c:s} \right|}}{{\left( {1 + d} \right)^{{{t}}}}} \times {{{r}}}^{{{t}}}\left( {{{{Q_{\mathrm{i}}}}} \ast \left[ {\left( {{{{A}}}_{{{{\mathrm{cti}}}}} + {{{K}}}_{{{\mathrm{i}}}}} \right)/2} \right]} \right)} \\\qquad\quad\;\;\; + \mathop {\sum}\limits_{t = 0}^n {\frac{{\left( {\overline {{\mathrm{GP}}} c:s} \right)}}{{\left( {1 + d} \right)^{{{t}}}}}} \times \left( {{{{Q}}}_{{{\mathrm{m}}}} \ast {{{N}}}^2} \right) \times {{{r}}}^{{{t}}}\left. {\left( {\left[ {\left( {{{{\mathrm{NPS}}}} + {{{\mathrm{Ton}}}}} \right)/2} \right] \ast \left( {{{{V}}}_{{{\mathrm{o}}}} + {{{V}}}_{{{\mathrm{i}}}} + {{{\mathrm{Pop}}}}} \right)} \right)} \right]\end{array}$$
(8)

Main characteristics of the model

Deterministic models may be suitable for many different systems and situations. The inclusion of greater complexity involving a larger number of variables allows us to develop a stochastic approach model (Ríos, 1995). These models accurately explain the results of the different parameters, relationship states, and initial conditions. Normally, they are used to study different business processes, such as customer acquisition, retention, and profitability, among others (Kumar and Reinartz, 2016). For analyses that aim to study differences between customers, such as segmentation research, data collection at a single point in time is appropriate (Blozis, 2022).

This model is based on the multiplicative effect of each construct as a function of a potential transactional CLV that supports the economic value of a client’s information. This formulation is proposed in recognition of the fact that there is a potential interaction effect between the different constructs in the model. This is what we refer to in marketing as an “interaction effect” (Weinberg and Berger, 2011).

Methodology

Given the complexity of the processes involved in the analysis and the objectives sought, the methodology is structured in two clearly differentiated stages. Firstly, a bibliographic review has been carried out that aims to lay the foundations for the construction of the model. Second is the preparation of an empirical framework. Considering the casuistry and the complex context of the business phenomena that arise, as well as the possible ambiguities of the different concepts treated, an empirical case is approached that provides a holistic vision of the investigated phenomenon based on a deductive philosophy (Saarijärvi et al., 2014). The successful implementation of conceptual models and heuristics in real companies is an additional advantage for these investigations (Fischer et al., 2011).

Applied research data

To apply this mathematical model, we have selected a company that is part of the financial services industry, specifically the insurance sector. At present, this industry is an important part of the financial sector, not only because insurers are significant investors in the financial markets but also because they provide various important services to consumers and companies (Gaganis et al., 2019). The insurance industry is mature and highly competitive. Success in this industry depends, in addition to the traditional control of financial metrics, on the insurer’s ability to monitor other intangible assets. However, there are only a few papers that address this problem in this field that specifically consider its uniqueness (Marcos et al., 2018).

To choose a paradigmatic company, we look for these characteristics: a high degree of organizational maturity, medium size, high availability of data, high willingness to commit the necessary resources for the investigation, and the lack of useful prior information about the value of its customers, which has led managers to receive new information with interest. A Spanish insurance company (hereinafter, “the company”) met all these requirements.

The research phase was collaborative and involved a series of individual interviews and group workshops with the different areas related to customer information management. Client-specific costs and revenues were considered using a longitudinal approach to estimate product margins and other financial metrics necessary to capitalize and discount cash flows. We also apply the conventional assumption that after customers are lost, they do not return, which is reasonable given the contractual relationships being considered (Ryals, 2005).

From the entire client portfolio, it was finally decided to choose 60 randomly, 10 for each previously defined archetype by the company together with a consulting company. The criterion chosen for the segmentation followed consumption patterns and socio-demographic characteristics. In this way, the different segments forming the customer portfolio are represented through the sample. This decision seeks a balance between the operability of the calculations and the goals pursued. All these customers met the requirement of having answered one of the surveys that the company sends regularly. We emphasize that although it is true that in terms of representativeness, no substantial differences were detected between those who answered and those who did not, there are significant differences in terms of other characteristics, as those who responded seem to have greater engagement with the company according to several metrics, including roughly double savings on average.

Table 2 summarizes the metrics used in the model where a brief definition of them and how they have been operationalized in this context have been included.

Table 2 Summary of the model variables applied in the company.

Results

The results shown in Table 3 were obtained by substituting the corresponding values in Eq. (8). In each case, the corresponding trend measure has been highlighted depending on whether the sample distribution is normal (mean) or not (median) after using the corresponding Shapiro–Wilk and d’Agostino–Pearson tests.

Table 3 Model results for sample N = 60.

About V ICRM model

There is substantial heterogeneity among different segments when observing any of the measures of central tendency and dispersion analyzed. On the other hand, the great differences in value between the different constructs that make up the model are also observed. Similar results are obtained in the heterogeneity between the different segments and components of the model if we analyze the rest of the measures described.

To confirm the differences indicated in the previous section, beyond the direct observation of the results, a statistical analysis is carried out that helps to confirm the heterogeneity both at the component level of the model and at the segment level to check whether significant differences are found. More than the differences between each factor, what is interesting to verify with this analysis is that the factors are different from each other and that all of them provide differentiated value to the dependent variable.

As we found that the relationships between the samples are basically independent and given the non-normal nature of the data, the Scheirer–Ray–Hare test was justified. This test is a non-parametric equivalent of the two-way analysis of variance (Šimkovic and Träuble, 2019). Assumptions for the test to be used satisfactorily: the samples should have the same size and at least five observations each.

The results obtained reveal (see Table 4) that both factors show significant differences because, for both values of the H statistic, very small p-values are associated (0.0170 and 3.74E−19, respectively), which are lower than the significance levels (α) of 5% and even of 1% if we examine the components of the model. Therefore, in both cases, with a confidence level of 95%, we can reject the null hypothesis that the mean ranks of all groups are equal. This implies that there are significant differences in the results depending on which segment the customers belong to and that the value of the different components of the model differs according to the construct analyzed. On the other hand, regarding the analysis of the interaction between the two factors, it cannot be concluded that there are differences (intra-subject analysis).

Table 4 Scheirer ray hare test results.

It can be concluded that the segments are not homogeneous concerning the value of the information captured by the model, which confirms what has already been pointed out by some researchers (Mccarthy et al., 2017). This implies that it may be interesting for the company to carry out new segmentations based on the economic value of CRM information beyond the use of already established archetypes.

Two correlation matrices among the different items that make up the model are shown to examine the relationships that may arise:

The first thing that emerges is the differences in significance according to the two correlation coefficients. Of the 20 association relations found in the two regular matrices (M5 × 5), only one of the relations is significant. Additionally, two other relationships migrate to different levels of significance.

Given the definition of the dependent variable as a sum (VICRM), all the intervening constructs have very significant levels of association and vary in the same direction. This is especially remarkable if we observe the value of engagement. It is also found that, as anticipated by Pansari and Kumar (2017), there is a positive relationship between the level of commitment and the amount of information provided to the company, measured by the Spearman correlation (rho = 0.468). This commitment is manifested in more information, of better quality, and more updated, which is how it has been measured in this research. Finally, we highlight the negative, almost null, relationship found between the social and knowledge part. The relationship is not significant in either case. Thus, all the relationships anticipated in Fig. 2 that were based on the literature and the propositions that support the model have been confirmed.

Focusing the analysis of the VICRM and its relationship with the different components on which it depends, it is observed (see Table 3) that the one with the greatest weight is the value of engagement, although if we examine the medians of the sample, the results are significantly different, and now, the highest weight is the value of knowledge. This difference is explained by the existence of extreme values within the compromise value throughout the sample. In all cases, the contribution of social value is very residual. This can be explained by the fact that very few relationships were reported in the system. It is expected, and even desirable that the social value can collect all the potential for the business shown in other research.

It is observed that the results do not conform to the well-known Pareto principle and follow another somewhat different distribution, which some authors, such as Brynjolfsson et al. (2011), call the long tail, where they find that, thanks to the use of IT, the participation of niche customers increases. This topic may be interesting for the company, as it will concentrate less risk by having better-distributed profitability per customer.

Once the analysis has been completed both at the level of individuals and segments, the following Table 5 summarizes the different final values of the constructs according to the previously configured segments. The theoretical value that implementing the model would have had on the total customer base is established.

Table 5 Model results according to customer archetypes.

Given that other studies have found a relationship between models based on customer value and company value in the markets and their link with business results (Kumar and Reinartz, 2016), it would be interesting to compare the data obtained with the available company results. Due to the lack of representativeness of the sample, such a comparison is not reliable because the result of the aggregate model will surely be inflated.

Finally, given the results shown in Table 6, the usual maxim in traditional customer lifetime value models, where the projected business accumulates significantly in the first few years, does not seem to hold in this case. According to Berger and Nasr (1998), this happens in the first 4 or 5 years. For Kumar et al. (2007), approximately half of a customer’s total lifetime value is generated in the first year, although it depends on the industry. With the implementation of this model, it would take approximately 7 years to reach 50 percent of the expected value, which shows that it is clearly less regressive than other traditional valuation models.

Table 6 Annual relative weight of the value of each model.

In relation to the lower prospective value of the first years, the fundamental reason lies in the effect of the engagement value, which increases over time due to how it is calculated, including the restriction that must be fulfilled: CEV*r[0;1]. Apart from the justification by the mathematical logic, the rationale for the long-term effect of customer engagement is that retention and relationships produce several important benefits. It is also consistent with existing literature highlighting that the gains generated tend to accelerate over time. The underlying evidence is that customers increase a company’s profitability because they are willing to pay higher prices, buy more, and recommend the company to others (Chang et al., 2012).

Comparison of the V ICRM model with a traditional alternative model

The calculations are available for what has been called an exclusively transactional CLV model throughout the paper, very similar to the one used in many studies. It only considers information on income and expenses together with the annual retention rate. A comparative analysis has been carried out to understand the present model better.

To confirm the differences indicated in the previous section, a statistical analysis is carried out to assess the heterogeneity both at the level of the models and of the segments. As the assumptions and conditions are the same, the Scheirer–Ray–Hare test is used again. The results obtained also confirm that both factors show significant differences. It follows that there are differences in the results depending on whether customers belong to one segment or another and that the value of the models differs.

Due to the importance of confirming the significant difference between the two models, the results of Mood’s median test have been added to verify that the two samples are significantly different (see Table 7). Therefore, it would be justified to use the new proposed model for the entire customer base, at least in terms of systematically obtaining results that are different and that promote different decision-making.

Table 7 Results of the Scheirer–Ray–Hare test and Mood’s median test for the two models.

At this point, we continue reviewing the correlations, including the relationships between all the components of the complete model of VICRM and the traditional model.

Directly comparing the two models (see Table 8), it is verified that there is a significant correlation with a confidence level of 99%, both through linear and Spearman correlations. In both cases, it is a large and positive intense correlation (r & rho > 0.5). On the other hand, both correlations are significant and positive between all VICRM model components and the traditional CLV model, except for the linear correlation for the knowledge value part.

Table 8 Correlation matrix of the components of VICRM, including comparison with traditional CLV.

If the results of the two models are directly compared (see Table 9), it is verified that the traditional model calculates a value more than twice the new one (∆142%). This may mean that, in general, the new model undervalues customers or that customers are overvalued by the traditional model. In any case, in the new model, we found that there is not such a high relationship between income and final value due to the mediating effect of the ƒ(CLV) compared to traditional models, which are much more dependent on operating margins. The overvaluation of the customers in traditional models is an issue that has been extensively studied in the literature. In particular, the segments with the higher and recurring income are the ones that suffer the greatest overestimation. Gupta and Lehmann (2003) and Jasek et al. (2018) show that the use of these traditional static models generally overestimates CLV, sometimes very substantially, because they ignore other important dynamics in customer relationships that impact their value.

Table 9 Results in CLV and VICRM models for the total sample.

Fixing the analysis unit in the client, the following results can be observed in Table 9.

Despite the fact that the total economic value of the CLV model is much higher than that estimated through the VICRM, it is verified that, of the 60 individuals, 14 would have a higher value when analyzed under the prism of the VICRM, shown in bold in Table 9 (23.3%). Therefore, the company could risk undervaluing almost a quarter of the portfolio with a traditional model. However, it is also true that the difference in these clients’ values is not very substantial compared to the total value.

Subsequently, we verified whether the sample data are equally homogeneous. The goal is to check whether a model better captures the heterogeneity of the customers, given the high observed variability. Several statistics frequently used in previous studies have been used to calculate the degree of dispersion (see Table 10).

Table 10 Results of the dispersion statistics in the CLV and VICRM models.

Except for the coefficient of variation, which is lower for the traditional CLV, for the rest of the metrics, the results are higher in all cases for the traditional CLV. Therefore, it can be concluded that the new model is more homogeneous and robust and that, consequently, it will be less exposed to outliers’ effects.

To verify that the new model is less regressive than the traditional ones, the following Table 6 shows how the two models behave over time. Although the behavior of the traditional CLV is more like the base models, it still does not concentrate as much value in the first years as other examples. This situation is due to the nature of the well-established contractual business of this insurance company.

Finally, the analysis of the effect of the retention rate is interesting. A 5% drop implies an economic value drop in the traditional model of 32.7%, whereas, in the new model, this same descent would only imply a decrease of 9.2%, caused exclusively by the effect on the part of the value of knowledge, because the part of the engagement and the part of the social value is subject to the constraint CEV*r[0,1]. A drop of 10% in the same rate represents a drop in the base model of 41.2%, whereas, in the new model, this same drop would mean a decrease of 14.1%. Therefore, the new model partially dissipates the effect that the retention rate variation has on traditional models, which are highly exposed to its effects. Several authors have begun to question whether the dependence between retention and CLV is as high as traditionally considered (Badri and Tran, 2022).

In the proposed model, the effect of retention is managed individually for everyone, which seems to mitigate the cumulative effect that the retention rate has on customer value in traditional models. The introduction of a concept such as the value of the normalized engagement for each client helps companies not only look at the retention rate to consider the probability of whether a customer will continue to generate economic flows in the coming years.

Contributions

From an academic perspective, we delve into a field of knowledge where there are few studies that propose novel quantitative models. To our knowledge, this is the first research with a theoretical-practical approach that seeks to unify in a single deterministic model the different components that reflect the value of customer information in the financial industry. Future research requires interdisciplinary collaboration between academics in accounting, marketing, management, and finance. This paper sheds more light on essential topics in customer valuation, such as the effect of retention and the relationships between different components of customer value that, despite having been mentioned in the literature, had not been clearly grounded in the form of soft and hard metrics. We contribute to marketing literature by proposing and testing a set of assumptions through real practice, using a new model that allows comparisons with traditional models in real-time. Thanks to this comparison, we could validate some theories about the overvaluation of some clients, the problems with retention rates, the distribution of value among customers, and how traditional models are too regressive.

From a professional perspective, starting from the traditional knowledge of the financial sector on asset valuation, this model can be used in multiple B2C industries without many adaptations. It serves to assign a specific value to CRM solutions that, until now, were difficult to quantify, providing a tool that allows objectifying the decision-making processes at the managerial level (for example, improving regular segmentation processes, including the partial values of each component separately, offering a guide to know what and how to measure the information available from your customers and to enhance the accountability of IT investments in CRM). Customer misclassification can cause companies to provide high service to current high-return customers who become less profitable in the future and/or low service to current low-return customers who are likely to become more profitable later. This model is simple and intuitive to apply and cheap to implement compared to other alternatives, as many executives and managers demand (Hanssens and Pauwels, 2016). These requirements represent a particularly critical barrier for SMEs. It has been shown that more sophisticated models are not always better or more beneficial for the user (Holm et al., 2012). Measuring the value of information as a new strategic metric can be helpful for the following purposes: (1) becoming aware of the value of information as an organizational asset to be managed, (2) improving the accountability of business areas, and (3) assessing the effectiveness of IT (Moody and Walsh, 1999).

Conclusions

Metrics are essential for advancing research and practice in an area, providing a basis for empirical validation of theories and relationships among concepts. On the other hand, for professionals, metrics are a way to learn what does and does not work when used in real environments. Companies should improve flexibility in the use and distribution of information in order to make the appropriate decisions more safely and quickly, helping to provide a higher level of customer service by personalizing business–customer relationships, which results in greater satisfaction and loyalty over time (Chen and Tsou, 2012).

The use of a longitudinal case study has allowed an empirical application of the proposed model. Analyzing customers’ values from other perspectives with real data is not an easy task. It can help to establish new successful strategies for companies in the future, where traditional financial and accounting tools coexist harmoniously with the use of new ones managed by marketing.

It has been proven that the exclusive use of traditional assessment models may be insufficient to show a complete image of customers. No approach to measuring customer profitability is universally superior. Instead, companies must balance the degree of sophistication of CPA and CLV with the complexity of customer service and customer behavior in their work environment (Holm et al., 2012). The use of the VICRM model serves as a new, more holistic, and comprehensive approach that helps to capture the particularities of the information value of their customers that the companies possess in their databases and that many are still incapable of making profits.

Limitations and directions for future research

More field research should be done in other contexts that can help to fit the different metrics better or even investigate new mediating or moderating variables. Due to the difficulty of establishing generalizable and contrasted metrics, researchers, together with the companies, should establish their metrics according to their preferences because different companies or industries may have different concerns about the importance of each variable (Safari et al., 2016). An attempt has been made to use the minimum number of variables and items to measure the necessary constructs, always seeking a reasonable balance between operability and rigor. However, this is a frequent discussion in the literature (Petrescu, 2013). Extending the customer value model to include new data-intensive and actionable metrics is a critical requirement for future research. For this reason, the model’s versatility allows certain adjustments to capture the essence of a particular business.

The choice of the multiplicative design of the model and some of the metrics may seem somewhat arbitrary until enough real-world evidence is available to choose a clearly superior choice, including the possible variation of the weight or range of each construct within the f(CLV). In any case, the weights may be adapted depending on the context. That said, we consider our proposal well-grounded in the literature, reasonably sophisticated, and especially practical to cover the research objective. Similar models have been used for other issues. We have responded to the call of the authors who demanded innovative and practical research on the subject (Matsuoka, 2020), as companies are reluctant to adopt complex models (Óskarsdóttir et al., 2018).

One of the main limitations found in this research has been the composition and, to a lesser extent, the size of the sample, which implies that the extrapolation of the results to calculate the entire portfolio must be done with caution. However, the rest of the conclusions should continue to remain valid. Other similar investigations have used much smaller sample sizes and more limited compositions (Ryals, 2005).

These and other possible limitations detected become useful lines of research for the future. This paper proposes a comprehensive model. Although exhaustive in the analysis, it is more than a conclusive model and can be adapted, for example, in B2B environments or in clearly non-contractual industries. Thus, we can extend the model to almost any industry or sector of economic and business activity.