Identification and functional characterization of new missense SNPs in the coding region of the TP53 gene

Doffe, Flora; Carbonnier, Vincent; Tissier, Manon; Leroy, Bernard; Martins, Isabelle; Mattsson, Johanna S. M.; Micke, Patrick; Pavlova, Sarka; Pospisilova, Sarka; Smardova, Jana; Joerger, Andreas C.; Wiman, Klas G.; Kroemer, Guido; Soussi, Thierry

doi:10.1038/s41418-020-00672-0

Download PDF

Article
Open access
Published: 30 November 2020

Identification and functional characterization of new missense SNPs in the coding region of the TP53 gene

Flora Doffe^1,2,
Vincent Carbonnier¹,
Manon Tissier¹,
Bernard Leroy³,
Isabelle Martins^1,4,
Johanna S. M. Mattsson⁵,
Patrick Micke⁵,
Sarka Pavlova^6,7,
Sarka Pospisilova^6,7,
Jana Smardova⁸,
Andreas C. Joerger^9,10,
Klas G. Wiman ORCID: orcid.org/0000-0002-7113-524X²,
Guido Kroemer^1,4,11,12 &
…
Thierry Soussi ORCID: orcid.org/0000-0001-8184-3293^1,2,3^nAff13^nAff14

Cell Death & Differentiation volume 28, pages 1477–1492 (2021)Cite this article

6589 Accesses
21 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Infrequent and rare genetic variants in the human population vastly outnumber common ones. Although they may contribute significantly to the genetic basis of a disease, these seldom-encountered variants may also be miss-identified as pathogenic if no correct references are available. Somatic and germline TP53 variants are associated with multiple neoplastic diseases, and thus have come to serve as a paradigm for genetic analyses in this setting. We searched 14 independent, globally distributed datasets and recovered TP53 SNPs from 202,767 cancer-free individuals. In our analyses, 19 new missense TP53 SNPs, including five novel variants specific to the Asian population, were recurrently identified in multiple datasets. Using a combination of in silico, functional, structural, and genetic approaches, we showed that none of these variants displayed loss of function compared to the normal TP53 gene. In addition, classification using ACMG criteria suggested that they are all benign. Considered together, our data reveal that the TP53 coding region shows far more polymorphism than previously thought and present high ethnic diversity. They furthermore underline the importance of correctly assessing novel variants in all variant-calling pipelines associated with genetic diagnoses for cancer.

Identifying proteomic risk factors for cancer using prospective and exome analyses of 1463 circulating proteins and risk of 19 cancers in the UK Biobank

Article Open access 15 May 2024

A deep catalogue of protein-coding variation in 983,578 individuals

Article 20 May 2024

Analysis of somatic mutations in whole blood from 200,618 individuals identifies pervasive positive selection and novel drivers of clonal hematopoiesis

Article Open access 14 May 2024

Introduction

The evaluation of germline and/or somatic TP53 status is becoming mandatory in a number of clinical situations. In patients with Li–Fraumeni syndrome or families with hereditary breast and ovarian cancer syndrome, the early identification of germline TP53 mutations has been shown to be highly beneficial for disease surveillance [1]. In chronic lymphocytic leukemia, acute myeloid leukemia, and myelodysplastic syndrome, somatic TP53 variants are associated with poor prognosis, and furthermore, they are now investigated in routine clinical practice with the aim of identifying patients who would benefit from specific treatment [2, 3].

It is therefore essential that the identification and classification of TP53 variants meet quality requirements for clinical diagnostics used in personalized medicine. However, the analysis of somatic or germline TP53 variants faces a range of challenges. In tumors, the somatic origin of mutations can be inferred via comparison with matched normal DNA from the same patient. Unfortunately, this material is often missing, necessitating indirect assumptions made using databases that include the most frequent germline variants observed in the human population. For the analysis of germline variants, data are directly compared to population databases. In both cases, the accuracy of the results will depend on the characteristics of the population databases. Among those characteristics are not only the increasing number of rare variants found in the human population but also the ethnic diversity of this latter.

The advent of massively parallel sequencing (next-generation sequencing, or NGS) has shown that the human genome includes far more genetic variation than anticipated, with many variants detected at frequencies far below the formerly used 1% limit [4]. To illustrate, the most frequent build (151) of dbSNP includes 335 million variants. However, several studies have questioned the quality of the entries, among which both pathogenic and nonpathogenic TP53 variants are included [5, 6]. In contrast, data from the 1000 Genomes Project have been highly curated, but the small number of individuals (2504) included in the project does not enable the full coverage of low-frequency variants. To circumvent this issue, the Exome Aggregation Consortium released the ExAC database, which compiles whole-exome sequencing data from 60,706 individuals [7]. ExAC includes not only data from the 1000 Genomes Project but also constitutional data from unrelated individuals sequenced as part of various disease-specific and population genetics studies. In 2016, ExAC became the Genome Aggregation Database (gnomAD) with an enlarged dataset providing information from 123,136 exome sequences and 15,496 whole-genome sequences from unrelated individuals (http://gnomad.broadinstitute.org/). Both ExAC and gnomAD have been widely used as a substitute for or a complement to dbSNP and are currently used in multiple analytical pipelines.

Two missense SNPs in the TP53 gene have been identified and extensively characterized. SNP rs1042522 (p.P72R) is common in all world populations but the ancestral allele (Pro) shows a north-south gradient ranging between 0.2 and 0.7 [8]. The second SNP, rs1800371 (p.P47S), has been shown to be specific to the African population, albeit at a lower frequency (0.01–0.02) [9]. Both SNPs are included in ClinVar and considered benign according to American College of Medical Genetics and Genomics (ACMG) criteria.

In the present study, using a combination of in silico, in vitro, structural and genetic approaches, we addressed a two-tiered question: (i) is it possible to identify novel and infrequent TP53 SNPs in the human population and (ii) what is the potential pathogenicity of these variants? Using aggregated and population-specific databases, in silico and functional studies, and clinical data, we identify 19 new exonic TP53 SNPs including five variants specific to the Asian population.

Results

Identification of new TP53 SNPs in the human population

To identify novel infrequent TP53 SNPs in the human population, fourteen independent datasets compiled from individuals without neoplastic disease (202,767 individuals) were screened for variants located anywhere in the entire TP53 gene (Fig. 1A and Table S1). Three datasets included aggregated data from multiple studies, whereas 11 comprised data from specific countries. These latter were globally distributed to increase the identification of population-specific variants. Furthermore, data overlap between these datasets was minimal, which avoided excessive redundancy in the analysis (“Materials and methods”).

**Fig. 1: Identification of new recurrent *TP53* variants in the human population.**

We thus uncovered 6001 different TP53 variants at frequencies ranging from 0.8 to 10⁻⁶, and, as expected, most of them were located in noncoding sequences of the TP53 gene (5628 noncoding and 373 coding) (Table S2a and Fig. S1). For the present work, we focused our analysis on missense variants. Other variants such as intronic, synonymous, or specific TP53 isoform-targeting variants are fully described in the Supplementary Information. The particular case of rs138729528, (p.R175C), a passenger mutation found at high frequency in human tumors and in the normal population is also discussed in the Supplementary Information. We identified 247 missense variants in the 14 datasets, with 4, 6, and 14 variants found in more than 5, 4, and 3 datasets, respectively (Figs. 1A, S1b and c, and Table S2c). Their frequencies ranged from 0.8 to 4 × 10⁻⁶. The two most common benign TP53 SNPs found in the human population, i.e., rs1042522 (p.P72R) and rs1800371 (p.P47S), are included in this list (found in 14 and 7 datasets, respectively) (Figs. 1B, C and S1b and c). The statuses of the remaining variants were unknown, but their recurrences in various independent datasets suggested that they could also be rare constitutional TP53 SNPs.

The gnomAD dataset includes population data that enables subdivision into eight classes according to ancestry. rs1042522 (p.P72R) was found in all ethnic groups at a frequency ranging from 0.4 to 0.7 (Fig. 2A). Our analysis confirmed previous observations of a lower frequency for rs1042522 in the African population (0.4 in our analysis). Those observations have led some authors to hypothesize that the frequency of this polymorphism may be latitude-dependent [8]. In the present analysis, we also confirmed the quasi-exclusivity of rs1800371 (p.P47S) in the African population, as previously observed in other studies [10] (Figs. 2A and S2). Unexpectedly, our analysis uncovered five novel variants that were present in only the East Asian (EAS) population of gnomAD (rs201753350; (p.V31I), rs587780728; (p.D49H), rs201382018; (p.E11Q), rs121912665; (p.A189V), and rs773553186; (p.E358V)) (Fig. 2B). This specificity was confirmed by the analysis of datasets specific to Asian populations (Taiwan, Korea, China, and Japan) (Figs. 2B and S2). These five variants are also included in such cancer mutation databases as Cosmic, UMD, and IARC. Because normal DNA is usually not available, the constitutional origin of these variants may have been missed. The UMD_TP53 database included 146 entries for these five variants published in 64 reports. For all of these latter, the ethnicity of all patients was carefully checked and shown to be highly predominantly Asian (139 out of 146 patients) (Fig. 2C and Table S3). Notably, in two American multicenter studies that included patients from various geographical regions, these variants were only identified in Asian patients. We also noticed that the nine cell lines expressing any one of these variants originated from Japanese patients (Table S3). Taken together, these results clearly showed that the five novel constitutional variants were indeed specific to the Asian population. Whether any of these infrequent variants originated from a single event, i.e., a founder mutation, is currently unknown. A sixth rare variant, rs72661119; (p.N263D), was identified specifically in the South Asian (SAS) population of gnomAD with multiple reports originating from India (Fig. 2C and Table S3).

**Fig. 2: Identification of specific *TP53* variants in the Asian population.**

The various TP53 mutations databases include both somatic variants from multiple types of cancer and germline variants from a range of families or patient cohorts with Li–Fraumeni syndrome or hereditary breast and ovarian cancer syndrome. As discussed above, normal tissue was not available for many of the 80,000 tumors included in the UMD_TP53 database; thus assessments of variant origin had often been done by comparison with dbSNP, which includes a limited number of TP53 population variants. It is therefore likely that infrequent germline variants were mistakenly identified as somatic variants, leading to database contamination. We therefore evaluated the germline-to-somatic ratio for each TP53 variant in the UMD_TP53 database. As shown in Fig. 2D, the distribution of this ratio for the majority of TP53 variants, including all TP53 mutation hot spots, was quite low, ranging from 0.03 to 0.056. Ten outlier variants displayed a high ratio, suggesting that they may be undetected germline variants (Fig. 2E). Indeed, nine of these outlier variants are included in the 247 missense variants described above, with seven of them identified in three or more datasets. They also included three of the variants detected in the Asian population.

In a study published by our group, p.N235S was detected in a patient with lung cancer and defined as somatic [11]. However, when we reanalyzed this patient by sequencing normal DNA, we found that the variant was germline (Fig. S3a). Similarly, reanalysis of normal DNA from chronic lymphocytic leukemia patients confirmed the germline origin of this variant as well as variants p.R290H, p.R283C, and p.G360A (Fig. S3b–f).

Our analysis suggests that the coding region of the TP53 gene includes far more ethnically distributed SNPs than was previously thought, and that some of them are mistakenly included as somatic variants in a number of databases. It is therefore essential to determine whether or not the p53 protein encoded by these SNPs is functional and how they must be classified according to ACMG criteria.

Predictive and functional analysis of TP53 variants; in silico analysis

Computational algorithms to predict the functional effect of missense variants were performed on 27 SNPs (set 27) found in at least three datasets, and on the outlying germline variants identified in the analysis of the UMD_TP53 database described in the previous section (Table S2c). First, we used the prediction scores from dbNSFP, a database that compiles scores from several popular algorithms along with a conservation score and other related information for every potential single nucleotide variant in the human genome [12] (“Material and methods”). Most of these algorithms are based on sequence homology and/or the physical properties of amino acids (Table S4). They are widely used for a global estimation of the deleteriousness of missense variants but they are not specific to any protein. We also used new in silico mutation prediction tools such as Revel or Envision that have been trained on known pathogenic and neutral nsSNVs. Among the 27 analyzed TP53 variants, 4 were shown to be included in the cancer shared dataset (CSD), including the hot spot variant p.R248Q (Fig. 3). Although the majority of individuals included in the various datasets were selected for nonneoplastic disease, it is known that the frequency of de novo germline TP53 mutation in the human population is high, ranging from 1/5000 to 1/20,000 [13, 14]. It is therefore not surprising that among the 138,632 individuals in gnomAD as well as those in other population datasets, some may have been carrying such de novo mutations, although without cancer at the time of inclusion [15]. The rank scores of TP53 variants found in set 27 were always associated with a low pathogenic score, in contrast to the scores of the CSD-associated variants (Fig. 3). We note too that except for a few variants, there was considerable heterogeneity among the employed predictive tools.

**Fig. 3: Predictive and functional profile of *TP53* variants.**

A unique benefit TP53 has over other genes is the availability of quantitative functional data for all missense variants. In a pioneer study, Kato et al. engineered 2100 TP53 variants and assessed their transactivation activity in a yeast assay using eight different TP53 response elements. Their approach yielded a quantitative score for each variant [16]. Our analysis of the residual activity of the variants showed clearly that most TP53 variants, with the exception of those included in the CSD, were still functional (Fig. 3).

Two recent studies looked at large libraries to explore the loss of function of TP53 variants in mammalian cells and extend the functional analysis to a range of properties including growth arrest, apoptosis, dominant negative effect, or response to etoposide stress [17, 18]. In both studies, each TP53 variant was given a score indicating its potency to alter TP53 function. These novel scores agreed well with the transcriptional data collected in the yeast studies and showed clearly that there was good correlation between yeast and mammalian data (Fig. 3). Ranking data from the work of Kotler et al. showed that the 11 variants in the central DNA-binding domain (DBD) did not cause loss of activity (Fig. S4) [17].

This analysis using functional data from large-scale, saturating mutagenesis showed that most of these variants were likely to be functional, in contrast to their CSD-associated counterparts. Furthermore, as expected, using TP53-functional criteria improved the classification of the different variants compared to predictive algorithms.

Functional analysis

Although our population and prediction analyses suggested strongly that these newly discovered variants were functional, we nonetheless chose a subset of 21 TP53 variants (set 21) for a detailed functional analysis (Tables 1 and S1). Transcriptional activities were tested using three different assays. FASAY, a yeast assay, initially described by Iggo et al. has been widely used in the clinic for rapid screening of TP53 status and was furthermore used by Kato et al. to define TP53 activity as described in the previous section [19]. FASAY enables the detection of variants with particular temperature sensitivities (Fig. S5). In our work, all tested variants were shown to be fully active at three temperatures (25, 30, and 35 °C) using the WAF1 promoter [20]. Cancer hot spot variants p.I254T (temperature-sensitive) and p.R175H (totally defective) were used as controls (Tables 1 and S5). TP53 variants were also expressed in the p53-null cell line H1299 to determine their function based on transactivation of a TP53-dependent luciferase reporter with various TP53 response elements. These variants had profiles indistinguishable from that of wild-type TP53 (Figs. 4A–C, S6 and Tables 1, S5).

Table 1 Functional analysis and classification of 21 TP53 variants found in the human population^a.

Full size table

**Fig. 4: *TP53* variants found in the human population do not display loss of activity.**

To confirm the validity of this analysis on TP53 regulation, the expression levels of endogenous TP53 target genes were also analyzed by qPCR after transfection of several variants into H1299 cells (Fig. S7a, b and Table S5). Genes involved in several TP53 pathways such as cellular response to DNA damage (WAF1, ZMAT3, MDM2), DNA repair (DDB2), metabolism (TIGAR), apoptosis (FAS, APAF1), or reactive oxygen species regulation (SESN1) were fully activated compared with pathogenic TP53 variants.

Two other functional assays, colony-formation assay and apoptosis, were also performed to assess the biological function of these variants (Figs. 4D, E and S8, S9a–c, and Table S5). Except for CSD TP53 variants used as a negative control, no loss of activity was observed for the various SNPs. The collective results of our analysis showed that these latter were indistinguishable from wild-type TP53.

Location of TP53 variants and predicted impact

We mapped the identified TP53 variants within the structure of p53. Quite a few SNPs were located in the intrinsically disordered N-terminal transactivation domain (TAD) and proline-rich region. They did not modify the molecular recognition feature for binding of the negative regulators MDM2/X or the TAZ1 domain of transcriptional coactivator p300. Interestingly, Asp49 within the nascent helix of the p53 TAD forms an intermolecular salt bridge with an arginine in complex with the nuclear receptor coactivator binding domain of the CREB binding protein (Fig. 5A), which has been suggested to contribute to binding specificity [21]. This specific salt-bridge partner was lost in p.D49H variants. The p.P72R variant reduced slightly the rigidity of the proline-rich region, which may serve as a stiff linker for projecting the TADs away from the core of the p53-DNA complex for optimal interaction with coactivators [22]. Whether this variant has an effect on cancer predisposition remains controversial/inconclusive [23].

**Fig. 5: Location of *TP53* SNPs in the protein structure.**

All eight DBD variants affected residues on or close to the protein surface (Figs. 5B and S10a–f). Four of them, p.Y107H, p.G154S, p.P219L, and p.P222L, clustered on a surface patch, distant from the DNA-binding surface (and other functional interfaces), around the site of the cancer hot spot mutation p.Y220C. This latter creates an extended surface crevice and is highly destabilizing; it lowers the melting temperature of the DBD by about 8 °C, causing it to rapidly unfold and aggregate at body temperature [24]. Structural models of the variant DBDs indicated minor local structural rearrangements at the mutation sites. Stability estimates using the HoTMuSiC server predicted only relatively small changes of <2 °C in the melting temperature of the DBD for all DBD SNP variants, except for p.G154S and p.P219S, which were predicted to be destabilized by 3–4 °C compared with the wild type (Table S6). This remains significantly below the stability loss observed for the most deleterious structural cancer mutations, such as p.Y220C or the classical temperature-sensitive TP53 variant p.V143A. Interestingly, Tyr107 is conserved in mammalian p53, but a histidine is found in many fish species and in the more thermodynamically stable sister proteins p63/p73. In the structure of the p.Y107H variant, the imidazole side chain of His107 potentially forms a hydrogen bond with the backbone oxygen of Ser149 (Fig. S10a).

The stability predictions were consistent with the observation of full activity at body temperature for all the variants. Although small changes in stability may have no functional impact, they may predispose carriers of the respective SNP to functional inactivation by other mildly-destabilizing mutations that would otherwise be harmless. Comparisons of the predictions with experimentally determined melting temperatures for a panel of cancer hot spot variants showed that they generally agreed well qualitatively, but that there were also notable exceptions. For example, in the case of the thermolabile p.R282W and zinc-binding deficient p.R175H cancer variants, the program failed to predict the drastic stability loss observed experimentally (Table S6). It will thus be of interest to determine the stability of the DBD SNP variants experimentally, to fully assess their impact on cancer risk.

The side chain of Ala189 in the extended L2 loop faces inwards, and the model of the p.A189V variant displayed altered hydrophobic packing against Arg196 and Tyr205 (Fig. S10c). The side chain of Asn235 in beta-strand S8 formed hydrogen bonds with two backbone oxygens (Lys139 and Tyr234) and the carboxylate group of Glu198 (Fig. S10d). This polar interaction network was lost in the p.N235S variant but may have been compensated for, at least in part, by alternative, potentially water-mediated hydrogen bonds formed by the serine hydroxyl. In the mouse structure, there is a lysine at this position, and a serine is found in various fish species.

The other two DBD variants, p.R283C and p.R290H, were located on the C-terminal helix that binds to the major groove of the p53 response elements (Fig. 5B). Arg283 sits close to the protein-DNA interface, but in most p53-DNA complexes, it does not interact directly with the phosphate backbone [25, 26], so no significant effect on DNA-binding affinity and specificity is expected. The closest distance between the guanidinium group and the phosphate backbone of DNA in Protein Data Bank (PDB) entry 3KZA, for example, is more than 6 Å. In some fish species and the paralogs p63/p73, a lysine is found at this position. The Arg290 side chain does not interact with the rest of the DBD and is generally disordered in DBD crystal structures, as for example, in PDB entries 2XWR [27] and 3KZ8 [28]. This residue is not conserved across vertebrate p53; a histidine is found at this position in beluga whale and feline p53.

Variant p.T312S was located in the adjacent flexible linker between the DBD and the tetramerization domain. Only one variant, p.E339K, was located in the tetramerization domain itself. Glu339 sits on the surface of the tetramer, facing the solvent, and is not involved in any inter-subunit contacts stabilizing the tetramer (Fig. 5C). Therefore, no major effects on the oligomerization equilibrium of p53 were expected for the lysine variant. This variant is, for example, also found in hamsters.

The remaining three variants affected the C-terminal regulatory region, an intrinsically disordered promiscuous binding module that can adopt different structural motifs, depending on posttranslational modification patterns and binding partners [22, 29]. Interestingly, one of those variants, p.S366A, removed a C-terminal phosphorylation site. Phosphorylation of Ser366 (and other serines in this region) by CHK1 and CHK2 modulates p53 C-terminal acetylation and the transactivation of p53 target genes [30].

ACMG classification of TP53 variants

Variant classification according to guidelines proposed by the American College of Medical Genetics and Genomics (ACMG) and the Association for Molecular Pathology are very useful for clinicians assessing genetic analyses [31]. However, the classification of TP53 variants according to these guidelines remains highly controversial. In the wake of heated debates, several ACMG-derived criteria have been suggested (Table S5) [32, 33]. Furthermore, TP53 variant classification according to the ClinVar platform shows that only a single TP53 variant (p.P72R) has been classified benign with three stars, whereas the entries for other variants have remained controversial (Table S5). This debate is partially due to the fact that the ACMG criteria were initially defined for Mendelian disorders in a specific clinical context. Also, they include many parameters that are not suitable for cancer. Furthermore, the prediction software, as well as population criteria used for an analysis, can influence the various scores. As TP53 variants may be somatic or germline, the ACMG guidelines must be adapted to ensure accurate classification. Allele frequency in the human population is a strong criterion to define a variant as benign. A minor allele frequency (MAF) threshold of 0.001 and minimal allele count (AC) of five in the population are currently used to define the stand-alone BA1 criterion, which is sufficient to classify a variant as benign. In the present study, using the three aggregated datasets, only rs1042522 (p.P72R) could be ranked as BA1 (Table 1 and Fig. 6A). One of the main issues with an aggregated database including multiple ethnic populations is the dilution of the signals for variants specific to an ethnic group. Furthermore, these databases are skewed toward European ancestry (more than 50% for gnomAD), an aspect that can bias analyses. Using the aforementioned criteria, we analyzed the TP53 variants using the eight population subsets from gnomAD and identified only two new BA1 variants, i.e., p.V31I in the Asian population (AF = 0.003 and AC = 65) and p.Y107H in the African population (AF = 0.00116 and AC = 23). The analysis of the 11 national datasets was far more informative, leading finally to the identification of nine TP53 variants that should be defined as benign using ACMG criteria (Tables 1, S5 and Figs. 6A, S11). Among those nine variants were the five EAS-specific SNP variants described above. Similarly, the analysis for the strong BS1 criterion (MAF ≥ 0.0003 and AC ≥ 5) with the whole-gnomAD database did not uncover new variants, whereas that of the various datasets led to BS1 classification for eight variants (Tables 1, S5 and Figs. 6A, S11). BS1 classification requires a supporting criterion, such as BS3 defined by the loss of activity using “well established in vitro or in vivo assays”. Combining these criteria based on population frequency and multiple functional studies, including the present work, led to the classification of 16 of the set 21 variants as benign (Tables 1, S5 and Figs. 6A, S11). For the remaining six TP53 variants, other ACMG criteria were too heterogeneous for a clear-cut classification. We thus defined them as variants of uncertain significance (VUS).

**Fig. 6: New *TP53* SNPs are spread out in the human population.**

Discussion

Until recently, human genetic variation studies were largely restricted to constitutional variants occurring at frequencies above roughly 1–5%. That threshold was due to not only technological and cost limitations but also our own lack of knowledge of human genome diversity. Consequently, most medical genetic analyses were developed based on information derived from these common variants. Today, high-throughput DNA sequencing technologies enable the analysis of thousands of samples and the identification of infrequent SNPs. Recent explosive population growth has led to a great number of rare SNPs occurring at frequencies below 0.5% [34, 35]. These low-frequency variants are rich in protein-coding sequences that could predispose to disease. For genes with high clinical impact, it is therefore essential to (i) identify all relevant variants present in the human population and (ii) define their potential pathogenicity.

The TP53 gene is a paradigm for such analyses. Indeed, the accurate diagnosis of somatic mutations in it is mandatory in multiple clinical settings, for example in chronic lymphocytic leukemia, where it is used to improve patient stratification and optimize therapeutic decisions [2]. Furthermore, the diagnosis of germline TP53 mutations for genetic counseling is proving pertinent in such pathologies as hereditary breast and ovarian cancer syndrome and pediatric cancer [1, 36].

Recently, the ACMG—Association for Molecular Pathology provided a systematic method for the interpretation of sequence variants for genetic disorders [31]. Unfortunately, a high fraction of variants is relegated to uncertain significance (VUS), often due to the rarity and conflictual nature of results in existing databases and the primary literature. Several recent publications sought to improve the classification of TP53 variants but their results were heterogeneous and thus a source of discord.

Historically, two exonic missense TP53 SNPs have been identified and defined as benign constitutional variants [37]. Somatic TP53 gene mutations have been described in more than 100,000 patients. However, in most of these cases, matched normal DNA was not available and, thus, the somatic origin of the mutation was determined by consulting reference databases. It is therefore highly likely that some of those variants were infrequent constitutional variants mistakenly identified as somatic mutations.

In the present study, using a combination of in silico, in vitro, structural, and genetic approaches, we addressed a two-tiered question: (i) is it possible to identify novel and infrequent TP53 SNPs in the human population and (ii) what is the potential pathogenicity of these variants?

The use of multiple independent population databases has proven very helpful for the selection of novel potential SNPs. Although gnomAD is an outstanding resource for population analyses, the addition of novel datasets of individuals from different ethnic groups has led to a more accurate selection of potential SNPs [38].

In silico predictive software enables the assessment of the effects of amino acid substitutions on protein structure or function with, in principle, no imperative need for functional studies. However, in silico and functional approaches each have their own strengths and weaknesses. Depending on the algorithm, the performance of in silico approaches will vary based on the specific gene and protein being considered. As all algorithms focus on protein function based on the homology of sequences and/or the physical properties of amino acids, confusion has arisen between deleteriousness and pathogenicity, which should be considered as two different features. Great caution has been advised for the use of these algorithms, including for TP53 [39,40,41]. Concerning this latter, mutations in the mdm2 binding domain, localized in the amino-terminus of the protein, are predicted to be deleterious and consequently classified as pathogenic. In fact, these variants are known to impair cell growth and ultimately lead to cell death, which is the opposite of what would be expected for cell transformation. This aspect is supported by the total absence of cancer-associated mutation in the two highly conserved codons 19 and 23 of this TP53 protein domain [42].

One major advantage for TP53 is the availability of three unbiased large-scale functional studies for all potent TP53 missense variants [16,17,18]. A correlation analysis showed excellent agreement between the three datasets despite their different readouts [43]. Although predicted as “nonfunctional” by several algorithms, variants such as p.N235S or p.E11Q do not display loss of activity. This confusion has led to a controversial classification of several TP53 variants. p.N235S has been defined as pathogenic, likely pathogenic or VUS in different studies [32, 44,45,46]. It remains defined as pathogenic in the COSMIC database (version 92) despite the fact that it is considered as likely benign in Clinvar since spring 2020 and was found to be thus in the present study as well. For TP53, we strongly advocate for the use of functional studies to assess TP53 variant deleteriousness.

The consequences of TP53 mutation on protein function and their repercussions on the various pathways regulated by TP53 are highly heterogeneous. Although a few variants can be considered as amorph with a total loss of function, the complex structure of the TP53 protein is known to give rise to variants with multiple consequences with hypomorph, neomorph or antimorph properties [47,48,49]. TP53 regulates transcription through specific binding to highly degenerated response elements in promoters or introns of target genes [50]. The affinity of TP53 for these various biological sites is variable, and several TP53 variants can display only partial loss of their DNA-binding activity, allowing the mutant protein to bind only to a subset of TP53 response elements. This feature is linked to differential transactivation activities that may depend on p53 protein levels, as well as target sequences [51, 52]. This heterogeneity for both wild-type and mutant TP53 is reflected by our transcriptional analysis performed on endogenous genes (Fig. S7). Genes activated by wild-type TP53 have been extensively analyzed and scrutinized in normal and tumor cells. However, recent studies have shown that genes repressed by TP53 are also key players in the tumor-suppressive effect of TP53. Within this latter aspect is the downregulation of cell-cycle genes via the p53–WAF1–DREAM–E2F/CHR pathway (p53–DREAM pathway). Upon p53 induction, p21^WAF1/CIP1 activates the transcriptional repressor DREAM, leading to the shutdown of more than 200 cell-cycle genes [53, 54]. Derepression of these genes has been shown to be of better prognostic value than TP53 mutations [42]. Whether or not the loss of function of TP53 variants toward the WAF1 promoter and the DREAM pathway is essential for promoting tumorigenicity remains to be elucidated. As of this writing, there is no “gold standard” assay to define a TP53, suppressive effect, but the observation of defective transcription in the large majority of TP53 variants found in human cancer indicates that this activity is a key target. Although none of the variants in the present study displayed any obvious functional defects, we cannot exclude that any one of them may be partially defective for a particular function. Nonetheless, their high frequency in the human population and low frequency in cancer mutation databases suggest that any deleterious clinical impacts will be very low.

Using different functional assays, we showed that a subset of 21 germline TP53 variants found recurrently in multiple population datasets from different ethnic groups was indistinguishable from wild-type TP53. Through the analysis of multiple national datasets, we were able to definitively validate nine of the 21 variants as BA1, i.e., the ACMG stand-alone criterion for benignity. Seven other variants met the ACMG criterion BS1 + BS3, also indicating benignity. Thus, we identified 16 variants that may only be defined as natural polymorphisms. Among those variants, we identified for the first time five variants that were specific to the Asian population (present only at very low frequency in other populations). Our analysis of the UMD_TP53 database confirmed the Asian origin of the five variants and furthermore showed that they were mistakenly defined as somatic pathogenic mutations. We also identified another TP53 variant specific to the Indian population. Our analysis emphasizes the need to exercise great caution when using population datasets to check patient cohorts. Matching the ethnicity of the patient or the cohort to the reference datasets appears essential for increasing the accuracy of the analysis [55]. Also, supplementing these datasets with data from normal populations and from all geographical areas is necessary to minimize variant misclassification. For example, the Asian variant rs201753350 (p.V31I), classified as benign (BA1) in our analysis using multiple criteria, had been classified as pathogenic in a family with gastric cancer in the Japanese population [56]. Such “false positive” results could have detrimental consequences for patients and families, who may end up experiencing anxiety, undergoing unnecessary examinations, etc.

We cannot exclude that a number of the variants described in the present study may be associated with specific hereditary traits associated or not with cancer, or with other genetic characteristics. To illustrate this point, variant rs1042522 (p.P72R) does not impair TP53 activity but whether it is associated with an increased risk of cancer remains highly controversial [23]. Similarly, variant rs1800371 (p.P47S), restricted to the African population, has been shown to be associated with an increased risk of breast cancer [57].

Most of the variants described in set 21 are described as pathogenic or likely pathogenic in multiple databases and pipelines for the assessment of pathogenicity. To improve the accuracy of TP53 mutation diagnostics, they should be updated with the present data. The 2020 release of the UMD_TP53 database as well as the web application Seshat have been updated with these novel data.

This analysis was made possible by the numerous functional and structural analyses available for TP53. Unfortunately, the quantity of research for other cancer genes is much more limited. Nevertheless, the availability of large, independent datasets representing ethnic populations should be sufficient to identify novel benign variants and emphasize genetic heterogeneity. For an accurate assessment of somatic and germline variants, human genome reference databases used in analytical pipelines must include accurate ethnicity information to ensure that genetic analyses provide the most pertinent information possible for patients.

Materials and methods

Details on the cell lines, western blot, luciferase assay, plasmid construction, and flow cytometry analysis for the experimental validation of TP53 variants identified in this study are provided in the Supplementary Information.

Population datasets

TP53 variants from a range of population databases were downloaded from their respective portals (Table S1). The origin of each dataset was carefully checked to avoid redundancies. Because a part of the 1000 Genomes Project was included in gnomAD, the data of the former were not used in the present study. To ensure accurate comparisons between the various datasets, data from all databases were obtained using the genomic coordinates of the entire TP53 gene as defined by RefSeq. In a first step, minimal genomic data, such as genomic coordinates and genetic events, were extracted from each dataset to define the correct annotation according to HGVS recommendations. In a second step, variant annotation was validated by using the Name Checker tool developed by Mutalyzer (https://mutalyzer.nl/). This latter handles all types of TP53 gene variations, such as substitutions, insertions, duplications, deletions, and more complex insertion/deletions [58]. The current version (2.0.26) of Mutalyzer uses the stable NCBI sequence NG_017013.2 as reference for TP53. This is an essential issue for avoiding problems due to the use of multiple genome references (NCBI Build 36.1/hg18, Genome Reference Consortium GRCh37/hg19 or Genome Reference Consortium GRCh38/hg38) by the various NGS pipelines as well as noncompliant nomenclature. To minimize database contamination by potential carriers of germline TP53 variants, the non-cancer version of gnomAD was used in the present analysis. Taken together, exonic TP53 variants issued from gnomAD correspond to 56% of the variants issued from the 14 cohorts.

Cancer-associated TP53 variants were downloaded from the UMD_TP53 database (Released 2017), which includes 80,402 mutations (6870 different TP53 variants) identified in tumors, in cell lines (somatic mutations) or in patients with hereditary cancers (germline mutations) [37]. This release includes mutations identified by conventional sequencing or by NGS (Table S1).

The sequence nomenclature used for TP53 variants in this work is in accordance with the Human Genome Variation Society’s guidelines using the NM_000546.5 transcript sequence and the full-length protein NP_000537.3 [59].

We previously defined and validated a specific set of pathogenic TP53 variants called the “CSD” [15]. Briefly, TP53 variants were extracted from four different nonoverlapping datasets on various types of cancer. The Sanger dataset includes 4299 variants extracted from the latest version of the UMD_TP53 mutation database excluding all NGS studies (Leroy et al. [37]). The other three sets of TP53 variants were compiled from large independent cancer sequencing projects: TCGA [60], ICGC [61], and MSK-IMPACT [62]. Each of these studies described more than 2000 different TP53 variants occurring at various frequencies in the coding regions or splice sites (+2/−2) of TP53. We created the CSD by combining these four datasets to define a core of 471 recurrent TP53 variants found at least once in each database [15]. The four datasets resulted from independent studies using different patients and different methodologies. We showed that shared variants were true recurrent pathogenic variants [15].

Functional analysis from large-scale mutagenesis data

The UMD_TP53 database includes three sets of functional data for TP53 variants. The first set was described in detail in a previous report [16, 63]. Briefly, haploid yeast transformants containing 2314 TP53 variants and a green fluorescent protein reporter plasmid were constructed. TP53 activity was tested by measuring the fluorescent intensity of green fluorescent protein that is controlled by eight different promoter sequences regulated by the p53 protein after 3 days of growth at 37 °C. The second set of functional data integrated in the UMD_TP53 database corresponds to the analysis of 5300 TP53 variants performed by Kotler et al. [17]. The growth activity of all variants localized in the DBD of TP53 was assessed in H1299 cells. The third set corresponds to the study by Giacomelli et al. [18]. In this study, dominant negative activities, losses of function, and responses to etoposide were analyzed in mammalian cells for 8258 TP53 variants (Table S4). Functional scores were extracted from these 12 datasets associated with the three studies and normalized for all the analyses of TP53 variants.

Selection of TP53 variants for in silico and in vitro analysis

A subset of 27 TP53 variants (set 27) were chosen for in silico analyses based on their occurrence in multiple datasets (more than 2). We also added the five Asian-specific variants and eight of the ten outlier germline variants found in the UMD_TP53 database. The variant p.T125R, found at high frequency as a germline variant in the database, was excluded because missense and synonymous mutations in this codon close to a splice site are known to impair TP53 splicing [64, 65]. The pathogenic variant p.R337H (known as the Brazilian mutation) associated with adrenocortical carcinoma was also excluded [66].

Set 21, used for in vitro analyses, is derived from set 27 using more stringent criteria. It includes variants found in multiple datasets (more than 4) at frequencies higher than 0.0005. Asian and outlier germline variants were retained but variants p.R248Q, p.R283H, and p.R156H found, respectively, in five, four, and four datasets were not included as they are well-known pathogenic variants included in the CSD. Set 21 thus included 19 novel TP53 variants as well as rs1042522 (p.P72R) and rs1800371 (p.P47S).

Molecular modeling and stability predictions

Models of p53 DBD mutants were generated using the SWISS-MODEL server [67], with PDB entry 2XWR as a template [27]. Differences in the melting temperatures of the mutant proteins compared with the wild-type protein were predicted using the HoTMuSiC server [68] based on PDB entry 2XWR and a wild-type DBD melting temperature of 46 °C.

Variant effect prediction

dbNSFP v3.5 was downloaded from the Jpopgen website (https://sites.google.com/site/jpopgen/dbNSFP). This database compiles prediction or conservation scores from multiple prediction algorithms [12]. Prediction scores from each algorithm were available as normalized data from 0 (less deleterious) to 1 (most deleterious).

Five other predictors not included in dbNSFP were also used: Revel [69], PON-P2 [70], MutPred2 [71], Mut_Pred [72], and Envision [73]. Brief descriptions of these predictors are provided in Table S4.

Literature survey and ethnicity analysis

The 2017 release of the UMD_TP53 database includes data from 4600 publications. A manual survey of the database was performed for the five TP53 variants suspected to be constitutional variants specific to the Asian population. Publications describing any one of these variants were retrieved for analysis. In most cases, the hospitals and/or institutions used for patient recruitment were available.

References

Ballinger ML, Best A, Mai PL, Khincha PP, Loud JT, Peters JA, et al. Baseline surveillance in Li-Fraumeni syndrome using whole-body magnetic resonance imaging: a meta-analysis. JAMA Oncol. 2017;3:1634–9.
Article PubMed Google Scholar
Malcikova J, Tausch E, Rossi D, Sutton LA, Soussi T, Zenz T, et al. ERIC recommendations for TP53 mutation analysis in chronic lymphocytic leukemia-update on methodological approaches and results interpretation. Leukemia 2018;32:1070–80.
Article CAS PubMed PubMed Central Google Scholar
Döhner H, Estey E, Grimwade D, Amadori S, Appelbaum FR, Büchner T, et al. Diagnosis and management of AML in adults: 2017 ELN recommendations from an international expert panel. Blood 2017;129:424–47.
Article PubMed PubMed Central CAS Google Scholar
Karki R, Pandya D, Elston RC, Ferlini C. Defining “mutation” and “polymorphism” in the era of personal genomics. BMC Med Genom. 2015;8:37.
Article CAS Google Scholar
Musumeci L, Arthur JW, Cheung FS, Hoque A, Lippman S, Reichardt JK. Single nucleotide differences (SNDs) in the dbSNP database may lead to errors in genotyping and haplotyping studies. Hum Mutat. 2010;31:67–73.
Article CAS PubMed PubMed Central Google Scholar
Arthur JW, Cheung FS, Reichardt JK. Single nucleotide differences (SNDs) continue to contaminate the dbSNP database with consequences for human genomics and health. Hum Mutat. 2015;36:196–9.
Article CAS PubMed Google Scholar
Lek M, Karczewski KJ, Minikel EV, Samocha KE, Banks E, Fennell T, et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature 2016;536:285–91.
Article CAS PubMed PubMed Central Google Scholar
Beckman G, Birgander R, Sjalander A, Saha N, Holmberg PA, Kivela A, et al. Is p53 polymorphism maintained by natural selection? Hum Hered. 1994;44:266–70.
Article CAS PubMed Google Scholar
Jennis M, Kung CP, Basu S, Budina-Kolomets A, Leu JI, Khaku S, et al. An African-specific polymorphism in the TP53 gene impairs p53 tumor suppressor function in a mouse model. Genes Dev. 2016;30:918–30.
Article CAS PubMed PubMed Central Google Scholar
Felley-Bosco E, Weston A, Cawley HM, Bennett WP, Harris CC. Functional studies of a germ-line polymorphism at codon 47 within the p53 gene. Am J Hum Genet. 1993;53:752–9.
CAS PubMed PubMed Central Google Scholar
Edlund K, Larsson O, Ameur A, Bunikis I, Gyllensten U, Leroy B, et al. Data-driven unbiased curation of the TP53 tumor suppressor gene mutation database and validation by ultradeep sequencing of human tumors. Proc Natl Acad Sci USA. 2012;109:9551–6.
Article CAS PubMed PubMed Central Google Scholar
Liu X, Wu C, Li C, Boerwinkle E. dbNSFP v3.0: a one-stop database of functional predictions and annotations for human nonsynonymous and splice-site SNVs. Hum Mutat. 2016;37:235–41.
Article PubMed PubMed Central CAS Google Scholar
Peng G, Bojadzieva J, Ballinger ML, Li J, Blackford AL, Mai PL, et al. Estimating TP53 mutation carrier probability in families with Li-Fraumeni syndrome using LFSPRO. Cancer Epidemiol Biomark Prev. 2017;26:837–44.
Article CAS Google Scholar
Gonzalez KD, Noltner KA, Buzin CH, Gu D, Wen-Fong CY, Nguyen VQ, et al. Beyond Li Fraumeni syndrome: clinical characteristics of families with p53 germline mutations. J Clin Oncol. 2009;27:1250–6.
Article CAS PubMed Google Scholar
Soussi T, Leroy B, Devir M, Rosenberg S. High prevalence of cancer-associated TP53 variants in the gnomAD database: a word of caution concerning the use of variant filtering. Hum Mutat. 2019;40:516–24.
CAS PubMed Google Scholar
Kato S, Han SY, Liu W, Otsuka K, Shibata H, Kanamaru R, et al. Understanding the function-structure and function-mutation relationships of p53 tumor suppressor protein by high-resolution missense mutation analysis. Proc Natl Acad Sci U S A. 2003;100:8424–9.
Article CAS PubMed PubMed Central Google Scholar
Kotler E, Shani O, Goldfeld G, Lotan-Pompan M, Tarcic O, Gershoni A, et al. A systematic p53 mutation library links differential functional impact to cancer mutation pattern and evolutionary conservation. Mol Cell. 2018;71:178–90.e8.
Article CAS PubMed Google Scholar
Giacomelli AO, Yang X, Lintner RE, McFarland JM, Duby M, Kim J, et al. Mutational processes shape the landscape of TP53 mutations in human cancer. Nat Genet. 2018;50:1381–7.
Article CAS PubMed PubMed Central Google Scholar
Ishioka C, Frebourg T, Yan YX, Vidal M, Friend SH, Schmidt S, et al. Screening patients for heterozygous p53 mutations using a functional assay in yeast. Nat Genet. 1993;5:124–9.
Article CAS PubMed Google Scholar
Grochova D, Vankova J, Damborsky J, Ravcukova B, Smarda J, Vojtesek B, et al. Analysis of transactivation capability and conformation of p53 temperature-dependent mutants and their reactivation by amifostine in yeast. Oncogene 2008;27:1243–52.
Article CAS PubMed Google Scholar
Lee CW, Martinez-Yamout MA, Dyson HJ, Wright PE. Structure of the p53 transactivation domain in complex with the nuclear receptor coactivator binding domain of CREB binding protein. Biochemistry. 2010;49:9964–71.
Article CAS PubMed Google Scholar
Joerger AC, Fersht AR. Structural biology of the tumor suppressor p53. Annu Rev Biochem. 2008;77:557–82.
Article CAS PubMed Google Scholar
Dahabreh IJ, Schmid CH, Lau J, Varvarigou V, Murray S, Trikalinos TA. Genotype misclassification in genetic association studies of the rs1042522 TP53 (Arg72Pro) polymorphism: a systematic review of studies of breast, lung, colorectal, ovarian, and endometrial cancer. Am J Epidemiol. 2013;177:1317–25.
Article PubMed PubMed Central Google Scholar
Joerger AC, Ang HC, Fersht AR. Structural basis for understanding oncogenic p53 mutations and designing rescue drugs. Proc Natl Acad Sci U S A. 2006;103:15056–61.
Article CAS PubMed PubMed Central Google Scholar
Kitayner M, Rozenberg H, Rohs R, Suad O, Rabinovich D, Honig B, et al. Diversity in DNA recognition by p53 revealed by crystal structures with Hoogsteen base pairs. Nat Struct Mol Biol. 2010;17:423–9.
Article CAS PubMed PubMed Central Google Scholar
Schmidt M, Bachhuber A, Victor A, Steiner E, Mahlke M, Lehr HA, et al. p53 expression and resistance against paclitaxel in patients with metastatic breast cancer. J Cancer Res Clin Oncol. 2003;129:295–302.
Article CAS PubMed Google Scholar
Natan E, Baloglu C, Pagel K, Freund SM, Morgner N, Robinson CV, et al. Interaction of the p53 DNA-binding domain with Its N-terminal extension modulates the stability of the p53 tetramer. J Mol Biol. 2011;409:358–68.
Article CAS PubMed PubMed Central Google Scholar
Kitayner M, Rozenberg H, Kessler N, Rabinovich D, Shaulov L, Haran TE, et al. Structural basis of DNA recognition by p53 tetramers. Mol Cell. 2006;22:741–53.
Article CAS PubMed Google Scholar
Joerger AC, Fersht AR. The p53 pathway: origins, inactivation in cancer, and emerging therapeutic approaches. Annu Rev Biochem. 2016;85:375–404.
Article CAS PubMed Google Scholar
Ou YH, Chung PH, Sun TP, Shieh SY. p53 C-terminal phosphorylation by CHK1 and CHK2 participates in the regulation of DNA-damage-induced C-terminal acetylation. Mol Biol Cell. 2005;16:1684–95.
Article CAS PubMed PubMed Central Google Scholar
Richards S, Aziz N, Bale S, Bick D, Das S, Gastier-Foster J, et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med. 2015;17:405–24.
Article PubMed PubMed Central Google Scholar
de Andrade KC, Frone MN, Wegman-Ostrosky T, Khincha PP, Kim J, Amadou A, et al. Response to: concern regarding classification of germlineTP53 variants as likely pathogenic. Hum Mutat. 2019;40:832–3.
Article PubMed Google Scholar
Evans DG, Turnbull C, Woodward ER. Concern regarding classification of germline TP53 variants as likely pathogenic. Hum Mutat. 2019;40:828–31.
Article PubMed Google Scholar
Keinan A, Clark AG. Recent explosive human population growth has resulted in an excess of rare genetic variants. Science 2012;336:740–3.
Article CAS PubMed PubMed Central Google Scholar
Coventry A, Bull-Otterson LM, Liu X, Clark AG, Maxwell TJ, Crosby J, et al. Deep resequencing reveals excess rare recent variants consistent with explosive population growth. Nat Commun. 2010;1:131.
Article PubMed CAS Google Scholar
Gröbner SN, Worst BC, Weischenfeldt J, Buchhalter I, Kleinheinz K, Rudneva VA, et al. The landscape of genomic alterations across childhood cancers. Nature 2018;555:321–7.
Article PubMed CAS Google Scholar
Leroy B, Ballinger ML, Baran-Marszak F, Bond GL, Braithwaite A, Concin N, et al. Recommended guidelines for validation, quality control, and reporting of TP53 variants in clinical practice. Cancer Res. 2017;77:1250–60.
Article CAS PubMed PubMed Central Google Scholar
Koch L. Exploring human genomic diversity with gnomAD. Nat Rev Genet. 2020;21:448.
Article CAS PubMed Google Scholar
Woods NT, Baskin R, Golubeva V, Jhuraney A, De-Gregoriis G, Vaclova T, et al. Functional assays provide a robust tool for the clinical annotation of genetic variants of uncertain significance. NPJ Genom Med. 2016;1:16001.
Article CAS PubMed PubMed Central Google Scholar
Flanagan SE, Patch AM, Ellard S. Using SIFT and PolyPhen to predict loss-of-function and gain-of-function mutations. Genet Test Mol Biomark. 2010;14:533–7.
Article CAS Google Scholar
Fortuno C, James PA, Young EL, Feng B, Olivier M, Pesaran T, et al. Improved, ACMG-compliant, in silico prediction of pathogenicity for missense substitutions encoded by TP53 variants. Hum Mutat. 2018;39:1061–9.
Article CAS PubMed PubMed Central Google Scholar
Donehower LA, Soussi T, Korkut A, Liu Y, Schultz A, Cardenas M, et al. Integrated analysis of TP53 gene and pathway alterations in The Cancer Genome Atlas. Cell Rep. 2019;28:1370–84.e5.
Article CAS PubMed PubMed Central Google Scholar
Carbonnier V, Leroy B, Rosenberg S, Soussi T. Comprehensive assessment of TP53 loss of function using multiple combinatorial mutagenesis libraries. Sci Rep. 2020; in press.
de Andrade KC, Mirabello L, Stewart DR, Karlins E, Koster R, Wang M, et al. Higher-than-expected population prevalence of potentially pathogenic germline TP53 variants in individuals unselected for cancer history. Hum Mutat. 2017;38:1723–30.
Article PubMed PubMed Central CAS Google Scholar
de Andrade KC, Frone MN, Wegman-Ostrosky T, Khincha PP, Kim J, Amadou A, et al. Variable population prevalence estimates of germline TP53 variants: a gnomAD-based analysis. Hum Mutat. 2019;40:97–105.
Article PubMed CAS Google Scholar
Fortuno C, Cipponi A, Ballinger ML, Tavtigian SV, Olivier M, Ruparel V, et al. A quantitative model to predict pathogenicity of missense variants in the TP53 gene. Hum Mutat. 2019;40:788–800.
Article CAS PubMed Google Scholar
Oren M, Rotter V. Mutant p53 gain-of-function in cancer. Cold Spring Harb Perspect Biol. 2010;2:a001107.
Article PubMed PubMed Central CAS Google Scholar
Soussi T, Wiman KG. TP53: an oncogene in disguise. Cell Death Differ. 2015;22:1239–49.
Article CAS PubMed PubMed Central Google Scholar
Kastenhuber ER, Lowe SW. Putting p53 in Context. Cell 2017;170:1062–78.
Article CAS PubMed PubMed Central Google Scholar
Riley T, Sontag E, Chen P, Levine A. Transcriptional control of human p53-regulated genes. Nat Rev Mol Cell Biol. 2008;9:402–12.
Article CAS PubMed Google Scholar
Inga A, Storici F, Darden TA, Resnick MA. Differential transactivation by the p53 transcription factor is highly dependent on p53 level and promoter target sequence. Mol Cell Biol. 2002;22:8612–25.
Article CAS PubMed PubMed Central Google Scholar
Campomenosi P, Monti P, Aprile A, Abbondandolo A, Frebourg T, Gold B, et al. p53 mutants can often transactivate promoters containing a p21 but not Bax or PIG3 responsive elements. Oncogene 2001;20:3573–9.
Article CAS PubMed Google Scholar
Engeland K. Cell cycle arrest through indirect transcriptional repression by p53: I have a DREAM. Cell Death Differ. 2018;25:114–32.
Article CAS PubMed Google Scholar
Fischer M, Grossmann P, Padi M, DeCaprio JA. Integration of TP53, DREAM, MMB-FOXM1 and RB-E2F target gene analyses identifies cell cycle gene regulatory networks. Nucleic Acids Res. 2016;44:6070–86.
Article CAS PubMed PubMed Central Google Scholar
Sirugo G, Williams SM, Tishkoff SA. The missing diversity in human genetic studies. Cell 2019;177:26–31.
Article CAS PubMed PubMed Central Google Scholar
Yamada H, Shinmura K, Okudela K, Goto M, Suzuki M, Kuriki K, et al. Identification and characterization of a novel germ line p53 mutation in familial gastric cancer in the Japanese population. Carcinogenesis 2007;28:2013–8.
Article CAS PubMed Google Scholar
Murphy ME, Liu S, Yao S, Huo D, Liu Q, Dolfi SC, et al. A functionally significant SNP in TP53 and breast cancer risk in African-American women. NPJ Breast Cancer. 2017;3:5.
Article PubMed PubMed Central CAS Google Scholar
Wildeman M, van Ophuizen E, den Dunnen JT, Taschner PE. Improving sequence variant descriptions in mutation databases and literature using the Mutalyzer sequence variation nomenclature checker. Hum Mutat. 2008;29:6–13.
Article CAS PubMed Google Scholar
den Dunnen JT, Dalgleish R, Maglott DR, Hart RK, Greenblatt MS, McGowan-Jordan J, et al. HGVS recommendations for the description of sequence variants: 2016 update. Hum Mutat. 2016;37:564–9.
Article CAS Google Scholar
Network CGAR, Weinstein JN, Collisson EA, Mills GB, Shaw KR, Ozenberger BA, et al. The Cancer Genome Atlas Pan-Cancer analysis project. Nat Genet. 2013;45:1113–20.
Article CAS Google Scholar
Hudson TJ, Anderson W, Artez A, Barker AD, Bell C, Bernabe RR, et al. International network of cancer genome projects. Nature 2010;464:993–8.
Article CAS PubMed Google Scholar
Zehir A, Benayed R, Shah RH, Syed A, Middha S, Kim HR, et al. Mutational landscape of metastatic cancer revealed from prospective clinical sequencing of 10,000 patients. Nat Med. 2017;23:703–13.
Article CAS PubMed PubMed Central Google Scholar
Soussi T, Kato S, Levy PP, Ishioka C. Reassessment of the TP53 mutation database in human disease by data mining with a library of TP53 missense mutations. Hum Mutat. 2005;25:6–17.
Article CAS PubMed Google Scholar
Holmila R, Fouquet C, Cadranel J, Zalcman G, Soussi T. Splice mutations in the p53 gene: case report and review of the literature. Hum Mutat. 2003;21:101–2.
Article CAS PubMed Google Scholar
Supek F, Miñana B, Valcárcel J, Gabaldón T, Lehner B. Synonymous mutations frequently act as driver mutations in human cancers. Cell 2014;156:1324–35.
Article CAS PubMed Google Scholar
Ribeiro RC, Sandrini F, Figueiredo B, Zambetti GP, Michalkiewicz E, Lafferty AR, et al. An inherited p53 mutation that contributes in a tissue-specific manner to pediatric adrenal cortical carcinoma. Proc Natl Acad Sci U S A. 2001;98:9330–5.
Article CAS PubMed PubMed Central Google Scholar
Bienert S, Waterhouse A, de Beer TA, Tauriello G, Studer G, Bordoli L, et al. The SWISS-MODEL Repository-new features and functionality. Nucleic Acids Res. 2017;45:D313–19.
Article CAS PubMed Google Scholar
Pucci F, Bourgeas R, Rooman M. Predicting protein thermal stability changes upon point mutations using statistical potentials: introducing HoTMuSiC. Sci Rep. 2016;6:23257.
Article CAS PubMed PubMed Central Google Scholar
Ioannidis NM, Rothstein JH, Pejaver V, Middha S, McDonnell SK, Baheti S, et al. REVEL: an ensemble method for predicting the pathogenicity of rare missense variants. Am J Hum Genet. 2016;99:877–85.
Article CAS PubMed PubMed Central Google Scholar
Niroula A, Urolagin S, Vihinen M. PON-P2: prediction method for fast and reliable identification of harmful variants. PLoS ONE. 2015;10:e0117380.
Article PubMed PubMed Central CAS Google Scholar
Pejaver V, Urresti J, Lugo-Martinez J, Pagel KA, Lin GN, Nam H, et al. MutPred2: inferring the molecular and phenotypic impact of amino acid variants. bioRxiv. https://doi.org/10.1101/134981.
Li B, Krishnan VG, Mort ME, Xin F, Kamati KK, Cooper DN, et al. Automated inference of molecular mechanisms of disease from amino acid substitutions. Bioinformatics 2009;25:2744–50.
Article CAS PubMed PubMed Central Google Scholar
Gray VE, Hause RJ, Luebeck J, Shendure J, Fowler DM. Quantitative missense variant effect prediction using large-scale mutagenesis data. Cell Syst. 2018;6:116–24.e3.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This study was supported by the Radiumhemmets Forskningsfonder (to TS), the Czech Science Foundation (GA19-15737S) (to SP), and the German Research Foundation (DFG) grant JO 1473/1-1 (to ACJ). We would also like to thank the core facility at Novum, BEA, Bioinformatics and Expression Analysis, which is supported by the board of research at the Karolinska Institute and the research committee at the Karolinska hospital. We thank Vladimir Bykov (Bioclinicum, Karolinska Institute) for help with the FACS analysis, Patrick Muller (BEA, Karolinska Institute) for help in QPCR analyses, and Alan Sauvat (INSERM U1138, Centre de Recherche des Cordeliers) for data analyses.

Author information

Thierry Soussi
Present address: Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden
Thierry Soussi
Present address: Cell Death and Drug Resistance in Lymphoproliferative Disorders Team, INSERM U1138, Centre de Recherche des Cordeliers, Paris, France

Authors and Affiliations

Equipe Labellisée par la Ligue Contre le Cancer, Université Paris Descartes, Université Sorbonne Paris Cité, Université Paris Diderot, Sorbonne Université, INSERM U1138, Centre de Recherche des Cordeliers, Paris, France
Flora Doffe, Vincent Carbonnier, Manon Tissier, Isabelle Martins, Guido Kroemer & Thierry Soussi
Department of Oncology–Pathology, Bioclinicum, Karolinska Institutet, Stockholm, Sweden
Flora Doffe, Klas G. Wiman & Thierry Soussi
Department of Life Science, Sorbonne Université, Paris, France
Bernard Leroy & Thierry Soussi
Metabolomics and Cell Biology Platforms, Institut Gustave Roussy, Villejuif, France
Isabelle Martins & Guido Kroemer
Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden
Johanna S. M. Mattsson & Patrick Micke
Department of Internal Medicine—Hematology and Oncology, University Hospital Brno and Faculty of Medicine, Masaryk University, Brno, Czech Republic
Sarka Pavlova & Sarka Pospisilova
Central European Institute of Technology, Masaryk University, Brno, Czech Republic
Sarka Pavlova & Sarka Pospisilova
Faculty of Science, Department of Experimental Biology, Masaryk University, Brno, Czech Republic
Jana Smardova
Institute of Pharmaceutical Chemistry, Johann Wolfgang Goethe University, Max-von-Laue-Str. 9, 60438, Frankfurt am Main, Germany
Andreas C. Joerger
Buchmann Institute for Molecular Life Sciences and Structural Genomics Consortium (SGC), Max-von-Laue-Str. 15, 60438, Frankfurt am Main, Germany
Andreas C. Joerger
Pôle de Biologie, Hôpital Européen Georges Pompidou, AP-HP, Paris, France
Guido Kroemer
Department of Women’s and Children’s Health, Karolinska University Hospital, Stockholm, Sweden
Guido Kroemer

Authors

Flora Doffe
View author publications
You can also search for this author in PubMed Google Scholar
Vincent Carbonnier
View author publications
You can also search for this author in PubMed Google Scholar
Manon Tissier
View author publications
You can also search for this author in PubMed Google Scholar
Bernard Leroy
View author publications
You can also search for this author in PubMed Google Scholar
Isabelle Martins
View author publications
You can also search for this author in PubMed Google Scholar
Johanna S. M. Mattsson
View author publications
You can also search for this author in PubMed Google Scholar
Patrick Micke
View author publications
You can also search for this author in PubMed Google Scholar
Sarka Pavlova
View author publications
You can also search for this author in PubMed Google Scholar
Sarka Pospisilova
View author publications
You can also search for this author in PubMed Google Scholar
Jana Smardova
View author publications
You can also search for this author in PubMed Google Scholar
Andreas C. Joerger
View author publications
You can also search for this author in PubMed Google Scholar
Klas G. Wiman
View author publications
You can also search for this author in PubMed Google Scholar
Guido Kroemer
View author publications
You can also search for this author in PubMed Google Scholar
Thierry Soussi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

KGW, GK, and TS designed research. FD, MT, JS, and IM performed research; JSMM, PM, SP, and SP performed clinical analyses. VC and BL contributed new reagents/analytic tools. ACJ performed structural analyses; FD and TS analyzed data. TS wrote the paper with input from all authors.

Corresponding author

Correspondence to Thierry Soussi.

Ethics declarations

Conflict of interest

KGW is a cofounder and a shareholder of Aprea Therapeutics AB, a company that develops p53-based cancer therapies including APR-246. KGW is a member of its Clinical Advisory Board and has received salary and research funding from Aprea Therapeutics AB.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Edited by G. Melino

Supplementary information

Supplementary Figures and Tables

Table S2

Table S3

Table S5

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Doffe, F., Carbonnier, V., Tissier, M. et al. Identification and functional characterization of new missense SNPs in the coding region of the TP53 gene. Cell Death Differ 28, 1477–1492 (2021). https://doi.org/10.1038/s41418-020-00672-0

Download citation

Received: 24 March 2020
Revised: 03 November 2020
Accepted: 04 November 2020
Published: 30 November 2020
Issue Date: May 2021
DOI: https://doi.org/10.1038/s41418-020-00672-0

This article is cited by

COSMIC-based mutation database enhances identification efficiency of HLA-I immunopeptidome
- Fangzhou Wang
- Zhenpeng Zhang
- Shichun Lu
Journal of Translational Medicine (2024)