FUT1 variants responsible for Bombay or para-Bombay phenotypes in a database

Soejima, Mikiko; Koda, Yoshiro

doi:10.1038/s41598-023-44731-1

Download PDF

Article
Open access
Published: 14 October 2023

FUT1 variants responsible for Bombay or para-Bombay phenotypes in a database

Mikiko Soejima¹ &
Yoshiro Koda¹

Scientific Reports volume 13, Article number: 17447 (2023) Cite this article

1540 Accesses
1 Citations
Metrics details

Subjects

Abstract

Rare individuals with Bombay and para-Bombay phenotypes lack or have weak expression of the ABO(H) antigens on surface of red blood cells due to no or very weak H-type α(1,2)fucosyltransferase activity encoded by FUT1. These phenotypes are clinically important because subjects with these phenotypes can only accept transfusions of autologous blood or blood from subjects with the same phenotypes due to the anti-H antibody. To survey FUT1 alleles involved in Bombay and para-Bombay phenotypes, the effect of 22 uncharacterized nonsynonymous SNPs in the Erythrogene database on the α(1,2)fucosyltransferase activity were examined by transient expression studies and in silico analysis using four different online software tools. Two nonfunctional alleles (FUT1 with c.503C>G and c.749G>C) and one weakly functional allele (with c.799T>C) were identified in transient expression studies, while the software predicted that the proteins encoded by more alleles including these would be impaired. Because both nonfunctional FUT1 alleles appear to link to the nonsecretor alleles, homozygotes of these alleles would be of the Bombay phenotype. The present results suggest that functional assays are useful for characterization of nonsynonymous SNPs of FUT1 when their phenotypes are not available.

Systematic sequence analysis of the FUT3 gene identifies 11 novel alleles in the Sindhi and Punjabi populations from Pakistan

Article Open access 26 March 2020

Survey and characterization of nonfunctional alleles of FUT2 in a database

Article Open access 04 February 2021

Novel variants in Krueppel like factor 1 that cause persistence of fetal hemoglobin in In(Lu) individuals

Article Open access 17 September 2021

Introduction

The H blood group antigen is synthesized by α(1,2)fucosyltransferase and is an essential precursor for the synthesis of A and B antigens in the presence of the corresponding A or B transferases¹. Humans have two types of α(1,2)fucosyltransferase, encoded by FUT1 and FUT2. FUT1 encodes the H-type α(1,2)fucosyltransferase (H enzyme) which determines expression of the H antigen in the erythroid lineage, whereas FUT2 encodes a secretor-type α(1,2)fucosyltransferase (Se enzyme) that controls expression of the H antigen in a variety of secretory epithelia and saliva^2,3. FUT1 and FUT2 have approximately 70% DNA sequence similarity and are located 35-kb from each other on chromosome 19q13.3^4,5,6.

Two H-deficient red cell phenotypes due to H enzyme deficiency have been recognized: the Bombay phenotype (H– phenotype), in which H, A, and B antigens are completely absent on erythrocytes, saliva, and body fluids, and the para-Bombay phenotype (weak H phenotype; H + w), in which the amount of H antigen (and thereafter, A and B antigens) on erythrocytes is very low. H-deficient red cell phenotypes are extremely rare (the frequency is 1/8000 in Taiwanese with a predominantly para-Bombay phenotype, 1/10,000 in India with a predominantly Bombay phenotype, and 1/1,000,000 in Europe), whereas nonsecretors due to Se enzyme deficiency are present in about 25% of many populations^3,7. The Bombay phenotype has only nonfunctional alleles of both FUT1 (h) and FUT2 (se), whereas the para-Bombay phenotype has only nonfunctional alleles of FUT1 (h) but at least one functional allele, FUT2 (Se), so that the H antigen produced by the Se enzyme is adsorbed from the serum into the erythrocytes, or has very low H enzyme activity encoded by the weak-functional FUT1 allele (H^w) in the presence of a nonfunctional FUT2 allele, resulting in negligible H antigen production^1,8,9. The Bombay phenotype was first recognized by the presence of anti-H in the serum in addition to anti-A and anti-B¹⁰. Because anti-H produced by subjects with Bombay phenotype carries the risk of severe hemolytic transfusion reactions, subjects with Bombay phenotype require autologous blood donation or blood from other subjects with same phenotype^9,11. On the other hand, anti-H produced by subjects with para-Bombay phenotype is usually not clinically significant¹. Therefore, it is clinically important to correctly determine Bombay or para-Bombay phenotypes.

The coding sequence of FUT1 resides only in exon 4, which encodes a 365-amino acid protein^4,12. This structural feature of the gene makes it easy to determine the sequences, haplotypes of SNPs of the protein coding region or get expression constructs. Since the cloning of FUT1⁴, molecular analysis of H-deficient red cell phenotypes has identified a number of nonfunctional or weak-functional FUT1 alleles^9,13,14. At present, the ISBT allele table for the H blood group system (018 H; FUT1FUT2) v6.1 31-MAR-2023 (https://www.isbtweb.org/resource/018h.html) lists 77 FUT1 alleles, including two functional alleles (FUT1*01 and FUT1*01.02, producing the H + phenotype), 37 alleles involved in phenotype H + weak, i.e., para-Bombay phenotypes (FUT1*01W.01-4, FUT1*01W.05.01 and 0.02, FUT1*01W.07-24, FUT1*01W.26-29, FUT1*01W.31-39), and 38 alleles involved in phenotype H–, i.e., the Bombay phenotype (FUT1*01N.01-37 and FUT1*0N.01).

However, not every nonsynonymous substitution affects the function of the encoded protein, and we need to estimate the impact of each SNP through in silico analysis in the absence of information on the phenotype¹⁵, but the prediction results are not always accurate. Alternatively, the enzyme activity has been experimentally determined by transient expression in cultured cells and then measurement of the α(1,2)fucosyltransferase activity by using ¹⁴C-labeled fucose and its acceptor¹³. Another strategy to examine enzyme activity is flow cytometry of cell surface H antigens by transient expression in cultured cells, which is indirect but does not require a radioisotope¹⁶. Erythrogene v0.8 (27-Nov-2017) (http://www.erythrogene.com/¹⁷) extracted the data of blood group alleles from ISBT 018 H (FUT1FUT2) blood group alleles (older version) and the 1000 Genome Project (https://www.internationalgenome.org/¹⁸) and matched them against blood group reference lists. Seventy-nine alleles are listed for FUT1.

In the previous study of FUT2, we identified two nonfunctional alleles (se) and one weak-secretor allele (Se^w) by transient expression studies, but there were discrepancies between the results of transient expression studies and in silico analysis in assessing the functional impacts of each SNP on Se enzyme activity¹⁹.

In this study, with the aim of determining how many nonsynonymous substitutions affect the activity of the encoding enzyme and whether they could be responsible for the Bombay or para-Bombay phenotypes, we picked 22 FUT1 alleles from Erythrogene that were not registered in the ISBT database and analyzed their effects on enzyme activity. In addition, three DNA samples with causal substitution (c.725T>G, p.L242R) of the FUT1 (FUT1*01N.09) giving rise to the classical Indian Bombay phenotype^20,21 were also examined to better understand the genetic background of this phenotype.

Materials and methods

Ethics approval

All methods were carried out in accordance with relevant guidelines and regulations. The oral informed consent was obtained and the DNA samples were taken from participants (47 Bangladeshis in 1999 and 58 Sri Lankan Tamils, 54 Sinhalese in 2002). The statement for oral informed consent approved by ethical committee of Kurume University in 1999 and 2002. However, present study protocol was reviewed and approved by the ethical committee of Kurume University School of Medicine in 2022 using existing and already anonymized DNA (No. 22158, approved date: 31 October 2022).

DNA samples

Twenty-nine genomic DNAs (HG00118, HG01435, HG01440, HG01443, HG01456, HG01516, HG01577, HG01610, HG01776, HG02003, HG02733, HG02789, HG02870, HG03367, HG03919, HG04185, HG04189 of the 1000 Genomes Project, NA12155 of CEPH/Utah Pedigree 1408, NA18610, NA19019, NA19042, NA19095, NA20289, NA20341, NA20847, NA20887, NA21104, NA21128, NA21141 of International HapMap Project) were purchased from the Coriell Institute for Medical Research (Camden, NJ) (Table 1). Of these, HG02733, HG03919, and HG04189 were registered as having FUT1*01N.09, which is a nonfunctional FUT1 allele of the classical Indian Bombay phenotype^20,21. In addition, we used genomic DNA from 58 Sri Lankan Tamils, 54 Sinhalese, and 47 Bangladeshis whose FUT2 genotypes had already been determined^22,23.

Table 1 Summary of candidates for nonfunctional FUT1 alleles, their attribution, and evaluation by expression of cell surface H antigens or in silico analyses.

Full size table

Direct sequencing of coding region and haplotype determination of FUT1

The nucleotide sequence is numbered from the A residue of the translation initiation codon as position number 1⁴. The variants were described according to the ISBT guidelines. The coding region of FUT1 of each genomic DNA was amplified and directly sequenced. For amplification, FUT1-F (5ʹ-GTT CAG AAG CTT CAG TGC ATT TGC TAA TTC GCC TTT C-3ʹ, -39 to -14 bp of FUT1, the artificially introduced HindIII recognition site is underlined) and FUT1-R (5ʹ-CAG GCC TCT GAA GCC ACG TAC T -3ʹ, 1145 to 1166 bp of FUT1, the indigenous XbaI recognition site is underlined) were used. The 50 µL PCR reaction contained about approximately 7 ng of genomic DNA, 25 µL of PrimeSTAR Max Premix (Takara Bio, Shiga, Japan), and 250 nM of each primer. The PCR temperature conditions were 35 cycles of denaturation at 95 °C for 10 s, annealing at 60 °C for 5 s, and extension at 72 °C for 7 s. Direct Sanger sequencing of the PCR products was performed using each PCR primer as the sequencing primer as described previously¹⁹. To determine the haplotypes of individuals who were heterozygous at two sites, we cloned PCR products by use of restriction sites of HindIII and XbaI into a mammalian expression vector pcDNA3.1(+) and sequenced the clones. The coding sequence of the FUT2 of the individuals who were shown to have nonfunctional FUT1 was also amplified and directly sequenced as described previously¹⁹.

Transient expression study to evaluate the effect of each of nonsynonymous SNP of FUT1 on the enzyme activity.

To evaluate the significance of each of nonsynonymous SNP of FUT1, transient expression experiments followed by flow cytometry analysis were performed as done with the FUT2 using an anti-H 1E3 monoclonal antibody^19,24. In addition to the FUT1 alleles containing each SNP concerned, the effects of the wild-type allele (FUT1*01), c.725T>G (FUT1*01N.09) inserted into pcDNA3.1(+) vectors were determined. Two μg of each construct together with 60 ng of the pGL3 Promoter was transfected into 2 × 10⁵ COS-7 cells (African green monkey kidney fibroblast-like cell) by means of TransIT-X2 (Mirus Bio LLC, Madison, WI). After 2 days, the cells were immunostained by using a mouse monoclonal antibody to H type 1–4 (1E3)²⁴, followed by incubation with FITC-labeled goat anti-mouse IgM (Bethyl Laboratories, Montgomery, TX), and H antigen expression was examined using a BD Accuri C6 system (Becton Dickinson, Franklin Lakes, NJ) as described previously¹⁹. The experiments were repeated four times independently. The transfection efficiency in each experiment was determined by luciferase luminescence intensity and the similar transfection efficiency was confirmed by the intensity of luciferase light as described previously¹⁹.

In silico prediction of effects of nonsynonymous SNPs on H enzyme

The effects of each nonsynonymous SNP of FUT1 on the function of the enzyme were predicted using four free software programs, MutationTaster (http://www.mutationtaster.org/)²⁵, MutationAssessor (http://mutationassessor.org/r3/)²⁶, PolyPhen-2 (http://genetics.bwh.harvard.edu/pph2/)²⁷, and Sorting intolerant from tolerant (SIFT) (http://sift.jcvi.org/)²⁸.

Screening of c.503C>G, c.725T>G, and c.749G>C in South Asian populations

According to the Erythrogene database, the nonfunctional FUT1 alleles with c.503C>G c.725T>G, and c.749G>C identified in this study were found in South Asian populations. Therefore, we attempted to screen these substitutions by high resolution melting (HRM)²⁹. Primer3 (https://bioinfo.ut.ee/primer3-0.4.0/,³⁰) was used to design PCR primers. For detection of c.503C>G, a forward primer (5ʹ-GGA GTA CGC GGA CTT GAG AG-3ʹ, 456–475 bp of FUT1) and a reverse primer (5ʹ-CGG AGA TGG TGG AAG AAA GT-3ʹ, 514–533 bp of FUT1) were used (Fig. 1A). For detection of both c.725T>G and c.749G>C, a forward primer (5ʹ- CTG CAG GTT ATG CCT CAG C-3ʹ, 673–691 bp of FUT), and a reverse primer (5ʹ-TGC TGG TGA CCA CGA AAA C-3ʹ, 769–787 bp of FUT1) were used (Fig. 1B).

Real-time PCR and HRM analysis were performed using a LightCycler 480 Instrument II and gene scanning software (Roche Diagnostics, Tokyo, Japan). The 20 μL PCR reaction mixture contained 2–20 ng of genomic DNA, 10 µL of Premix Ex Taq (Probe qPCR) (Takara, Tokyo, Japan), 1 µL of LightCycler 480 High Resolution Melting Dye (Roche Diagnostics, Tokyo, Japan), and 125 nM of each primer. The PCR temperature conditions were 95 °C for 30 s, followed by 40 cycles of 95 °C for 5 s and 60 °C for 20 s. The products were then denatured at 95 °C for 1 min and rapidly cooled to 60 °C for 1 min allowing heteroduplex formation, and data were collected over the range from 74 to 90 °C for c.503C>G or 80–96 °C for c.725T>G and c.749G>C, increasing at 0.02 °C/s with 25 acquisitions/sec. The raw melting curve data were normalized by manual adjustment of linear regions of pre- and post-melt signals of all samples. Temperature shifting was then performed using a temperature shift threshold of default setting (5%) for detection for c.503C>G. On the other hand, temperature shifting was not performed for c.725T>G and c.749G>C to clearly separate heterozygotes of c.725T>G and c.749G>C.

Results

Sequence and haplotype determination of FUT1

First, we determined the DNA sequence of the total coding region of the FUT1 of 29 individuals to survey the nonfunctional alleles of FUT1 registered in Erythrogene¹⁷. We confirmed all of the indicated SNPs in respective DNA samples in the database by direct Sanger sequencing of the FUT1 coding region. As a result, in addition to c.725T>G, we encountered 22 uncharacterized nonsynonymous SNPs: c.20G>C, c.101A>G, c.181G>A, c.220C>T, c.229C>G, c.283C>G, c.468C>G, c.503C>G, c.530T>G, c.565G>C, c.607C>T, c.625G>A, c.649G>A, c.691C>T, c.749G>C, c.796G>C, c.799T>C, c.800G>C, c.1013T>A, c.1022C>T, c.1064A>G, and c.1096T>C.

We then determined haplotypes of four fragments of the FUT1 coding region with two substitutions, that is, c.35C>T and c.181G>A, c.35C>T and c.220C>T, c.35C>T and c.649G>A, c.822C>A and c.1064A>G, by subcloning them into plasmids. Sequencing of the clones revealed that only one haplotype of the four alleles differed from those listed in the database. That is, c.822C>A and c.1064A>G were on the same chromosome in the database, but each was on a different chromosome. On the other hand, consistent with the database, c.181G>A, c.220C>T, and c.649G>A were on the functional FUT1 allele with c.35C>T (FUT1*01.02). Because c.822C>A is a synonymous SNP, we performed functional analyses of the 23 alleles including c.725T>G (FUT1*01N.09) listed in Table 1.

Functional analyses of candidates of nonfunctional FUT1 alleles

For determination of whether each uncharacterized FUT1 allele encodes a functional H enzyme or not, the α(1,2)fucosyltransferase activity in transfectants of each of FUT1 expression vector was determined in previous studies^13,20. In this study, we tried flow cytometry for measurement of H antigens expressed on the surface of culture cells using anti-H monoclonal antibody (1E3)²⁴ because the phenotype of erythrocytes could not be demonstrated and antibody tests of serum could not be performed. The predicted amino acid change for each allele and the expression levels of H antigens on the cell surface are shown in Table 1. Nine representative flow cytometry results including positive and negative controls are shown in Fig. 2.

In this study, substitutions with significantly (p < 0.05) lower activity compared to the positive control (H antigen expression of wild-type FUT1 allele, FUT1*01, 28.7 ± 3.2%) were defined as activity-affecting substitutions. Among them, alleles with substitutions whose activity was less than 10% of the positive control (H antigen expression of wildtype FUT1 allele, FUT1*01) were considered weakly functional alleles. In addition, H-enzyme inactivating substitutions were defined as those with no difference in activity (p > 0.05) compared to the negative control (the expression level of pcDNA3.1(+) without insert, 0.7 ± 0.2%). The percentage of H antigen–positive cells transfected with pcDNA3.1 ligated with the FUT1 of c.20G>C (p.R7P), c.101A>G (p.H34R), c.35C>T (p.A12V) and c.181G>A (p.A61T), c.35C>T (p.A12V) and c.220C>T (p.P74S), c.229C>G (p.L77V), c.283C>G (p.Q95E), c.468C>G (p.D156E), c.35C>T (p.A12V) and c.530T>G (p.L177R), c.565G>C (p.D189H), c.607C>T (p.R203C), c.625G>A (p.D209N), c.35C>T (p.A12V) and c.649G>A (p.V217I), c.796G>C (p.E266Q), c.1064A>G (p.D355G), and c.1096T>C (plus10 amino acids) were not significantly different (p > 0.05) with those ligated with the positive control of the wild-type allele, FUT1*01 (28.7 ± 3.2%). On the other hand, the percentage of H antigen–positive cells transfected with pcDNA3.1 ligated with FUT1 of c.691C>T (p.R231C), c.1013T>A (p.I338N), and c.1022C>T (p.P341L) were two-third of the positive control and those of c.800G>C (p.W267S, 10.0 ± 1.5%, Table 1) were less than half of the positive control but much higher than that of c.799T>C (p.W267R, 2.4 ± 0.9%). In the same experimental condition, expression of the H antigen on cells transfected with pcDNA3.1-FUT1 of c.503C>G (p.P168R, 0.4 ± 0.2%) or c.749G>C (p.R250P, 0.5 ± 0.2%) was almost undetectable as was that of c.725T>G (p.L242R, 0.4 ± 0.3%) and pcDNA3.1 without FUT1 (0.7 ± 0.2%).

In silico analysis to estimate the significance of uncharacterized nonsynonymous SNPs

We also predicted the possible impacts of 22 amino acid substitutions on the structure and function of the encoded H enzyme using four software programs, while c.1096T>C was excluded from the analysis because it occurred on T of termination codon TGA and 10 amino acids were added to the C-terminus (Table 1).

The results of predictions were not always consistent with those of expression experiments. For example, it has already been reported that p.L242R (c.725T>G), the substitution responsible for the classical Indian Bombay phenotype, is an H-deficient allele but it was classified as medium by Mutation Assessor. Of 22 amino acids substitutions, the predicted effects were matched for all software and experiments for p.A61T (c.181G>A), p.D156E (c.468C>G), p.D209N (c.625G>A) as polymorphic, neutral, benign, and tolerated substitutions and p.P168R (c.503C>G), p.L242R (c.725T>G), p.R250P (c.749G>C), p.W267S (c.799T>C) and p.I338N (c.1013T>A) as disease-causing, medium or low, damaging, and affected substitutions. On the other hand, they were mismatched for the other 14 substitutions (Table 1). Estimated concordance rates between in vitro expression studies and in silico function predictions were 81.8%, 50.0%, 68.2%, and 59.1% for MutationTaster, MutationAssessor, PolyPhen-2, and SIFT, respectively. Like the FUT2-encodedenzyme (Se enzyme) analyses¹⁹, the software generally tended to overestimate the impacts of the nonsynonymous SNPs we tested here.

FUT2 alleles link to nonfunctional FUT1 alleles

Finally, we identified two newly characterized completely nonfunctional alleles (with c.503C>G and c.749G>C) and one weakly functional allele (with c.799T>C) in this study. According to the 1000 Genomes Project Database, the Database of Genomic Variants (http://dgv.tcag.ca/dgv/app/variant?id=esv3644597&ref=GRCh38/hg38)³¹, one heterozygote of FUT1 with c.503C>G (HG02789, Punjabi in Lahore, Pakistan) was a compound heterozygote of FUT2*01N.02 (nonfunctional FUT2 alleles with c.428G>A nonsense substitution) and FUT2*0N.01 (accession number: v3644597, approximately 10-kb deletion including the entire FUT2 coding region) and one heterozygote of FUT1 with c.749G>C (NA21128, Gujarati Indians in Houston, Texas) was a homozygote of FUT2*01N.02 (Table 2). Therefore, FUT1 with c.503C>G was estimated to link to FUT2*01N.02 or FUT2*0N.01 and FUT1 with c.749G>C to FUT2*01N.02. Accordingly, homozygotes of these alleles are expected to be the Bombay phenotype because both nonfunctional FUT1 alleles were linked to nonfunctional FUT2 allele. On the other hand, one heterozygote of FUT1 with c.799T>C (NA19095, Yoruba in Ibadan, Nigeria) was a functional FUT2 homozygote. Therefore, homozygotes for this allele are considered to be of the secretor phenotype. Unfortunately, we cannot be certain because the phenotype has not been confirmed, but homozygotes for this weak-functional allele are likely to be the para-Bombay phenotype, regardless of the secretor phenotype (Table 2).

Table 2 Linkage of nonfunctional or weakly functional alleles of FUT1 to alleles of FUT2.

Full size table

The FUT1*01N.09 is known to links to FUT2*0N.01 in the classical Indian Bombay phenotype^20,21. In fact, two FUT1*01N.09 heterozygotes (HG03919 and HG04189, both Bengalis in Bangladesh) were heterozygous for FUT2*0N.01 and these two FUT1*01N.09 alleles are presumed to link to FUT2*0N.01 (Table 2). However, one FUT1*01N.09 heterozygote (HG02733, Punjabi in Lahore, Pakistan) was functional FUT2/FUT2*01N.02 for the FUT2. Thus, in this one subject, FUT1*01N.09 does not link to FUT2*0N.01, and if it linked to functional FUT2, the homozygote for this allele would be a para-Bombay phenotype (Table 2).

Screening of c.503C>G, c.725T>G, and c.749G>C by HRM in South Asian populations

According to the Erythrogene database, three H-deficient alleles with c.503C>G, c.725T>G, and c.749G>C were present only in South Asian populations. Therefore, we attempted to screen these substitutions by HRM in South Asian populations. HRM clearly separated each heterozygote of c.503C>G (Fig. 3A), c.725T>G, and c.749G>C from the respective wild-type homozygote (Fig. 3B). Temperature shifting was then performed using a temperature shift threshold of default setting (5%) for detection for 503C>G. On the other hand, temperature shifting was not performed for c.725T>G and c.749G>C to clearly distinguish c.725T>G from c.749G>C heterozygotes although the melting curve pattern of the wild type allele is broader than that of the temperature-shifted pattern (Fig. 3C,D).

We then screened 54 Sinhalese (Fig. 3C,D), 58 Tamils (not shown) in Sri Lanka, and 47 Bangladeshis (not shown) and found one heterozygous c.725T>G in each population. And one (Tamil) was homozygous for FUT2*0N.01 and two (Sinhalese and Bangladeshi) were heterozygous for FUT2*0N.01 (Table 2).

Discussion

Haplotype determination of FUT1 was relatively easy due to the small number of SNPs and the low coexistence of multiple SNPs on a single allele. Therefore, only four subjects requiring cloning of the FUT1 coding region into a plasmid for haplotype identification. Sequencing the clones revealed that the haplotypes of only one of four alleles were different from that registered in the Erythrogene database. This result is different from FUT2 in that we recently examined the genomic DNA of 18 unidentified alleles of FUT2 in Erythrogene database and found that the combination of SNPs for some alleles differed from the database due to multiple SNPs on a single allele¹⁹.

In this study, we also performed transient expression studies of 22 uncharacterized FUT1 alleles available from the Erythrogene database and found two nonfunctional alleles, one weakly functional allele, and four alleles partially reduced encoded H enzyme activity. Fifteen alleles appeared to encode H enzymes equivalent to the wild-type. The H-deficient phenotype is known to be very rare compared to the nonsecretor phenotype, which is present in about 25% of many populations^1,3. Accordingly, the frequency of FUT1 substitutions is low (26 of 29 alleles were 0.1% or less in the1000 Genomes Project Database). In addition to c.725T>G, a causal substitution of the classical Indian Bombay phenotype, they were the only two substitutions here that completely inactivated enzyme activity. It is interesting to note that all of these were found in South Asian populations, but the reason for this is not clear at present. Classical Indian Bombay subjects with FUT1*01N.09 and FUT2*0N.01 have been reported not only in Indians, but also Bangladeshis, Pakistanis, Sri Lankans, and even in West Asian Iranians^20,21,32,33. Thus, the causal haplotype (FUT1*01N.09–FUT2*0N.01) of the classical Indian Bombay phenotype is presumed to be present with some frequency, albeit low, and to be widespread in a broad band of South Asian populations and certain West Asian populations, while the other two nonfunctional FUT1 alleles (with c.503C>G and c.749G>C) may be restricted to relatively specific populations in South Asia. To investigate distribution of these substitutions and estimation of prevalence of Bombay or para-Bombay phenotypes, a large-scale analysis of the South Asian population is needed, and the HRM analysis used in this study is expected to be a good tool for this purpose.

The allele with c.649G>A resulting in p.V217I appears to be functional in the present transient expression studies although all in silico analyses suggest that this substitution has significant impact of encoded protein. On the other hand, the allele with c.649G>T resulting in p.V217F is listed as a weakly functional allele with the name of FUT1*01W.24 by the ISBT 018 H (FUT1FUT2) blood group alleles v6.1 31-MAR-2023. In addition, the FUT1 with c.799T>C that produces p.W267R significantly reduced H enzyme activity, and the FUT1 with c.800G>C that produces p.W267S also reduced enzyme activity by more than half compared to the wild-type enzyme. Thus, 217 V and 267W appeared to be important for H enzyme activity.

As mentioned above, in the classical Indian Bombay phenotype, FUT1*01N.09 links to FUT2*0N.01 in the literature^12,21, and in at least two 1000 Genomes Project subjects and three Tamil, Sinhalese, and Bangladeshi subjects with FUT1*01N.09 also appear to be linked to FUT2*0N.01. However, we encountered here one FUT1*01N.09 not linked to FUT2*0N.01 because the genotype of FUT2 of this subject was functional FUT2/FUT2*01N.02. It is difficult to determine the exact haplotypes of FUT1 and FUT2 in this subject because FUT1 and FUT2 are 35 kb apart on chromosome 19q13.3⁶. Based on gene frequencies, it is likely that the c.725T>G substitution of FUT1 occurred on the chromosome with FUT2*0N.01 in South Asia. Therefore, although we cannot be certain, it is likely that FUT1*01N.09, which does not link to the FUT2*0N.01, arose by homologous recombination between chromosomes during meiosis. In any case, since only one allele has been analyzed so far, analysis of a large number of samples and family analyses will be necessary to estimate the mechanism of generation of this allele.

Conclusion

We identified two nonfunctional FUT1 alleles (with c.503C>G and c.749G>C) and one weak allele (with c.799T>C) in samples in the 1000 Genomes Project Database. To estimate the impact of each SNP, transient expression studies are desirable for analysis of FUT1 as well as FUT2.

Data availability

The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.

References

Daniels, G. in Human Blood Groups (ed Geoff Daniels) 11–95 (Wiley-Blackwell, 2013).
Le Pendu, J., Lemieux, R. U., Lambert, F., Dalix, A. M. & Oriol, R. Distribution of H type 1 and H type 2 antigenic determinants in human sera and saliva. Am. J. Hum. Genet. 34, 402–415 (1982).
PubMed PubMed Central Google Scholar
Oriol, R., Candelier, J. J. & Mollicone, R. Molecular genetics of H. Vox Sang. 78(Suppl 2), 105–108 (2000).
CAS PubMed Google Scholar
Larsen, R. D., Ernst, L. K., Nair, R. P. & Lowe, J. B. Molecular cloning, sequence, and expression of a human GDP-L-fucose:beta-D-galactoside 2-alpha-L-fucosyltransferase cDNA that can form the H blood group antigen. Proc. Natl. Acad. Sci. USA 87, 6674–6678. https://doi.org/10.1073/pnas.87.17.6674 (1990).
Article ADS CAS PubMed PubMed Central Google Scholar
Kelly, R. J., Rouquier, S., Giorgi, D., Lennon, G. G. & Lowe, J. B. Sequence and expression of a candidate for the human Secretor blood group alpha(1,2)fucosyltransferase gene (FUT2). Homozygosity for an enzyme-inactivating nonsense mutation commonly correlates with the non-secretor phenotype. J. Biol. Chem. 270, 4640–4649. https://doi.org/10.1074/jbc.270.9.4640 (1995).
Rouquier, S., et al. Molecular cloning of a human genomic region containing the H blood group alpha(1,2)fucosyltransferase gene and two H locus-related DNA restriction fragments. Isolation of a candidate for the human Secretor blood group locus. J. Biol. Chem. 270, 4632–4639. https://doi.org/10.1074/jbc.270.9.4632 (1995).
Yu, L. C., Yang, Y. H., Broadberry, R. E., Chen, Y. H. & Lin, M. Heterogeneity of the human H blood group alpha(1,2)fucosyltransferase gene among para-Bombay individuals. Vox Sang. 72, 36–40. https://doi.org/10.1046/j.1423-0410.1997.00036.x (1997).
Article CAS PubMed Google Scholar
Storry, J. R. et al. Identification of six new alleles at the FUT1 and FUT2 loci in ethnically diverse individuals with Bombay and Para-Bombay phenotypes. Transfusion 46, 2149–2155. https://doi.org/10.1111/j.1537-2995.2006.01045.x (2006).
Article CAS PubMed Google Scholar
Scharberg, E. A., Olsen, C. & Bugert, P. The H blood group system. Immunohematology 32, 112–118 (2016).
Article PubMed Google Scholar
Bhende, Y. M. et al. A “new” blood group character related to the ABO system. Lancet 1, 903–904 (1952).
CAS PubMed Google Scholar
Davey, R. J., Tourault, M. A. & Holland, P. V. The clinical significance of anti-H in an individual with the Oh (Bombay) phenotype. Transfusion 18, 738–742. https://doi.org/10.1046/j.1537-2995.1978.18679077959.x (1978).
Article CAS PubMed Google Scholar
Koda, Y., Soejima, M. & Kimura, H. Structure and expression of H-type GDP-L-fucose:beta-D-galactoside 2-alpha-l-fucosyltransferase gene (FUT1). Two transcription start sites and alternative splicing generate several forms of FUT1 mRNA. J. Biol. Chem. 272, 7501–7505. https://doi.org/10.1074/jbc.272.11.7501 (1997).
Kelly, R. J. et al. Molecular basis for H blood group deficiency in Bombay (Oh) and para-Bombay individuals. Proc. Natl. Acad. Sci. USA 91, 5843–5847. https://doi.org/10.1073/pnas.91.13.5843 (1994).
Article ADS CAS PubMed PubMed Central Google Scholar
Scharberg, E. A., Olsen, C. & Bugert, P. An update on the H blood group system. Immunohematology 35, 67–68 (2019).
Article PubMed Google Scholar
Lei, H. et al. A para-Bombay blood group case associated with a novel FUT1 mutation c.361G>A. Transfus. Med. Hemother. 48, 254–258. https://doi.org/10.1159/000513318 (2021).
Bureau, V. et al. Comparison of the three rat GDP-L-fucose:beta-D-galactoside 2-alpha-L-fucosyltransferases FTA FTB and FTC. Eur. J. Biochem. 268, 1006–1019. https://doi.org/10.1046/j.1432-1327.2001.01962.x (2001).
Article CAS PubMed Google Scholar
Moller, M., Joud, M., Storry, J. R. & Olsson, M. L. Erythrogene: a database for in-depth analysis of the extensive variation in 36 blood group systems in the 1000 Genomes Project. Blood Adv. 1, 240–249. https://doi.org/10.1182/bloodadvances.2016001867 (2016).
Article CAS PubMed PubMed Central Google Scholar
Sudmant, P. H. et al. An integrated map of structural variation in 2,504 human genomes. Nature 526, 75–81. https://doi.org/10.1038/nature15394 (2015).
Article CAS PubMed PubMed Central Google Scholar
Soejima, M. & Koda, Y. Survey and characterization of nonfunctional alleles of FUT2 in a database. Sci. Rep. 11. https://doi.org/10.1038/s41598-021-82895-w (2021).
Koda, Y., Soejima, M., Johnson, P. H., Smart, E. & Kimura, H. Missense mutation of FUT1 and deletion of FUT2 are responsible for Indian Bombay phenotype of ABO blood group system. Biochem. Biophys. Res. Commun. 238, 21–25. https://doi.org/10.1006/bbrc.1997.7232 (1997).
Article CAS PubMed Google Scholar
Fernandez-Mateos, P. et al. Point mutations and deletion responsible for the Bombay H null and the Reunion H weak blood groups. Vox Sang. 75, 37–46 (1998).
Article CAS PubMed Google Scholar
Pang, H. et al. Two distinct Alu-mediated deletions of the human ABO-secretor (FUT2) locus in Samoan and Bangladeshi populations. Hum. Mutat. 16, 274. https://doi.org/10.1002/1098-1004(200009)16:3%3c274::AID-HUMU20%3e3.0.CO;2-I (2000).
Article CAS PubMed Google Scholar
Soejima, M. & Koda, Y. Denaturing high-performance liquid chromatography-based genotyping and genetic variation of FUT2 in Sri Lanka. Transfusion 45, 1934–1939. https://doi.org/10.1111/j.1537-2995.2005.00651.x (2005).
Article CAS PubMed Google Scholar
Nakajima, T., Yazawa, S., Miyazaki, S. & Furukawa, K. Immunochemical characterization of anti-H monoclonal antibodies obtained from a mouse immunized with human saliva. J. Immunol. Methods 159, 261–267. https://doi.org/10.1016/0022-1759(93)90165-4 (1993).
Article CAS PubMed Google Scholar
Schwarz, J. M., Rodelsperger, C., Schuelke, M. & Seelow, D. MutationTaster evaluates disease-causing potential of sequence alterations. Nat. Methods 7, 575–576. https://doi.org/10.1038/nmeth0810-575 (2010).
Article CAS PubMed Google Scholar
Reva, B., Antipin, Y. & Sander, C. Predicting the functional impact of protein mutations: Application to cancer genomics. Nucleic Acids Res. 39, e118. https://doi.org/10.1093/nar/gkr407 (2011).
Article CAS PubMed PubMed Central Google Scholar
Sunyaev, S. et al. Prediction of deleterious human alleles. Hum. Mol. Genet. 10, 591–597. https://doi.org/10.1093/hmg/10.6.591 (2001).
Article CAS PubMed Google Scholar
Sim, N. L. et al. SIFT web server: Predicting effects of amino acid substitutions on proteins. Nucleic Acids Res. 40, W452-457. https://doi.org/10.1093/nar/gks539 (2012).
Article CAS PubMed PubMed Central Google Scholar
Erali, M., Voelkerding, K. V. & Wittwer, C. T. High resolution melting applications for clinical laboratory medicine. Exp. Mol. Pathol. 85, 50–58. https://doi.org/10.1016/j.yexmp.2008.03.012 (2008).
Article CAS PubMed PubMed Central Google Scholar
Untergasser, A. et al. Primer3–new capabilities and interfaces. Nucleic Acids Res. 40, e115. https://doi.org/10.1093/nar/gks596 (2012).
Article CAS PubMed PubMed Central Google Scholar
MacDonald, J. R., Ziman, R., Yuen, R. K., Feuk, L. & Scherer, S. W. The Database of Genomic Variants: A curated collection of structural variation in the human genome. Nucleic Acids Res. 42, D986-992. https://doi.org/10.1093/nar/gkt958 (2014).
Article CAS PubMed Google Scholar
Ringressi, A., Cunsolo, V., Malentacchi, F. & Pozzessere, S. Erythrocyte phenotype in a pregnant woman of Sri Lanka: Description of the case and complications related to communication problems. Ann Ist Super Sanita 54, 35–39. https://doi.org/10.4415/ANN_18_01_08 (2018).
Article PubMed Google Scholar
Shahriyari, F., Oodi, A., Kenari, F. N. & Shahabi, M. Identification of two novel FUT1 mutations in people with Bombay phenotype from Iran. Transfus. Apher. Sci. 62, 103640. https://doi.org/10.1016/j.transci.2023.103640 (2023).
Article PubMed Google Scholar

Download references

Acknowledgements

We thank Ms. Katherine Ono for the English editing of this manuscript.

Author information

Authors and Affiliations

Department of Forensic Medicine, Kurume University School of Medicine, Kurume, 830-0011, Japan
Mikiko Soejima & Yoshiro Koda

Authors

Mikiko Soejima
View author publications
You can also search for this author in PubMed Google Scholar
Yoshiro Koda
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.S. contributed to planning and conducting experiments, data analysis, and writing of the original draft. Y.K. contributed to supervision, data analysis, and review and editing the manuscript.

Corresponding author

Correspondence to Yoshiro Koda.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Soejima, M., Koda, Y. FUT1 variants responsible for Bombay or para-Bombay phenotypes in a database. Sci Rep 13, 17447 (2023). https://doi.org/10.1038/s41598-023-44731-1

Download citation

Received: 27 July 2023
Accepted: 11 October 2023
Published: 14 October 2023
DOI: https://doi.org/10.1038/s41598-023-44731-1

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.