Abstract
The development of a stable human gut microbiota occurs within the first year of life. Many open questions remain about how microfloral species are influenced by the composition of milk, in particular its content of human milk oligosaccharides (HMOs). The objective is to investigate the effect of the human HMO glycome on bacterial symbiosis and competition, based on the glycoside hydrolase (GH) enzyme activities known to be present in microbial species. We extracted from UniProt a list of all bacterial species catalysing glycoside hydrolase activities (EC 3.2.1.-), cross-referencing with the BRENDA database, and obtained a set of taxonomic lineages and CAZy family data. A set of 13 documented enzyme activities was selected and modelled within an enzyme simulator according to a method described previously in the context of biosynthesis. A diverse population of experimentally observed HMOs was fed to the simulator, and the enzymes matching specific bacterial species were recorded, based on their appearance of individual enzymes in the UniProt dataset. Pairs of bacterial species were identified that possessed complementary enzyme profiles enabling the digestion of the HMO glycome, from which potential symbioses could be inferred. Conversely, bacterial species having similar GH enzyme profiles were considered likely to be in competition for the same set of dietary HMOs within the gut of the newborn. We generated a set of putative biodegradative networks from the simulator output, which provides a visualisation of the ability of organisms to digest HMO and mucin-type O-glycans. B. bifidum, B. longum and C. perfringens species were predicted to have the most diverse GH activity and therefore to excel in their ability to digest these substrates. The expected cooperative role of Bifidobacteriales contrasts with the surprising capacities of the pathogen. These findings indicate that potential pathogens may associate in human gut based on their shared glycoside hydrolase digestive apparatus, and which, in the event of colonisation, might result in dysbiosis. The methods described can readily be adapted to other enzyme categories and species as well as being easily fine-tuneable if new degrading enzymes are identified and require inclusion in the model.
Similar content being viewed by others
Introduction
Human milk oligosaccharides (HMOs) are a class complex unconjugated sugars derived from lactose, and which is the third-most abundant component of milk, after lactose itself, and lipids, with their concentration being at their highest in colostrum1. Although undigested by the neonate, HMOs are thought to provide protection from pathogenic microorganisms by masking the epithelial cells of the developing intestinal mucosa2,3, promotion of mucin expression4 and by promoting the growth of lactobacilli and bifidobacteria specialising in their digestion5. Aside from protecting against necrotising enterocolitis6 or Streptococcus group B organisms7, HMOs may also provide other benefits to the neonate, including in neurodevelopment8,9 and prevention of allergic diseases10,11,12. The development of a healthy gut microbiome, as a consequence of human milk oligosaccharides in the diet, has indications of longer-term benefit to the infant, including reduced incidence of inflammatory bowel diseases13, neuroprotective effects14 and small-molecule homeostasis15. In this context, understanding the principles of HMO synthesis as well as the conditions upon which HMOs are potentially digested by the infant gut16, is a step towards answering the broad range of open questions. Previously, we have modelled the synthesis of the HMO portion of human milk and ran simulations that could account for around 90% of all structures observed and reported in the literature17. The logical follow-on is to consider the HMO degradation by the gut microbiome.
As evidence of the prominent role of carbohydrate-active enzymes (cazymes) in digestion of dietary and host oligosaccharides, 95% of the genomes contained in CAZy, the database in which those enzymes have long been collected and grouped into families18, as well as 85% of its entries, are from the superkingdom of Bacteria. Novel enzyme activities are usually inferred, both from genomic and19 metagenomic studies20 of these organisms. The IUBMB Enzyme List, with its Enzyme Commission (EC) number hierarchy, provides a functional classification of enzymes based on experimental evidence of the reactions they catalyse21, a classification which is independent of the specific protein or organism. The manually curated association between EC numbers and CAZy families is a source of identification of carbohydrate synthesis/degradation effectors. Genome automatic annotation that assigns protein function based on a broad variety of criteria, provides an independent starting point for selecting these cazymes. In the past decade, numerous studies have covered the analysis of the cazyme content in the gut microbiome as reviewed in19,22,23 where the focus is mostly on cazymes involved in catabolism by Bacteria. These are generally confined to the sub-subclass of glycosidases (GH family), classified under EC 3.2.1.-, and occasionally as polysaccharide lyases (PL family), which are classified under EC 4.2.2.-.
In the present account, we report on a new simulator of 13 hydrolase enzymes known to be possessed by gut microflora24,25,26, which we have used to investigate the potential of microbes to digest either HMOs or the glycoconjugates of the intestinal mucosa, and the possibility of cross-feeding25,27 and syntrophy28. We queried the UniProt protein database for the available evidence for proteins involved in glycoside hydrolase and sulfatase activity, correlated these with their corresponding EC numbers, and identified the CAZy families into which they are organised. Through simulated digestions of populations of HMOs and mucin-type (GalNAc-linked) O-glycans, we estimated the likelihood of different taxa and individual bacteria to grow on these classes of substrate. Previous attempts to simulate HMO catabolism have been made29 but were mostly based on the occurrence of specific (orthologous) genes in bacterial genomes.
Within a similar framework as that presented in17 for HMO biosynthesis, we generated a set of putative networks from the output of simulating HMO degradation. Network visualisation was kept consistent in terms of colour coding of HMO categories. A closer look at species distribution and contribution expectedly revealed B. bifidum, B. longum as having the most diverse GH activity. More surprisingly, C. perfringens, in the present study, is associated with similar capacities as the Bifidobacter species, which might have implications for the involvement of this pathogen in the development of necrotising enterocolitis30.
Results
Bacterial proteins associated with activities in sub-subclasses EC 3.2.1 (glycoside hydrolases) and EC 3.1.6 (sulfuric-ester hydrolases) were extracted from UniProt (Release 2023_02) as described in the Methods. Similar data were extracted from CAZy (May 2023 update) for EC 3.2.1. Amongst these proteins 155 complete EC numbers were found, when cross-referenced with the BRENDA31 repository, of which 30 were unique to UniProt and 20 were unique to BRENDA (Fig. 1B). Twelve proteins associated with sulfuric-ester hydrolase (sulfatase) activity were found, all of which were in UniProt, and six of which were in BRENDA (Fig. 1A).
Enzymes and reactions
A Glycologue simulator of 12 glycoside hydrolases and 1 sulfatase was developed, based on their known involvement in HMO and mucin digestion by gut floral bacteria24,25,32. The enzymes of the model are shown in Table 1, numbered 1–13, with the EC number, where available, comprising one sialidase (enzyme 1), two galactosidases (2 and 3), two N-acetylgalactosaminidases (4, 8), two N-acetylglucosaminidases (5, 13), one β-N-acetylhexosaminidase (7), one sulfatase (9), three fucosidases (6, 10, 11) and the endo-glycosidase, lacto-N-biosidase (12). The corresponding activities were modelled in Glycologue33 notation, which uses a single-character encoding system for the monosaccharides. Reactions and glycan substrates were cross-checked with ChEBI34 and Rhea. Rhea accession numbers, where available, are included in Table 1.
CAZy families and organism taxonomic classification
From the UniProt dataset (Release 2023_02) 95 distinct CAZy glycoside hydrolase families were identified in Bacteria. The number of distinct simulated EC numbers available to individual species were counted. No organism was found to have all 11 fully characterised enzymes of the simulator, based on the assignment of a complete EC number in UniProt or BRENDA, the highest ranked being Bifidobacterium bifidum, with 10 full EC numbers associated with its GH-encoding proteins.
Of the 8368 bacterial species associated with glycoside hydrolase (GH) or sulfuric-ester hydrolase (SH) activities in UniProt/BRENDA, 448 were manually positively identified with human gut, the remaining 7920 being classed as non-gut species (often from environmental samples). A subset of 166 of the gut species have been identified as potential pathogens35.
From Table 1 an enzyme profile could be assigned to each species based on the presence or absence of a particular enzyme activity in the UniProt data. This enzyme profile value is an integer representing the state of enzyme activities (see Methods, Enzyme profiles), represented through this work as pX, with X an integer from 0 to \({\sum }_{i=1}^{13}{2}^{i-1}\)= 8191. From the gut bacteria, 40 unique enzyme profiles were identified, compared with 79 profiles in non-gut species. Interactive Krona36 SVG (Scalable Vector Graphics) diagrams of gut, non-gut and all species (combined) are shown in Supplementary Information. On comparing the gut and non-gut microbial subsets, 4 enzyme profiles unique to gut species were identified, and 12 were unique to non-gut bacteria. The median numbers of simulated characterised hydrolytic enzymes available to non-gut (n = 7920) and gut (n = 448) species were found to be 2 and 3, respectively.
Feeding HMOs to a simulated microbiome
A library of 226 experimentally determined HMOs17 was used as input to the Glycologue GH simulator, for every species within each subset of the species data. Every GH reaction was associated with one bit of information from the innate enzyme profile value. Once all substrates had been digested, as far as possible, within the simulator, the subset of the enzymes used on those substrates was stored as digestion profile value. Not all enzyme activities of the model were activated by the HMO set. For example, three species, Bacteroides caccae, Adhaeribacter pallidiroseus and Zobellia galactanivorans, express enzymes 1 through 8 (see Table 1), which is the enzyme profile value p7936. After simulated digestion, since none of the HMOs possessed an α-galactose or α-N-acetylglucosaminidase residue, the enzymes corresponding to the removal of these sugar units, namely α-galactosidase and α-N-acetylglucosaminidase were never activated, resulting in a further elimination of enzymes 2 and 5. The substrate-specific, or environment-specific, profile value for these bacteria was therefore calculated to be p7954. The mapping of innate profile values to HMO-trained profile values is summarised in Fig. 2. Six species were discovered to possess unique subsets of the modelled enzymes, including the probiotic species B. bifidum and B. longum, but also two potential pathogens, C. perfringens and Oribacterium sinus.
Simulated networks of the digestion of the HMOs are shown in Fig. 3. Nodes that were part of the experimentally determined HMO set are drawn at a larger node size than the intermediates and coloured according to the core type (Fig. 4) to be consistent with the representation previously used17 for HMO biosynthesis. Complete digestion of all HMOs to lactose was not possible with the current version of the GH enzyme simulator, owing to its inability to process 3-fucosylated lactose (3-FL). Nevertheless, both the size of the networks and the number of disjoint subnetworks could be matched with certain profile values. The number of reaction subnetworks is seen to increase, as the numbers of enzymes available to an organism decrease (Fig. 3A–D; see also Supplementary Information). The most complete digestion profile, with all GH enzymes active, is shown in Fig. 3A), with a single major network of hydrolase reactions, and several smaller islands. The network for the species B. bifidum and B. longum was partitioned into two large and five small subnetworks (Fig. 3B). (Fig. 3B), which was closely matched by the C. perfringens enzyme profile (Fig. 3C). A putative digestive network that is representative of several species is shown in Fig. 3D, including E. coli, but also 24 other species, of which 7 were associated with the gut microbia that include two other species from the same order of Enterobacterales, Enterobacter cloacae and Salmonella enterica, two from Bacteroidales (Parabacteroides distasonis and Bacteroides stercoris) and two within Eubacteriales (Eisenbergiella tayi and Blautia wexlerae). With 5 GH enzyme activities available, compared with 8 and 10 activities in panel B, the graph is noticeably more fragmented than those of Bifidobacteria, indicating only partial digestion of the HMO substrates. Other HMO-degradation networks are shown in Supplementary Information (Networks).
In Fig. 5, simulated networks of glycosyltransferase action during HMO biosynthesis17 are contrasted with the predicted networks of the glycoside hydrolases simulated in this study. The structure of TFS-LNO, a trifucosyl, monosialyl derivative of lacto-N-octaose37,38, was used as a reference. In both networks, several intermediates encountered have previously been characterised, and are highlighted in the figure, including F-LND III39, DF-LNH c40,41, TF-LNO II39, TF-LNH40,41,42 and DF-LNH II40,42. Not all paths through the networks are equally probable, and in the biosynthetic network, are regulated kinetically, through enzyme activities and their spatiotemporal localisations within Golgi43.
Differences in intermediates between biosynthetic and degradative pathways were also observed. When the computed intermediates of the HMO-Glycologue minimal biosynthetic networks, published previously17, were compared with those of the GH simulator acting on the same initial set of HMO substrates, the percentage of the intermediates that were common to both networks was found to be 59.1%, the differences being partly due to products terminating in 6-linked GlcNAc, as a result of the βHexNAcH activity (enzyme 7) acting to remove β3-linked GlcNAc from a branched oligosaccharide.
Potential energy scores
To quantify the effect of an innate glycoside hydrolase apparatus feeding on HMO substrates, and more readily to compare the ability of species to thrive on these dietary oligosaccharides, a scoring system was developed for each simulated digestion, as described in the Methods (Calculation of potential energy scores). First, a maximum score was determined for the digestion of 226 HMOs by the simulator with all enzymes active, and the scores of all unique enzyme profile values were assigned a potential energy (PE) score relative to this maximum, from 0 to 1.
The highest scoring species in the gut subclass were B. bifidum and B. longum, with relative PE scores of 0.9888 and 0.8210, respectively, while C. perfringens scored 0.8089. These were coloured blue or purple in Fig. 6B (grouped at the level of taxonomic Order), while the closest scoring species were from the 7 bacteria referenced above (Fig. 3D), with a corresponding PE score of 0.6983, a value that was also achieved by several other GH enzyme profiles, such as p8072, which matched Akkermansia muciniphila, Bacteroides reticulotermitis, Bacteroides xylanisolvens and Phocaeicola vulgatus. Among the lowest scoring organisms on these substrates (PE value 0.0012) included Enterococcus caccae and Salmonella typhi, for which only two enzymes, α-galactosidase and β-N-acetylhexosaminidase, were retrieved from UniProt/BRENDA. For the 79 profiles in the non-gut bacterial subclass, PE scores ranged from 0.0007 (Bacillus paranthracis) to 0.7688 (Adhaeribacter pallidiroseus and Zobellia galactanivorans). The non-gut bacteria are grouped according to taxonomic Class in Fig. 6A. In both HMO and O-glycan simulated digestions, a similar pattern of clustering of profiles is evident, but the highest-performing species appear only in the gut category. Frequency distributions of potential energy scores in gut and non-gut Bacterial species are shown in Fig. 6D, along with the number of simulated enzymes available within each division.
Simulated syntrophies
Bacteria expressing extracellular glycoside hydrolase enzymes can release substrates to the benefit of other species in a community, a process known as cross-feeding or syntrophy27,28,44. To investigate this phenomenon in silico, with the data in UniProt, we compared each enzyme profile in the simulator against the others, in pairwise fashion. Glycoside hydrolase enzyme profiles of the gut species were compared bitwise, as described in the Methods (Calculation of potential syntrophies) using Boolean exclusive-OR (XOR) and conjunction (AND). Where an enzyme activity was found to be lacking in one organism but present in the other, the score was incremented, as evidence of a potential collaboration, when supplied on substrates amenable to catabolism by that enzyme. Where the same enzyme activity was possessed by both profiles (each profile representing one or more species) the score was decremented, as an indication of the potential for competition on the same substrates. Enzymes absent from both profiles were treated neutrally. Thus, it was possible to assign an overall score for each pair, from the GH data in UniProt and BRENDA alone, without the need for simulated digestion. The potential syntrophic scores are illustrated in Fig. 7, where potential collaborations are represented as green pixels, competitors as red, and neutral interactions as white. Intraspecies competition45 is represented by the diagonal, in which identical profile values are compared.
Syntrophic scores (x) were found to range from − 6.0 to + 8.0. A general feature of the data in Fig. 7 is that the potential for collaboration between species exists for those expressing a greater variety of GH enzymes, in the region in which green pixels are dominant. A possible exception is the profile of B. longum, which tended to score neutrally against the profiles of other species. A potential for syntrophy between Clostridium butyricum (p8152) and Brevundimonas diminuta (p7991) is highlighted, where six enzymes from Table 1 are present in one and absent in the other. The potential for competition between Bacillus cereus (p8059) and Acinetobacter johnsonii (p8127), which share similar overall profiles, is also indicated.
With a score of 4.50, profiles p8072 and p7991 are likewise complementary, which suggests that gut species associated with these profiles, such as Akkermansia muciniphila and B. diminuta might collaborate within the host gut, as might C. butyricum and B. diminuta (p8072 and p8152), with x = 4.0. A. muciniphila and C. butyricum (p8072 and p8152) having a pairwise x value of -1.5 and are thus classed as competitive according to Fig. 7. The strength of the association, or the disassociation, based on enzyme complementarities, will be related to the potential energy extracted from a given substrate set.
Feeding mucin-type O-glycans to a simulated microbiome
For comparison with the HMO population, a set of 434 mucin-type O-glycans, characterised from various species, were used as input to the simulator. When all enzymes were available, the digestive network shown in Fig. 8 was obtained, which shows nearly complete digestion of all substrates. Despite the incompleteness of the simulated digestions, the potential energy scores were calculated for the non-gut and gut classes of bacteria, as shown in Fig. 6. With the large numbers of bacteria appearing in the former category, to maintain the same aspect ratios in the heatmaps, the non-gut data were clustered at the Class taxa level, and by Order in the case of gut species. Again, the Bifidobacteriales spp. scored highly, as did C. perfringens, although not as highly as when fed the HMO substrate population, perhaps because of a lack of LNB substrates for the Bifidobacteriales. In the upper quartile of these PE scores (PE > 0.6128), with values between 0.7 and 0.8, were profiles p8072, p7944, p7952 and p7936, which were associated, in the data extracted from UniProt, with A. muciniphila (and 3 other spp.), Parabacteroides faecis, Bacteroides thetaiotaomicron (and 3 other spp.) and Bacteroides caccae.
Verification with the CAZy database
The problem of EC number misassignment, either manually or through software, has been noted previously46,47,48. Owing to the possibility of the existence of false positives, or mis-annotations, within the UniProt-derived data, the methods described here were applied to the corresponding data obtained from CAZy, as a way to verify the results independently. In a parallel set of experiments, gut and non-gut bacterial species were ranked according to the number of simulated glycoside hydrolases each possessed, and an enzyme profile value was assigned. With the 3,009,944 Bacterial entries in CAZy, 6,814 unique species were found, of which 5,690 are assigned GH families. Compared with the UniProt/BRENDA, the number of EC number assignments in CAZy was much smaller, resulting in 134 selected species for the simulations, of which 44 were assigned to the gut microbiota and 94 to non-gut. The median number of simulated enzymes available was also lower than in the UniProt/BRENDA-derived dataset, at 1 enzyme per species, for both the gut and non-gut associated subsets.
Simulations were repeated with the CAZy dataset, including HMO (Extended Data Fig. E1) and O-glycan digestion networks, the calculation of relative potential-energy scores (Extended Data Fig. E2) and pairwise enzyme-profile comparisons (Extended Data Fig. E3). Despite the differences in annotation, a similar pattern emerged: gut species consistently ranked higher both in the number of enzymes and the resulting PE score, as summarised in Fig. 9, when compared to non-gut spp. The enzyme profile values for UniProt- and CAZy-sourced microbiota are distinct, as shown by the profile of B. bifidum, which in the CAZy data does not possess the enzyme sialidase, EC 3.2.1.18 (enzyme 1 in Table 1), whereas 10 protein entries within UniProtKB were associated with this activity. The distinction between gut and non-gut bacteria is clearly seen in the heatmaps of Extended Data Fig. E2. Despite the discrepancies in annotation, Clostridium perfringens ranks second highest in the number of glycoside hydrolase enzymes active towards HMOs, in both datasets, and, in CAZy, the third-highest PE scorer, jointly with A. muciniphila.
Discussion
Glycologue was originally developed as a simulator of biosynthetic pathways, based on a formal-language approach to describe the acceptor substrates of the glycosyltransferases and sulfotransferases17,33,49,50, and the present account is the first application of the method to the glycoside hydrolases. In the model reported here, therefore, we have deliberately focused on enzymatic as opposed to genomic/transcriptomic information associated with bacterial species, since our goal was to compare the potential rather than the observed ability of microbes to process human milk oligosaccharides.
Our analysis of the GH enzymes and their association with gut and non-gut bacteria supports the longstanding findings that Bifidobacteroides species are uniquely suited towards colonising human gut in early life, when fed a diet of complex lactose-based oligosaccharides such as HMOs. Our findings also confirm the report on A. muciniphila as being able to thrive in vitro when grown on human milk28. A. muciniphila-like species are observed to colonise human gut across all age groups, beginning in early life and increasing to adult-like levels within a year51. A mucin-degrading strain, MucT, which at lower abundances has been shown to correlate with diseases in humans and mice, can also use HMOs as an energy source (see52).
A surprising finding was the case of C. perfringens, which scored highly when fed on these substrates, because of its broad glycoside hydrolase apparatus, viz., the enzymes 1–8 and 11 of Table 1. One report found no growth of this species on 2′-FL53, while another demonstrated growth of the Clostridium perfringens ATCC 13124 variant on porcine gastric mucin54. Therefore, the induction of a general α-l-fucosidase activity (EC 3.2.1.51) might be possible in an environment where complex fucosylated glycans are in abundance relative to simpler trisaccharides such as 2′-FL. This would support the hypothesis that early colonisation of human gut by HMO-specialising microbes protects against species with a preference for mucin O-glycans. Low relative PE scores were obtained for both non-gut species Staphylococcus aureus (0.2087) and Staphylococcus epidermidis (0.1733), despite a report that both species had shown potential for growth on HMOs55. Given that HMO concentrations were not found to decrease during these experiments, however, a mechanism for the promotion of these bacteria in milk, other than catabolism, is likely involved, a conclusion that would be supported by our data.
Can Clostridium perfringens digest HMOs?
Necrotising enterocolitis (NEC) is an often fatal disease of early infancy, especially in preterm infants56. The causes and symptoms of NEC are multifarious, but bacterial agents are implicated in many cases, including members of the Clostridia, Enterococcus and Cronobacter genii30. HMOs are known to reduce the likelihood of NEC in many instances56,57,58,59 and have therefore been the focus of studies into their use as prebiotics60,61. Incidence of NEC has been associated with low HMO diversity in low birth-weight infants62. Dietary HMOs 2′-FL (2′-fucolactose), 6′-SL (6′-sialyllactose) and DS-LNT (disialyl lacto-N-tetraose)63 have shown, in animal models, to protect neonatal gut from NEC infection and are thus targets of therapeutic development in humans64. The mechanism of protection is likely immunomodulatory, promoting immune cell development and anti-inflammatory pathways56,65.
Clostridium perfringens is a Gram-positive spore-forming anaerobe found in both soil and human and animal gut, which is implicated in NEC56, gastroenteritis66 and histotoxic infections. As a food-borne pathogen, its prevalence in meat and dairy industries has been an ongoing concern67,68,69. C. perfringens types can produce a wide variety of toxins70,71, including alpha (CPA), beta (CPB), epsilon (ETX), iota (ITX) and enterotoxin (CPE). Perfringolysin O (PFO) is a pore-forming toxin that contributes to Clostridial myonecrosis (gas gangrene)72. A recent genomics study of C. perfringens having linked those with pfoA+ strains with C. perfringens-associated NEC in neonates73, the possibility that C. perfringens has a substantial number of the glycoside hydrolase enzymes necessary for HMO digestion may indicate that it has the potential for opportunistic colonisation of the infant gut and thereby contribute to clinical disease. A study of the coincidence of PFO toxins with GH enzyme activities may therefore be warranted. A resolution of the apparent paradox, which is that DS-LNT protects against NEC, while C. perfringens is predicted to be able to digest it, may be simply the competition that exists between Clostridial and Bifidobacter species for gut colonisation in early life. A more subtle effect, based on possible inhibition of the sialidases involved by the disialylated DS-LNT, might be elucidated, but confirmation would require a more detailed knowledge of the kinetics of exo-α-sialidases of these species, which is outside the scope of the present study.
The question also arises as to whether certain bacterial competitors in the gut might lower the average concentration of the more complex fucosylated HMOs, which would otherwise be capable of inducing the GH enzymes in C. perfringens, especially the sialidases74,75 and “generalist” α-l-fucosidase activity (EC 3.2.1.51, cf. Table 1)54,76,77. It is possible, therefore, that probiotic use would promote competition with C. perfringens and similar pathogenic organisms and species beneficial to the host.
Gut versus non-gut species
The observed contrast between gut and non-gut bacteria may be enhanced by the noisy nature of the non-gut set which contains bacteria of mixed origins (terrestrial, marine, etc.). Nonetheless, differences are evident from a comparison of the number of GH enzymes available and the potential-energy scores obtained with these organisms, and in the heatmaps of Fig. 6 and Extended Data Fig. E2. It is most evident in the latter, where the non-gut species scored poorly. The higher scoring species in the heatmaps, fed on both HMO and O-glycan substrates, correlates with the numbers of simulated GH enzymes found in each database, which show a cluster of outliers in the gut category (Fig. 6C). These outliers include the known Lactobacillus spp. and others summarised in Fig. 9. More remarkable, however, is that all others are possible pathogens: B. caccae78, B. fragilis79, C. perfringens and Streptococcus pneumoniae35. This could explain the ability for pathogens to interfere and trigger inflammation or infection. Our results suggest that the glycoside hydrolases of C. perfringens be examined in greater detail. For example, further characterisation of the Calx-beta domain protein CPF_2130 (UniProtKB Entry A0A0H2YQI3) might be needed, to confirm that this has the fucosidase activity of EC 3.2.1.51 that is ascribed to it (see above).
Limitations and future directions
Since our objective was to investigate possible GH activity on HMOs and mucin-type O-glycans, enzyme activities related fructofuranose (FOS) and galactooligosaccharide (GOS) hydrolysis were excluded from the simulator. In a future release, the enzymes β-fructofuranosidase (EC 3.2.1.80), β-fructosidase (EC 3.2.1.26) and endo-galactanase (EC 3.2.1.89), could be added, to simulate the activities of these bacterial enzymes on fructans and oligosaccharides from non-human milks. The present study has a number of limitations, including the possibility of noise in the data, referred to earlier, but also curation errors, such as misassignment of EC numbers in data repositories. The lack of EC numbers for the sulfatase (9) has already been noted, and entries in general have few Bacteria-associated examples in UniProt/BRENDA (Fig. 1A). A β6-linked GlcNAc hydrolase (13) exists but was inactive in our simulations owing to the lack of an EC number. It would be interesting to see how this latter enzyme influences the intermediates of the biodegradative HMO pathways, compared to those of biosynthesis. While these enzymes have been modelled in Glycologue, it will not be possible to integrate the simulator with the database-generated enzyme profiles until the enzymes have been officially classified within the EC system. Another possible limitation is that our results reflect a degree of bias in the underlying data, in the sense that certain organisms may have received more interest owing to their roles either in health or disease.
The glycoside hydrolase reactions modelled in this study convey less information than those of the corresponding glycosyltransferase simulators, which model enzymes possessing greater specificities. Accordingly, where the glycosyltransferase rules are readily reversible49, the corresponding reversed reactions of hydrolysis would result in many transglycosylated products, leading to increased uncertainty in the choice of the reversed degradation pathway, i.e., in the biosynthetic direction. In order to model, for example, the effect of sulfation on the exo-GH enzymes, additional information about known parent monosaccharide and modifier specificities would need to be added to the reaction rules.
Sugar modifiers such as sulfation or acetylation may block some GH activities, for example the 3S-Gal sulfatase of the bacterium B. thetaiotaomicron are inactive towards sulfated mucin O-glycans32. In the future, as sulfatases from different organisms are characterised, the generic sulfatase activity of enzyme 9 in the model that currently acts as a placeholder will be modified, as needed, and further activities added, such as 3S-GalNAc sulfatase32. A further complication to modelling these bacterial enzymes will be the presence of sulfoglycosidases in certain species, for example the 6-sulfo-β-d-N-acetylglucosaminidase of Bifidobacterium bifidum80,81, which has a preference for particular sulfation patterns of GlcNAc.
The potential-energy scores could be adapted to offset the current penalty applied to LNBase action, if an organism is found to possess a transport mechanism for uptake of LNB, as with the GNB/LNB transport system of B. longum subsp. infantis. In the future, bit masking could be used to incorporate transporter functionality, or to distinguish between intracellularly and extracellularly expressed glycosidase activities, by extracting subcellular location data from UniProtKB, and inverting the bits in the enzyme profile value of those enzymes that are only expressed internally by a given organism. Furthermore, as our results have shown, the pathway simulation for biosynthesis and degradation does not yield the same intermediary products. A more detailed study of these differences could reveal a bias in HMO composition depending on the presence/absence of specific bacterial species or strains. An extension to the simulator would be required to monitor this phenomenon. It is expected that the predictions of Fig. 7 would be the most affected by masking of activities through nutrient uptake and activation of intracellular digestion pathways. Since the scoring system in this work does not distinguish between intracellular or extracellular enzyme activities, the results are a preliminary observation only.
The methods described in this work can be adapted to other components of the human gut microbiome, such as archaea82,83, about which relatively little is known compared to bacteria, and to enzymes in additional EC subclasses. As more data on the glycoside hydrolase enzymes of the gut microbiome become available and are curated in UniProtKB, these could be included in the future, in a re-examination of the digestive abilities of these organisms.
Methods
Data extraction
Data were extracted from UniProt (https://uniprot.org) (Release 2023_02) using the RESTful Application Programming Interface (API)84. For bacterial proteins matching EC numbers in sub-subclass EC 3.2.1 (glycosidases) and EC 3.1.6 (sulfuric-ester hydrolases) the UniProtKB fields Entry, Entry Name, Reviewed, EC number, BRENDA, Protein names, Gene Names, CAZy, Organism, Length, Subcellular location [CC] and Taxonomic lineage were extracted using the stream method as tab-separated value (TSV) files for further offline processing. Queries matched EC numbers appearing in UniProt or BRENDA (Release 2023.1, February 1, 2023), using a Boolean OR, such as query = ((ec:3.2.1.*) OR (xref:brenda-3.2.1.*)). The method was sufficiently general that it could be adapted to other taxonomic domains, such as Archaea, and to other EC sub-subclasses.
Example: https://rest.uniprot.org/uniprotkb/stream?compressed=false&fields=accession,id,reviewed,ec,xref_brenda,protein_name,gene_names,xref_cazy,organism_name,length,lineage&format=tsv&query=((ec:3.2.1.*) OR (xref:brenda-3.2.1.*)) AND (taxonomy_id:2).
The CAZy database was queried for all EC 3.2.1.- entries appearing in the ExplorEnz database (Release 2023.01, January 2023) of IUBMB Enzyme Nomenclature85. Downloaded HTML files were parsed and saved as TSV files. Bacterial species names were cross-referenced with the taxonomic lineage data obtained from UniProtKB.
Glycoside hydrolase simulator
Glycoside hydrolase (GH) enzymes were selected from the ExplorEnz based on their activities towards glycoconjugates and milk oligosaccharides, and correlated with their corresponding Rhea86 identifiers (Table 1). A simulator of the GH enzyme reactions was constructed, using the Glycologue formalism described previously33,49.
Enzymes simulated
Thirteen activities were modelled, as shown in Table 1, with the index value 1–13, the EC number, a short name, a longer accepted name, CAZy families and Rhea accession numbers. An alternative version of this table with the Glycologue reaction patterns used in the hydrolase model is given in Supplementary Information (Tables). Reactions within the simulator could be limited by Boolean conditionals, using regular expressions.
Enzyme profile values
The Glycologue simulator represents enzyme profiles in terms of knockouts of individual activities, the combination of which is an integer ranging from 0 to 2n-1, where n is the total number of enzymes modelled, with bit value 0 representing an expressed enzyme activity, and 1 representing a complete knockout of that activity. The least significant bit of the profile value representing α-sialidase (1) in Table 1, and the most significant bit is that of β6GlcNAcH (13). For example, the bacterium Fusicatenibacter saccharivorans, which was found to possess only the enzyme activities 2, 3 and 7, has the profile value p8121 (8121 = 1 + 0 + 0 + 8 + 16 + 32 + 0 + 128 + 256 + 512 + 1024 + 2048 + 4096).
Matching EC numbers with species
The data extracted from UniProtKB were filtered into separate files according to records matching complete EC numbers in the Glycologue simulator, resulting in 11 datasets, and similarly for CAZy. For each set of records matching a single EC number, a set of organism names was extracted. The number of unique activities within each sub-subclass and taxonomic domain was recorded within the UniProtKB and CAZy databases. To visualise the overlap between repositories, Venn diagrams were produced in R using package venn (venn_1.11).
Species were separated into classes, those known to be associated with human gut, and all others, which were described as non-gut. Of the gut species, a subset were identified as potentially pathogenic based on published reports35. From the UniProtKB lineage data, species were ranked according to their position with the superkingdom, phylum, class, order, family, genus and species, with the species being stored as the assigned binomial name. For every binomial species in each subcategory, a set EC numbers associated with GH activities in UniProt/BRENDA was deduced and an enzyme profile value assigned to that species. Krona36 graphs were generated from profile value and taxonomic lineage data.
Glycoside hydrolase simulated digestions
Two sets of substrates, 226 human milk oligosaccharides based on lactose17, and 434 GalNAc-linked O-glycans, encoded in the Glycologue modelling language33,49, were obtained from published studies to use as input to the glycoside hydrolase simulator. For each species, each substrate in the set was passed to the enzymes capable of being expressed by the organism, as a supplied profile value, and degraded according to the exo- and endo-glycosidase reaction patterns. The process proceeded iteratively until no further degradation products were formed. The number of distinct simulated reactions was counted, along with the number of enzymes activities used during the degradation of the complete set of substrates. Each substrate-product reaction pair was recorded together with the enzyme catalyst. Reaction networks were drawn using Tulip87 software (https://tulip.labri.fr), using the Stress Majorization (OGDF) graph layout algorithm.
Calculation of potential energy scores
Every successful hydrolase reaction releasing a monosaccharide was assigned a score of 1, and the release of a disaccharide such as Gal-(β1,3)-GlcNAc from lacto-N-biose was counted as 0.5, to simulate the effect of a delayed entrance to energy metabolism. A theoretical maximum energy score was calculated from the enzyme profile p0, where all enzymes were available to act on the set of all substrates. The organism-related energy score was then calculated as the fraction of simulated energy score for an organism divided by this theoretical maximum. Heatmaps were generated in R, coloured by calculated energy score, relating profile value to species grouped at the level of phylum, class, order, or genus.
Calculation of potential syntrophies
Enzyme profile values were compared pairwise using a bitwise-XOR comparison, to obtain a profile of shared and unshared enzymes. The number of enzymes matching between profile values at the same bit position were scored positively if the value of the XOR. Since the Glycologue profile value encodes knockouts of activities, its bitwise inverse was calculated by a conjunctive AND operation with 2n-1, where n = 13. For each profile-value pair, (p1, p2), the result of (p1 AND p2 AND 2n-1) was recorded and scored according to the enzymes active in each profile. Where an enzyme was expressed in one organism but not the other, a collaboration might exist (syntrophy), while enzymes common to both were treated as competitive, and scored negatively. Enzymes lacking to both members of a pair were scored neutrally: p1 XOR p2 = 1, ∆x = 1; p1 AND p2 = 0, ∆x = 0; p1 AND p2 = 1, ∆x = –0.5. The final value of x was obtained as the sum of all the pairwise bit comparisons.
Web application
The glycoside hydrolase simulator is embedded within each member of the family of Glycologue glycosylation-related enzyme simulators (https://glycologue.org), where it can be used to predict digestion networks of the products of glycosyltransferase enzymes.
Data availability
The HMO and O-glycan structures used as substrates of the hydrolase simulator are available as CSV files (Supplementary Information). The enzyme model simulator is available at https://glycologue.org/m/. GlyConnect Compozitor is available at https://glyconnect.expasy.org/compozitor/.
References
Urashima, T. et al. The predominance of type I oligosaccharides is a feature specific to human breast milk. Adv. Nutr. 3, 473S-482S. https://doi.org/10.3945/an.111.001412 (2012).
Kuntz, S., Rudloff, S. & Kunz, C. Oligosaccharides from human milk influence growth-related characteristics of intestinally transformed and non-transformed intestinal cells. Br. J. Nutr. 99, 462–471. https://doi.org/10.1017/S0007114507824068 (2008).
Kuntz, S., Rudloff, S. & Kunz, C. Milk oligosaccharides from different cattle breeds influence growth-related characteristics of intestinal cells. Front. Nutr. 6, 31. https://doi.org/10.3389/fnut.2019.00031 (2019).
Wu, R. Y. et al. Human milk oligosaccharides increase mucin expression in experimental necrotizing enterocolitis. Mol. Nutr. Food Res. https://doi.org/10.1002/mnfr.201800658 (2018).
Thongaram, T., Hoeflinger, J. L., Chow, J. & Miller, M. J. Human milk oligosaccharide consumption by probiotic and human-associated bifidobacteria and lactobacilli. J. Dairy Sci. 100, 7825–7833. https://doi.org/10.3168/jds.2017-12753 (2017).
Altobelli, E., Angeletti, P. M., Verrotti, A. & Petrocelli, R. The impact of human milk on necrotizing enterocolitis: a systematic review and meta-analysis. Nutrients 12, 1322. https://doi.org/10.3390/nu12051322 (2020).
Lin, A. E. et al. Human milk oligosaccharides inhibit growth of group B Streptococcus. J. Biol. Chem. 292, 11243–11249. https://doi.org/10.1074/jbc.M117.789974 (2017).
Berger, P. K., Ong, M. L., Bode, L. & Belfort, M. B. Human milk oligosaccharides and infant neurodevelopment: a narrative review. Nutrients 15, 719. https://doi.org/10.3390/nu15030719 (2023).
Wei, X. et al. Genetic-phenotype analysis of Bifidobacterium bifidum and its glycoside hydrolase gene distribution at different age groups. Foods 12, 922. https://doi.org/10.3390/foods12050922 (2023).
Burris, A. D., Pizzarello, C. & Järvinen, K. M. Immunologic components in human milk and allergic diseases with focus on food allergy. Semin. Perinatol. 45, 151386. https://doi.org/10.1016/j.semperi.2020.151386 (2021).
Lodge, C. J. et al. Human milk oligosaccharide profiles and allergic disease up to 18 years. J. Allergy Clin. Immunol. 147, 1041–1048. https://doi.org/10.1016/j.jaci.2020.06.027 (2021).
Tarrant, I. & Finlay, B. B. Human milk oligosaccharides: potential therapeutic aids for allergic diseases. Trends Immunol https://doi.org/10.1016/j.it.2023.06.003 (2023).
Hidalgo-Cantabrana, C. et al. Bifidobacteria and their health-promoting effects. Microbiol. Spectr. https://doi.org/10.1128/microbiolspec.BAD-0010-2016 (2017).
Hung, T.-W. et al. Human milk oligosaccharide 2′-fucosyllactose induces neuroprotection from intracerebral hemorrhage stroke. Int. J. Mol. Sci. 22, 9881. https://doi.org/10.3390/ijms22189881 (2021).
Glorieux, G., Nigam, S. K., Vanholder, R. & Verbeke, F. Role of the microbiome in gut-heart-kidney cross talk. Circ. Res. 132, 1064–1083. https://doi.org/10.1161/CIRCRESAHA.123.321763 (2023).
La Rosa, S. L. et al. Glycan processing in gut microbiomes. Curr. Opin. Microbiol. 67, 102143. https://doi.org/10.1016/j.mib.2022.102143 (2022).
McDonald, A. G., Mariethoz, J., Davey, G. P. & Lisacek, F. In silico analysis of the human milk oligosaccharide glycome reveals key enzymes of their biosynthesis. Sci. Rep. 12, 10846. https://doi.org/10.1038/s41598-022-14260-4 (2022).
Drula, E. et al. The carbohydrate-active enzyme database: functions and literature. Nucl. Acids Res. 50, D571–D577. https://doi.org/10.1093/nar/gkab1045 (2022).
Luis, A. S. & Martens, E. C. Interrogating gut bacterial genomes for discovery of novel carbohydrate degrading enzymes. Curr. Opin. Chem. Biol. 47, 126–133. https://doi.org/10.1016/j.cbpa.2018.09.012 (2018).
Jia, B., Han, X., Kim, K. H. & Jeon, C. O. Discovery and mining of enzymes from the human gut microbiome. Trends Biotechnol. 40, 240–254. https://doi.org/10.1016/j.tibtech.2021.06.008 (2022).
McDonald, A. G. & Tipton, K. F. Enzyme nomenclature and classification: the state of the art. FEBS J https://doi.org/10.1111/febs.16274 (2022).
Kaoutari, A. E. et al. The abundance and variety of carbohydrate-active enzymes in the human gut microbiota. Nat. Rev. Microbiol. 11, 497–504. https://doi.org/10.1038/nrmicro3050 (2013).
Wardman, J. F., Bains, R. K., Rahfeld, P. & Withers, S. G. Carbohydrate-active enzymes (CAZymes) in the gut microbiome. Nat. Rev. Microbiol. https://doi.org/10.1038/s41579-022-00712-1 (2022).
Ravcheev, D. A. & Thiele, I. Comparative genomic analysis of the human gut microbiome reveals a broad distribution of metabolic pathways for the degradation of host-synthetized mucin glycans and utilization of mucin-derived monosaccharides. Front. Genet. 8, 111. https://doi.org/10.3389/fgene.2017.00111 (2017).
Ioannou, A., Knol, J. & Belzer, C. Microbial glycoside hydrolases in the first year of life: an analysis review on their presence and importance in infant gut. Front Microbiol 12, 631282. https://doi.org/10.3389/fmicb.2021.631282 (2021).
Masi, A. C. & Stewart, C. J. Untangling human milk oligosaccharides and infant gut microbiome. iScience 25, 103542. https://doi.org/10.1016/j.isci.2021.103542 (2022).
Berkhout, M. D., Plugge, C. M. & Belzer, C. How microbial glycosyl hydrolase activity in the gut mucosa initiates microbial cross-feeding. Glycobiology 32, 182–200. https://doi.org/10.1093/glycob/cwab105 (2022).
Kostopoulos, I. et al. Akkermansia muciniphila uses human milk oligosaccharides to thrive in the early life conditions in vitro. Sci. Rep. 10, 14330. https://doi.org/10.1038/s41598-020-71113-8 (2020).
Garrido, D. et al. Comparative transcriptomics reveals key differences in the response to milk oligosaccharides of infant gut-associated bifidobacteria. Sci. Rep. 5, 13517. https://doi.org/10.1038/srep13517 (2015).
Coggins, S. A., Wynn, J. L. & Weitkamp, J.-H. Infectious causes of necrotizing enterocolitis. Clin. Perinatol. 42, 133–154. https://doi.org/10.1016/j.clp.2014.10.012 (2015).
Chang, A. et al. BRENDA, the ELIXIR core data resource in 2021: new developments and updates. Nucl. Acids Res. 49, D498–D508. https://doi.org/10.1093/nar/gkaa1025 (2021).
Luis, A. S. et al. A single sulfatase is required to access colonic mucin by a gut bacterium. Nature 598, 332–337. https://doi.org/10.1038/s41586-021-03967-5 (2021).
McDonald, A. G. & Davey, G. P. O-Glycologue: a formal-language-based generator of O-glycosylation networks. In Glycosylation (ed. Davey, G. P.) 223–236 (Springer, 2022).
Hastings, J. et al. ChEBI in 2016: improved services and an expanding collection of metabolites. Nucl. Acids Res. 44, D1214–D1219. https://doi.org/10.1093/nar/gkv1031 (2016).
Yang, J. et al. Species-level analysis of human gut microbiota with metataxonomics. Front. Microbiol. 11, 2029. https://doi.org/10.3389/fmicb.2020.02029 (2020).
Ondov, B. D., Bergman, N. H. & Phillippy, A. M. Interactive metagenomic visualization in a Web browser. BMC Bioinform. 12, 385. https://doi.org/10.1186/1471-2105-12-385 (2011).
Tachibana, Y., Yamashita, K. & Kobata, A. Oligosaccharides of human milk: structural studies of di- and trifucosyl derivatives of lacto-N-octaose and lacto-N-neooctaose. Arch. Biochem. Biophys. 188, 83–89. https://doi.org/10.1016/0003-9861(78)90359-4 (1978).
Wu, S., Grimm, R., German, J. B. & Lebrilla, C. B. Annotation and structural analysis of sialylated human milk oligosaccharides. J. Proteome Res. 10, 856–868. https://doi.org/10.1021/pr101006u (2011).
Urashima, T., Hirabayashi, J., Sato, S. & Kobata, A. Human milk oligosaccharides as essential tools for basic and application studies on galectins. Trends Glycosci. Glycotechnol. 30, 51–65. https://doi.org/10.4052/tigg.1734.1SE (2018).
Wu, S. et al. Development of an annotated library of neutral human milk oligosaccharides. J. Proteome Res. 9, 4138–4151. https://doi.org/10.1021/pr100362f (2010).
Remoroza, C. A. et al. Creating a mass spectral reference library for oligosaccharides in human milk. Anal. Biochem. 90, 8977–8988. https://doi.org/10.1021/acs.analchem.8b01176 (2018).
Thurl, S. et al. Systematic review of the concentrations of oligosaccharides in human milk. Nutr. Rev. 75, 920–933. https://doi.org/10.1093/nutrit/nux044 (2017).
Tu, L. & Banfield, D. K. Localization of Golgi-resident glycosyltransferases. Cell Mol. Life Sci. 67, 29–41. https://doi.org/10.1007/s00018-009-0126-z (2010).
Djemai, K., Drancourt, M. & Tidjani Alou, M. Bacteria and methanogens in the human microbiome: a review of syntrophic interactions. Microb. Ecol. 83, 536–554. https://doi.org/10.1007/s00248-021-01796-7 (2022).
Hibbing, M. E., Fuqua, C., Parsek, M. R. & Peterson, S. B. Bacterial competition: surviving and thriving in the microbial jungle. Nat. Rev. Microbiol. 8, 15–25. https://doi.org/10.1038/nrmicro2259 (2010).
von Grotthuss, M. et al. PDB-UF: database of predicted enzymatic functions for unannotated protein structures from structural genomics. BMC Bioinform. 7, 53. https://doi.org/10.1186/1471-2105-7-53 (2006).
Žváček, C., Friedrichs, G., Heizinger, L. & Merkl, R. An assessment of catalytic residue 3D ensembles for the prediction of enzyme function. BMC Bioinform. 16, 359. https://doi.org/10.1186/s12859-015-0807-6 (2015).
Garapati, P. V., Zhang, J., Rey, A. J. & Marygold, S. J. Towards comprehensive annotation of Drosophila melanogaster enzymes in FlyBase. Database 2019, bay144. https://doi.org/10.1093/database/bay144 (2019).
McDonald, A. G., Tipton, K. F. & Davey, G. P. A knowledge-based system for display and prediction of O-glycosylation network behaviour in response to enzyme knockouts. PLoS Comput. Biol. 12, e1004844. https://doi.org/10.1371/journal.pcbi.1004844 (2016).
McDonald, A. G. & Davey, G. P. Simulating the enzymes of ganglioside biosynthesis with Glycologue. Beilstein J. Org. Chem. 17, 739–748. https://doi.org/10.3762/bjoc.17.64 (2021).
Collado, M. C. et al. Intestinal integrity and Akkermansia muciniphila, a mucin-degrading member of the intestinal microbiota present in infants, adults, and the elderly. Appl. Environ. Microbiol. 73, 7767–7770. https://doi.org/10.1128/AEM.01477-07 (2007).
Cani, P. D. et al. Akkermansia muciniphila: paradigm for next-generation beneficial microorganisms. Nat. Rev. Gastroenterol. Hepatol. 19, 625–637. https://doi.org/10.1038/s41575-022-00631-9 (2022).
Yu, Z.-T., Chen, C. & Newburg, D. S. Utilization of major fucosylated and sialylated human milk oligosaccharides by isolated human gut microbes. Glycobiology 23, 1281–1292. https://doi.org/10.1093/glycob/cwt065 (2013).
Fan, S. et al. Cloning, characterization, and production of three α-l-fucosidases from Clostridium perfringens ATCC 13124: three α-l-fucosidases of Clostridium perfringens. J. Basic Microbiol. 56, 347–357. https://doi.org/10.1002/jobm.201500582 (2016).
Hunt, K. M. et al. Human milk oligosaccharides promote the growth of Staphylococci. Appl. Environ. Microbiol. 78, 4763–4770. https://doi.org/10.1128/AEM.00477-12 (2012).
Bering, S. Human milk oligosaccharides to prevent gut dysfunction and necrotizing enterocolitis in preterm neonates. Nutrients 10, 1461. https://doi.org/10.3390/nu10101461 (2018).
AlFaleh, K. & Anabrees, J. Probiotics for prevention of necrotizing enterocolitis in preterm infants: probiotics for prevention of necrotizing enterocolitis in preterm infants. Evid-Based Child Health 9, 584–671. https://doi.org/10.1002/ebch.1976 (2014).
Bode, L. Human milk oligosaccharides in the prevention of necrotizing enterocolitis: a journey from in vitro and in vivo models to mother-infant cohort studies. Front. Pediatr. 6, 385. https://doi.org/10.3389/fped.2018.00385 (2018).
Nolan, L. S., Rimer, J. M. & Good, M. The role of human milk oligosaccharides and probiotics on the neonatal microbiome and risk of necrotizing enterocolitis: a narrative review. Nutrients 12, 3052. https://doi.org/10.3390/nu12103052 (2020).
Coppa, G. V., Zampini, L., Galeazzi, T. & Gabrielli, O. Prebiotics in human milk: a review. Dig. Liver Dis. 38, S291–S294. https://doi.org/10.1016/S1590-8658(07)60013-9 (2006).
Barile, D. & Rastall, R. A. Human milk and related oligosaccharides as prebiotics. Curr. Opin. Biotechnol. 24, 214–219. https://doi.org/10.1016/j.copbio.2013.01.008 (2013).
Wejryd, E. et al. Low diversity of human milk oligosaccharides is associated with necrotising enterocolitis in extremely low birth weight infants. Nutrients 10, 1556. https://doi.org/10.3390/nu10101556 (2018).
Jantscher-Krenn, E. et al. The human milk oligosaccharide disialyllacto-N-tetraose prevents necrotising enterocolitis in neonatal rats. Gut 61, 1417. https://doi.org/10.1136/gutjnl-2011-301404 (2012).
Autran, C. A. et al. Human milk oligosaccharide composition predicts risk of necrotising enterocolitis in preterm infants. Gut 67, 1064–1070. https://doi.org/10.1136/gutjnl-2016-312819 (2018).
He, Y., Liu, S., Leone, S. & Newburg, D. S. Human colostrum oligosaccharides modulate major immunologic pathways of immature human intestine. Mucosal. Immunol. 7, 1326–1339. https://doi.org/10.1038/mi.2014.20 (2014).
Freedman, J. C., Shrestha, A. & McClane, B. A. Clostridium perfringens enterotoxin: action, genetics, and translational applications. Toxins 8, 73. https://doi.org/10.3390/toxins8030073 (2016).
Troutt, H. F. & Osburn, B. I. Meat from dairy cows: possible microbiological hazards and risks. Rev. Sci. Tech. 16, 405–414. https://doi.org/10.20506/rst.16.2.1022 (1997).
Lebrun, M., Mainil, J. G. & Linden, A. Cattle enterotoxaemia and Clostridium perfringens : description, diagnosis and prophylaxis. Vet. Rec. 167, 13–22. https://doi.org/10.1136/vr.167.1.12 (2010).
Santos, R. A. N. D. et al. Clostridium perfringens associated with dairy farm systems show diverse genotypes. Int. J. Food Microbiol. 382, 109933. https://doi.org/10.1016/j.ijfoodmicro.2022.109933 (2022).
Uzal, F. A. et al. Towards an understanding of the role of Clostridium perfringens toxins in human and animal disease. Fut. Microbiol. 9, 361–377. https://doi.org/10.2217/fmb.13.168 (2014).
Kiu, R. & Hall, L. J. An update on the human and animal enteric pathogen Clostridium perfringens. Emerg. Microb. Infect. 7, 1–15. https://doi.org/10.1038/s41426-018-0144-8 (2018).
Van Asbroeck, E. et al. Clostridium myonecrosis—a rare and underdiagnosed condition in the elderly: a case with severe skipping lesions and an overview of treatment guidelines. Int. J. Emerg. Med. 15, 56. https://doi.org/10.1186/s12245-022-00458-w (2022).
Kiu, R. et al. Particular genomic and virulence traits associated with preterm infant-derived toxigenic Clostridium perfringens strains. Nat. Microbiol. 8, 1160–1175. https://doi.org/10.1038/s41564-023-01385-z (2023).
Li, J., Uzal, F. & McClane, B. Clostridium perfringens sialidases: potential contributors to intestinal pathogenesis and therapeutic targets. Toxins 8, 341. https://doi.org/10.3390/toxins8110341 (2016).
Wang, Y. Sialidases from Clostridium perfringens and their inhibitors. Front. Cell Infect. Microbiol. 9, 462. https://doi.org/10.3389/fcimb.2019.00462 (2020).
Moya-Gonzálvez, E. M. et al. Infant gut microbial metagenome mining of α-l-fucosidases with activity on fucosylated human milk oligosaccharides and glycoconjugates. Microbiol. Spectr. 10, e01775-e1822. https://doi.org/10.1128/spectrum.01775-22 (2022).
Wu, H., Owen, C. D. & Juge, N. Structure and function of microbial α-l-fucosidases: a mini review. Essays Biochem. 67, 399–414. https://doi.org/10.1042/EBC20220158 (2023).
Cheng, Z. et al. Bloodstream infection caused by Bacteroides caccae in a diabetic patient: a case report and review of the literature. Clin. Lab. https://doi.org/10.7754/Clin.Lab.2019.190534 (2019).
Patrick, S. A tale of two habitats: Bacteroides fragilis, a lethal pathogen and resident in the human gastrointestinal microbiome. Microbiology https://doi.org/10.1099/mic.0.001156 (2022).
Katoh, T. et al. Identification and characterization of a sulfoglycosidase from Bifidobacterium bifidum implicated in mucin glycan utilization. Biosci. Biotechnol. Biochem. 81, 2018–2027. https://doi.org/10.1080/09168451.2017.1361810 (2017).
Katoh, T. et al. A bacterial sulfoglycosidase highlights mucin O-glycan breakdown in the gut ecosystem. Nat. Chem. Biol. 19, 778–789. https://doi.org/10.1038/s41589-023-01272-y (2023).
Borrel, G. et al. The host-associated archaeome. Nat. Rev. Microbiol. 18, 622–636. https://doi.org/10.1038/s41579-020-0407-y (2020).
Chibani, C. M. et al. A catalogue of 1,167 genomes from the human gut archaeome. Nat. Microbiol. 7, 48–61. https://doi.org/10.1038/s41564-021-01020-9 (2022).
The UniProt Consortium et al. UniProt: the universal protein knowledgebase in 2023. Nucl. Acids Res 51, D523–D531. https://doi.org/10.1093/nar/gkac1052 (2023).
McDonald, A. G., Boyce, S. & Tipton, K. F. ExplorEnz: the primary source of the IUBMB enzyme list. Nucl. Acids Res. 37, D593–D597. https://doi.org/10.1093/nar/gkn582 (2009).
Bansal, P. et al. Rhea, the reaction knowledgebase in 2022. Nucl. Acids Res. 50, D693–D700. https://doi.org/10.1093/nar/gkab1016 (2022).
Auber, D. et al. Tulip 5. In Encyclopedia of Social Network Analysis and Mining (eds Alhajj, R. & Rokne, J.) 1–28 (Springer, 2017).
Acknowledgements
The authors thank Nathalie Juge (Quadram Institute Bioscience, UK) for helpful discussions and Francisco Lorenzo Escudero for contributing to the manual sorting of bacterial species (gut/non-gut). This work was partly supported by the Swiss Federal Government through the State Secretariat for Education, Research and Innovation (SERI). The Expasy portal is maintained at the Swiss Institute of Bioinformatics and hosted at the Vital-IT Competency Centre. This work was part supported by SNSF grant #31003A_179249 (Frédérique Lisacek). AM gratefully acknowledges library access provided by Trinity College Dublin.
Author information
Authors and Affiliations
Contributions
A.M. and F.L. wrote the manuscript text and A.M. prepared the figures. Experiments were designed by A.M. and F.L. Software was developed by A.M. All authors reviewed the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
McDonald, A.G., Lisacek, F. Simulated digestions of free oligosaccharides and mucin-type O-glycans reveal a potential role for Clostridium perfringens. Sci Rep 14, 1649 (2024). https://doi.org/10.1038/s41598-023-51012-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-023-51012-4
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.