Abstract
Herbivorous insects employ an array of salivary proteins to aid feeding. However, the mechanisms behind the recruitment and evolution of these genes to mediate plant-insect interactions remain poorly understood. Here, we report a potential horizontal gene transfer (HGT) event from bacteria to an ancestral bug of Eutrichophora. The acquired genes subsequently underwent duplications and evolved through co-option. We annotated them as horizontal-transferred, Eutrichophora-specific salivary protein (HESPs) according to their origin and function. In Riptortus pedestris (Coreoidea), all nine HESPs are secreted into plants during feeding. The RpHESP4 to RpHESP8 are recently duplicated and found to be indispensable for salivary sheath formation. Silencing of RpHESP4-8 increases the difficulty of R. pedestris in probing the soybean, and the treated insects display a decreased survivability. Although silencing the other RpHESPs does not affect the salivary sheath formation, negative effects are also observed. In Pyrrhocoris apterus (Pyrrhocoroidea), five out of six PaHESPs are secretory salivary proteins, with PaHESP3 being critical for insect survival. The PaHESP5, while important for insects, no longer functions as a salivary protein. Our results provide insight into the potential origin of insect saliva and shed light on the evolution of salivary proteins.
Similar content being viewed by others
Introduction
Saliva is an oral secretion composed of a mixture of bioactive compounds that enable insects to feed and survive successfully. Analysis of salivary components from planthoppers, aphids, whiteflies, and true bugs has revealed abundant orphan genes that are restricted to a single species or a particular taxonomic group1,2,3. In the past decades, the functions of several orphan salivary proteins have been uncovered. For example, Te28 and Te84 from the spider mite were found to suppress salicylic acid (SA)-mediated plant defense4; Bsp9 from the whitefly was found to target host immunity regulators5; and LsSP1 from the planthopper was found to reduce elicitor-induced plant immunity6. Despite these findings, it remains unknown where these orphan salivary components originate and how they have evolved in the long-term coevolution of insects and plants.
The salivary sheath, formed by gel saliva, is critical for insect feeding. This unique structure is closely linked to the probing action of the stylet, serving multiple functions, from enhancing mechanical stability to lubricating stylets7. Disruption of salivary sheath formation negatively affects insect feeding from plant sieve tubes, while it does not impact insect feeding from artificial diets8,9. In planthoppers, salivary sheath-deficient insects fail to anchor their stylet at a fixed point, leading to a failure in penetration initiation and higher mortality8. The salivary sheath directly contacts host plants and plays important roles in herbivore–plant interactions. A few proteins in the salivary sheath were reported to exhibit a high evolutionary rate10. However, given the difficulty in collecting the salivary sheath and dissolving proteins, only a few salivary sheath proteins were identified, with the majority exhibiting species specificity11.
Horizontal gene transfer (HGT), the transmission of genetic material between distantly related species, is considered as the major driving force in life evolution12. In recent years, advancements in high-throughput sequencing have shown the prevalence and importance of HGTs, particularly in bacteria and protists, which have close associations with their metazoan recipients13,14. Some of these transferred genes have been found to confer direct benefits to their recipients, such as LOC105383139 gene for male courtship15, carotenoid biosynthesis gene for aphid body coloration16, and parasitoid killing factor for insect defense to parasitoids17. HGT also plays critical roles in shaping the plant-insect interaction. Accumulated evidence demonstrated that HGT from microbial species enhanced the enzymatic repertoire of herbivorous insects, which in turn facilitated their adaptation toward herbivory and novel host plants18. For example, the widespread of horizontally transferred polygalacturonases, cellulases, and xylanases in insect genome provided selective advantage to recipients by degrading plant cell wall components19,20, while the β-fructofuranosidase, cysteine synthase, and biotin synthase involved in assimilation of plant nutrients21,22,23. In addition, insects have evolved various counter adaptations to plant defenses by acquiring pre-evolved genes from bacteria and fungi18. In Bemisia tabaci, a fungal transferred salivary protein was reported to be migrated into plant cells and promoted insect feeding by suppressing ferredoxin-triggered plant immunity24. Generally, the majority of reported horizontally transferred genes that were associated with insect feeding code for enzymes, and function similarly between the donor and recipient organisms18. Our knowledge on their evolved function after co-opted by the insect recipient is still limited.
Pentatomomorpha, an infraorder of Heteroptera, encompasses five superfamilies. Among these, three superfamily of Coreoidea, Lygaeoidea, and Pyrrhocoroidea, grouped as Eutrichophora, comprise more than 8500 described species, and are paraphyletic to a significant extent25. Riptortus pedestris (Coreoidea), Pyrrhocoris apterus (Pyrrhocoroidea), and Oncopeltus fasciatus (Lygaeoidea) are important species in these three superfamilies with available genome information26,27. Our previous study revealed that R. pedestris contains abundant species-specific genes with unknown origins and functions26. In this study, we provide evidence for the potential horizontal transfer of a bacterial gene to the ancestral of Coreoidea, Lygaeoidea, and Pyrrhocoroidea. The acquired genes undergo duplications, gain introns, and evolve through co-option in recipient insects and serve as the main salivary components that enhance insect feeding.
Results
Identification of potential horizontally transferred salivary proteins in R. pedestris
The salivary proteins of R. pedestris were identified by comparing the untreated soybeans and soybeans infested with R. pedestris using shotgun LC − MS/MS analysis. Out of the 40 salivary proteins that were identified (Table 1), 9 displayed Eutrichophora-specificity and showed sequence similarity to bacterial proteins. We therefore annotated them as horizontal-transferred, Eutrichophora-specific salivary protein (HESPs) (Fig. 1a, Supplementary Table 1). Noteworthy, these nine salivary RpHESPs displayed 15.4% to 98.0% amino acid sequence similarity to each other, with a hit e-value ranging from 0 to 10−24 (Fig. 1b, Supplementary Fig. 1). To identify additional RpHESP homologs in R. pedestris, the nine RpHESPs were searched against the R. pedestris genome. In addition to these nine RpHESPs, we did not find other RpHESPs. These results suggest that all RpHESPs in R. pedestris are secreted into plants during insect feeding.
The nine RpHESPs were tandemly arranged on Chromosome 1, and we designate them as RpHESP1 to RpHESP9 based on their genomic position (Fig. 2a). The RpHESP2 to RpHESP9 genes were located in adjacent gene loci and shared the same transcription orientations, while RpHESP1 was independently located and had different transcription orientations from the other RpHESPs. Except for RpHESP2 that contained 13 exons, the other RpHESPs contained 3 exons (Fig. 2a). To validate the insect origin of these RpHESP sequences, we further analyzed the flanking genes of RpHESPs in the genome. The results showed that the flanking genes upstream and downstream of RpHESPs were belonged to insects (Supplementary Table 2). In addition, the flanking region between RpHESP1 and an insect gene was successfully amplified by PCR, indicating the accuracy of genome assembly. These results suggest that RpHESPs are integrated in the insect genome, but not derive from bacteria or other contamination.
All RpHESP genes have a signal peptide sequence, indicating their potential to be secreted (Fig. 1a). Then, the spatial-temporal expression patterns of RpHESPs in different tissues and developmental stages were analyzed. According to our results, all the RpHESP genes showed similar expression patterns in tissues and development stages, respectively (Fig. 1c, Supplementary Fig. 2). With regard to developmental analysis, the newly deposited eggs expressed the lowest levels of RpHESP transcripts. Since then, the transcript levels gradually increased along with egg development, and remain high in the nymph and adult stages (Supplementary Fig. 2). In tissue analysis, all RpHESP transcripts were found to be expressed exclusively in the salivary glands (Fig. 1c).
HESPs are potentially transferred from bacteria to an ancestral bug
BLASTing RpHESPs against the NCBI nr database revealed that the RpHESPs had no sequence similarity to any known proteins in eukaryotes, with the exception of an uncharacterized protein found in the milkweed bug O. fasciatus (Lygaeoidea). To gain a comprehensive understanding of HESP phylogeny, transcriptomic data from 60 insect species (representing 14 superfamilies within Hemiptera) were analyzed. With the exception in Anoplocnemis dalasi (Coreoidea), the HESP-associated contigs were found in almost all species in Eutrichophora (Supplementary Table 3). In contrast, no HESP homologs were detected in other insect species. An evolutionary analysis based on 34 single copy genes demonstrated that Eutrichophora species (Coreoidea, Lygaeoidea, and Pyrrhocoroidea superfamilies) clustered in the same clade and that Eutrichophora diverged from the Pentatomoidea superfamily at ~117–141 million years ago (Mya) (Fig. 3). Given that HESP distributed in nearly all Eutrichophora species, but not the Pentatomoidea species (Supplementary Table 3), we speculated that HESP might be evolved ~117–141 Mya, the time during the divergency of Eutrichophora and Pentatomoidea.
The genomes of R. pedestris, P. apterus, and O. fasciatus are currently available for analysis. In P. apterus, six PaHESPs were identified and were found to be tandemly arranged in adjacent gene loci on Scaffold 25, suggesting the possibility of duplication events (Fig. 2b). PaHESPs consisted of 2-4 exons. PaHESP5 exhibited different transcriptional orientation compared to the other PaHESPs (Fig. 2b). In O. fasciatus, four OfHESPs were identified. Two OfHESPs were arrayed in adjacent gene loci on Scaffold 5, while the other two were located on Scaffold 45 and Scaffold 607, respectively (Fig. 2c).
All HESPs in the three genome-available insects displayed sequence similarity to bacterial proteins. For example, RpHESP1, PaHESP1, and OfHESP1 were most closely related to genes from the Omnitrophica bacterium, with a hit e-value < 10−110. It is worth to mention that there are only 14 bacteria species contain HESP-associated genes, and except for these bacteria, no gene in fungi or other protists showed sequence similarity to insect HESPs. A maximum likelihood tree was then constructed based on the HESPs in the three insects and HESP-associated genes in bacteria (Fig. 4). The results showed that all insect HESPs were clustered in the same group, while the bacteria genes clustered together. RpHESP1, PaHESP1, and OfHESP1 showed high sequence similarity and were clustered in the same clade. RpHESP2 and PaHESP2 were clustered together in another clade, but no HESP gene in O. fasciatus was found in this clade. It is possible that RpHESP2 and PaHESP2 evolved after the divergence of Lygaeoidea from Pyrrhocoroidea and Coreoidea (Fig. 3), or because of the independent gene losses in O. fasciatus. Our study also identified relatively recent gene duplication events in each of the three species; for example, RpHESP4 to RpHESP8 had more than 90.2% amino acid similarity (Fig. 4; Supplementary Fig. 1). These results suggest that HESPs are duplicated and separated into different clades after acquirement.
Diversified roles of RpHESPs in R. pedestris
RNAi experiments were performed to evaluate the potential functions of RpHESPs. As RpHESP4 to RpHESP8 shared >92% nuclear acid sequence similarity (Supplementary Fig. 3), and it is difficult to silence these genes specifically. Therefore, RpHESP4 to RpHESP8 were collectively silenced using the conserved region (refer to RpHESP4-8). First, dsRNA targeting RpHESP1, RpHESP2, RpHESP3, RpHESP4-8, and RpHESP9 were individually injected into third instar nymphs. Four days after the injection, the RNAi efficacy was evaluated, and the results showed that the dsRNA specifically and effectively reduced the expression of the target genes, respectively (Supplementary Fig. 4). It is noteworthy that the expression of RpHESP4-8 and RpHESP9 were significantly induced when knocked down RpHESP1 or RpHESP2, indicating the potential compensatory effect. After dsRNA inhibition, decreased survival was observed in R. pedestris treated with dsRpHESP1, dsRpHESP3, dsRpHESP4-8, and dsRpHESP9, while no significant difference in survival was observed between R. pedestris treated with dsGFP and dsRpHESP2 (Fig. 5a). The most pronounced lethal effects were observed in insects treated with dsRpHESP9, with 56.4% died within 15 days, followed by dsRpHESP3 (39.5%) and dsRpHESP4-8 (29.7%).
During the feeding process, hemipteran insects usually secrete two types of saliva, gel saliva and watery saliva, into plant tissues. The gel saliva solidifies quickly and forms a salivary sheath that provides mechanical stability, lubrication, and protection against chemical defense7. Salivary sheath is an important indicator for insect feeding activity, as it always envelops stylets and is left in plant tissues after stylet withdrew28. We first dissected salivary sheath left in soybean seeds, and measured their length. The results showed that dsRpHESP4-8-treated R. pedestris secreted significant shorter salivary sheath than that of the dsGFP-treated control (Fig. 5b). In contrast, no significantly difference in length was observed in other dsRpHESPs treatments (Fig. 5b). The salivary sheath secreted by dsRNA-treated R. pedestris was also inspected by scanning electron microscopy. The salivary sheath secreted by dsGFP-treated controls were tubule-shaped with a hollow lumen that allowed stylet movement (Fig. 5c). Similar morphology was observed in salivary sheath secreted from dsRpHESP1-, dsRpHESP2-, dsRpHESP3-, and dsRpHESP9-treated R. pedestris. However, the tubular structure was collapsed in salivary sheath secreted by dsRpHESP4-8-treated R. pedestris (Fig. 5c), indicating the potential involvement of RpHESP4-8 in salivary sheath formation. To further investigate the relationship of RpHESPs and salivary sheath, the salivary sheath component was analyzed using LC-MS/MS analysis. The tubular structure outside the soybean seeds were cut and washed to remove potential watery saliva. As a result, all RpHESPs can be abundantly identified in salivary sheath (Supplementary Table 4). These results suggest that all RpHESPs are present in salivary sheath, but only RpHESP4-8 involves in maintaining the structure of the salivary sheath.
Thereafter, the insect probing frequency and probing time were monitored. The probing frequency was recorded by counting the salivary sheath left in soybean seeds, as R. pedestris secrete salivary sheath in each probing attempt. The results showed that the dsRpHESP4-8- and dsRpHESP9-treated R. pedestris probed less frequently than that of the dsGFP control (Fig. 5d). The probing time was recorded by measuring the time from probing initiation to successful penetration. The dsGFP-treated R. pedestris quickly arched their stylets in a fixed point and started penetration as fast as 3 min (Fig. 5e). Similar probing behaviors were observed in dsRpHESP1-, dsRpHESP2-, and dsRpHESP3-treated R. pedestris. In contrast, the dsRpHESP4-8- and dsRpHESP9-treated insects repeatedly attempted to probe the soybean but failed to start penetration (Fig. 5e). The mean time for dsRpHESP4-8- and dsRpHESP9-treated R. pedestris to initiate penetration was 12 min and 10 min, respectively, which were significantly longer than that of dsGFP-treated insects (Fig. 5e). These results suggested that dsRpHESP4-8- and dsRpHESP9-treatment have negative effect on insect probing process.
Diversified roles of PaHESPs in P. apterus
In this study, the function of HESP was further analyzed in the firebug P. apterus, a true bug mainly feeding on mallow, linden, and clover seeds. Although adjacently arrayed in the genome, these six PaHESP showed great variation in nucleic acid sequence length (from 972 bp to 4296 bp) and amino acid sequence similarity (from 19.8 to 83.8%) (Fig. 6a; Supplementary Fig. 5). The expression patterns of PaHESPs in different tissues were quantified by qRT-PCR. As a result, five out of six PaHESPs were specifically expressed in salivary glands, except for PaHESP5, which was prevalently expressed in salivary glands, carcass, testis, and ovaries (Fig. 6b). All of the PaHESPs were predicted to have a signal peptide, indicating a secretory nature (Fig. 6c). LC-MS/MS was used to determine if PaHESPs were secreted into clover seeds during feeding. Untreated clover seeds were used as a control. Five PaHESPs that specifically expressed in salivary glands were found to be secreted into the host in high amounts, with sequence coverage ranging from 47.4% to 62.3% and unique peptides ranging from 14 to 62 (Fig. 6c; Supplementary Table 5). By contrast, no PaHESP5-derived peptide was detected, suggesting that PaHESP5 no longer serves as a salivary protein after evolution.
RNAi experiments were performed to study the functions of PaHESPs. The dsRNA targeting PaHESP1, PaHESP2, PaHESP3, PaHESP4, PaHESP5, and PaHESP6 were individually injected into third instar nymphs, respectively. qRT-PCR results demonstrated that the dsRNA can specifically and effectively reduce the expression of the target genes, respectively (Supplementary Fig. 6). Following dsRNA inhibition, decreased survivability was observed in dsPaHESP1-, dsPaHESP3- and dsPaHESP5-treated P. apterus. On the contrary, no significant difference in survival was found between dsGFP and other dsPaHESP-treated P. apterus (Fig. 6d). The salivary sheath secreted by dsPaHESP-treated P. apterus was then examined under scanning electron microscopy. No noticeable change in the morphological structure of the salivary sheath was observed after dsPaHESP (Supplementary Fig. 7), indicating that no HESP is critical for maintaining the structure of the salivary sheath in P. apterus.
Discussion
Salivary secretion in herbivorous insects is abundant in orphan proteins of unknown origin and function. This study documented a potential HGT event that occurred at ~117–141 Mya. After being transferred to insects, HESPs underwent duplications, gained introns, and evolved through co-option, allowing them to perform various functions in insect feeding and other processes. Our findings shed new light on the potential origin of orphan proteins in insects and suggest the potential evolution of salivary proteins.
The horizontal transferred genes in insects usually display two unusual properties: an absence of homologies in the majority of other lineages within the Insect phylum, and a high degree of sequence similarity to distantly related species, especially those from the protists12,18. In this study, HESPs was confirmed to be integrated into insect genome (Fig. 2; Supplementary Table 2), strictly distributed in three insect superfamily (Fig. 3; Supplementary Table 3), and showed high sequence similarity to bacteria genes (Fig. 4), suggesting a potential HGT origin. However, compared with most reported HGT events that had a lot of homologues in donor relatives, only a few bacteria species contain HESP-associated genes. Moreover, in current phylogenic tree, insect HESPs did not cluster with bacterial genes in strongly supported bootstrap (Fig. 4). Therefore, we cannot rule out the possibility that HESP-associated genes in bacteria and insects were independently evolved, and their sequence similarity might be resulted from convergent evolution. Also, there is a possibility that the HGT event was originated from insects to bacteria, as Eutrichophora species were intimately associated with symbiotic microorganism29. While our current evidence suggests HGT as the most likely scenario, additional work is needed to definitively rule out alternative hypotheses. Perhaps a high-throughput sequencing of a wider range of organisms in the future might solve this issue.
Recent advancements in HGT detection have shown that many apparent gene duplications are actually the result of HGT instead of autochthonous gene duplication12,30. Our study identified 9, 6, and 4 HESP genes in R. pedestris, P. apterus, and O. fasciatus genomes, respectively. The PaHESPs were found to be located adjacent to each other in the genome of P. apterus (Fig. 2b), suggesting the possibility of a single HGT event, followed by gene duplications. In contrast, although all RpHESPs in R. pedestris located in the same chromosome, RpHESP1 was ~1.5 × 107-bp far away from the other RpHESPs and had different transcription orientations (Fig. 2a). Phylogeny analysis demonstrated that RpHESP1 was clustered with PaHESP1 and OfHESP1, but not the other RpHESPs (Fig. 4). Therefore, we cannot rule out the possibility that RpHESP1 and the other RpHESPs derived from two independent HGT events, which deserved further investigations.
Although most HESPs in R. pedestris and P. apterus likely evolved from autochthonous gene duplication, their functions varied greatly. For example, the newly duplicated RpHESP4 to RpHESP8 were crucial for maintaining the structure of the salivary sheath, while the other RpHESPs were not (Fig. 5). To sustainably survive in the recipient organism, a transferred gene typically needs to provide a selective advantage, either to itself or the recipient31. RNAi experiments demonstrated that multiple HESPs were important for insect survival (Figs. 5; 6), indicating that these genes underwent co-option evolution after being transferred to insect recipients. Despite being abundant in the secretion of RpHESP2, PaHESP2, PaHESP4, and PaHESP6 into plants, no significant phenotypic changes were observed in insects treated with corresponding dsRNA (Figs. 5; 6). R. pedestris and P. apterus are polyphagous insects that feed on a wide range of hosts or the same host at different developmental stages26,32. It is highly possible that some salivary proteins play a role in insects feeding on specific diets, which deserves further investigation. In addition, some horizontally transferred genes displayed functional redundancy after gene duplication in recipients33,34. There may have functional compensation in duplicated HESPs, which contribute to the genetic robustness and provided a selective advantage35.
Although the RpHESP2 to RpHESP9 and PaHESP1 to PaHESP6 were derived from gene duplication, a low level of amino acid sequence similarity was found in some comparison groups, such as a 15.4% similarity between RpHESP2 and RpHESP9, and a 19.8% similarity between PaHESP1 and PaHESP3 (Supplementary Figs. 1, 5). Insect saliva plays critical roles in herbivore–plant interactions, and a few salivary proteins have been found to evolve at a high rate10. While it is likely that there are other HGT-derived orphan salivary proteins in insects, their donor genes have not yet been identified. This could be due to a lack of information about potential donors or the fast evolution of transferred genes in recipients, causing them to show no sequence similarity to their donors12.
Overall, our study provides a comprehensive insight into the evolution of orphan salivary proteins (Supplementary Fig. 8). The HESPs were potentially transferred from bacteria to an ancestral bug of Coreoidea, Lygaeoidea, and Pyrrhocoroidea superfamily. These acquired genes underwent duplications, gained introns, and evolved through co-option, providing a selective advantage to the recipients by enhancing their feeding abilities or other undefined processes.
Methods
Insects
The R. pedestris strain originally collected from a soybean field (33.7° N, 117.0 °E) in Suzhou, China, was reared on soybean pods and soybean seeds for more than 20 generations. The P. apterus strain was originally collected from Xinjiang, China, and reared on clover seeds. Both strains were maintained at 26 ± 0.5 °C and 50 ± 5% relative humidity under a light/dark photoperiod of 16/8 h.
Collection of entire salivary secretion and salivary sheath
Before the experiment, both R. pedestris and P. apterus were subjected to a 24-h starvation period with only water being provided. Next, the insects were introduced into separate cages, one containing soybean seeds and the other clover seeds, and allowed to feed for 24 h. The presence of a salivary sheath indicated the feeding site of the insects, which can be easily observed under a SZ2-ILST Stereomicroscope (Olympus, Shinjuku, Japan). To analyze the component of entire salivary secretion, the salivary sheath and the surrounding area near the salivary sheath were carefully collected using a pair of forceps (Ideal-Tek, Balerna, Switzerland). To analyze the component of salivary sheath only, the tubular structure outside the soybean seeds were cut, and rinsed in 1× PBS to remove potential watery saliva remained in the salivary sheath. The insect salivary secretions were characterized by LC-MS/MS analysis. Two independent biological replicates were performed for each treatment. The none-infested seeds were used as a negative control.
LC-MS/MS analysis
The collected samples were homogenized in the lysis solution containing 1% SDS, 100 mM DTT, and 100 mM Tris-HCl (pH 6.8), and then boiled for 10 min. Later, shotgun-based proteomic analysis was performed via the custom service of Novogene Institute (Novogene, Beijing, China) as previously described2. Briefly, the samples were lysed with a lysis buffer containing 100 mM NH4HCO3, 8 M Urea, and 0.2% SDS, followed by 5 min of ultrasonication on ice. Afterward, the lysate was centrifuged at 12,000 × g for 15 min at 4 °C, and the resulting supernatant was transferred to a clean tube. The sample was then subjected to reduction using 10 mM DTT for 1 h at 56 °C, followed by alkylation with iodoacetamide in the dark at room temperature for 1 h. Subsequently, the sample was thoroughly mixed with 4 times its volume of pre-cooled acetone through vortexing and then incubated at −20 °C for at least 2 h. Following centrifugation, the precipitate was collected. After two washes with cold acetone, the pellet was dissolved in the dissolution buffer.
For trypsin digestion, 3 μL of 1 μg/μL trypsin and 500 μL of 50 mM TEAB buffer were added. The sample was mixed and allowed to digest at 37 °C overnight. Following digestion, formic acid was added to the sample mixture, which was then centrifuged at 12,000 × g for 5 min at room temperature. The resulting supernatant was gently loaded onto a C18 desalting column. The column was washed thrice with a washing buffer (0.1% formic acid, 3% acetonitrile) and then eluted three times with an elution buffer, which included elution buffer (0.1% formic acid, 70% acetonitrile). The eluted sample was collected and subsequently lyophilized.
LC-MS/MS analysis was carried out according to the methods described below. The lyophilized powder was dissolved in 10 μL of solution A (100% water, 0.1% formic acid), and then centrifuged at 14,000 × g for 20 min at 4 °C. Subsequently, 1 μg of the sample was injected into a home-made C18 Nano-Trap column (2 cm × 75 μm, 3 μm). The peptides were separated using a home-made analytical column (15 cm × 150 μm, 1.9 μm) through a linear gradient elution. The separated peptides were analyzed using the Q Exactive HF-X mass spectrometer (Thermo Fisher). The ion source used was Nanospray Flex™ (ESI), with a spray voltage of 2.3 kV and an ion transport capillary temperature of 320 °C. The top 40 precursor ions with the highest abundance in the full scan were selected for fragmentation by higher energy collisional dissociation (HCD), and subsequently analyzed in MS/MS.
The output raw data were preceded to identify potential salivary proteins using the software MaxQuant 1.6.5.0 with the following default parameters, tolerance of one missed cleavages of trypsin, carbamidomethyl (C), oxidation (M), and Acetyl (Protein N-term). Finally, the identifications were filtered to 1% false discovery rate (FDR) at the peptide-spectrum match (PSM) level. To mitigate common contaminants, the Common Repository of Adventitious Proteins database was employed.
Identification of HESPs
A total of nine RpHESPs were identified in the salivary samples of R. pedestris. To identify additional RpHESP homologs in R. pedestris, the RpHESPs were searched against the R. pedestris genome with a cut-off E-value of 10−10. No other RpHESPs were found. Furthermore, the nine RpHESPs were used as queries to search the NCBI nr database. The insect and bacterial HESPs homologues with a cut-off E-value of 10−10 were retrieved and used for subsequent analysis.
As there was limited information regarding insect HESP homologues in the NCBI nr database, transcriptomic datasets were retrieved from 60 species in the SRA repository (Supplementary Table 3). The raw data were de novo assembled using SPAdes (v3.13.0) with default parameters. Thereafter, the assembled contigs were searched against the RpHESPs to identify the potential HESP homologues in other insect species using BLASTX with a cut-off E-value of 10−10.
Bioinformatic analysis of HESPs
The SignalP 5.0 Server (https://services.healthtech.dtu.dk/service.php?SignalP-5.0) was employed to predict the presence of signal peptides and cleavage sites. In addition, the Splign Server (https://www.ncbi.nlm.nih.gov/sutils/splign/splign.cgi) was utilized to predict genomic structure of HESP genes, whereas MegAlign v7.1.0 was adopted to estimate the amino acid sequence similarity36.
Phylogenetic analysis
Currently, the genomes of three insects in Coreoidea, Pyrrhocoroidea, and Lygaeoidea, respectively, are available, including R. pedestris, P. apterus, and O. fasciatus26,27. In this study, the HESPs homologues in these species, together with bacterial genes were used for phylogenetic analysis. First of all, the amino acid sequences were aligned with MAFFT (v7.310) with default parameters (maxiterate:1000), and ambiguously aligned regions were trimmed by Gblock37. Secondly, the best-fit model of amino acid substitution (PROTGAMMAJTT) was evaluated by ModelTest-NG v0.1.6. Then, the maximum likelihood (ML) trees were constructed using RAxML v0.9.0 with 1000 bootstrap replications38. More details of all the reference sequences used in phylogenetic analysis are listed in Supplementary Table 6.
To analyze the species relationship, the coding regions of assembled contigs generated above were firstly predicted using TransDecoder v5.5.0. After filtering redundant alternative splicing events, the protein data set containing nonredundant transcripts was used to find the homologous pairs of sequences by using the all-versus-all BLASTp algorithm with a significant cutoff E-value of 10−5. Thereafter, the BLASTp result was converted into a normalized similarity matrix and processed using OrthoMCL v2.0.9 with default parameters39. In addition, protein families were identified by Markov chain clustering MCL-14–13740. The phylogenetic tree was then constructed using single copy orthologues in each species (1:1:1 genes identified by OrthoMCL analysis), and Drosophila melanogaster was utilized to root the tree. Moreover, sequence alignment was performed by MAFFT v7.1.0. Conserved amino-acid sites were identified by TrimAl v1.241. Furthermore, ModelTest-NG was employed to determine the best model (PROTGAMMAJTT). Then, a ML tree was constructed using RAxML under the LG + I + G4 + F model with 1,000 bootstraps replications. Bayesian MCMCTree v4.9i42 was used to perform the divergence time analysis and calibration time was based on four nodes: D. melanogaster-Bemisia tabaci (330–376 Mya)43, Rhodnius prolixus- Halyomorpha halys (108–227 Mya)44, and Diaphorina citri- Nilaparvata lugens (112–290 Mya)43.
RNA interference (RNAi)
HESPs from R. pedestris and P. apterus were silenced using RNAi. To minimize the non-target effect of RNAi, the target genes were searched against the transcriptomic and genomic database of R. pedestris or P. apterus, and the specific regions of the target genes were selected for dsRNA synthesis. The DNA sequences of target genes were amplified using the primers listed in Supplementary Table 7, and later cloned into pClone007 Vector (Tsingke, Beijing, China). Meanwhile, the recombinant plasmids were amplified using the primers containing T7 sequences, and the PCR-generated DNA templates were adopted to synthesize the double-stranded RNAs (dsRNAs) using a T7 High Yield RNA Transcription Kit (Vazyme, Nanjing, China).
RNAi experiments were performed as previously described45. Briefly, the dsRNA was loaded into a capillary tube using tips. The R. pedestris and P. apterus that were pre-anaesthetized with carbon dioxide were placed on the plate constantly releasing carbon dioxide. Thereafter, dsRNA was injected into the insect mesothorax using FemtoJet (Eppendorf-Netheler-Hinz, Hamburg, Germany). Then, the treated insects were reared on soybean seeds or clover seeds, respectively. The silencing efficiency was determined at four days post-injection using the qRT-PCR.
qRT-PCR
To determine the RNAi efficiency, the whole R. pedestris (4) and P. apterus (4) were used. To collect tissue samples, carcass (10), fat body (10), guts (20), and salivary glands (20) were carefully dissected from the 5th instar nymphs, whereas, testes (10) and ovaries (10) were dissected from male and female adults, respectively. The number of insects used in each sample was illustrated in brackets. Afterwards, total RNA was extracted using RNAiso plus (TaKaRa, Dalian, China). After determining the RNA integrity and quantity, the first-strand cDNA was reverse-transcribed using HiScript II Q RT SuperMix (Vazyme, Nanjing, China). Later, qPCR was run on a Roche Light Cycler® 480 Real-Time PCR System using the SYBR Green Supermix Kit (Yeasen, Shanghai, China). The following PCR conditions were used: denaturation for 5 min at 95 °C, followed by 40 cycles at 95 °C for 10 s and 60 °C for 30 s. Primer Premier v6.0 was used to design the qPCR primers (Supplementary Table 7). Actin and GAPDH were used as the house-keeping genes. The relative quantitative method (2−ΔΔCt) was adopted to evaluate the relative gene expression46. Three independent biological replicates were performed.
Scanning electron microscopy
The treated insects were allowed to feed on soybean seeds or clover seeds for 24 h. Scanning electron microscopy were performed as we previously described6. In detail, the infested seeds were attached to a stub and placed in a dryer at 65 °C for 2 h. Then, the seeds were further dried in a desiccator under vacuum condition. Gold-sputtering was performed when vacuum degree reached 6.5, and the gold-sputtering time was set as 15 s. Thereafter, the samples were visualized by SEM TM4000 II plus (Hitachi, Tokyo, Japan). More than 20 salivary sheaths in each treatment were observed.
Insect bioassays
For the survival assay, the live insects at 24 h after dsRNA treatment were selected to exclude the insects died with mechanical injury. Thereafter, the insects were transferred to a glass tube (diameter, 10 mm; height, 20 cm) containing soybean or clover seeds. The wet cotton was placed into two rearing devices, respectively, to provide the drinking water. Then, the survival rates under each treatment were recorded for 15 consecutive days. A group of 12 insects were included in each treatment, and three independent replicates were performed.
Feeding behavior analysis
Before experiment, the dsRNA-treated R. pedestris (at 4-6 days post injection) were collected and fed on a filter paper for 24 h with water provided only. Subsequently, the insects were transferred into a glass tube with soybean seeds placed at the center. The probing time, which was recorded by measuring the time from probing initiation to successful penetration. The probing initiation was regarded as the first time the stylets contact the soybean seed. Successful penetration was regarded as the stylets anchored at a fixed point and inserted into seeds (the stylet was moveless once penetrated successfully). When the insect stylet contacts the soybean seed, we start the timer. When the stylet successfully penetrates into seeds, we stop the timer.
For analysis of probing frequency, a group of 4 dsRNA-treated R. pedestris were allowed to feed on soybean seeds for 48 h. The probing frequency was recorded by counting the salivary sheath left in soybean seeds under a SZ2-ILST Stereomicroscope, as R. pedestris secrete salivary sheath in each probing attempt.
In addition, the salivary sheath morphology was observed. The salivary sheath left in soybean seeds was carefully disserted under a SZ2-ILST Stereomicroscope using a pair of forceps. The disserted samples were carefully washed in 1× PBS to remove attached soybean components. Thereafter, the salivary sheath was photographed under a SMZ225 Stereomicroscope (Nikon, Tokyo, Japan). The salivary sheath length was measured using ImageJ software v1.53e (https://imagej.nih.gov/).
Statistics and reproducibility
The log-rank test (SPSS Statistics 19, Chicago, IL, USA) was applied to determine the statistical significance of survival distributions (n = 36–47 biological replicates). Two-tailed unpaired Student’s t test was used to analyze the results of qRT-PCR (n = 3 biological replicates), salivary sheath length (n = 18–24 biological replicates), probing frequency (n = 8–9 biological replicates), and probing time (n = 11–14 biological replicates). The number of biological replicates and the exact p value of each statistical test was provided in Supplementary Data 1. Data were graphed in GraphPad Prism 9.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
The mass spectra data generated in this study were submitted to the ProteomeXchange with accession numbers PXD039150 and PXD039269. Sequences of PaHESPs and RpHESPs have been deposited in NCBI GenBank with accession numbers OQ126862-OQ126876. Source data are provided in Supplementary Data 1.
References
van Bel, A. J. & Will, T. Functional evaluation of proteins in watery and gel saliva of aphids. Front. Plant Sci. 7, 1840 (2016).
Huang, H. J. et al. Identification of salivary proteins in the whitefly Bemisia tabaci by transcriptomic and LC-MS/MS analyses. Insect Sci. 28, 1369–1381 (2021).
Huang, H. J. et al. Identification of Riptortus pedestris salivary proteins and their roles in inducing plant defenses. Biology 10, 753 (2021).
Villarroel, C. A. et al. Salivary proteins of spider mites suppress defenses in Nicotiana benthamiana and promote mite reproduction. Plant J. 86, 119–131 (2016).
Wang, N. et al. A whitefly effector Bsp9 targets host immunity regulator WRKY33 to promote performance. Philos. Trans. R. Soc. B 374, 20180313 (2019).
Huang, H. J. et al. Planthopper salivary sheath protein LsSP1 contributes to manipulation of rice plant defenses. Nat. Commun. 14, 737 (2023).
Miles, P. W. Aphid saliva. Biol. Rev. Camb. Philos. Soc. 74, 41–85 (1999).
Huang, H. J. et al. A salivary sheath protein essential for the interaction of the brown planthopper with rice plants. Insect Biochem. Mol. Biol. 66, 77–87 (2015).
Will, T. & Vilcinskas, A. The structural sheath protein of aphids is required for phloem feeding. Insect Biochem. Mol. Biol. 57, 34–40 (2015).
Rao, W. et al. Secretome analysis and in planta expression of salivary proteins identify candidate effectors from the brown planthopper Nilaparvata lugens. Mol. Plant Microbe Interact. 32, 227–239 (2019).
Huang, H. J. et al. How does saliva function in planthopper-host interactions? Arch. Insect Biochem. Physiol. 100, e21537 (2019).
Soucy, S. M., Huang, J. & Gogarten, J. P. Horizontal gene transfer: building the web of life. Nat. Rev. Genet. 16, 472–482 (2015).
Sieber, K. B., Bromley, R. E. & Dunning Hotopp, J. C. Lateral gene transfer between prokaryotes and eukaryotes. Exp. Cell Res. 358, 421–426 (2017).
Di Lelio, I. et al. Evolution of an insect immune barrier through horizontal gene transfer mediated by a parasitic wasp. PLoS Genet. 15, e1007998 (2019).
Li, Y. et al. HGT is widespread in insects and contributes to male courtship in lepidopterans. Cell 185, 1–13 (2022).
Moran, N. A. & Jarvik, T. Lateral transfer of genes from fungi underlies carotenoid production in aphids. Science 328, 624–627 (2010).
Gasmi, L. Horizontally transmitted parasitoid killing factor shapes insect defense to parasitoids. Science 373, 535–541 (2021).
Wybouw, N. et al. Horizontal gene transfer contributes to the evolution of arthropod herbivory. Genome Biol. Evol. 8, 1785–1801 (2016).
de la Paz Celorio-Mancera, M. et al. Polygalacturonase causes lygus-like damage on plants: cloning and identification of western tarnished plant bug (Lygus hesperus) polygalacturonases secreted during feeding. Arthropod Plant Interact. 2, 215–225 (2008).
Pauchet, Y. et al. Identification and characterization of plant cell wall degrading enzymes from three glycoside hydrolase families in the cerambycid beetle Apriona japonica. Insect Biochem. Mol. Biol. 49, 1–13 (2014).
Zhao, C., Doucet, D. & Mittapalli, O. Characterization of horizontally transferred β-fructofuranosidase (ScrB) genes in Agrilus planipennis. Insect Mol. Biol. 23, 821–832 (2014).
Ohlen, M. et al. Cyanide detoxification in an insect herbivore: molecular identification of β-cyanoalanine synthases from Pieris rapae. Insect Biochem Mol. Biol. 70, 99–110 (2016).
Weyandt, N., Aghdam, S. A. & Brown, A. M. V. Discovery of early-branching Wolbachia reveals functional enrichment on horizontally transferred genes. Front. Microbiol. 13, 86739 (2022).
Wang, Y. Z. et al. Horizontally transferred salivary protein promotes insect feeding by suppressing ferredoxin-mediated plant defenses. Mol. Biol. Evol. 40, msad221 (2023).
Liu, Y. et al. Higher‐level phylogeny and evolutionary history of pentatomomorpha (Hemiptera: Heteroptera) inferred from mitochondrial genome sequences. Syst. Entomol. 44, 810–819 (2019).
Huang, H. J. et al. Chromosome-level genome assembly of the bean bug Riptortus pedestris. Mol. Ecol. Resour. 21, 2423–2436 (2021).
Panfilio, K. A. et al. Molecular evolutionary trends and feeding ecology diversification in the Hemiptera, anchored by the milkweed bug genome. Genome Biol. 20, 64 (2019).
Wang, Y. et al. Penetration into rice tissues by brown planthopper and fine structure of the salivary sheaths. Entomol. Exp. Appl. 129, 295–307 (2008).
Takeshita, K. & Kikuchi, Y. Riptortus pedestris and Burkholderia symbiont: an ideal model system for insect-microbe symbiotic associations. Res. Microbiol. 168, 175–187 (2017).
Todd, J. & Treangen, E. P. C. R. Horizontal transfer, not duplication, drives the expansion of protein families in prokaryotes. PLoS Genet. 7, e1001284 (2011).
Husnik, F. & McCutcheon, J. P. Functional horizontal gene transfer from bacteria to eukaryotes. Nat. Rev. Microbiol. 16, 67–79 (2018).
Kristenová, M., Exnerová, A. & Štys, P. Seed preferences of Pyrrhocoris apterus (Heteroptera: Pyrrhocoridae): are there specialized trophic populations? European. J. Endocrinol. 108, 581–586 (2011).
Kirsch, R. et al. Metabolic novelty originating from horizontal gene transfer is essential for leaf beetle survival. Proc. Natl Acad. Sci. USA 119, e2205857119 (2022).
Song, W. et al. Contribution of horizontal gene transfer to the functionality of microbial biofilm on a macroalgae. ISME J. 15, 807–817 (2021).
Láruson, Á. J., Yeaman, S. & Lotterhos, K. E. The importance of genetic redundancy in evolution. Trends Ecol. Evol. 35, 809–822 (2020).
Clewley, J. P. & Arnold, C. MEGALIGN. The multiple alignment module of LASERGENE. Methods Mol. Biol. 70, 119–129 (1997).
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Kozlov, A. M. et al. RAxML-NG: a fast, scalable and user-friendly tool for maximum likelihood phylogenetic inference. Bioinformatics 35, 4453–4455 (2019).
Li Li, C. J. S. J. & Roos, D. S. OrthoMCL: Identification of ortholog groups for eukaryotic genomes. Genome Res. 13, 2178–2189 (2003).
Enright, A. J., Van Dongen, S. & Ouzounis, C. A. An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res. 30, 1575–1584 (2002).
Capella-Gutiérrez, S., Silla-Martínez, J. M. & Gabaldón, T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–1973 (2009).
Yang, Z. PAML 4: Phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 24, 1586–1591 (2007).
Misof, B. et al. Phylogenomics resolves the timing and pattern of insect evolution. Science 346, 763–767 (2014).
Johnson, K. P. et al. Phylogenomics and the evolution of hemipteroid insects. Proc. Natl Acad. Sci. USA 115, 12775–12780 (2018).
Xu, H. J. et al. Two insulin receptors determine alternative wing morphs in planthoppers. Nature 519, 464–467 (2015).
Livak, K. J. & Schmittgen, T. D. Analysis of relative gene expression data using real-time quantitative PCR and the 2− ΔΔCT method. Methods 25, 402–408 (2001).
Acknowledgements
We thank Hai-Jun Xu from Zhejiang University for kindly providing firebugs. This project has received funding from the National Natural Science Foundation of China (U20A2036: J.P.C., J.M.L.; 32001987: Y.H.L.), the Natural Science Foundation of Zhejiang Province (LDQ24C140001: H.J.H.), and the Young Elite Scientists Sponsorship Program by CAST (2021QNRC001: H.J.H.).
Author information
Authors and Affiliations
Contributions
H.-J.H., C.-X.Z., J.-M.L. and J.-P.C. planned and designed the research. H.-J.H., L.-L.L., Z.-X.Y., J.-B.L., Y.-H.L., and Z.-Y.W. performed experiments and analyzed data. L.-L.L. drew the model of HESP evolution. Z.-T.S. provided valuable suggestion for the research. H.-J.H. drafted the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Communications Biology thanks the anonymous reviewers for their contribution to the peer review of this work. Primary Handling Editors: Hannes Schuler and David Favero. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Huang, HJ., Li, LL., Ye, ZX. et al. Salivary proteins potentially derived from horizontal gene transfer are critical for salivary sheath formation and other feeding processes. Commun Biol 7, 257 (2024). https://doi.org/10.1038/s42003-024-05961-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s42003-024-05961-9
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.