Abstract
Many important natural products are produced by multidomain non-ribosomal peptide synthetases (NRPSs)1,2,3,4. During synthesis, intermediates are covalently bound to integrated carrier domains and transported to neighbouring catalytic domains in an assembly line fashion5. Understanding the structural basis for catalysis with non-ribosomal peptide synthetases will facilitate bioengineering to create novel products. Here we describe the structures of two different holo-non-ribosomal peptide synthetase modules, each revealing a distinct step in the catalytic cycle. One structure depicts the carrier domain cofactor bound to the peptide bond-forming condensation domain, whereas a second structure captures the installation of the amino acid onto the cofactor within the adenylation domain. These structures demonstrate that a conformational change within the adenylation domain guides transfer of intermediates between domains. Furthermore, one structure shows that the condensation and adenylation domains simultaneously adopt their catalytic conformations, increasing the overall efficiency in a revised structural cycle. These structures and the single-particle electron microscopy analysis demonstrate a highly dynamic domain architecture and provide the foundation for understanding the structural mechanisms that could enable engineering of novel non-ribosomal peptide synthetases.
Similar content being viewed by others
Main
A non-ribosomal peptide synthetase (NRPS) module incorporates a single residue into a peptide natural product. Each module contains a peptidyl carrier protein (PCP) that is post-translationally modified with a phosphopantetheine cofactor6, an adenylation domain that loads the amino-acid substrate onto the PCP cofactor, and a condensation domain that catalyses peptide bond formation. NRPSs then use a carboxy (C)-terminal thioesterase or reductase domain to catalyse product release. Structures of individual domains1 provide insight into the NRPS structural mechanism. Interestingly, the adenylation domains have been shown to adopt two catalytic conformations7. First the adenylate-forming conformation activates the amino-acid substrate using ATP to form an aminoacyl adenylate and pyrophosphate. A C-terminal subdomain then rotates by ~140° to form the thioester-forming conformation that is used to install the amino acid onto the PCP7. These two functional states have been observed in structures of the phenylalanine activating adenylation domain of gramicidin synthetase8 and the complexes between adenylation and carrier proteins obtained with mechanism-based inhibitors9,10. Once loaded, both the pantetheine and loaded substrate have been shown to interact transiently with the core of the carrier protein11,12. The structure of SrfA-C, the terminal module from surfactin biosynthesis, contains a condensation–adenylation-PCP–thioesterase architecture and is to date the only structure of an intact NRPS module13. The condensation and adenylation domains share an extensive interface and were proposed to form the core of the module13. Lacking the pantetheine modification, this apo-structure shows the PCP domain directed towards the condensation domain. The other active sites are 40–60 Å from the pantetheinylation site, indicating that extensive domain rearrangements are required to complete the NRPS catalytic cycle. Movement of the PCP domain, potentially coupled to the adenylation C-terminal subdomain rotation7, is necessary for delivery of the peptide intermediates to the different catalytic domains.
We determined structures of two NRPSs with the same architecture as SrfA-C (Extended Data Fig. 1), but with holo-proteins that show functional interactions between the PCP and catalytic domains (Fig. 1). First we present two structures of AB3403 from the human pathogen Acinetobacter baumannii (protein annotation ABBFA_003403 in strain AB307-0294) that belongs to an uncharacterized biosynthetic pathway implicated in motility14, and biofilm15 and pellicle16 formation. We describe the structures of holo-AB3403 obtained without ligands and also upon crystallization in the presence of Mg⋅ATP and glycine, which among the proteinogenic amino acids serves as the best substrate (Extended Data Fig. 2). Second, we present the structure of EntF from Escherichia coli, showing the PCP cofactor covalently trapped with a mechanism-based inhibitor to model thioester formation within the adenylation domain. These results provide views of two distinct steps in the NRPS catalytic cycle and demonstrate how the domain rotation within the adenylation domain mediates the delivery of the PCP between the two catalytic domains.
The structures of AB3403 were determined at 2.7 and 2.9 Å resolution (Extended Data Table 1). No prior structure exists of an NRPS condensation domain bound to a ligand; the holo-AB3403 protein shows the pantetheine cofactor residing in the active site (Fig. 2 and Extended Data Fig. 3a). The two lobes of the condensation domain adopt the closed orientation seen recently in the CDA synthetase condensation domain17. Contacts are made between the pantetheine and the helix running from Glu20 to Leu30, in particular Tyr26 and Ile27, which forms one wall of the tunnel through which the pantetheine approaches the active site (Fig. 2b). Additionally, Tyr37 forms a hydrogen bond with the amide of the cysteamine moiety of the pantetheine cofactor. As the main chain carbonyl of Tyr37 hydrogen bonds to the main chain amide of the catalytic His145, this is a critical interaction to close the two lobes and bring the active histidine into proper position.
Holo-AB3403 therefore illustrates the conformation that is adopted to properly deliver the pantetheine of the PCP to the condensation domain. The PCP is rotated ~30° relative to the orientation of the PCP domain of SrfA-C (Extended Data Fig. 4). The AB3403 PCP interface with the condensation domain is composed of residues from helix α2, the helix that follows the pantetheinylation site at Ser1006, and the loops that precede and follow this helix. In particular, residues Phe999 to Tyr1032 face the condensation domain. Leu1007 and Val1010 form a hydrophobic interaction with Leu22 and Ile80 of the condensation domain. The side chain of Lys1011 forms a hydrogen bond with the main chain carbonyl of Gln78. Finally, Val1026, Ala1027, and Ala1030 on the PCP helix α3 form a hydrophobic interaction with Tyr26 and Leu30. Arg344 of the condensation domain, which is positioned on an insertion compared with SrfA-C, interacts with the phosphate from the cofactor.
The AB3403 adenylation domain (Fig. 2c) is precisely positioned in the adenylate-forming conformation, unlike the adenylation domain of SrfA-C, which is in an open conformation that may be used for substrate binding or release5. The lysine of the conserved catalytic A10 motif 7,18 interacts with a phosphate oxygen from AMP and a carboxylate oxygen with glycine and superimposes with the homologous lysine in the gramicidin synthetase domain. In SrfA-C, the homologous lysine is ~12 Å away.
The thioesterase domain of AB3403 is structurally similar to the homologous domains of both SrfA-C and EntF (Extended Data Fig. 5), the latter of which has been characterized by NMR and crystallography in complex with the upstream PCP domain19,20. Despite the similarities in domain structure, the thioesterase domain of AB3403 is in a markedly different location compared with SrfA-C (Fig. 3a). Interestingly, in this new position the thioesterase domain cradles the back face of the PCP domain. The thioesterase domains of SrfA-C or AB3403 do not make substantial contacts with the other catalytic domains.
We next examined the delivery of the holo-PCP to the adenylation domain in a different NRPS protein. We have previously used targeted mechanism-based inhibitors, harbouring a vinylsulfonamide moiety that traps the thioester-forming reaction21 to characterize functional adenylation-PCP di-domain interactions9,10. These inhibitors mimic the native aminoacyl adenylate, but contain a Michael acceptor positioned to react with the pantetheine thiol. EntF crystallized only in the presence of the serine adenosine vinylsulfonamide (Ser-AVS) inhibitor (Fig. 2d and Extended Data Fig. 6) that limits conformational flexibility to promote crystallization. Crystals of the EntF protein diffract to 2.8 Å (Extended Data Table 2). No electron density was observed for the thioesterase domain although the intact protein was present in the crystal lattice (Extended Data Fig. 7).
The condensation domain of EntF is similar to the closed AB3403 conformation (Fig. 2a). The adenylation domain adopts the catalytic thioester-forming conformation of prior adenylation-PCP proteins9,10, demonstrating that the conformation is compatible with a full NRPS module. The active site of the EntF adenylation domain identifies conserved residues (Fig. 2d) that have been shown to play important catalytic roles in other members of this enzyme superfamily7. Arg863 interacts with the cofactor phosphate, while Gly864 and Gln865 form one wall of the pantetheine tunnel. Interactions with the nucleotide occur between Asp840 and the ribose hydroxyls, and between Tyr746 and Tyr852 and the adenine ring. The inhibitor serine binds in the binding pocket formed by Asp648, Ser722, and Asp754 (Fig. 2d).
The lack of density for the thioesterase domain in EntF suggested multiple conformations in the crystal lattice. This is not surprising given the limited interactions in SrfA-C and AB3403 between the thioesterase domains and the other domains. To assess thioesterase conformational mobility, we examined EntF by negative-stain electron microscopy followed by classification and averaging of single-particle projections (Extended Data Fig. 8). The class averages revealed primarily a tri-lobed density with two neighbouring globular densities of similar size attributed to the condensation and adenylation domains and a smaller lobe attributed to the thioesterase domain (Fig. 3b). The positioning of the thioesterase domain assumes a surprisingly wide range of distances and angles relative to the other domains.
The large interface of the SrfA-C condensation and adenylation domains13 suggested they constitute a catalytic platform, upon which the other domains move. We therefore compared the interfaces of the three NRPS modules (Fig. 3c). The interface in AB3403 is 1,023 Å2, comparable in size to the 1,097 Å2 interface of SrfA-C. In contrast, the interface in EntF is only 780 Å2, resulting from the rotation of the adenylation C-terminal subdomain to the thioester-forming conformation.
Additionally, the conformation of the interface is not conserved between all three proteins. Alignment of the structures on the basis of the amino (N)-terminal subdomains of the adenylation domain shows that the condensation domain of both AB3403 and EntF differ slightly from each other and more significantly from SrfA-C. In AB3403 and EntF, the condensation domains are rotated by ~25° relative to the adenylation domains. Furthermore, the EntF condensation domain is shifted closer towards the adenylation domain. Structural comparisons suggest that this alternative conformation in EntF may not be compatible with the adenylate-forming conformation. The three different condensation–adenylation domain conformations, the adenylate-forming incompatibility seen in EntF, and the multiple extended and compact conformations seen in the electron microscopy data suggest that the condensation–adenylation domain platform may be more dynamic than previously proposed13.
The new structures confirm the hypothesis7 that the adenylation domain conformational change is a structural mechanism to guide the PCP between active sites in the context of complete NRPS modules. The rotation of the adenylation domain C-terminal subdomain from the adenylate-forming conformation in AB3403 to the thioester-forming conformation of EntF delivers the PCP into the adenylation domain for loading. The recent structure of loaded holo-PCP has shown the interaction of the substrate with the PCP core which may help to promote release of the substrate from the adenylation domain11. This interaction also alters the surface electrostatic potential of regions that interact with the neighbouring catalytic domains, including α2 and α3, and may influence the PCP delivery to neighbouring catalytic domains. Finally, this transfer is further assisted by the linker region that joins the adenylation C-terminal subdomain with the PCP domain, which includes important contacts that are preserved in the adenylate- and thioester-forming conformations22, as well as the open conformation of SrfA-C.
The basic NRPS catalytic cycle requires that the PCP visits three adjacent catalytic domains in a coordinated manner. The two catalytic conformations of the adenylation domain7 require that the full cycle has four catalytic structural states (Fig. 4). Specifically, (I) the adenylation domain catalyses amino-acid adenylation, (II) the PCP is delivered to the adenylation domain for thioester-formation to load the PCP, (III) the PCP is delivered to the condensation domain to receive the upstream peptide, and finally (IV) the peptide is delivered to a downstream condensation, thioesterase, or reductase domain for release.
Our results show that states I and III are identical and only three distinct conformations are required to accommodate the four catalytic states of the NRPS cycle (Fig. 4, yellow). The protein first adopts an adenylate-forming conformation, seen in AB3403, state III, to catalyse amino-acid adenylation. Through the domain rotation of the adenylation C-terminal subdomain, the PCP is delivered to the adenylation domain to load the pantetheine cofactor, as seen in the crystal structure of EntF, state II. Return of the PCP to the condensation domain delivers the loaded PCP for receipt of the upstream peptide, state III. Critically, as seen in AB3403, the adenylation domain can activate a second amino acid to prime the system for another cycle. The ability to simultaneously catalyse peptide bond formation and amino-acid adenylation at two active sites significantly increases the overall catalytic efficiency and throughput of the NRPS module. Finally, although no structure exists of a full NRPS module with the PCP directed into the thioesterase or other downstream domain in state IV, the structure of AB3403 also offers a new view of the thioesterase domain and suggests the peptide-loaded PCP could be delivered to the downstream thioesterase domain through a simple rotation.
The modular architecture of NRPSs as well as their capacity to catalyse unusual chemistry23,24 offer the potential for generating novel products through engineering enzyme activity and the combination of heterologous domains. These efforts have been limited by deficiencies in our understanding of the functional interactions between domains and within active sites. The new views of two essential catalytic states in the NRPS cycle, an appreciation of the greater dynamics of NRPS systems, and the structures of holo-NRPS proteins with relevant ligands will provide the necessary insights to guide these engineering efforts. In addition, these studies complement the recent visualization of modular polyketide synthases by cryo-electron microscopy25 to set the stage for investigations of the structural foundation of even larger, multi-modular biosynthetic proteins.
Methods
No statistical methods were used to predetermine sample size. The experiments were not randomized. The investigators were not blinded to allocation during experiments and outcome assessment.
Expression, purification, and crystallization of AB3403
The human pathogen A. baumannii contains an uncharacterized NRPS cluster that has been implicated in motility and biofilm formation; the product of this operon is unknown. This operon contains eight genes. In strain AB307-0294 (ref. 26), from which the NRPS gene was cloned, this operon consists of genes ABBFA_003399 to ABBFA_003406. In the more commonly used ATCC17978 strain, the same genes are encoded by A1S_0119 to A1S_0112. The ABBFA_003403 (designated AB3403 herein) protein sequence is available at GenBank under accession number ACJ56070.1.
The gene encoding AB3403 was PCR-amplified from AB307-0294 genomic DNA26 (courtesy of T. A. Russo). The amplified fragment was cloned into the pET15b-TEV expression vector27 and confirmed by DNA sequencing. The vector provides a His5-tag, linker, and tobacco etch virus (TEV) protease recognition site that, upon treatment with TEV protease, yields a final recombinant product with glycine and histidine preceding the initial methionine residue.
The AB3403 pET15b-TEV construct was transformed into E. coli (BL21-DE3) cells. Transformed cells were grown in LB media to an absorbance at 600 nm (A600 nm) of 0.6 at 37 °C. Protein expression was induced by addition of 0.5 mM isopropyl-β-d-thiogalactoside (IPTG) and cells were incubated overnight at 16 °C. Cells were harvested by centrifugation, flash-frozen in liquid nitrogen, and stored at −80 °C. Selenomethionine-labelled protein was generated in M9 minimal media using a metabolic inhibition protocol28. All purification steps were identical to the native protein.
For purification, cells were resuspended in a buffer containing 50 mM HEPES (pH 7.5), 250 mM NaCl, 10 mM imidazole, 0.2 mM TCEP. Cells were lysed by mechanical disruption (Branson Sonifier) and the resulting lysate was clarified by centrifugation at 235,000g for 45 min. The cell lysate was passed over a His-trap (GE Healthcare) immobilized metal ion affinity column and washed with lysis buffer containing 50 mM imidazole. Bound proteins were eluted with the same buffer containing 300 mM imidazole. The protein was incubated with TEV protease and dialysed against a TEV cleavage buffer (50 mM HEPES (pH 8.0), 250 mM NaCl, 0.2 mM TCEP, and 0.5 mM EDTA) for 16 h at 4 °C. This partly purified protein was then phosphopantetheinylated by incubation with His6-tagged non-specific phosphopantetheinyl transferase Sfp (10 nM), 12.5 mM MgCl2, and 1 mM CoA for 60 min at 20 °C. The clarified protein was then passed over the His-trap column a second time to remove uncleaved protein, the TEV protease, Sfp, and other contaminating proteins. The holo-AB3403 protein in the column flow-through was pooled, dialysed against a size exclusion buffer containing 50 mM HEPES (pH 7.5), 150 mM NaCl, 0.2 mM TCEP, and further purified by gel filtration (Superdex200). Protein concentration was assessed after dialysis against a crystallization buffer (25 mM HEPES (pH 7.5), 50 mM NaCl, 0.2 mM TCEP) using an extinction coefficient at 280 nm of 157,570 M−1 cm−1.
Crystallization conditions for holo-AB3403 were initially identified from a sparse matrix screen at 20 °C. Final crystals for native and SeMet-labelled holo-AB3403 were grown at 14 °C by hanging-drop vapour diffusion against 0.75–0.95 M potassium citrate, 0.01–0.025 M glycine, and 0.05 M bis-tris propane (BTP) (pH 8.0). Highest-quality native crystals were obtained using a protein concentration of 5.5 mg ml−1 with a protein to cocktail ratio of 1.5:1. SeMet protein was crystallized in the same manner with a protein concentration of 7.5 mg ml−1 and 1:1 protein to cocktail ratio. To obtain crystals in the presence of ligands, the protein was pre-incubated for 45 min at 4 °C with 2 mM MgCl2, and 1.5-fold molar excess of ATP and glycine.
Structure determination of AB3403
Crystals of holo-AB3403 were cryoprotected by stages using either ethylene glycol or potassium citrate for native and SeMet protein, respectively. The native protein crystals were cryo-protected with cocktails containing 1.0 M potassium citrate, 0.3 M glycine, 0.05 M BTP (pH 8.0), and increasing (8, 16, and 24%) v/v ethylene glycol. The SeMet-labelled protein was cryo-protected with cocktails containing 0.3 M glycine, 0.05M BTP (pH 8.0) and increasing (1.0, 1.2, 1.4, and 1.6 M) potassium citrate. Crystals derived from protein co-crystallized with ligands included the same concentration of MgCl2, ATP, and glycine in the cryo-protectant cocktails.
Diffraction data were collected on APS beamline 23-IDB. The native data (2.7 Å) were collected using a multi-crystal, multi-data set strategy using two crystals. A complete low-resolution scan was taken for one crystal followed by a higher-resolution scan of the best diffracting crystal. A high-resolution region of the second crystal was combined with the two scans from the first crystal. The optimal regions were identified with the JBLU-ICE software at the GM/CA beamline. A single peak wavelength data set (3.35 Å) was collected for SeMet-labelled protein. The liganded protein data set was collected with a single crystal.
Diffraction data were indexed, merged, and scaled using iMOSFLM29 in space group P4x212. Structure determination was performed with PHENIX30 using a combination of experimental single-wavelength anomalous diffraction (SAD) phasing and phased molecular replacement. A partial molecular replacement solution was positioned through PHASER with a sculpted (PHENIX sculptor) model derived from PheA (PDB accession number 1AMU)8 and CytC1 (PDB 3VNR). Using this partial molecular replacement model, the selenium sites were identified with the SAD data from SeMet-labelled crystals. An initial model was produced with PHENIX Autobuild that contained ~65% of the protein molecule, spread across multiple symmetry related molecules. This model was combined into a single protein chain, built and refined iteratively against native data using ARP-WARP31, COOT32, and PHENIX refine.
The final refinements were performed with translation-libration-screw-rotation (TLS) parameterization33 with groups consisting of residues 1:191, 191:445, 446:480, 481:862, 863:959, 960:973, 974:1044, and 1054:1318, roughly defining the NRPS domain (or subdomain) boundaries. The protein is complete from residues Asn2 to Pro1319 with two small disordered loops in the adenylation domain at Asn500–Asp501 and Gly627–Gly630. The latter loop is part of the conserved serine/threonine- and glycine-rich P-loop that is involved in binding the triphosphate of the nucleotide7. Additionally, the condensation domain contains electron density for a diacylglycerol lipid molecule that co-purified with the protein and potentially derived from the bacterial membrane during cell disruption. Diffraction and refinement statistics are presented in Extended Data Table 1. Experimental electron densities of the ligands of both structures are presented in stereo format in Extended Data Fig. 3.
Purification of EntF
The enterobactin biosynthetic cluster of E. coli has been used as a model system in many studies. The full-length EntF, containing the condensation, adenylation, PCP, thioesterase domain architecture, loads serine onto the PCP domain. The condensation domain then recognizes the external carrier protein EntB that has been loaded with 2,3-dihydroxybenzoate (DHB) by the activity of the freestanding adenylation domain EntE. The DHB-serine amide is then transferred to the thioesterase domain while two additional cycles of synthesis complete the enterobactin trilactone.
The EntF protein used in this study (GenBank P11454) was described previously22,34. The entf gene was PCR amplified from E. coli JM109 and cloned into a pET15-TEV vector with a N-terminal 5× His-tag and a TEV protease cleavage site22. The entf vector was transformed into E. coli (BL21-DE3) cells for protein expression. Cells were grown in lysogeny broth (LB) media to A600 nm = 0.6 at 37 °C before protein induction with 1 mM IPTG. Cells were grown overnight at 16 °C and collected by centrifugation. The cell pellets were flash frozen in liquid nitrogen. Selenomethionine-labelled EntF was expressed in M9 minimal media as described28.
For purification both of native and of SeMet-labelled protein, cells were resuspended in lysis buffer containing 50 mM Tris-HCl pH 7.5, 400 mM NaCl, 0.2 mM TCEP, 10% glycerol, and 10 mM imidazole. Cells were lysed via sonication and centrifuged at 235,000g for 45 min. Initial purification was achieved with a His-trap immobilized metal ion affinity column. Protein was eluted using lysis buffer with 300 mM imidazole. EntF was incubated with TEV protease overnight at 4 °C in a cleavage buffer containing 50 mM Tris pH 7.5, 400 mM NaCl, 0.2 mM TCEP, 10% glycerol, and 0.5 mM EDTA. Although expressed in E. coli, phosphopantetheinylation was assured by the addition of 10 nM Sfp, 1 mM CoA, and 12.5 mM MgCl2. The reaction was incubated at room temperature (22 °C) for 1–2 h. The holo-EntF was run over an immobilized metal ion affinity column once more to remove uncleaved protein along with Sfp. A final polishing step was performed with a Superdex 200 16/600 column in a final dialysis buffer containing 50 mM EPPS pH 8.0, 150 mM NaCl, 0.2 mM TCEP, 1 mM MgCl2, and 10% glycerol. Before crystallization, the Ser-AVS inhibitor was added at a concentration four times that of EntF and allowed to incubate for 2–4 h at room temperature.
For electron microscopy, native EntF was purified as above with the exception that a minimal dialysis buffer was used, which contained 50 mM EPPS pH 8.0, 100 mM NaCl, and 0.2 mM TCEP. No inhibitor was added.
Crystal conditions for the Ser-AVS inhibited EntF were first identified using the Hauptman-Woodward high-throughput screen35. Large diffraction-quality native and SeMet crystals were grown using hanging drop vapour diffusion at 20 °C. A crystallization cocktail, consisting of 100 mM BTP pH 7.5, 125–150 mM MgCl2, and 22–28% PEG 4000, was diluted 1:1 with the final dialysis buffer. The hanging drops then combined protein at 30 mg ml−1 and the undiluted crystallization cocktail at a ratio of 1:2. This ‘batch mimic’ limited the differences between the drop and reservoir and has been successful with other protein samples in our laboratory36.
Structure determination of EntF
Native EntF crystals were cryoprotected by that addition of 2,3-butanediol directly to the crystallization drop to a final concentration of ~10%. SeMet crystals were cryoprotected similarly except with glycerol to a final concentration of ~20%. Diffraction data were collected on APS beamline 23-IDB using the rastering option to find the optimal spots on both the native the SeMet crystals. Diffraction data were indexed, merged, and scaled using iMOSFLM29 in space group P4x212. Structure determination for the SeMet inflection data was performed in PHENIX30 using a PhaserEP MR-SAD with a partial molecular replacement solution that was obtained using a sculpted model (generated with PHENIX sculptor) derived from the Pseudomonas aeruginosa bidomain adenylation-PCP protein PA1221 (PDB 4DG9)9. Automated model building with BUCCANEER was used to build ~65% of the structure37. This partial model from the SeMet data was used as a molecular replacement model for the native data, and the remaining portion of the protein was built by hand (excluding the thioesterase domain, which was unresolved and constitutes about 19%). This model was built and refined iteratively using COOT32 and PHENIX refine. TLS refinement33 was used in final stages with groups consisting of residues 5:186, 187:429, 430:444, 445:857, 858:964, 965:971, and 972:1045.
The final model showed density for the condensation, adenylation, and PCP domains of EntF; no density was observed for the thioesterase domain. Diffraction and refinement statistics are presented in Extended Data Table 2.
In general, the overall quality of the density was weaker for the N-terminal subdomain of the condensation domain, residues 1–186, probably reflecting the higher mobility of this region of the protein. The average B-factors for different regions of the protein (Extended Data Table 2) support this conclusion.
Negative-stain electron microscopy analysis of EntF
EntF, purified as described above, was prepared for electron microscopy using the conventional negative staining protocol38, and imaged at room temperature with a Tecnai T12 electron microscope operated at 120 kV using low-dose procedures. Images were recorded at a magnification of ×71,138 and a defocus value of ~1.5 μm on a Gatan US4000 CCD camera. All images were binned (2 pixels × 2 pixels) to obtain a pixel size of 4.16 Å on the specimen level. Particles were manually excised using e2boxer (part of the EMAN 2 software suite)39. Two-dimensional reference-free alignment and classification of particle projections was performed using ISAC40. A total of 17,431 projections of EntF were subjected to ISAC, producing 133 classes consistent over two-way matching and accounting for 5,344 particle projections (Extended Data Fig. 8B).
Synthesis of serine adenosine vinylsulfonamide
Ser-AVS was synthesized using the protocol summarized in (Extended Data Fig. 6). All reactions were performed under an inert atmosphere of dry Ar in oven-dried (150 °C) glassware. 1H and 13C NMR spectra were recorded on a Varian 600 MHz spectrometer. Proton chemical shifts are reported in parts per million from an internal standard of residual chloroform (7.26 p.p.m.) or methanol (3.31 p.p.m.), and carbon chemical shifts are reported using an internal standard of residual chloroform (77.3 p.p.m.) or methanol (49.1 p.p.m.). Proton chemical data are reported as follows: chemical shift, multiplicity (s, singlet; d, doublet; t, triplet; m, multiplet; br, broad), integration, coupling constant. High-resolution mass spectra were obtained on an Agilent TOF II time of flight/mass spectrometry (TOF/MS) instrument equipped with either an ESI or APCI interface. Thin-layer chromatography (TLC) analyses were performed on TLC silica gel 60F254 from EMD Chemical, and were visualized with ultraviolet light or 10% PMA solution. Purifications were performed by flash chromatography on silica gel (Dynamic Adsorbents, 60A).
Materials. Chemicals, reagents, and solvents were purchased from Sigma Aldrich, Chem-Impex, or Acros Organic Fischer, and were used as received. An anhydrous solvent-dispensing system (J. C. Meyer) using two packed columns of neutral alumina was used for drying tetrahydrofuran (THF), Et2O, while two packed columns of molecular sieves were used to dry DMF and the solvents were dispensed under argon. Compound 1 was purchased from Chem-Impex and used as received. Compounds 2 (ref. 41) and 4 (ref. 10) were synthesized according to the reported procedures.
tert-Butyl (R,E)-4-(2-(N-(tert-butoxycarbonyl)sulfamoyl)vinyl)-2,2-dimethyloxazolidine-3-carboxylate (3). To a solution of tert-butyl (2) (395 mg, 1.0 mmol, 2.0 equiv) in 1:3 DMF–THF (4 ml) at −78 °C, was added a 1 M solution of LiHMDS in THF (2.0 ml, 4.0 equiv) dropwise over 15 min and the solution was stirred at -78 °C for an additional 15 min. Next, Garner’s aldehyde (1) (115 mg, 0.5 mmol, 1.0 equiv) in THF (1 ml) was added to the reaction over 15 min. The solution was gradually warmed to 25 °C and stirred for 15 h. The solvent was removed in vacuo and the mixture was taken up in H2O (30 ml). The pH was adjusted to 3–4 with 1 N aqueous HCl, then was extracted with ethyl acetate (EtOAc) (3 × 20 ml). The combined organic layers were washed with H2O (30 ml), saturated aqueous NaCl (30 ml), dried (MgSO4), and concentrated. Purification by flash chromatography (10% EtOAc–hexane to 50% EtOAc–hexanes) afforded the title compound 3 as colourless oil (150 mg, 74%): retardation factor (Rf) = 0.50 (50:50 EtOAc–hexanes); [α] +0.9 (c 0.02, CH2Cl2); 1H NMR (600 MHz, CD3OD) 1H NMR (600 MHz, CD3OD) δ 1.45 (s, 3H), 1.48 (m, 9H), 1.51 (s, 9H), 1.60 (s, 3H), 3.83–3.85 (m, 1H), 4.15 (dd, J = 12.0, 6.0 Hz, 1H), 4.56–4.58 (m, 1H), 6.64 (d, J = 18 Hz, 1H), 6.77–6.81 (m, 1H); 13C NMR (150 MHz, CD3OD) δ 28.41, 28.47, 28.80, 28.81, 58.7, 68.3, 84.19, 84.22, 95.8, 130.6, 145.7, 152.2, 152.7; HRMS (ESI–) calculated for C17H29N2O7S [M − H]− 405.1701, found 405.1721 (error 4.9 p.p.m.).
Ser-AVS
To a solution of N6, N6-bis(tert-butoxycarbonyl)-2′,3′-O-isopropylideneadenosine (4) (73 mg, 0.14 mmol, 1.1 equiv), vinylsulfonamide (3) (52 mg, 0.13 mmol, 1.0 equiv) and PPh3 (56 mg, 0.21 mmol, 1.7 equiv) in THF (1 ml) at 0 °C, was added a solution of DIAD (42 μl, 0.21 mmol, 1.7 equiv) in THF (1 ml) over 1 h using a syringe pump. The solution was gradually warmed up to 23 °C and stirred overnight. The mixture was filtered over a short pad of silica gel, which was washed with 20% EtOAc-hexanes (100 ml). The filtrate was concentrated to afford crude 5 (Rf = 0.45, 50:50 EtOAc–hexanes), which was used in the next step without further purification. To a solution of crude 5 from the previous step was added 80% aqueous trifluoroacetic acid (1 ml) at 0 °C. The solution was stirred for 6 h at 0 °C then concentrated. Recrystallization from 1:20 MeOH–Et2O (5 ml) afforded the title compound (32 mg, 47%) as colourless film: [α] -10.3 (c 0.600, MeOH); 1H NMR (600 MHz, CD3OD) δ 3.30–3.39 (m, 2H), 3.67–3.70 (m, 1H), 3.83 (dd, J = 11.6, 4.1 Hz, 1H), 4.05–4.08 (m, 1H), 4.22–4.25 (m, 1H), 4.34–4.35 (m, 1H), 4.77–4.81 (m, 1H), 5.94 (d, J = 6.2 Hz, 1H), 6.70 (dd, J = 15.4, 6.5 Hz, 1H), 6.77 (d, J = 15.4 Hz, 1H), 8.27 (s, 1H), 8.29 (s, 1H); 13C NMR (150 MHz, CD3OD) δ 45.8, 54.1, 62.3, 72.9, 74.8, 85.8, 91.7, 121.3, 134.8, 137.0, 143.2, 149.9, 151.3, 156.1; HRMS (ESI+) calculated for C14H22N7O6S [M + H]+ 416.1347, found 416.1339 (error 1.9 p.p.m.).
Kinetic analysis of AB3403
Substrate preference for the adenylation domain of holo-AB3403 was established by the pyrophosphate exchange assay42 allowing radiolabelled PPi to be incorporated into ATP in the reverse reaction. One micromolar holo-AB3403 was added to 2 mM ATP, 0.2 mM NaPPi, 50 mM HEPES (pH 7.5), 100 mM NaCl, 10 mM MgCl2, 0.15 μCi [32P]PPi, and 5 mM substrate. Reactions (100 μl) were incubated for 10 min at 37 °C, then quenched with 0.5 ml 1.2% charcoal, 0.1 M unlabelled PPi, and 0.35 M perchloric acid. The charcoal was pelleted by centrifugation, washed twice with 1 ml H2O, and resuspended in 0.5 ml H2O for scintillation counting.
To determine the apparent kinetic constants for ATP and glycine for the holo-AB3403 adenylation domain, the NADH+ consumption assay monitored at A340 nm (refs 43, 44) with full-length AB3403. Hydroxylamine was used as a surrogate for the pantetheine in the second partial reaction to displace AMP for use in the coupled consumption assay45. Standard reactions contained 50 mM HEPES (pH 7.5), 15 mM MgCl2, 2 mM ATP, 3 mM phosphoenolpyruvate, 0.2 mM NADH+, 5 U myokinase, 5 U pyruvate kinase, 6.5 U lactate dehydrogenase, and 150 mM buffered hydroxylamine. Apparent kinetic constants were determined by varying concentrations of ATP or glycine with the one or the other in excess. Reactions were initiated by the addition of 0.001 mM enzyme. Calculations were done using PRISM software.
References
Koglin, A. & Walsh, C. T. Structural insights into nonribosomal peptide enzymatic assembly lines. Nat. Prod. Rep. 26, 987–1000 (2009)
Fischbach, M. A. & Walsh, C. T. Antibiotics for emerging pathogens. Science 325, 1089–1093 (2009)
Walsh, C. T. The chemical versatility of natural-product assembly lines. Acc. Chem. Res. 41, 4–10 (2008)
Walsh, C. T. & Fischbach, M. A. Natural products version 2.0: connecting genes to molecules. J. Am. Chem. Soc. 132, 2469–2493 (2010)
Strieker, M., Tanovic´, A. & Marahiel, M. A. Nonribosomal peptide synthetases: structures and dynamics. Curr. Opin. Struct. Biol. 20, 234–240 (2010)
Mercer, A. C. & Burkart, M. D. The ubiquitous carrier protein--a window to metabolite biosynthesis. Nat. Prod. Rep. 24, 750–773 (2007)
Gulick, A. M. Conformational dynamics in the Acyl-CoA synthetases, adenylation domains of non-ribosomal peptide synthetases, and firefly luciferase. ACS Chem. Biol. 4, 811–827 (2009)
Conti, E., Stachelhaus, T., Marahiel, M. A. & Brick, P. Structural basis for the activation of phenylalanine in the non-ribosomal biosynthesis of gramicidin S. EMBO J. 16, 4174–4183 (1997)
Mitchell, C. A., Shi, C., Aldrich, C. C. & Gulick, A. M. Structure of PA1221, a nonribosomal peptide synthetase containing adenylation and peptidyl carrier protein domains. Biochemistry 51, 3252–3263 (2012)
Sundlov, J. A., Shi, C., Wilson, D. J., Aldrich, C. C. & Gulick, A. M. Structural and functional investigation of the intermolecular interaction between NRPS adenylation and carrier protein domains. Chem. Biol. 19, 188–198 (2012)
Goodrich, A. C., Harden, B. J. & Frueh, D. P. Solution structure of a nonribosomal peptide synthetase carrier protein loaded with its substrate reveals transient, well-defined contacts. J. Am. Chem. Soc. 137, 12100–12109 (2015)
Jaremko, M. J., Lee, D. J., Opella, S. J. & Burkart, M. D. Structure and substrate sequestration in the pyoluteorin type II peptidyl carrier protein PltL. J. Am. Chem. Soc. 137, 11546–11549 (2015)
Tanovic, A., Samel, S. A., Essen, L. O. & Marahiel, M. A. Crystal structure of the termination module of a nonribosomal peptide synthetase. Science 321, 659–663 (2008)
Clemmer, K. M., Bonomo, R. A. & Rather, P. N. Genetic analysis of surface motility in Acinetobacter baumannii. Microbiology 157, 2534–2544 (2011)
Rumbo-Feal, S. et al. Whole transcriptome analysis of Acinetobacter baumannii assessed by RNA-sequencing reveals different mRNA expression profiles in biofilm compared to planktonic cells. PLoS One 8, e72968 (2013)
Giles, S. K., Stroeher, U. H., Eijkelkamp, B. A. & Brown, M. H. Identification of genes essential for pellicle formation in Acinetobacter baumannii. BMC Microbiol . 15, 116 (2015)
Bloudoff, K., Rodionov, D. & Schmeing, T. M. Crystal structures of the first condensation domain of CDA synthetase suggest conformational changes during the synthetic cycle of nonribosomal peptide synthetases. J. Mol. Biol. 425, 3137–3150 (2013)
Marahiel, M. A., Stachelhaus, T. & Mootz, H. D. Modular peptide synthetases involved in nonribosomal peptide synthesis. Chem. Rev. 97, 2651–2674 (1997)
Frueh, D. P. et al. Dynamic thiolation-thioesterase structure of a non-ribosomal peptide synthetase. Nature 454, 903–906 (2008)
Liu, Y., Zheng, T. & Bruner, S. D. Structural basis for phosphopantetheinyl carrier domain interactions in the terminal module of nonribosomal peptide synthetases. Chem. Biol. 18, 1482–1488 (2011)
Qiao, C., Wilson, D. J., Bennett, E. M. & Aldrich, C. C. A mechanism-based aryl carrier protein/thiolation domain affinity probe. J. Am. Chem. Soc. 129, 6350–6351 (2007)
Miller, B. R., Sundlov, J. A., Drake, E. J., Makin, T. A. & Gulick, A. M. Analysis of the linker region joining the adenylation and carrier protein domains of the modular nonribosomal peptide synthetases. Proteins 82, 2691–2702 (2014)
Gaudelli, N. M., Long, D. H. & Townsend, C. A. β-Lactam formation by a non-ribosomal peptide synthetase during antibiotic biosynthesis. Nature 520, 383–387 (2015)
Maruyama, C. et al. A stand-alone adenylation domain forms amide bonds in streptothricin biosynthesis. Nature Chem. Biol. 8, 791–797 (2012)
Dutta, S. et al. Structure of a modular polyketide synthase. Nature 510, 512–517 (2014)
Adams, M. D. et al. Comparative genome sequence analysis of multidrug-resistant Acinetobacter baumannii. J. Bacteriol. 190, 8053–8064 (2008)
Kapust, R. B. et al. Tobacco etch virus protease: mechanism of autolysis and rational design of stable mutants with wild-type catalytic proficiency. Protein Eng. 14, 993–1000 (2001)
Doublié, S. Preparation of selenomethionyl proteins for phase determination. Methods Enzymol. 276, 523–530 (1997)
Battye, T. G., Kontogiannis, L., Johnson, O., Powell, H. R. & Leslie, A. G. iMOSFLM: a new graphical interface for diffraction-image processing with MOSFLM. Acta Crystallogr. D 67, 271–281 (2011)
Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. D 66, 213–221 (2010)
Langer, G., Cohen, S. X., Lamzin, V. S. & Perrakis, A. Automated macromolecular model building for X-ray crystallography using ARP/wARP version 7. Nature Protocols 3, 1171–1179 (2008)
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta Crystallogr. D 60, 2126–2132 (2004)
Urzhumtsev, A., Afonine, P. V. & Adams, P. D. TLS from fundamentals to practice. Crystallogr. Rev . 19, 230–270 (2013)
Roche, E. D. & Walsh, C. T. Dissection of the EntF condensation domain boundary and active site residues in nonribosomal peptide synthesis. Biochemistry 42, 1334–1344 (2003)
Luft, J. R. et al. A deliberate approach to screening for initial crystallization conditions of biological macromolecules. J. Struct. Biol. 142, 170–179 (2003)
Sundlov, J. A. & Gulick, A. M. Structure determination of the functional domain interaction of a chimeric nonribosomal peptide synthetase from a challenging crystal with noncrystallographic translational symmetry. Acta Crystallogr. D 69, 1482–1492 (2013)
Cowtan, K. The Buccaneer software for automated model building. 1. Tracing protein chains. Acta Crystallogr. D 62, 1002–1011 (2006)
Ohi, M., Li, Y., Cheng, Y. & Walz, T. Negative staining and image classification - powerful tools in modern electron microscopy. Biol. Proced. Online 6, 23–34 (2004)
Tang, G. et al. EMAN2: an extensible image processing suite for electron microscopy. J. Struct. Biol. 157, 38–46 (2007)
Yang, Z., Fang, J., Chittuluru, J., Asturias, F. J. & Penczek, P. A. Iterative stable alignment and clustering of 2D transmission electron microscope images. Structure 20, 237–247 (2012)
Ikeuchi, H., Meyer, M. E., Ding, Y., Hiratake, J. & Richards, N. G. A critical electrostatic interaction mediates inhibitor recognition by human asparagine synthetase. Bioorg. Med. Chem. 17, 6641–6650 (2009)
Rusnak, F., Faraci, W. S. & Walsh, C. T. Subcloning, expression, and purification of the enterobactin biosynthetic enzyme 2,3-dihydroxybenzoate-AMP ligase: demonstration of enzyme-bound (2,3-dihydroxybenzoyl)adenylate product. Biochemistry 28, 6827–6835 (1989)
Horswill, A. R. & Escalante-Semerena, J. C. Characterization of the propionyl-CoA synthetase (PrpE) enzyme of Salmonella enterica: residue Lys592 is required for propionyl-AMP synthesis. Biochemistry 41, 2379–2387 (2002)
Reger, A. S., Carney, J. M. & Gulick, A. M. Biochemical and crystallographic analysis of substrate binding and conformational changes in acetyl-CoA synthetase. Biochemistry 46, 6536–6546 (2007)
Wilson, D. J. & Aldrich, C. C. A continuous kinetic assay for adenylation enzyme activity and inhibition. Anal. Biochem. 404, 56–63 (2010)
Reuter, D. C., McIntosh, J. E., Guinn, A. C. & Madera, A. M. Synthesis of vinyl sulfonamides using the Horner reaction. Synthesis 2003, 2321–2324 (2003)
Acknowledgements
We thank R. Sanishvili for assistance with data collection. This work was funded in part by National Institutes of Health GM-068440 (to A.M.G.) and GM-115601 (to G.S.), and Award W81XWH-11-2-0218 from the Telemedicine and Advanced Technology Research Center of the US Army Medical Research and Materiel Command (A.M.G.). Data were collected at the GM/CA beamline of the Advanced Photon Source, which is funded by the National Cancer Institute (ACB-12002) and the National Institute of General Medical Sciences (AGM-12006) under Department of Energy contract number DE-AC02-06CH11357 to A.P.S. A Stafford Fellowship (to B.R.M.) and support from the Hauptman-Woodward Institute is acknowledged.
Author information
Authors and Affiliations
Contributions
C.L.A. characterized activity of and initially crystallized AB3403. J.A.S. initially crystallized EntF. E.J.D. and B.R.M. optimized crystal, and solved and refined the models of AB3403 and EntF, respectively. C.S. and C.C.A. designed and synthesized the mechanism-based inhibitor. J.T.T. and G.S. performed and analysed the single-particle electron microscopy. A.M.G., E.J.D., B.R.M., G.S., J.T.T., C.C.A., and C.S. analysed the results and wrote the manuscript. All authors saw and approved the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Extended data figures and tables
Extended Data Figure 1 Structure-based alignment of EntF, AB3403, and SrfA-C.
Condensation, adenylation, PCP, and thioesterase domains are represented with bars in grey, pink, green–cyan, and blue. Conserved motifs and catalytically important residues are highlighted with the same colours, including the HHxxxD motif of the condensation domains, the aspartic acid hinge that separates the N- and C-terminal subdomains of the adenylation domain, the GGHS motif that is the site of pantetheinylation in the PCP, and the catalytic nucleophile of the thioesterase domain. The SrfA-C, AB3403, and EntF proteins share approximately 26% sequence identity. The adenylation and PCP domains are more well-conserved, sharing ~35% identity, whereas the condensation (21%) and thioesterase (25%) domains are less well conserved. Domain boundaries are described in the table below.
Extended Data Figure 2 Substrate specificity of full-length AB3403.
Amino-acid specificity of AB3403 was recorded for all 20 proteinogenic amino acids, as well as 4-chlorobenzoate (4CB) and 4-hydroxybenzoate (4HB). Average values and standard deviations are shown for three replicates with each substrate; results were recorded as micromoles of radiolabelled ATP incorporated per minute per milligram of enzyme. Apparent kinetic constants are also shown for ATP and glycine calculated from duplicate measurements for four to six substrate concentrations.
Extended Data Figure 3 Stereo representations of electron density figures shown in Fig. 2.
To better visualize the active sites and electron density quality, stereo figures are included in the extended data. In all panels, density is shown with coefficients of the form (Fo − Fc) calculated before inclusion of ligands and contoured at 3σ. a, Stereo representation of electron density of AB3403 condensation domain shows the phosphopantethine on Ser1006 approaching His145 within the condensation domain pocket. Inhibitor carbon atoms in green, carbons of residues within 5 Å of inhibitor in grey, nitrogen in blue, oxygen in red, sulphur in yellow, and water in light blue. b, Electron density of the nucleotide binding pocket of AB3403 bound to glycine and AMP. Stereo representation of electron density shows the AMP, glycine, and Mg+ present in the active site of the adenylation domain. Ligand carbon atoms are in green, carbons of residues within 5 Å of inhibitor in grey, nitrogen in blue, oxygen in red, phosphorus in orange, and the Mg+ cofactor in purple. c, Stereo representation of the electron density shows the phosphopantethine on Ser1006 covalently attached to the Ser-AVS inhibitor in the active site of the adenylation domain. Inhibitor carbon atoms in green, carbons of residues within 4 Å of inhibitor in grey, nitrogen in blue, oxygen in red, phosphorus in orange, sulphur in yellow, and water in light blue.
Extended Data Figure 4 Comparison of AB3403 and SrfA-C PCP-condensation domain interaction.
Stereo representation illustrating different orientations of the PCP domains of SrfA-C and AB3403 relative to the condensation domains with which they interact. AB3403 is shown with a white condensation domain and a green-cyan PCP. SrfA-C is shown with a yellow condensation domain and a pale blue PCP. The pantetheine of AB3403 is shown bound to Ser1006. The position of Ser1003, mutated to an alanine residue in SrfA-C, is also highlighted.
Extended Data Figure 5 Comparison of AB3403 thioesterase domain to the functional PCP–thioesterase interaction.
Stereo representation of the thioesterase (blue) domain of AB3403 interacts with the back face of the PCP domain in AB3403. The functional interaction between the EntF thioesterase domain and its holo-PCP, trapped crystallographically, illustrates that the same face of the thioesterase domain interacts functionally (PDB 3TEJ). A 28-residue insertion of AB3403 is coloured yellow.
Extended Data Figure 6 Synthesis of Ser-AVS.
The Ser-AVS probe was synthesized following similar protocols described elsewhere41,46. Garner’s aldehyde 1 was coupled with 2 using LiHMDS to exclusively furnish the (E)-vinylsulfonamide 3. Mitsunobu coupling of 3 with bis-Boc adenosine 4 afforded 5, which was globally deprotected using 80% aqueous trifluoroacetic acid to yield Ser-AVS.
Extended Data Figure 7 Electrophoretic mobility of EntF.
a, Native gel electrophoresis. Lane 1: EntF. Lane2: EntF incubated with fourfold molar excess of Ser-AVS inhibitor. Lane 3: EntF Crystals. Lane 4: novex NativeMark labelled in kilodaltons. b, Denaturing gel electrophoresis using loading buffer with SDS and β-mercaptoethanol. Gel lane 1: EntF. Lane 2: EntF incubated four times with Ser-AVS inhibitor. Lane 3: Life Technologies Mark12 labelled in kilodaltons. The native gel shows the inhibited EntF in a more compact conformation compared with EntF without the inhibitor.
Extended Data Figure 8 Negative-stain electron microscopy analysis of EntF.
a, Raw electron microscopy image of negative-stained EntF. b, Class averages of EntF particles.
Supplementary information
Supplementary Information
This file contains a Supplementary Discussion. (PDF 114 kb)
Rights and permissions
About this article
Cite this article
Drake, E., Miller, B., Shi, C. et al. Structures of two distinct conformations of holo-non-ribosomal peptide synthetases. Nature 529, 235–238 (2016). https://doi.org/10.1038/nature16163
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/nature16163
This article is cited by
-
Subdomain dynamics enable chemical chain reactions in non-ribosomal peptide synthetases
Nature Chemistry (2024)
-
Advances in the adenylation domain: discovery of diverse non-ribosomal peptides
Applied Microbiology and Biotechnology (2023)
-
Structural insights into the substrate-bound condensation domains of non-ribosomal peptide synthetase AmbB
Scientific Reports (2022)
-
Bifurcation drives the evolution of assembly-line biosynthesis
Nature Communications (2022)
-
Rational inhibitor design for Pseudomonas aeruginosa salicylate adenylation enzyme PchD
JBIC Journal of Biological Inorganic Chemistry (2022)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.