Introduction

The major limitation for using peptides as affinity reagents, probes and therapeutics is their inherent instability in biological environments. Although the amide backbone is chemically stable, peptides are readily broken down in a matter of seconds in the digestive tract, in blood, plasma, serum and inside cells due to the presence of proteases1. Because of this instability, many routes have been devised to chemically alter or modify natural peptides such as 1) including the addition of N-methylation to the backbone2, 2) insertion of β-amino acids3,4, 3) changing the location of the side chain (e.g., peptoids5,6), or 4) by covalent cyclization via insertion of chemical bridges7,8.

Left unanswered by these studies is the question of how much a functional peptide with natural amino acids could be stabilized by sequence optimization alone. Previously, we demonstrated a route to create high-diversity cyclic peptide libraries via mRNA display9,10 and used this approach to isolate a high affinity binder termed cycGiBP to the signaling protein Gαi1-GDP11. cycGiBP showed a very high affinity (Kd = 2.1 nM) and ~3-fold increase in protease resistance as compared with the corresponding linear sequence. One observation in that work was that only one of the three possible cleavage sites predominated when cycGiBP was subjected to chymotrypsin. Using the standard substrate notation for proteases (P3-P2-P1-P1′-P2′-P3′; P's represent the amino acid identity at a position and the scissile bond is located between P1 and P1′ residue), chymotrypsin has a strong P1 preference for W > Y > F L12. However, cycGiBP shows cleavage at P1 = Y5, but not at P1 = W4 or F711.

We wondered if there were other members in that library that had improved protease resistance while retaining binding function. To address this issue, we performed a dual selection for chymotrypsin resistance and binding function on a library previously only sieved for binding function. Those experiments resulted in a several highly protease resistant peptides and indicate that sequence optimization can improve hydrolytic stability by as much as 400-fold compared to peptides sequences isolated without this selective pressure.

Methods

E. coli expression of Gαi1-GDP and mRNA display

Gαi1-GDP with a C-terminal BirA tag was expressed and purified as previously described13. mRNA display selection targeting Gαi1-GDP was performed starting at round 7 of our previous work and performed as described11 with the modification that the cDNA fusions were digested with 2 mg of immobilized chymotrypsin (Sigma-Aldrich) per 106 cpm of fusions at room temperature for 15 minutes in 50 mM Sodium Phosphate buffer (pH = 8.0). The chymotrypsin beads were removed by centrifugation through 0.45 μm filters before the selection step.

Peptide synthesis

R6A (MSQTKRLDDQLYWWEYL), Biotin-labeled R6A (Bio-MSQTKRLDDQLYWWEYL), cycPRP-1 (MITWIDFISPSK), cycPRP-2 (MTWFEYLSGSK), cycPRP-3 (MTWFEFLSSTSK) and cycGiBP (MITWYEFVAGTK) were synthesized on Rink Amide AM Resin LL (Novabiochem) and cyclized using DSG as described by Millward et al11. After the reaction the cyclized peptides were purified via C18 HPLC and the mass confirmed by MALDI-TOF MS.

Binding constant determination

Binding constants were determined relative to the R6A peptide [Kd = 60 nM]14 by equilibrium competition using 35[S]-Met radiolabeled Gαi1-GDP and biotinylated R6A as previously described10 and the data analyzed using GraphPad Prism 5.0.

Protease resistance

Peptides (250 nmol of peptide in DMSO) were added to 50 mM sodium phosphate buffer (pH 8.0) with a final DMSO concentration of 2% (v/v). Sixty units of immobilized chymotrypsin (Sigma Aldrich) were added and allowed to incubate at room temperature. Aliquots were taken at various time points and subsequently filtered. The aliquots were then injected onto a C18 reverse phase column and separated by a gradient elution from 15 to 90% B in 25 minutes. Solvent A consisted of 0.1% (v/v) TFA in water and solvent B contained CH3CN with 0.1% (v/v) TFA. The area under the starting material peak was quantitated using the 32 KaratGold Software package (Beckman). The graph was generated by fitting the data to a one phase exponential decay equation (GraphPad Prism 5.0). The mean and the standard error are reported in table 1.

Table 1 Chymotrypsin resistance of linear and cyclic

Km and Vmax determination

The peptides were prepared and characterized as described above in the protease resistance experiment. Only a single 2 minute time point with varying concentrations of the peptides (0, 5, 11, 22, 65 and 260 μM) was analyzed using Michaelis-Menten enzyme kinetics regression equation (GraphPad Prism 5.0). The mean and the standard error are reported in 2table 3.

Table 2 Km and Vmax enhancements for GiBP and PRPs
Table 3 Stability of linear and cyclic peptides in human serum

Human serum digests

Lyophilized human serum (Thermo Scientific) was reconstituted by adding 2 mL ddH2O to each 5 mg vial from the manufacturer. Each reaction contained 250 nmol of peptide, 50 μL of sodium phosphate buffer (pH 8.0) and 10% DMSO (v/v). One milliliter of reconstituted serum was added to each sample and incubated at 37°C. For each time point, 100 μL aliquots were taken from the reaction and quenched in 300 μL of acetonitrile. These quenched samples were centrifuged to separate precipitate and the supernatant was diluted in water to 1.5 mL. Samples were then analyzed via HPLC as described above. The mean and the standard error are reported in table 3.

Circular dichroism spectroscopy (CD)

Far UV-CD spectra were obtained using a Jasco J810 spectropolarimeter (located at the USC NanoBiophysics Core Facility) equipped with a Peltier device. The peptides (25–100 μM) were prepared in 10 mM phosphate buffer at pH 7.4 and placed in a 1 mm path length cuvette. Thereafter, CD spectra were recorded in the range of 195–240 nm. Five spectra were acquired and averaged. Spectra were baseline corrected by subtracting blank spectra of the corresponding solutions without peptide and ellipticities were converted to mean residue molar ellipticities in degrees cm2 dmol−1.

Results and Discussion

Previously, we created a trillion-member mRNA display cyclic peptide library with the form MXXXXXXXXXK (termed MX10K) and used seven rounds of selection to isolate cyclic peptides that bind to the signaling protein Gαi1-GDP11. The best peptide from that selection was cyclic GiBP (cycGiBP), a specific, 12-residue cyclic peptide with a Kd = 2.1 nM. Additionally, we found that cycGiBP was ~3-fold more resistant to chymotrypsin as compared to a linear version of the peptide11.

In an effort to see if protease-resistant, natural-sequence peptides could be found, we took the Pool 7 library from our original experiment (Figure 1a) and subjected it to a two-step selection protocol (Figure 1b). After cyclization with DSG, the library of mRNA peptide fusions was first subjected to degradation by immobilized chymotrypsin for 15 minutes at room temperature and then selected for binding to the target of interest, here Gαi1-GDP. We chose the 15 minute digestion time because preliminary experiments showed that more than 90% of the library was degraded under these conditions (data not shown) and our previous results indicated that cycGiBP should have been similarly degraded11. We reasoned this incubation should thus provide sufficient selection pressure to reveal protease-resistant sequences. After three rounds of selection, representative clones from Pool 10 were sequenced (Figure 1c).

Figure 1
figure 1

mRNA Display selection for chymotrypsin resistance.

(a) Pool 7 of the cyclic peptide library (MX10K) targeting Gαi1-GDP was used as the starting point for the selection. (b) In rounds 8–10, the library was cyclized and subjected to chymotrypsin degradation and binding selection. (c) Representative clones from Pool 10 were sequenced (see supplemental information) and peptides cycPRP-1, cycPRP-2 and cycPRP-3 were further characterized.

We then chose three sequences from pool 10 for further characterization. We term these molecules cycPRPs for Cyclic Protease Resistant Peptides (cycPRP-1, cycPRP-2 and cycPRP-3). The cycPRP sequence consensus in Pool 10 is relatively similar to the consensus sequence seen in Pool 7 and also in other Gαi1-GDP binding peptides. The core sequence of the cycPRPs (TWIDFI, TWFEYL, TWFEFL) is very similar to cycGiBP (TWYEFV)10, R6A (YWWEYL)14, KB-752 (TWYDFL)15, the GSP peptide (TVWEFL)13 and the AR6-05 peptides (YWWEFL)16 selected by both our lab and others. The fact that the sequence consensus is similar suggests that the cycPRP peptides bind to Gαi1-GDP and proteolysis did not destroy binding. Additionally, all previously selected peptides bind near switch 2 of Gαi1-GDP and based on this observation, we hypothesize that the cycPRPs bind to the same site on Gαi1-GDP.

To verify that the cycPRPs bind to Gαi1-GDP as well as determine binding constants, the cycPRPs were initially assessed by in vitro pull-down with immobilized Gαi1-GDP. These data indicated that all three molecules were functional, with cycPRP-3 being the best binder followed by cycPRP-1 and cycPRP-2 (data not shown). Consistent with this view, we determined the Kd values of cycPRP-3 to be 9 ± 2 nM and cycPRP-1 to be 90 ± 20 nM for Gαi1-GDP. This binding is excellent compared to KB-752 (Kd = 3.9 uM) and on par with to cycGiBP (Kd = 2.1 nM), raising the possibility that the dual selection optimized the product of protease resistance and binding affinity, rather than one or the other.

The fact that the sequences retained a large fraction of aromatic residues in the core motif was somewhat surprising given the strong preference for chymotrypsin to cleave the backbone adjacent to tryptophan, tyrosine, or phenylalanine16,17. Indeed, our initial prediction was that the chymotrypsin selection would result in sequences lacking aromatic residues that still retained binding function, which clearly did not occur. Based on the primary sequence, cycPRP-1, cycPRP-2 and cycPRP-3, each have 2-3 chymotrypsin cleavage sites (Figure 2), essentially the same as peptides isolated originally without protease challenge11. Initial HPLC/MALDI analysis of chymotrypsin-cleaved peptides revealed that only one cut site was predominant in each of the cycPRP sequences, similar to the result observed originally for cycGiBP. Interestingly, the major cut site varies, depending on the sequence–occurring between Y-E in cycGiBP, W-I in cycPRP-1 and W-F in cycPRP-2 and cycPRP-3 (Figure 2).

Figure 2
figure 2

Theoretical (red) and observed (blue) chymotrypsin digest sites of peptides that bind Gαi1-GDP.

cycGIBP is from Pool 7 of the original selection for function only, while the three cycPRP peptides (cycPRP1-3) are from Pool 10 of the protease/binding selection. Red arrows denote theoretical digest sites and blue arrows show the actual digest sited as determined by MALDI-TOF.

We next determined the half-life of the cycPRPs as substrates for chymotrypsin, under conditions similar to the chymotrypsin digestion during the selection (Table 1). The linear GiBP peptide provides a reference point to gauge improvements in protease resistance in this series–and we can use this fixed point to address the effects of protease challenge, cyclization, as well as comparing variations in the primary sequence. Regarding the overall effect of the selection, we observed a 100–200 fold improvement in linear PRPs resistance and a 100–300 fold increase of cycPRPs resistance to chymotrypsin, as compared with linear GiBP. The improved half-life of the PRPs (t1/2 = 26 to 90 min) is also consistent with the peptides being able to withstand the 15 minute chymotrypsin digestion largely intact. Regarding cyclization, it is clear that cyclization improves chymotrypsin resistance for cycGiBP, but for the protease resistant peptides, these gains are either quite modest (2-fold for cycPRP-1) or non-existent (cycPRP-3). The data thus indicate that cyclization is not the primary driver of chymotrypsin resistance in peptides selected using this protease challenge.

There are no obvious variations in the primary sequence that would indicate the cycPRPs have improved chymotrypsin resistance as compared with cycGiBP. Put another way, there is no sequence model of which we are aware that predicts the cycPRPs should be more protease resistant than cycGiBP. Amongst the cycPRPs, the number of aromatic amino acids is almost identical to cycGiBP and there are no systematic alterations in the number of beta-branched amino acids, prolines, glycines, or other specific residues near or adjacent to the cut sites. Indeed, at least one statistical model predicts the W-F dipeptide seen at the P1-P1′ position in cycPRP-2 and cycPRP-3 would be particularly susceptible to cleavage16. Previous kinetic work with model substrates indicates that W at P1 is a markedly better substrate than Y or F (rank order P1; W > Y > F)12. In this view, cycGiBP could be seen as anomalous, since the primary cleavage site occurs between Y and E, even though there is a W-Y dipeptide step present.

Our work here differs from model kinetic studies in that each of our substrates contains multiple possible cleavage sites. On the other hand, exhaustive analysis and alignment of many model substrates might be expected to reveal the importance of residues at other positions in the chain to the overall cleavage rate. Schellenberger et al17., constructed exactly such a QSAR model based on available kcat/Km data and found that kcat/Km depends on the residue identity from P3 to P2′. Including residues outside this window did not improve the predictive power of the model and the primary determinant at the P2′ position was whether the residue was proline or not. While this model has several caveats (it treats the effect each position as independent and additive), the overall observations are relevant here. Most notable is the predicted variation in cleavage based on residue identity at each position. For P3 and P2, the model predicts variation in kcat/Km of 40-fold (A > R > G, D > P, K) and 10-fold (L,V > P > A > G) respectively. For P1, the model predicts a relatively small variation in Y, W and F (2-fold) and for P1′ predicts a 20-fold variation (F, A > L, V, G).

In order to quantitatively compare our peptides with previous model of chymotrypsin substrates, we determined kcat/Km values for cycGiBP, cycPRP-1, cycPRP-2, cycPRP-3 (Table 2). The kcat/Km value for cycGiBP is 1.1 × 105 sec−1 M−1 which is in line with model amide bonded substrates17,18, supporting the view that cycGiBP acts as normal substrate for the enzyme. Interestingly, the increased protease resistance of the PRP peptides originates from changes in both kcat and Km–with substrates showing 8–16 fold decreases in binding and 8–50 fold decreases in catalytic cleavage rates. cycPRP-2 and cycPRP-1 have the smallest kcat/Km values, reduced 420-fold and 350-fold as compared to cycGiBP. This reduction is remarkable because it substantially exceeds what would have been expected if all the aromatic residues had been converted to leucine16. cycPRP-3 shows a 110-fold reduction in kcat/Km compared to cycGiBP, but is also the highest affinity binder. This data also argues that the selection optimized the product of the binding affinity and the reduction in kcat/Km, such that survival represents a compromise between these selected traits.

Previous observations have demonstrated that, at least with proteins, protease resistance can result from differences in protein structure19,20. While peptides are generally unstructured in solution, a possible mechanism that could reduce the cleavage rate of the PRPs would be if the peptides had significant levels of secondary structure. In this model, only the unfolded form of the peptide would be available for binding and cleavage by the enzyme. This model would not account for changes in kcat, but could contribute to changes in Km by decreasing the fraction of free peptide available. To test this, we took the CD spectrum of both the linear and cyclic versions of GiBP, cycPRP-1, cycPRP-2 and cycPRP-3 in phosphate buffer at 10°C (Figure 3) and at 60°C (see supplemental information). The reduced temperature was used in order to maximize structure formation, as peptide folding transitions are often quite broad due to their modest enthalpies of formation21,22. The largest structure difference between linear and cyclic molecules is seen for cycGiBP (Figure 3a) and very small differences are seen for cycPRP-1, cycPRP-2 and cycPRP-3 (Figure 3b–d). These spectra do not conform simply to established model helix, sheet, or coil spectra23. The lack of a minima near λ = 222 nm indicates the peptides are not helical and the overall spectra look most similar to the anti-parallel beta sheet structure23. However, the spectra also are similar to unstructured peptides and show little changes upon heating the samples to 60°C (see supplemental information). The CD spectra and lack of temperature dependence are most consistent with both the linear and cyclic peptides lacking regular secondary structure, although they do not eliminate non-canonical folded forms.

Figure 3
figure 3

CD spectra of linear and cyclic versions of GiBP, PRP-1, PRP-2 and PRP-3.

All spectra were taken at 10°C in 10 mM phosphate buffer pH 7.4 with linear peptides are shown in blue and cyclic peptides in red. (a) GIBP. (b) PRP-1. (c) PRP-2. (d) PRP-3.

The lack of peptide structure and the relatively small importance of cyclization imply that the majority of the protease resistance results from sequence effects that alter the efficiency of these peptides as chymotrypsin substrates. This could be because the peptides are optimized to be poor chymotrypsin substrates or because of more general changes that make the peptides poor protease substrates. To test this, we incubated cycGiBP, cycPRP-1 and cycPRP-3 in human serum containing active proteases and peptidases and examined the half-life of the molecules (Table 3).

Serum generally degrades peptides very quickly24 due to the presence of multiple proteases with differing specificity (e.g., thrombin, plasmin and kallikrein). In line with that view, linear GiBP is undetectable at the first time point after incubation (t1/2 < 1 min) and cycGiBP shows a half-life of ~20 minutes. Surprisingly, cyclization of the PRPs gives similar dramatic improvements in stability against serum digestion with increases of 20- to 130-fold for cycPRP-1 and cycPRP-3 respectively. This difference is consistent with serum aminopeptidase, which can digest linear peptides, but whose activity is blocked when the peptides are cyclized25. The fact that the cyclic PRPs show dramatically improved stability against digestion by both chymotrypsin and the proteases present in serum, even though the peptides were never exposed to serum proteases during the selection, argues that the peptides have been optimized in a general way to resist proteolytic cleavage. This optimization is somewhat idiosyncratic, as the most stable serum peptide (cycPRP-3) is the least stable of the series against chymotrypsin alone.

Conclusions

Our two-step selection protocol (mRNA display using protease challenge and binding selection) demonstrates that it is possible to isolate highly functional natural peptides that have the ability to bind a target of interest and that are resistant to proteases. This procedure results in stability increases ranging from 100- to 400-fold. Mechanistically, the protease resistance results from both a lower rate of cleavage (kcat) and a weaker interaction with the enzyme (Km). Surprisingly, the protease resistant peptides do not contain dramatic sequence changes compared with non-protease resistant molecules. Only a few changes are needed to increase proteolytic stability, with 4–6 relatively conservative amino acid changes (out of 10 positions) needed to convert a peptide susceptible to protease into one resistant to protease. These changes are not predictable with existing heuristics/models, do not remove key residues that are substrates for the protease (here W, Y and F) and do not appear to increase the ordered structure of the molecules. Rather, the changes appear to affect endopeptidase cleavage generally, perhaps through local conformational biases. Nonetheless, these results are exciting because they provide a route to create protease-resistant homologs of biologically-active peptides for use as reagents, diagnostics and therapeutics.