Massively parallel measurement of protein–protein interactions by sequencing using MP3-seq

Baryshev, Alexandr; La Fleur, Alyssa; Groves, Benjamin; Michel, Cirstyn; Baker, David; Ljubetič, Ajasja; Seelig, Georg

doi:10.1038/s41589-024-01718-x

Download PDF

Article
Open access
Published: 27 August 2024

Massively parallel measurement of protein–protein interactions by sequencing using MP3-seq

Nature Chemical Biology (2024)Cite this article

6063 Accesses
28 Altmetric
Metrics details

Subjects

Abstract

Protein–protein interactions (PPIs) regulate many cellular processes and engineered PPIs have cell and gene therapy applications. Here, we introduce massively parallel PPI measurement by sequencing (MP3-seq), an easy-to-use and highly scalable yeast two-hybrid approach for measuring PPIs. In MP3-seq, DNA barcodes are associated with specific protein pairs and barcode enrichment can be read by sequencing to provide a direct measure of interaction strength. We show that MP3-seq is highly quantitative and scales to over 100,000 interactions. We apply MP3-seq to characterize interactions between families of rationally designed heterodimers and to investigate elements conferring specificity to coiled-coil interactions. Lastly, we predict coiled heterodimer structures using AlphaFold-Multimer (AF-M) and train linear models on physics-based energy terms to predict MP3-seq values. We find that AF-M-based models could be valuable for prescreening interactions but experimentally measuring interactions remains necessary to rank their strengths quantitatively.

A reference map of the human binary protein interactome

Article 08 April 2020

Transcriptome-wide high-throughput mapping of protein–RNA occupancy profiles using POP-seq

Article Open access 13 January 2021

Quantification of extracellular proteins, protein complexes and mRNAs in single cells by proximity sequencing

Article 01 December 2022

Main

Synthetic protein binders mediating interactions between other proteins or cells can potentially revolutionize fields ranging from cell therapy¹ to synthetic biology^2,3,4,5 and material science^6,7,8. Early work often relied on natural interaction domains such as SH3 and PDZ (ref. ⁹). However, such components provide a poor starting point for rationally designing large-scale assemblies because of crosstalk. Consequently, synthetic protein circuits remain much smaller and simpler than biological protein–protein interaction (PPI) networks. To scale up synthetic protein-based circuits, we need large-scale libraries of modular interaction domains. Ideally, these interaction domains would be orthogonal, whereby each domain interacts only with its designated binding partner.

Fulfilling this need through rational heterodimer design has yielded promising results but creating large, totally orthogonal sets of dimers remains challenging. Early rational design work focused on coiled-coil dimers (1×1s); 1×1 coil binding is primarily determined by complementary electrostatic and hydrophobic interactions at specific heptad positions. Because the biophysical rules guiding these interactions are relatively well understood and can be captured by predictive models^10,11, orthogonal sets of up to six 1×1 heterodimers have been generated^12,13. Recently, a set of orthogonal 1×1s (primarily homodimers) was identified in a high-throughput screen¹⁴. However, the restricted geometry required for 1×1 coil interactions limits the number of possible orthogonal interactions. The Baker lab expanded the coiled-coil toolbox by introducing helical bundle heterodimers, with protomers consisting of two helices connected by a hairpin loop, where interactions are determined by designed hydrogen-bond networks (HB-nets) between the bundles (2×2s)^15,16. This multihelix bundle approach could pave the way to generating larger orthogonal sets. However, reliably minimizing off-target interactions (for example, because of proteins associating in an unexpected register or orientation) remains challenging as these are not captured in the biophysical design objective to produce 2×2s. A practical alternative is to prepare diverse de novo 2×2 protomer libraries and measure an all-by-all matrix of interactions. Afterward, an orthogonal set can be extracted¹⁷.

Many PPI measuring methods have been developed that could be used to run all-by-all PPI screens with varying throughput levels. Mass-spectrometry-based and protein-array-based methods can measure PPIs but require laborious protein purification steps^18,19. Yeast-display or phage-display methods use next-generation sequencing technologies to increase throughput but are limited to ‘several-versus-many’ screening²⁰. Alpha-seq, a recent high-throughput method that takes advantage of the yeast mating pathway, overcomes this limitation and allows library-on-library screening²¹. However, throughput is still limited and not all proteins correctly fold when displayed on the yeast surface.

Yeast two-hybrid (Y2H) methods are a powerful alternative to surface display for characterizing PPIs²². In Y2H, one protein is fused to a DNA-binding domain (DBD) and the second to a transcriptional activation domain (AD). If the proteins interact, a functional transcription factor is reconstituted and drives the expression of a growth-essential enzyme. Early Y2H approaches tested small numbers of PPIs using plate-based selection but lab automation and pooling strategies^23,24 have enabled proteome-scale screens. To further address Y2H scaling issues, high-throughput Y2H (HT-Y2H) and enzyme complementation methods have been developed leveraging next-generation sequencing to read out interaction strength^{25,26,27,28,29,30,31,32,33,34}. Concomitantly with experimental methods, custom workflows for analyzing HT-Y2H data were developed^35,36. Most of the experimental methods require library construction in Escherichia coli or rely on yeast mating and, thus, need separate protein libraries to transform MATα and MATa yeast. High-throughput bacterial two-hybrid assays have been developed, which provide a noneukaryotic alternative for screening PPIs that avoid yeast mating and library transfer¹⁴. However, when testing protein interactions, it can be desirable to have an environment similar to their native context. Therefore, using Y2H-based assays to test interactions of naturally intracellular proteins with eukaryotic origins can be beneficial. Specifically, expressing such proteins in yeast allows for more accurate folding, the introduction of post-translational modifications and increased solubility when compared to bacterial expression^35,36,37.

Deep learning models such as AlphaFold2 (AF2)³⁸ and RoseTTAFold³⁹ have been shown to predict protein structures with near-experimental accuracy. Structure prediction for proteins with multiple chains is possible, either with the original AF2 model or with specialized models such as AF-Multimer (AF-M)³⁸. Given the speed at which these models operate compared to running Y2H assays, determining how these models can be used to prescreen PPIs or supplement PPI assays is attractive. Tools have been developed to run AF-M in an all-by-all manner to assist with PPI prescreening⁴⁰ and AF-M error metrics have been demonstrated to be state-of-the-art protein–peptide interaction predictors⁴¹. However, it remains unclear how well AF-M can predict protein binding orthogonality.

Here, we introduced MP3-seq, a massively parallel Y2H workflow for measuring PPIs using sequencing. In MP3-seq, the identity of each protein is encoded in a DNA barcode and the relative barcode pair abundance before and after a selection experiment serves as a proxy for interaction strength. Plasmids are assembled in yeast through homologous recombination encoding the protein pairs of interest, their associated barcodes and all other elements required for Y2H experiments. Our workflow bypasses the need for plasmid cloning in E. coli or yeast mating and proteins fold and interact inside the cell instead of on the surface. In contrast to bacterial assays, MP3-seq also works with glycosylated proteins. We validate MP3-seq using well-characterized coiled-coil heterodimer interactions and synthetic binders for different human B cell lymphoma 2 (Bcl-2) protein family members. Then, we apply MP3-seq to characterize interactions between rationally designed 2×2 and 1×2 heterodimers, demonstrating that MP3-seq can measure over 100,000 PPIs in a single experiment. We identified successful designs and used a greedy algorithm to find potentially orthogonal subsets. We delved into the elements expected to confer 2×2 interaction specificity by screening variations of a successful 2×2 pair. Lastly, we predict complexes for coiled-coil dimers with AF2 and AF-M and assess the ability of these models to predict interactions and orthogonality. We then use AF-M error values and physics-based structure energy terms from Rosetta to train simple models to predict MP3-seq results to investigate how complex predictors could supplement high-throughput measurements.

Results

MP3-seq workflow

In MP3-seq, all molecular components required for measuring interactions between two specific proteins are encoded on a single plasmid (Fig. 1a). A plasmid library is constructed directly through homologous recombination in yeast to measure all possible interactions for a set of proteins. Haploid MATa-type yeast is first transformed with a mixture of DNA fragments. One fragment is a backbone carrying a centromere (CEN) sequence, the selection marker, a DBD and an AD. We use the Cys(2)His(2) zinc-finger domain of the mouse transcription factor Zif268 (ref. ⁴²) as the DBD and its corresponding promoter to drive the expression of the growth-essential enzyme His3. The herpes simplex virus-derived protein domain VP16 is used as the AD. Additional fragments contain one protein of interest and its associated barcode separated by a terminator sequence. Because of their short length and distinct sequences, we used Tsynth23 and Tsynth27 as terminators in most experiments⁴³. A simplified map of the plasmid can be seen in Supplementary Fig. 1.

After transformation, we perform an extended outgrowth and selection step in medium without tryptophan to ensure plasmid maintenance. At this point, the CEN sequence ensures each yeast cell (and any subsequent daughter cells) will contain approximately one library plasmid, which is critical to link the growth of transfected yeast to barcode counts⁴⁴. An aliquot of the cells is frozen, while the remainder undergo selection in medium lacking histidine. Plasmid DNA is extracted from His preselection and postselection cells and the barcode-containing regions are amplified (Fig. 1a, right). The barcode–barcode amplicons are sequenced and their relative enrichment can be calculated from barcode counts to serve as a proxy for interaction strength (Fig. 1b). The expression or folding of some proteins may be impacted by fusion with either the AD or the DBD; therefore, we test all proteins in two fusion orders (DBD fused to P1 and AD fused to P2; DBD fused to P2 and AD fused to P1).

MP3-seq analysis pipeline

First, we calculate the enrichment between the His preselection and postselection stages for PPI fusion order (Fig. 1b). Next, we detect autoactivators and replace their His postselection values with values calculated from the nonautoactivating fusion order (see Autotune in Methods); autoactivation is an error mode in Y2H experiments where high enrichment is observed for a protein for all interaction partners, suggesting nonspecific activation of selection marker expression⁴⁵. Typically, this behavior is observed fused to either the AD or the DBD but not in both orientations. If an autoactivator is found, its homodimer is not recoverable with Autotune, as there is no reverse fusion order to use for correction.

Following autoactivator removal, enrichment values from replicate experiments can be averaged together to obtain interaction strength values. Alternatively, to correct for variation in the read count distribution between experimental replicates resulting from different sequencing depths, selection times and other experimental factors, we calculate the log₂ fold change (LFC) using the DESeq2 package⁴⁶. DESeq2 calculates differential enrichment across multiple replicates and provides Hochberg-adjusted Wald test P values (P_adj), identifying PPIs with LFCs significantly different from an LFC of zero.

In some cases, combining MP3-seq measurements for both fusion orders is desirable (for example, if comparing MP3-seq LFCs to K_d values collected for the pair of interest). We treat each fusion order as an independent set of measurements, turning them into two pseudoreplicates. These pseudoreplicates are combined with DESeq2 to calculate LFCs (Fig. 1b). We refer to these values as pseudoreplicate LFCs (P-LFCs).

MP3-seq benchmarking with orthogonal coiled-coil dimers

To validate MP3-seq, we screened 144 pairwise interactions between six orthogonal 1×1s in the National Institute of Chemistry Peptides (NICP) set¹³ (Fig. 2a). A pool of 24 DNA fragments with each of the 12 proteins fused to either the AD or the DBD was ordered. All interaction pairs were assembled in a pooled experiment and interaction strengths were quantified with MP3-seq. As expected, His preselection barcode counts remained steady over time (Supplementary Fig. 2a) and applying different levels of 3-amino-1,2,4 triazole (3-AT), a competitive inhibitor of His3, affected His postselection counts (Supplementary Fig. 2b–d). LFCs for both orientations were calculated from five experimental replicates, with interactions occurring almost exclusively between designed (on-target) partners (that is, P1:P2, P3:P4, etc.; Fig. 2b). Homodimer plasmid assembly was less efficient in MP3-seq and generally resulted in lower input barcode counts but coverage was sufficient for inclusion in the analysis (Supplementary Fig. 2e). This phenomenon is likely because of increased sequence homology interfering with plasmid assembly. For a more quantitative validation, we correlated MP3-seq LFCs with luciferase expression assay interaction scores in HEK293T cells¹³ and found good agreement (r² = 0.74; Fig. 2c). Figure 2d represents P-LFC MP3-seq data for these PPIs as a graph, where each protein is a vertex and edges are significant interactions weighted by P-LFC.

**Fig. 2: Validation of MP3-seq with coiled-coil heterodimers.**

In a separate all-by-all experiment with 28 proteins, we screened the NICP 1×1s and two sets of 1×1s derived from them (N1, N2, N5–N8 and P5A–P8A) but with increased thermodynamic stability of their on-target interactions⁴⁷. In this experiment, we expect related coils to have similar interaction patterns (for example, P5A can bind to P6A, N6 and P6), which agrees with the interaction graph created from significant MP3-seq P-LFCs in Fig. 2e. The full interaction graph (Supplementary Fig. 3a) showed minimal off-target crosstalk between NCIP proteins and their variants. We found low correlation with melting temperatures collected for the interactions but good correlations with split luciferase and split transcription factor interaction assays⁴⁷ (Supplementary Fig. 3b–d).

To assess whether MP3-seq values provide quantitative information about interaction strength, we performed three all-by-all replicates for interactions between varying-length 1×1s designed to span a wide range of K_d values⁴⁸ (Fig. 2f). These interactions are not expected to be orthogonal as weaker interactions were achieved by truncating two parent four-heptad binders (A_N⁴, B_N⁴) by a half or full heptad⁴⁸ (Fig. 2g). The MP3-seq P-LFCs correlated very well with previously measured dissociation constants over approximately three orders of magnitude (r² = 0.94; Fig. 2h).

We explored using minimum read count filtering at different thresholds and found that filtering brought minimal or no changes to correlation with the reported K_d values. However, for the NICP series, filtering did improve correlation (particularly for Spearman’s rho) (Supplementary Fig. 4a,b). Lastly, to examine the performance for weaker interactions, we compared the correlation of enrichment and P-LFC values with the K_d values for the five weakest interactions⁴⁸. Enrichments had a slightly better ability to order weak interactions than P-LFCs, suggesting that a non-DeSeq2 analysis may be better suited for the high-throughput study of weak PPIs (Supplementary Fig. 4c).

MP3-seq benchmarking with Bcl-2 family binders

To validate MP3-seq outside of 1×1 interactions, we tested a set of proteins previously characterized by biolayer interferometry⁴⁹ and Alpha-seq²¹ composed of six homologous proteins from the Bcl-2 family (Bcl-2, Bcl-X_L, Bcl-w, Mcl-1, Bfl-1 and Bcl-B) and nine de novo designed inhibitors of said homologs. A crystal structure of a synthetic binder bound to its target is shown in Fig. 3a, while Fig. 3b shows the domain organization of the six human proteins. All inhibitors were designed to interact with the BH3 domain-binding pocket. To better compare with the surface-display-based HT-Y2H method Alpha-seq, a truncated version of Mcl-1 was used.

**Fig. 3: Validation of MP3-seq with Bcl-2 proteins.**

A two-replicate MP3-seq screen of all Bcl-2 homologs against all inhibitors is shown in Fig. 3c. Version 1 of MP3-seq was used for the first replicate of this experiment. Unlike version 2, barcodes were inserted into the 3′ untranslated regions of the binders upstream of the terminators and could, thus, differentially impact mRNA stability and protein levels. However, the inter-replicate correlations suggested consistent interactions between versions (Supplementary Fig. 5a). Only binders beginning with α corresponded to the final, specific designs, while the others were intermediate or failed designs. This can be seen in Fig. 3c, with a divide between the largely orthogonal rightmost four columns of the heat map of MP3-seq P-LFCs and the leftmost less specific columns.

Our data agree well with dissociation constants obtained from biolayer interferometry⁴⁹ (r² = 0.61; Fig. 3d). We found good agreement with Alpha-seq percentage survival for their low-throughput pairwise and high-throughput batched assays on the same interactions (batched r² = 0.45, paired r² = 0.61, n = 43; Supplementary Fig. 5b–e). MP3-seq interactions are measured with proteins expressed in yeast, while biolayer interferometry uses purified proteins and Alpha-seq displays proteins on the yeast surface, partially explaining the variation between our results and those published earlier.

Some Bcl-2 inhibitors failed to produce K_d values when measured with biolayer interferometry, likely because the interactions were below detection limits. We examined the undetected (n = 11) versus detected interactions (n = 43) and found that the mean MP3-seq value of PPIs that biolayer interferometry detected was significantly greater than those that were undetected (one-tailed independent t-test, H₁: µ_detected > µ_undetected, t = 4.51, P = 1.858 × 10⁻⁵; Fig. 3e). Pairwise Alpha-seq also had a significantly greater mean of detected interactions than undetected ones (H₁: µ_detected > µ_undetected, t = 1.91, P = 0.0307). However, high-throughput batched Alpha-seq did not exhibit this behavior (H₁: µ_detected > µ_undetected, t = 1.36, P = 0.0904). MP3-seq correlation improved for batched but not paired Alpha-seq when including all data points (batched r² = 0.45, paired r² = 0.69, n = 54; Supplementary Fig. 5b–e) (Fig. 3f–g). Together, these results show that MP3-seq can work with globular proteins in addition to coils and that MP3-seq results agree with those obtained by Alpha-seq and biolayer interferometry.

Lastly, we screened a set of intrinsically disordered p53 upregulated modulator of apoptosis (PUMA) peptides that bind to Mcl-1 at the BH3 pocket (Fig. 3b). Mcl-1 binding results in the disordered PUMA BH3-binding motif transitioning to an ordered helix⁵⁰. Peptides of BH3 motif mutants were used to investigate the effects of helicity degree on this intrinsically disordered protein-dependent interaction. We found a good correlation between MP3-seq and earlier stopped-flow fluorescence measurements of the K_d values for PUMA peptides interacting with the full Mcl-1 protein (r² = 0.66; Fig. 3h).

Large-scale assay of designed heterodimers (DHDs)

To explore the feasibility of using MP3-seq for large-scale screening, we prepared a library of designed heterodimers (DHDs)¹⁶ (Supplementary Methods). On-target pairs were designed as three-helix or four-helix bundles with buried HB-nets connecting all helices and then split into a helix–turn–helix hairpin and a single helix (1×2s) or 2×2s. Initially, we chose 100 designs for testing (DHD1; Fig. 4a), adding 52 more designs in a second round of experiments (DHD2; Fig. 4a). As controls, we selected nine previously tested 2×2s¹⁶. We also included 35 pairs of binders derived from these controls by modifying the hairpin loops or truncating the helices (DHD0; Fig. 4a). Finally, we added a series derived from a common parent 2×2 through truncating heptads (mALb; Fig. 4a). Sequences for DHD0–DHD2 and mALb truncations can be found in Supplementary Table 1. For each on-target pair, one protomer is designated ‘A’ and the other is designated ‘B’. Interactions between and within these groups were tested in MP3-seq experiments of varying sizes. We performed two replicates of an all-by-all screen including DHD0, DHD1, DHD2 and mALb proteins, as well as the Bcl-2 homologs and their binders, resulting in a matrix of 337 × 337 = 113,569 interactions. We performed three replicates of a screen using only DHD1 with a subset of DHD0 and one with DHD0, DHD2 and mALb. MP3-seq version 1 was used for most experiments in this section. Nevertheless, the experiments highlight the large scales achievable with MP3-seq and the inter-replicate correlations suggested that the data were internally consistent (Supplementary Fig. 6).

**Fig. 4: High-throughput screening of DHDs.**

We applied autotune corrections and calculated P-LFCs for all possible pairs. On-target interactions, where the A and B protomers were designed to interact, were evaluated for success. In this case, we defined a ‘success’ as any PPI with P_adj ≤ 0.01 and P-LFC ≥ 4. The number of successes out of the total possible on-target PPIs in each design set can be seen in Fig. 4b. Every design set had an approximately 20% success rate in the largest screen, consistent with past 22% success rates for α-helical bundles⁵¹. The top five P-LFC on-target interactions for each design category are shown in Fig. 4c.

Finding orthogonal subsets

For all four sets of designs (DHD0–DHD2 and mALb), we detected a large fraction of strong off-target interactions (for example, A1 and B2 of DHD1 instead of A1 and B1). For example, in the all-by-all screen with all of DHD0–DHD2 and mALb, only 33 of 914 total PPIs with P_adj ≤ 0.01 and P-LFC ≥ 4 were on target.

We asked whether we could extract potential orthogonal subsets. We used significant (α = 0.01) positive P-LFC values to construct a weighted undirected graph as in Fig. 2d,e. This approach allowed us to rephrase the problem as finding a graph of degree one vertices without self-edges, which maximizes the sum of the remaining edges. To find a graph satisfying our constraints, we developed a simple scoring function that rewards graphs on the basis of existing orthogonal edges or those over a desired orthogonality gap and punishes graphs for nonorthogonal edges. The orthogonality gap is defined as the difference between the weakest on-target interaction and the strongest off-target interaction¹⁴. A larger gap generally results in smaller sets. This scoring function was used in a greedy graph reduction method, Deleting Undirected Edges Thoughtfully (DUET), which removes a vertex and its associated edges each iteration until a one-regular graph remains (Supplementary Methods) (the number of degree one vertices and score of total graphs per algorithm step can be seen in Supplementary Fig 7a,b). For the DHD1 and DHD0 subset, we went from 2,001 edges between 202 vertices to 18 DUET pairs (Fig. 4d); for the DHD0, DHD2 and mALb data, we went from 279 edges between 85 vertices to 11 DUET pairs (Fig. 4e); for all designs (DHD0–DHD2 and mALb), we went from 1,562 edges between 270 vertices to 36 DUET pairs (Fig. 4f). Of these, two, two and four DUET pairs were on-target for DHD0 + DHD1, DHD0 + DHD2 + mALb and all designs, respectively.

The DUET final results are only orthogonal if all non-highly significant interactions are considered noninteracting. As this may not be the case, we used all P-LFC > 0 between DUET pair protomers regardless of significance for a more conservative analysis. First, we removed interactions with protomers for which the DUET pair P-LFC was lower than the highest non-DUET pair P-LFC. Then, we reduced the remaining DUET pairs by removing whichever pair had the largest non-DUET P-LFC one by one (Supplementary Methods and Fig. 4g). When reduced to half the starting DUET pairs (squares in Fig. 4g), the orthogonal sets shown in Fig. 4h were left, which all had orthogonality gaps comparable to the NICP set in Fig. 2 and the Bcl-2 inhibitor pairs in Fig. 3. An additional undesirable behavior for these potentially orthogonal sets would be if DUET is biased toward selected proteins with missing interaction data (and, therefore, fewer edges in the graph). To determine whether this was the case, we ran permutation tests where we counted the number of missing interactions between the proteins in the initial DUET results and randomly sampled protein sets of the same size. We did not find that DUET results had a significantly higher number of missing interactions (Supplementary Fig 7c).

Elucidating the rules of specific helix–loop–helix binding

High-throughput, high-quality data can be used to probe novel design rules to improve protein design. We used MP3-seq to understand binding specificity better in designed 2×2 pairs by testing variants of different lengths and with a different number of buried HB-nets. Our target pair, mALb8 (the common parent of the mALb set), is a high-affinity 2×2 heterodimer with three HB-nets between its A and B protomers (Fig. 5a). Two mALb protomer truncations were designed by removing one or two turns from the end of each α-helix (Fig. 5b and Supplementary Fig. 8a). The original binders and the truncations were screened with MP3-seq and LFCs were calculated from four replicates. The relationship between length and binding affinity was highly nonlinear, as truncating the binders by two turns eliminated binding (Fig. 5c). We could use these data to infer the shortest possible length (3.5 heptads) for 2×2 binders containing two buried HB-nets. This pattern was similar to that seen for the A_N and B_N single-helix-truncation experiments in Fig. 1d,e.

**Fig. 5: Systematic exploration of length and HB-net on helix–loop–helix binding.**

We wanted to find the minimum number of buried HB-nets needed for orthogonal binding. Therefore, we removed the first (R1), second (R2) or both (R12) buried HB-nets from the mALb8 dimer and replaced them with either large (L) or small (S) hydrophobic residues (Supplementary Methods). The original HB-nets are shown in Fig. 5d and the replacement sequences are shown in Supplementary Fig. 8b. We observed that two HB-net mismatches were needed to prevent binding (Fig. 5e, boxed areas). For example, A_R1(s) and B_R2(s) had two HB-net mismatches (half of the second HB-net on protomer A and half of the first HB-net on protomer B) and showed no binding. A_R12(S) and B_R2(s) had one HB-net mismatch (first HB-net on chain B) and still bound. B_R12(S) weakly bound to both A_R1(S) and A_R2(S) with one HB-net mismatch. Additionally, hydrophobic residue size alone was not sufficient to confer orthogonality. For example, B_R1(S) and B_R1(L) bound to both A_R1(L) and A_R1(S). Using these simple rules and MP3-seq, we created two new orthogonal pairs (A_R1:B_R1 and A_R2:B_R2). We also note that the designs largely did not homodimerize (Fig. 5e, unboxed areas). The all-by-all screen of original, truncated and HB-net-removed mALb proteins can be found in Supplementary Fig. 8c, which showed that mALb proteins with one truncation had similar binding patterns with HB-net variants to the originals but that the second truncation eliminated binding.

Predicting orthogonal binding with linear models

To assess AF’s ability to predict orthogonal interactions, we used AF2 and three published versions of AF-M (v1–v3) to predict complexes for all six 1×1 NICP pairs in Fig. 2b (Supplementary Methods). We compared computed error metrics (predicted local distance difference test, predicted aligned error, etc.) with MP3-seq LFC values and on-target and off-target classification. We noted that AF-M v2 and v3 performed better than AF2 and AF-M v1 (Supplementary Fig. 9a,b). The best-performing metric was the interface predicted TM (iPTM) score averaged across our predicted complexes (Fig. 6a). We wanted to see how AF-M metrics compare to a state-of-the-art specialized model for 1×1 binding prediction, such as iCipa¹⁴ (Supplementary Fig. 9c). We found that iCipa correlated better with LFC values (Fig. 6b), particularly in the classification task (Fig. 6c). iCipa also better predicted orthogonality (Supplementary Fig. 9d,e).

**Fig. 6: Predicted complex error for coiled dimers and simple model performance.**

Next, we wanted to test the generalizability of AF2 on more complex protein architectures; thus, we predicted structures for the mALb8 (2×2) complexes. There was no apparent increase in correlation between AF-M v2 and v3 (Supplementary Fig. 9f,g). The AF-M v3 average iPTM for the mALb8 interactions is shown in Fig. 6d. Heterodimeric interactions had higher confidence than homodimeric complexes but the detailed effect of the HB-net mismatches was only partially captured.

Encouraged by the generalization abilities of AF-M, we set out to determine whether a combination of AF-M error metrics and structural metrics can be used to train a better predictor of LFC values. Rosetta was used to collect physics-based metrics (energy of interaction, surface of the interface, shape complementarity, etc.) for each simulated dimer complex. Agglomerative hierarchical clustering was used to reduce the number of multicollinear features between the collected energy terms and AF error metrics (Supplementary Fig. 10). Linear least square ridge regression models and logistic ridge regression classifiers were trained on feature sets of decreasing sizes (Methods). We used two train–test approaches. For the first, we ranked all data points by their P_adj values and partitioned the dataset into high-quality test set interactions and a mix of high-quality and low-quality training set interactions (see Supplementary Methods for modeling details). This approach was chosen to assess the ability of models using AF-M complex features to fill in missing interactions. In the other approach, a subset of proteins was selected and all interactions involving those proteins were assigned to the test set. The protein-based split was used to evaluate how AF-M complex-based models could predict interactions involving new proteins instead of filling in missing interactions between known proteins. Multiple train–test sets were created for this split by holding out different protein sets to get a distribution of test performances (Methods). Examples of the two test sets can be seen in Supplementary Fig. 11a.

The regression models performed better than using only iPTM on both protein architectures (Fig. 6b) and reached performances similar to iCipa on the 1×1 set. As expected, models using more Rosetta features yielded better results (Supplementary Figs. 11c–g and 12a,b). All regression models dropped drastically in performance in the held-out protein task. All models did well on the held-out interaction test set but the iPTM area under the precision recall curve (AUPRC) dropped drastically when considering the full dataset, while iCipa performed at the same level (Fig. 6c and Supplementary Fig. 11e). The classification task’s performance stayed relatively high between the two tasks even when the training set size was reduced (Supplementary Fig. 13a,b).

Similarly, models trained on the more complicated mALb8 interactions had drops in performance between the held-out interaction and held-out protein tasks (Supplementary Figs. 11c–g and 14). Although there was a smaller gain in performance when more features were included in modeling compared to the NICP predictors, the model with the most features still outperformed using iPTM alone in LFC prediction (Fig. 6e). Interestingly, reducing the training set size led to gains in model performance, likely because of the held-out interaction test set initially consisting of only high LFC interactions (Supplementary Fig. 14).

Lastly, to evaluate whether simple models trained on AF-M features to predict MP3-seq LFC values could generalize to new protein families, we used models trained with only AF error metrics to try to predict across protein families. That is, we attempted to predict the mALb8 interactions with models trained on NCIP proteins and vice versa (Supplementary Figs. 12c–f and 14b). We found in both cases that the models performed worse than iPTM when applied to protein families on which they were not trained.

Discussion

Here, we introduce MP3-seq, an easy-to-use HT-Y2H method that can measure pairwise PPIs in a single yeast strain without surface display. We also developed a data analysis workflow based on DESeq2 that makes it easy to merge replicates, remove autoactivators and identify statistically significant interactions. The MP3-seq workflow could be further generalized in future work by adjusting the selection scheme to (experimentally) eliminate autoactivators⁴⁵ or by integrating it with protein stability and expression assays such as Stable-seq⁵² or high-throughput protease assays⁵³ to confirm that the tested proteins are folded correctly and that an apparent negative interaction measurement is not because of reduced protein levels.

We validated MP3-seq using several sets of proteins for which interactions were previously characterized: a family of human Bcl-2 proteins and their de novo designed inhibitors, a set of peptides binding to Mcl-1 and three sets of coiled-coil peptides. We found quantitative agreement between our results and those reported previously. We then applied our method to characterize interactions in a pool of de novo 2×2 heterodimers containing buried HB-net and showed that it could scale to measure over 100,000 interactions in a single experiment. Our computational workflow enabled us to identify potential orthogonal subsets at various orthogonality gaps from these data.

By screening interactions between protomers with truncations and modified HB-nets, we probed design rules for 2×2 binders. We showed that the minimum length for strong binding is 3.5 heptads, with binding affinity dropping sharply for shorter designs. Moreover, we found that at least two HB-net mismatches are needed for orthogonality, thereby setting minimum requirements for future sets of orthogonal 2×2 binders with buried HB-nets.

Lastly, we assessed the ability of AF2 and AF-M to predict orthogonal binding interactions. On the set of 1×1 dimers, we found that a sequence-based predictor (iCipa) correlated better with experimental LFC values than any single AF metric. However, using a combination of AF metrics and Rosetta physics-based metrics, accurate regression models (LFC predictor) could be trained on minimal data, capable of matching the best available family-specific models such as iCipa. While iCipa was trained on over 8,000 pairs and uses energy terms refined by hand, the LFC predictor can be generated automatically for virtually any protein family with one hundredfold fewer experimental data points.

While the LFC predictors excelled at test–train splits designed to mimic filling in missing interactions in an all-by-all screen, they struggled when test sets were selected to mirror the task of predicting interactions of ‘new’ proteins not present in training sets. This examination of AF-M error metrics and models trained on AF-M metrics and Rosetta simulation values showcases that, while complex predictors are a powerful tool, high-throughput experimental assays remain necessary for tasks such as orthogonality confirmation and determining mutation effects on interactions.

Looking forward, we believe the increased scale and streamlined workflow of MP3-seq will further accelerate the adoption of HT-Y2H methods. These benefits will facilitate use in applications ranging from training predictive models of PPIs for generative protein design to characterizing interactions between human protein variants at high throughput.

Methods

Experimental workflow

Gene fragments were ordered from Twist Biosciences for the DBD and AD fusions of protein binders of interest. Each fragment consisted of a linker coding sequence (CDS), the binder CDS, a stop codon, a synthetic terminator, a PCR handle, a barcode and an insulation sequence. The linker and insulation sequences serve as homology sequences during plasmid assembly in yeast. The plasmid vector contained linker homologies, the AD, the DBD, a CEN autonomously replicating sequence for yeast replication and TRP1 for tryptophan selection. A plasmid can be seen in Supplementary Fig. 1. These fragments and vectors are then transformed into electrocompetent Y777 yeast cells for combinatorial assembly through homologous recombination. Transformed cells were resuspended in synthetic complete medium lacking tryptophan, SC-TRP medium, and grown to allow redundant plasmid dropping. Postpassage cells were inoculated in SC-HIS-TRP medium for His selection or reserved and frozen for the His⁺ sample. After selection in His⁻ medium, cells were frozen for the His⁻ sample. Samples were thawed, plasmids were extracted and barcode regions were prepped for sequencing with qPCR. Detailed information about the experimental workflow can be found in the Supplementary Methods.

Analysis workflow

FASTQ files were obtained using bcl2fastq, cutadapt⁵⁴ was used to trim barcode regions and Starcode⁵⁵ was used to cluster and count barcode–barcode pairs for the His⁺ and His⁻ files. At this point, experiments were screened for autoactivators and Autotune was used to infill His⁻ barcode counts as needed (Supplementary Equations 1 and 2). Proteins classified as autoactivators can be found in Supplementary Table 2. LFC and P-LFC values were then calculated with DESeq2 (ref. ⁴⁶). Detailed information about the analysis workflow can be found in the Supplementary Methods.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The data that support the findings of this work are available in this article and the Supplementary Information. Barcode count data (both raw barcode counts and processed barcode counts) and de novo designed sequences are available on GitHub (https://github.com/Seeliglab/MP3-DUET-AUTOTUNE). Raw sequencing data are available from the Gene Expression Omnibus under accession code GSE271790. Source data are provided with this paper.

Code availability

All scripts necessary to recreate the MP3-seq pipelines, analysis and models can be found on GitHub (https://github.com/Seeliglab/MP3-DUET-AUTOTUNE).

References

Cho, J. H., Collins, J. J. & Wong, W. W. Universal chimeric antigen receptors for multiplexed and logical control of T cell responses. Cell 173, 1426–1438 (2018).
Article CAS PubMed PubMed Central Google Scholar
Chen, Z. et al. De novo design of protein logic gates. Science 368, 78–84 (2020).
Article CAS PubMed PubMed Central Google Scholar
Chen, Z. & Elowitz, M. B. Programmable protein circuit design. Cell 184, 2284–2301 (2021).
Article CAS PubMed PubMed Central Google Scholar
Gao, X. J., Chong, L. S., Kim, M. S. & Elowitz, M. B. Programmable protein circuits in living cells. Science 361, 1252–1258 (2018).
Article CAS PubMed PubMed Central Google Scholar
Groves, B., Khakhar, A., Nadel, C. M., Gardner, R. G. & Seelig, G. Rewiring MAP kinases in Saccharomyces cerevisiae to regulate novel targets through ubiquitination. eLife 5, e15200 (2016).
Article PubMed PubMed Central Google Scholar
Ljubetič, A. et al. Design of coiled-coil protein-origami cages that self-assemble in vitro and in vivo. Nat. Biotechnol. 35, 1094–1101 (2017).
Article PubMed Google Scholar
Ben-Sasson, A. J. et al. Design of biologically active binary protein 2D materials. Nature 589, 468–473 (2021).
Article CAS PubMed PubMed Central Google Scholar
Gonen, S., DiMaio, F., Gonen, T. & Baker, D. Design of ordered two-dimensional arrays mediated by noncovalent protein–protein interfaces. Science 348, 1365–1368 (2015).
Article CAS PubMed Google Scholar
Kuriyan, J. & Cowburn, D. Modular peptide recognition domains in eukaryotic signaling. Annu. Rev. Biophys. Biomol. Struct. 26, 259–288 (1997).
Article CAS PubMed Google Scholar
Fong, J. H., Keating, A. E. & Singh, M. Predicting specificity in bZIP coiled-coil interactions. Genome Biol. 5, R11 (2004).
Article PubMed PubMed Central Google Scholar
Potapov, V., Kaplan, J. B. & Keating, A. E. Data-driven prediction and design of bZIP coiled-coil interactions. PLoS Comput. Biol. 11, e1004046 (2015).
Article PubMed PubMed Central Google Scholar
Thompson, K. E., Bashor, C. J., Lim, W. A. & Keating, A. E. SYNZIP protein interaction toolbox: in vitro and in vivo specifications of heterospecific coiled-coil interaction domains. ACS Synth. Biol. 1, 118–129 (2012).
Article CAS PubMed PubMed Central Google Scholar
Lebar, T., Lainšček, D., Merljak, E., Aupič, J. & Jerala, R. A tunable orthogonal coiled-coil interaction toolbox for engineering mammalian cells. Nat. Chem. Biol. 16, 513–519 (2020).
Article CAS PubMed PubMed Central Google Scholar
Boldridge, W. C. et al. A multiplexed bacterial two-hybrid for rapid characterization of protein–protein interactions and iterative protein design. Nat. Commun. 14, 4636 (2023).
Article CAS PubMed PubMed Central Google Scholar
Boyken, S. E. et al. De novo design of protein homo-oligomers with modular hydrogen-bond network-mediated specificity. Science 352, 680–687 (2016).
Article CAS PubMed PubMed Central Google Scholar
Chen, Z. et al. Programmable design of orthogonal protein heterodimers. Nature 565, 106–111 (2019).
Article CAS PubMed Google Scholar
Brodnik, A., Jovičić, V., Palangetić, M. & Silađi, D. Construction of orthogonal CC-sets. Informatica 43, 19–22 (2019).
Article Google Scholar
Rao, V. S., Srinivas, K., Sujini, G. N. & Kumar, G. N. S. Protein–protein interaction detection: methods and analysis. Int. J. Proteom. https://doi.org/10.1155/2014/147648 (2014).
Smits, A. H. & Vermeulen, M. Characterizing protein–protein interactions using mass spectrometry: challenges and opportunities. Trends Biotechnol. 34, 825–834 (2016).
Article CAS PubMed Google Scholar
Rouet, R., Jackson, K. J. L., Langley, D. B. & Christ, D. Next-generation sequencing of antibody display repertoires. Front. Immunol. 9, 118 (2018).
Article PubMed PubMed Central Google Scholar
Younger, D., Berger, S., Baker, D. & Klavins, E. High-throughput characterization of protein–protein interactions by reprogramming yeast mating. Proc. Natl Acad. Sci. USA 114, 12166–12171 (2017).
Article CAS PubMed PubMed Central Google Scholar
Fields, S. & Song, O. A novel genetic system to detect protein–protein interactions. Nature 340, 245–246 (1989).
Article CAS PubMed Google Scholar
Weile, J. et al. A framework for exhaustively mapping functional missense variants. Mol. Syst. Biol. 13, 957 (2017).
Article PubMed PubMed Central Google Scholar
Luck, K. et al. A reference map of the human binary protein interactome. Nature 580, 402–408 (2020).
Article CAS PubMed PubMed Central Google Scholar
Diss, G. & Lehner, B. The genetic landscape of a physical interaction. eLife 7, e32472 (2018).
Article PubMed PubMed Central Google Scholar
Erffelinck, M.-L. et al. A user-friendly platform for yeast two-hybrid library screening using next generation sequencing. PLoS ONE 13, e0201270 (2018).
Article CAS PubMed PubMed Central Google Scholar
Jin, F., Avramova, L., Huang, J. & Hazbun, T. A yeast two-hybrid smart-pool-array system for protein-interaction mapping. Nat. Methods 4, 405–407 (2007).
Article CAS PubMed PubMed Central Google Scholar
Rajagopala, S. V. & Uetz, P. Analysis of protein–protein interactions using array-based yeast two-hybrid screens. In Yeast Functional Genomics and Proteomics (ed. Stagljar, I.) Vol. 548 (Humana Press, 2009).
Trigg, S. A. et al. CrY2H-seq: a massively multiplexed assay for deep-coverage interactome mapping. Nat. Methods 14, 819–825 (2017).
Article CAS Google Scholar
Weimann, M. et al. A Y2H-seq approach defines the human protein methyltransferase interactome. Nat. Methods 10, 339–342 (2013).
Article CAS PubMed Google Scholar
Yachie, N. et al. Pooled-matrix protein interaction screens using barcode fusion genetics. Mol. Syst. Biol. 12, 863 (2016).
Article PubMed PubMed Central Google Scholar
Yang, F. et al. Development and application of a recombination-based library versus library high-throughput yeast two-hybrid (RLL-Y2H) screening system. Nucleic Acids Res. 46, e17 (2018).
Article PubMed Google Scholar
Yang, J.-S. et al. rec-YnH enables simultaneous many-by-many detection of direct protein–protein and protein–RNA interactions. Nat. Commun. 9, 3747 (2018).
Article PubMed PubMed Central Google Scholar
Yu, H. et al. Next-generation sequencing to generate interactome datasets. Nat. Methods 8, 478–480 (2011).
Article CAS PubMed PubMed Central Google Scholar
Banerjee, S., Velásquez-Zapata, V., Fuerst, G., Elmore, J. M. & Wise, R. P. NGPINT: a next-generation protein–protein interaction software. Brief. Bioinform. 22, bbaa351 (2021).
Article PubMed Google Scholar
Velásquez-Zapata, V., Elmore, J. M., Banerjee, S., Dorman, K. S. & Wise, R. P. Next-generation yeast-two-hybrid analysis with Y2H-SCORES identifies novel interactors of the MLA immune receptor. PLoS Comput. Biol. 17, e1008890 (2021).
Article PubMed PubMed Central Google Scholar
Quartley, E. et al. Heterologous expression of L. major proteins in S. cerevisiae: a test of solubility, purity and gene recoding. J. Struct. Funct. Genomics 10, 233–247 (2009).
Article CAS PubMed PubMed Central Google Scholar
Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021).
Article CAS PubMed PubMed Central Google Scholar
Baek, M. et al. Accurate prediction of protein structures and interactions using a three-track neural network. Science 373, 871–876 (2021).
Article CAS PubMed PubMed Central Google Scholar
Yu, D., Chojnowski, G., Rosenthal, M. & Kosinski, J. AlphaPulldown—a Python package for protein–protein interaction screens using AlphaFold-Multimer. Bioinformatics 39, btac749 (2023).
Article CAS PubMed Google Scholar
Johansson-Åkhe, I. & Wallner, B. Improving peptide–protein docking with AlphaFold-Multimer using forced sampling. Front. Bioinform. 2, 959160 (2022).
Article PubMed PubMed Central Google Scholar
McIsaac, R. S. et al. Synthetic gene expression perturbation systems with rapid, tunable, single-gene specificity in yeast. Nucleic Acids Res. 41, e57 (2013).
Article CAS PubMed Google Scholar
Curran, K. A. et al. Short synthetic terminators for improved heterologous gene expression in yeast. ACS Synth. Biol. 4, 824–832 (2015).
Article CAS PubMed Google Scholar
Scanlon, T. C., Gray, E. C. & Griswold, K. E. Quantifying and resolving multiple vector transformants in S. cerevisiae plasmid libraries. BMC Biotechnol. 9, 95 (2009).
Article PubMed PubMed Central Google Scholar
Shivhare, D., Musialak-Lange, M., Julca, I., Gluza, P. & Mutwil, M. Removing auto-activators from yeast-two-hybrid assays by conditional negative selection. Sci. Rep. 11, 5477 (2021).
Article CAS PubMed PubMed Central Google Scholar
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Article PubMed PubMed Central Google Scholar
Plaper, T. et al. Coiled-coil heterodimers with increased stability for cellular regulation and sensing SARS-CoV-2 spike protein-mediated cell fusion. Sci. Rep. 11, 9136 (2021).
Article CAS PubMed PubMed Central Google Scholar
Thomas, F., Boyle, A. L., Burton, A. J. & Woolfson, D. N. A set of de novo designed parallel heterodimeric coiled coils with quantified dissociation constants in the micromolar to sub-nanomolar regime. J. Am. Chem. Soc. 135, 5161–5166 (2013).
Article CAS PubMed Google Scholar
Berger, S. et al. Computationally designed high specificity inhibitors delineate the roles of Bcl2 family proteins in cancer. eLife 5, e20352 (2016).
Article PubMed PubMed Central Google Scholar
Rogers, J. M., Wong, C. T. & Clarke, J. Coupled folding and binding of the disordered protein PUMA does not require particular residual structure. J. Am. Chem. Soc. 136, 5197–5200 (2014).
Article CAS PubMed PubMed Central Google Scholar
Ljubetič, A., Gradišar, H. & Jerala, R. Advances in design of protein folds and assemblies. Curr. Opin. Chem. Biol. 40, 65–71 (2017).
Article PubMed Google Scholar
Kim, I., Miller, C. R., Young, D. L. & Fields, S. High-throughput analysis of in vivo protein stability. Mol. Cell. Proteom. 12, 3370–3378 (2013).
Article CAS Google Scholar
Rocklin, G. J. et al. Global analysis of protein folding using massively parallel design, synthesis and testing. Science 357, 168–175 (2017).
Article CAS PubMed PubMed Central Google Scholar
Benatuil, L., Perez, J. M., Belk, J. & Hsieh, C.-M. An improved yeast transformation method for the generation of very large human antibody libraries. Protein Eng. Des. Sel. 23, 155–159 (2010).
Article CAS PubMed Google Scholar
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 17, 10 (2011).
Article Google Scholar

Download references

Acknowledgements

This work was supported by the National Institutes of Health (award R01GM120379), National Science Foundation (award 2312398) and Office of Naval Research (award N00014-16-1-3189) to G.S. A.L. would like to acknowledge funding of the European Commission Marie Skłodowska-Curie Actions program (CC-LEGO 792305), Slovenian research agency projects CC-TRIGGER J1-4406, N1-0323 and P4-0176 and FET open project Virofight (899619).

Author information

These authors contributed equally: Alexandr Baryshev, Alyssa La Fleur.

Authors and Affiliations

Department of Electrical & Computer Engineering, University of Washington, Seattle, WA, USA
Alexandr Baryshev, Benjamin Groves & Georg Seelig
Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA, USA
Alyssa La Fleur & Georg Seelig
Department of Bioengineering, University of Washington, Seattle, WA, USA
Cirstyn Michel & David Baker
Department of Biochemistry, University of Washington, Seattle, WA, USA
David Baker & Ajasja Ljubetič
Institute for Protein Design, University of Washington, Seattle, WA, USA
David Baker & Ajasja Ljubetič
Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
David Baker
Department for Synthetic Biology and Immunology, National Institute of Chemistry, Ljubljana, Slovenia
Ajasja Ljubetič

Authors

Alexandr Baryshev
View author publications
You can also search for this author in PubMed Google Scholar
Alyssa La Fleur
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin Groves
View author publications
You can also search for this author in PubMed Google Scholar
Cirstyn Michel
View author publications
You can also search for this author in PubMed Google Scholar
David Baker
View author publications
You can also search for this author in PubMed Google Scholar
Ajasja Ljubetič
View author publications
You can also search for this author in PubMed Google Scholar
Georg Seelig
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.B. optimized the MP3-seq workflow, performed most experiments and analyzed the data. A.L.F. designed the analytical pipeline and handled writing the paper, figure creation and modeling. B.G. conceptualized the MP3-seq approach and performed proof-of-concept experiments. C.M. performed experiments, created figures and wrote the paper. D.B. proposed conceptual ideas and provided project advice. A.L. designed the mALb8 and other de novo proteins, performed simulations, wrote the paper and provided funding. G.S. supervised experiments, provided funding and wrote the paper. All authors read and contributed to the final paper.

Corresponding authors

Correspondence to Ajasja Ljubetič or Georg Seelig.

Ethics declarations

Competing interests

G.S. is a cofounder and shareholder of Parse Biosciences. The other authors declare no competing interests.

Peer review

Peer review information

Nature Chemical Biology thanks Gabriel Rocklin and the other, anonymous reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary methods, structure visualization, statistics and reproducibility, Figs. 1–14 and Tables 2–5.

Reporting Summary

Supplementary Table 1

Table of nucleotide sequences of the designed binders.

Supplementary Table 6

Table of primers used in this study.

Source data

Source Data Fig. 2

Source data for heat maps, scatter plots and undirected graphs.

Source Data Fig. 3

Source data for heat maps, scatter plots and box-and-whisker plots.

Source Data Fig. 4

Source data for heat maps, undirected graphs and line plots.

Source Data Fig. 5

Source data for heat maps.

Source Data Fig. 6

Source data for heat maps and bar plots.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Baryshev, A., La Fleur, A., Groves, B. et al. Massively parallel measurement of protein–protein interactions by sequencing using MP3-seq. Nat Chem Biol (2024). https://doi.org/10.1038/s41589-024-01718-x

Download citation

Received: 27 September 2023
Accepted: 01 August 2024
Published: 27 August 2024
DOI: https://doi.org/10.1038/s41589-024-01718-x