Understanding the onset of hot streaks across artistic, cultural, and scientific careers

Liu, Lu; Dehmamy, Nima; Chown, Jillian; Giles, C. Lee; Wang, Dashun

doi:10.1038/s41467-021-25477-8

Download PDF

Article
Open access
Published: 13 September 2021

Understanding the onset of hot streaks across artistic, cultural, and scientific careers

Nature Communications volume 12, Article number: 5392 (2021) Cite this article

42k Accesses
22 Citations
790 Altmetric
Metrics details

Subjects

Abstract

Across a range of creative domains, individual careers are characterized by hot streaks, which are bursts of high-impact works clustered together in close succession. Yet it remains unclear if there are any regularities underlying the beginning of hot streaks. Here, we analyze career histories of artists, film directors, and scientists, and develop deep learning and network science methods to build high-dimensional representations of their creative outputs. We find that across all three domains, individuals tend to explore diverse styles or topics before their hot streak, but become notably more focused after the hot streak begins. Crucially, hot streaks appear to be associated with neither exploration nor exploitation behavior in isolation, but a particular sequence of exploration followed by exploitation, where the transition from exploration to exploitation closely traces the onset of a hot streak. Overall, these results may have implications for identifying and nurturing talents across a wide range of creative domains.

Elites, communities and the limited benefits of mentorship in electronic music

Article Open access 21 February 2020

Milán Janosov, Federico Musciotto, … Gerardo Iñiguez

A dataset of publication records for Nobel laureates

Article Open access 18 April 2019

Jichao Li, Yian Yin, … Dashun Wang

Scientific prizes and the extraordinary growth of scientific topics

Article Open access 05 October 2021

Ching Jin, Yifang Ma & Brian Uzzi

Introduction

A remarkable feature of creative careers is the existence of hot streaks^1,2,3. Despite the ubiquitous nature of hot streaks across artistic, cultural, and scientific domains, it remains unclear if there are any regularities underlying the beginning of a hot streak. Understanding the origin of hot streaks is not only crucial for our quantitative understanding of patterns governing creative life cycles but it also has implications for the identification and development of talent across a wide range of settings^4,5. Deciphering what predicts hot streaks, however, remains a challenge, partly due to the complex nature of creative careers^{1,6,7,8,9,10,11,12,13,14,15,16,17}. The lack of systematic explanations for hot streaks, combined with the randomness of when they occur within a career¹, paints an unpredictable, if incomplete, view of creativity across a diverse range of domains.

Of the myriad forces that might affect career progression and success, the strategies of exploration and exploitation have attracted enduring interests from a broad set of disciplines^{14,15,16,18,19,20,21,22}, prompting us to examine their potential relationship with hot streaks. Indeed, according to the literature, exploitation allows individuals to build knowledge in a particular area and to refine their capabilities in that area over time. This could be relevant for understanding hot streaks since exploitation allows individuals to “go deep” in a focal area to both establish expertise in that area and foster a reputation related to that expertise^18,19. Exploration, on the other hand, engages individuals in experimentation and search beyond their existing or prior areas of competency. Although exploration is more risky and consequently associated with larger variance in outcomes²³, it may also increase one’s likelihood of stumbling upon a groundbreaking idea through unanticipated combinations of disparate sources²⁴. In contrast, exploitation, as a conservative strategy, may stifle originality and, may over time, limit an individual’s ability to consistently produce high-impact work¹⁴. Taken together, the benefits and downsides to these contrasting approaches raise a fundamental question: Are career hot streaks reflective of exploration or exploitation behavior, or some combination of the two?

To answer this question, we develop computational methods using deep learning^25,26 and network science^27,28 and apply them to large-scale datasets tracing the career outputs of artists, film directors, and scientists. Specifically, we build high-dimensional representations of the artworks, films, and scientific publications they produce (Supplementary Note 1), which capture abstract concepts, styles, and topics represented therein, allowing us to trace an individual’s career trajectory on the underlying creative space (Supplementary Note 1). We further quantify the hot streak within each career by the impact of works one produced¹, measured by auction price^1,29, IMDB ratings^1,30, and paper citations in 10 years^1,12, respectively. We then correlate the timing of hot streaks with the creative trajectories for each individual, allowing us to examine changes in the characteristics of the work one produces around the beginning of a hot streak.

Results

To examine the art styles of each artist and their exploration and exploitation dynamics, we collected over 800 K images of visual arts from museum and gallery collections, covering the career histories of 2128 artists^31,32. Building on recent advances in computer vision^33,34, we use a transfer-learning approach³⁵ to construct an embedding for artworks using deep neural networks (Fig. 1a–c). We generate a 200-dimensional embedding of each artwork (see “Methods” and Supplementary Note 1.1), and identify art styles through clusters on the 200-dimensional embedding space, allowing us to trace the evolution of art styles over the course of their careers (Fig. 2a–d).

**Fig. 1: Quantifying individual creative trajectories using high-dimensional representation techniques.**

**Fig. 2: Creative trajectories and hot-streak dynamics: three exemplary careers.**

To examine the career histories of film directors, we collected our second dataset capturing plot description and cast information for each film recorded in the IMDB database (79 K films by 4337 directors; see Supplementary Note 1.2 for more detail). We build a 200-dimensional representation of each film by combining its plot and cast information (Fig. 1d, e, see “Methods,” and Supplementary Note 1.2), and identify the style of each film based on clusters in the obtained embedding space, allowing us to investigate the dynamics of styles for film directors (Fig. 2e–h).

In the third setting, we analyze the career histories of 20,040 scientists by combining publication and citation datasets from the Web of Science and Google Scholar^1,12, tracing the dynamics of research topics as reflected in the publication history of each career. We use a method developed recently by Zeng et al.¹⁶, which identifies research topics within a career by finding communities in a weighted co-citing network of all publications by the individual (Figs. 1f and 2i–l). To ensure that the results obtained for scientific careers are consistent with the embedding methods used to analyze the careers of artists and directors, we also applied a node embedding method to the co-citing network to identify research topics, and repeated our analyses, finding that the conclusions remain the same (Supplementary Note 1.3).

To quantify the exploration and exploitation behaviors reflected in each individual’s career across the three domains, we measure the style or topic entropy for the work one produces, defined as \(\widetilde{H}=-\mathop{\sum }\nolimits_{i=1}^{m}{p}_{i}\,{{\log }}\,{p}_{i}\), where \({p}_{i}\) is the frequency in which one devotes to an art style or topic \(i\) and \(m\) is the number of unique styles or topics. On one extreme, a pure exploitation strategy means that an individual’s work is contained within only one style or topic (\(\widetilde{H}=0\)); on the other extreme, \(\widetilde{H}={{\log }}\,n\) corresponds to the case of pure exploration, where \(n\) is the number of works one produced in the period, indicating that an individual’s attention is evenly divided across a distribution of styles or topics (\({p}_{i}=1/n\)). For convenience, we normalize the entropy measure to obtain the rescaled entropy \(H=\widetilde{H}/{{\log }}\,n\). Figure 2 illustrates three notable careers as examples for identifying art styles, topics, and their entropies calculated using the methodologies described above as well as in “Methods.”

To test whether hot streaks are associated with exploration or exploitation, we measure the distribution of entropy \(P(H)\) for works produced before and during a hot streak (Fig. 3a–c). To gauge the expected magnitude of \(H\) around a hot streak, we further construct a null model for each career by randomly designating the time at which the hot streak begins¹. We calculate the average entropy \(\langle H\rangle\) measured in real careers before (Fig. 3d–f) and after the onset of the hot streak (Fig. 3g–i), and compare them with random careers, measured by the distribution of entropy, \(P\left(\langle H\rangle \right)\), for 1000 realizations of the randomized careers. Figure 3d–i shows three primary findings. First, before a hot streak, \(\langle H\rangle\) is systematically larger than expected (z-scores >2), indicating that individuals tend to diversify the topics they work on before a hot streak begins, consistent with an exploration strategy in the period leading up to hot streak. Second, following the onset of the hot streak, \(\langle H\rangle\) measured in real careers becomes significantly smaller than expected (z-score <−2), suggesting that individuals become substantially more focused on what they work on, reflecting an exploitation strategy during hot streak. Third, despite the differences in the three types of careers we study and the methodologies to examine their career outputs, the observed associations between exploration, exploitation, and hot streaks appear universal across all three domains we studied.

**Fig. 3: Exploration, exploitation and career hot streaks.**

To systematically examine the temporal changes in entropy, we align careers based on when their hot streak begins and measure the dynamics of \(H\) around the hot streak (Fig. 3j–l). We find that compared with randomized careers, \(H\) measured in real careers is systematically elevated before a hot streak begins, but drops precipitously below expectation during the hot streak. We further compare directly the entropy distribution \(P(H)\) before and after the hot streak begins, finding that, across all three domains, \(H\) during a hot streak is systematically smaller than before (Fig. 3m–o, Kolmogorov–Smirnov (KS) test, p value <0.001); this pattern is absent when we repeat the same measurement for randomized careers (Fig. 3p–r).

The exploitation behavior during hot streaks appears consistent with several famous examples, including painter Jackson Pollock’s “drip period” (1946–1950) (Fig. 2d), director Peter Jackson’s “The Lord of the Rings trilogy” (Fig. 2h), and the career of scientist John Fenn, whose hot streak arrived late in his career, but the work he produced during that period on electrospray ionization eventually won him the chemistry Nobel in 2002 (Fig. 2l). These examples raise an intriguing question: can the exploitation behavior by itself predict career hot streaks? To test this, we identify episodes of exploitation in each career by tracing the dynamics of \(H\) across our three domains. We calculate the probability of initiating a hot streak with the onset of an exploitation episode, and compare it with the baseline probability measured in randomized careers (Fig. 3s–u). We find that when exploitation occurs by itself, not preceded by exploration, the chance that such episodes coincide with a hot streak is significantly lower than expected, not higher, across all three domains. These results indicate that exploitation by itself may not guarantee hot streaks, further suggesting the importance of prior exploration. Indeed, reexaminations of the careers of Jackson Pollock, Peter Jackson, and John Fenn reveal a phase of unusual exploration of new and diverse art styles, types of films, and research topics, respectively, for the period leading up to their hot streaks (Fig. 2c, g, k). This observation raises the question of whether exploration that precedes a hot streak is instead the crucial ingredient, prompting us to calculate the probability of initiating a hot streak following an exploration episode alone. However, we find that when the episode of exploration is not followed by exploitation, the chance for such exploration to coincide with a hot streak again reduces significantly. By contrast, exploration followed by exploitation appears consistently associated with a significant lift in the probability of initiating a hot streak: this configuration consistently outperforms the baseline across all three domains (20.5%, 13.8%, and 19.2% over the baseline for artists, directors, and scientists, respectively), and represents the only positive lift among all combinations of the two creative strategies (Fig. 3s–u). Figure S46 further examines the exploration, exploitation, and normal phases, and explores all potential sequences of any two of the three phases (nine in total), reaching the same conclusions.

Taken together, these results suggest that neither exploration nor exploitation alone is associated with the hot streak dynamics; rather, it is the shift from exploration to exploitation that closely traces the onset of a hot streak. One plausible explanation is that exploration, as a risky, variance-enhancing strategy, increases one’s chances to stumble upon new, potentially groundbreaking ideas; the subsequent exploitation behavior allows the individual to focus, develop knowledge and capabilities in that focal area, and build out their discoveries further. Importantly, our findings suggest that both ingredients of exploration and exploitation seem necessary. This supports the notion that not all explorations are fruitful, and that exploitation in the absence of promising new ideas may not be as productive. On the other hand, the sequence of exploration followed by exploitation may facilitate the emergence of high-impact work by incorporating new insights into a focused agenda. The positioning of exploration before exploitation may therefore serve to expand an individual’s creative possibilities.

We test the robustness of our results across several dimensions. We split our samples of artists, directors, and scientists based on the timing of their hot streaks (Supplementary Note 3.1), the individual’s level of impact (Supplementary Note 3.2), and different fields of studies (Supplementary Note 3.3), and repeat our analyses in each subsample, arriving at consistent conclusions. We further control for individual fixed effects in their exploration–exploitation dynamics (Supplementary Note 3.4), and find that artists, directors, and scientists predictably deviate from their typical creative behaviors around the beginning of a hot streak: individuals who tend to exploit become more exploratory before a hot streak begins, whereas individuals who tend to explore become particularly focused during their hot streak (Supplementary Note 3.4). We further use regression analysis to fit the relationship between hot streaks and the exploration–exploitation transition by controlling for the impact of an individual’s work, their career stage, and other individual characteristics, and find that our conclusions remain the same (Supplementary Note 3.5). For scientists who experience two hot streaks, we perform our measurements for the first and second hot streak separately (Supplementary Note 3.6), and find that the exploration–exploitation dynamics hold true in both cases. For those having hot streaks at the beginning of their careers, while by construction we cannot observe their prior behaviors, we find that they consistently engage in exploitation during their hot streaks (Supplementary Note 3.7). We further verify that these results are robust to using different community detection algorithms such as Infomap³⁶ (Supplementary Note 3.8) and different ways of aggregating data over time (Supplementary Note 3.9). We also replaced our entropy measure to quantify the exploration–exploitation dynamics by the Simpson diversity measure \((1-{\Sigma }_{i}{p}_{i}^{2})\) (Supplementary Note 3.10), the number of styles or topics (Supplementary Note 3.11), the fraction of works in the most popular style or topic (Supplementary Note 3.12), and probability of switching topics (Supplementary Note 3.13), and repeat all our analyses, finding again the same conclusions.

To understand the potential forces that might facilitate the shift from exploration to exploitation, we further examine the organization of innovative activity. Motivated by the literature on science teams^8,37,38, here we focus on scientific careers only, asking whether there are detectable changes in collaboration patterns around the exploration–exploitation transition. We find that scientists are more likely to explore with small teams before a hot streak, but exploit with large teams after a hot streak begins. Indeed, we quantify the change in team size through two measures. We trace the dynamics of team size around the beginning of a hot streak (Fig. 4a). We also calculate the team size distribution observed in real careers normalized by the randomized careers (\({R}\left({{{{{\rm{team}}}}}}\; {{{{{\rm{size}}}}}}\right)\); Fig. 4b). Both results show that team size drops significantly before the hot streak yet becomes substantially larger than expected during the hot streak (Fig. 4a, b). We further find that the onset of hot streaks appears to mark an increase in new collaborators (Supplementary Note 4), consistent with the advantages of fresh teams³⁸. Note that the role and definition of teams vary substantially across the three domains, hence this analysis is applicable to scientific careers only. Given the observational nature of our study, we cannot rule out potential omitted variables that might mediate these patterns. Nevertheless, these results are in line with the findings that small and large teams are differentially positioned for innovation³⁷: large teams tend to excel at furthering existing ideas and design, whereas small teams tend to disrupt current ways of thinking with new ideas and opportunities. We further test the robustness of these results across different disciplines, adjusting for self-citations, and controlling for the publication year, research field, and career stage using regression analysis, all arriving at the same conclusions (Supplementary Note 4).

**Fig. 4: Authorship structure and hot streaks in science.**

Our next analysis probes potential connections between phases of exploration and exploitation surrounding a scientist’s hot streak. We examine properties of the topics that are explored during the period leading up to hot streak, ranging from recency to citation impact to popularity, asking which topics tend to be chosen for subsequent exploitation. We find that the topic that was eventually exploited is less likely to be the one explored the most recently, or the highest cited, or the most popular among the topics explored before (see Supplementary Note 5). These findings imply that, more than simply chasing after discovery through exploration, individuals appear to seek out new opportunities by deliberating over different possibilities, and then harvesting promising directions through exploitation. To test if these potential connections can help us better understand which direction to exploit following exploration, we set up a simple prediction task to predict which topic to exploit using the features discussed above that characterize the exploration phase, including team size and topic properties (Supplementary Note 5); this exercise yielded substantial predictive power (accuracy of 0.89 and area under the curve of 0.83). Overall, these results suggest intriguing connections between phases of exploration and exploitation surrounding a hot streak, which may have implications for science funding, especially given hot streaks and research grants tend to last for a similar duration.

Finally, we consider career trajectories following the end of a hot streak. We measure the average entropy \(\langle H\rangle\) after the end of a hot streak and compare the measurements in real careers with the distribution of entropy \(P\left(\left\langle H\right\rangle \right)\) from the randomized careers (Fig. 5a–c). We find that, after the hot-streak period, \(\langle H\rangle\) becomes statistically indistinguishable from the randomized careers (−1 ≤ z-score ≤ 1). We further examine the temporal changes in entropy at the end of a hot streak by aligning careers based on when their hot streaks end (Fig. 5d–f). We find again a lack of difference between data and the null model. Together, these analyses suggest that individuals return to “normal” after their hot streak ends, showing an absence of exploration or exploitation patterns.

Discussions

Taken together, these results unveil identifiable regularities underlying the onset of career hot streaks, which appear to apply universally across a wide range of creative domains. Overall, our results highlight the important role of both exploration and exploitation in individual careers. Curiously, across all three domains we studied, a major turning point for individual careers appears most closely linked with neither exploration nor exploitation behavior in isolation, but rather with the particular sequence of exploration followed by exploitation. Indeed, extant literature has documented the fundamental role of exploration and exploitation in creativity (Supplementary Note 2.2 and Supplementary Table 1). Yet as creative behaviors, they have traditionally been considered either in isolation or in combination but rarely in succession^14,22; this is especially the case for career-level analysis. Our results suggest a sequential view of creative strategies that balance experimentation and implementation may be particularly powerful for producing long-lasting contributions. These findings may hold broad relevance for identifying, training, and nurturing creative talents, especially given the various forces that sometimes appear in tension with the exploration–exploitation dynamics, ranging from the intensifying pressure to publish^39,40 to the increasing trend of exploration over a career¹⁶, from the specialization of individual expertise¹⁰ to how such specialization is favored in personnel evaluations^41,42.

It is important to note that while our results demonstrate significant and consistent relationships across domains, the overall effect size seems modest. On the one hand, this suggests that additional controls might further tighten the relationship. For example, after we control for authorship and the effect of collaborations, the effect size seems to magnify (Supplementary Note 4.5). On the other hand, it also suggests opportunities to examine other potential processes that may also underlie the onset of hot streaks. Indeed, real careers are complex, with heterogeneous influences operating across domains as well as a multitude of individual and institutional factors. Hence, it is plausible that additional factors may also be at work. In this study, we also tested several alternative explanations for the onset of hot streaks (Supplementary Note 6). Although each of these hypotheses we tested appears plausible by itself, we find that none of them shows consistent associations, indicating that none of these alternative hypotheses alone can account for the hot-streak dynamics we studied. It is also likely that on an individual basis, the exploration–exploitation transition is further influenced by other external factors, such as shifting market conditions⁴³, social network structure^38,44, and disciplinary culture^18,19. Individuals may also receive short-term feedback (e.g., art critiques or peer reviews) that may offer additional signals shaping their career focus. As such, the patterns of exploration and exploitation may reflect personal initiatives as well as responses to external forces. Nevertheless, our results suggest that, despite the obvious heterogeneity in the settings we examined and the myriad factors that may affect career progression and success, the exploration–exploitation dynamics appears consistently associated with the onset of hot streaks across rather diverse domains.

The data-driven nature of our study indicates that it is not immune to two limitations common in this type of analysis. First, while the datasets we assembled in this paper represent large collections of career histories and outputs across a variety of domains, they are limited to individuals who have had sufficiently long careers providing enough data points for statistical analyses (Supplementary Note 1). Second, this paper presents correlational evidence, whose primary goal is to investigate empirical regularities associated with the onset of hot streaks. Future work using causal research designs may improve causative interpretations of the regularities reported here.

Furthermore, while this work mainly focuses on universal patterns related to the onset of hot streaks, there could be important domain-specific differences in the role of exploration, exploitation, and success that are worth investigating further. For example, our preliminary analysis suggests that the level of exploration and exploitation in science appears much stronger than in art or film directing (Fig. 3). The number of styles/topics within each career also varies substantially across domains (Supplementary Fig. 30). While these cross-domain differences could flow from inherent differences in data and methods, assessing domain-specific patterns is an important direction for future work.

Notably, the sequence of exploration followed by exploitation closely resembles strategies observed in a wide range of natural and socio-technical settings, from animal foraging⁴⁵ to human cognitive search⁴⁶, from multi-armed bandits and reinforcement learning⁴⁷ to role oscillation between brokerage and closure in social network⁴⁸ to changing innovation strategies over business cycles⁴⁹. It thus suggests that the sequential strategies of exploration followed by exploitation uncovered in this study may have broad relevance that goes beyond individuals’ careers. Lastly, the representation techniques used in this paper could open up promising avenues for research on creativity^50,51,52, offering a quantitative framework to probe the characteristics of the creative products themselves. Future advances in deep learning may enable researchers to incorporate more creative dimensions, and hence more fruitfully contribute to a computationally enhanced understanding of creativity.

Methods

High-dimensional representation of artworks

We apply a pre-trained VGGNet algorithm³³, one of the best-known algorithms for image recognition, to images of artworks, and connect it with an additional neural network with fully connected layers to classify the art style labels recorded in our dataset (Fig. 1a). The convolutional layers in the pre-trained VGGNet use 3 × 3 filters to detect local patterns from the artwork (Fig. 1b). The filters in the first layer capture spatial patterns such as line orientations and brushstrokes (Fig. 1b), whereas those in higher layers combine outputs of filters from lower layers to capture more complex features, such as shapes and objects (Fig. 1c). To leverage VGGNet’s image recognition capabilities, here we do not train the VGGNet layers, but instead train the fully connected layers to repurpose VGGNet to identify art styles (Fig. 1a), helping the first two fully connected layers to find an abstract representation of concepts and themes by grouping together related outputs of the VGGNet layers. Prior research shows that art style may be decoded from both brush strokes and the overall concepts, subjects, and themes^34,53, suggesting that both low- and high-level features are important for capturing art styles. We combine the outputs from the first and third convolutional layers in VGGNet with the fully connected layer before the final classification layer (see Supplementary Note 1.1 for several case studies showing how art styles are interpreted by our deep learning framework). We apply our deep neural network to the career outputs of each artist in the dataset, and then use principal component analysis for dimensionality reduction to generate a 200-dimensional embedding of each artwork.

High-dimensional representation of films

We build high-dimensional representations of films by combing the plot and casting information of each film. We first train word embeddings⁵¹ in the description of the plot to learn a 100-dimensional text representation of a film from the co-occurrence of words (Fig. 1d and Supplementary Note 1.2). To incorporate casting information, we construct a weighted co-casting network among all actors and apply a node embedding method DeepWalk⁵⁴ to obtain a 100-dimensional casting vector for each film (Fig. 1e and Supplementary Note 1.2). We then concatenate the vectors for plot and cast, allowing us to develop a 200-dimensional embedding space to represent all films. Despite the myriad factors that may affect the artistic and financial success of a film⁵⁵, ranging from the screenplay to acting, we find that the learned high-dimensional representation can successfully predict film genre with an accuracy of 0.948 (Supplementary Note 1.2).

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The data used in this study have been deposited in the GitHub repository https://kellogg-cssi.github.io/onsethotstreaks.

Code availability

The code used in this study has been deposited in the GitHub repository https://kellogg-cssi.github.io/onsethotstreaks.

References

Liu, L. et al. Hot streaks in artistic, cultural, and scientific careers. Nature 559, 396 (2018).
Article ADS CAS Google Scholar
Williams, O. E., Lacasa, L. & Latora, V. Quantifying and predicting success in show business. Nat. Commun. 10, https://doi.org/10.1038/s41467-019-10213-0 (2019).
Garimella, K. & West, R. In Proc. International Conference on Web and Social Media, Vol. 13, 170–180 (2019).
Fortunato, S. et al. Science of science. Science 359, eaao0185 (2018).
Azoulay, P. et al. Toward a more scientific science. Science 361, 1194–1197 (2018).
Article ADS Google Scholar
Merton, R. K. Matthew effect in science. Science 159, 56–5 (1968).
Article ADS CAS Google Scholar
Simonton, D. K. Origins of Genius: Darwinian Perspectives on Creativity (Oxford Univ. Press, 1999).
Wuchty, S., Jones, B. F. & Uzzi, B. The increasing dominance of teams in production of knowledge. Science 316, 1036–1039 (2007).
Article ADS CAS Google Scholar
Malmgren, R. D., Ottino, J. M. & Amaral, L. A. N. The role of mentorship in protégé performance. Nature 465, 622–626 (2010).
Article ADS CAS Google Scholar
Jones, B. F. The burden of knowledge and the “death of the renaissance man”: Is innovation getting harder? Rev. Econ. Stud. 76, 283–317 (2009).
Article Google Scholar
Petersen, A. M. et al. Reputation and impact in academic careers. Proc. Natl Acad. Sci. USA 111, 15316–15321 (2014).
Article ADS CAS Google Scholar
Sinatra, R., Wang, D., Deville, P., Song, C. & Barabási, A.-L. Quantifying the evolution of individual scientific impact. Science 354, aaf5239 (2016).
Way, S. F., Morgan, A. C., Clauset, A. & Larremore, D. B. The misleading narrative of the canonical faculty productivity trajectory. Proc. Natl Acad. Sci. USA 114, E9216–E9223 (2017).
Article CAS Google Scholar
Foster, J. G., Rzhetsky, A. & Evans, J. A. Tradition and innovation in scientists’ research strategies. Am. Sociol. Rev. 80, 875–908 (2015).
Article Google Scholar
Jia, T., Wang, D. & Szymanski, B. K. Quantifying patterns of research-interest evolution. Nat. Hum. Behav. 1, 1–7 (2017).
Article Google Scholar
Zeng, A. et al. Increasing trend of scientists to switch between topics. Nat. Commun. 10, 1–11 (2019).
Article ADS Google Scholar
Wang, D. & Barabási, A.-L. The Science of Science (Cambridge Univ. Press, 2021).
Bourdieu, P. The specificity of the scientific field and the social conditions of the progress of reason. Soc. Sci. Info. 14, 19–47 (1975).
Article Google Scholar
Kuhn, T. S. The Essential Tension: Selected Studies in Scientific Tradition and Change (University of Chicago Press, 1977).
Polanyi, M. The republic of science. Minerva 1, 54–73 (1962).
Article Google Scholar
March, J. G. Exploraion and exploitation in organizational learning. Organ. Sci. 2, 71–87 (1991).
Article ADS Google Scholar
Lavie, D., Stettner, U. & Tushman, M. L. Exploration and exploitation within and across organizations. Acad. Manag. Ann. 4, 109–155 (2010).
Article Google Scholar
Fleming, L. Recombinant uncertainty in technological search. Manag. Sci. 47, 117–132 (2001).
Article Google Scholar
Uzzi, B., Mukherjee, S., Stringer, M. & Jones, B. Atypical combinations and scientific impact. Science 342, 468–472 (2013).
Article ADS CAS Google Scholar
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Article ADS CAS Google Scholar
Goodfellow, I., Bengio, Y. & Courville, A. Deep Learning (MIT Press, 2016).
Newman, M. Networks (Oxford Univ. Press, 2018).
Barabási, A.-L. Network Science (Cambridge Univ. Press, 2016).
Galenson, D. W. Old Masters and Young Geniuses: The Two Life Cycles of Artistic Creativity (Princeton Univ. Press, 2011).
Wasserman, M., Zeng, X. H. T. & Amaral, L. A. N. Cross-evaluation of metrics to estimate the significance of creative works. Proc. Natl Acad. Sci. USA 112, 1281–1286 (2015).
Article ADS MathSciNet CAS Google Scholar
Schich, M. et al. A network framework of cultural history. Science 345, 558–562 (2014).
Article ADS CAS Google Scholar
Lee, B. et al. Dissecting landscape art history with information theory. Proc. Natl Acad. Sci. USA 117, 26580–26590 (2020).
Article ADS CAS Google Scholar
Simonyan, K. & Zisserman, A. In The 3rd International Conference on Learning Representations (ICLR, 2015).
Gatys, L. A., Ecker, A. S. & Bethge, M. In Proc. IEEE Conference on Computer Vision and Pattern Recognition, 2414–2423 (CVPR, 2016).
Elgammal, A., Liu, B., Kim, D., Elhoseiny, M. & Mazzone, M. In Thirty-Second AAAI Conference on Artificial Intelligence, Vol. 32 (2018).
Rosvall, M. & Bergstrom, C. T. Maps of random walks on complex networks reveal community structure. Proc. Natl Acad. Sci. USA 105, 1118–1123 (2008).
Article ADS CAS Google Scholar
Wu, L., Wang, D. & Evans, J. A. Large teams develop and small teams disrupt science and technology. Nature 566, 378–382 (2019).
Article ADS CAS Google Scholar
Zeng, A., Fan, Y., Di, Z., Wang, Y. & Havlin, S. Fresh teams are associated with original and multidisciplinary research. Nat. Hum. Behav. https://doi.org/10.1038/s41562-021-01084-x (2021).
De Rond, M. & Miller, A. N. Publish or perish: bane or boon of academic life? J. Manag. Inq. 14, 321–329 (2005).
Article Google Scholar
Fanelli, D. Do pressures to publish increase scientists’ bias? An empirical support from US States Data. PLoS ONE 5, e10271 (2010).
Article ADS Google Scholar
Youn, T. I. & Price, T. M. Learning from the experience of others: the evolution of faculty tenure and promotion rules in comprehensive institutions. J. High. Educ. 80, 204–237 (2009).
Article Google Scholar
Alperin, J. P. et al. Meta-Research: How significant are the public dimensions of faculty work in review, promotion and tenure documents? ELife 8, e42254 (2019).
Article Google Scholar
Balietti, S., Goldstone, R. L. & Helbing, D. Peer review and competition in the Art Exhibition Game. Proc. Natl Acad. Sci. USA 113, 8414–8419 (2016).
Article CAS Google Scholar
Lazer, D. & Friedman, A. The network structure of exploration and exploitation. Adm. Sci. Q. 52, 667–694 (2007).
Article Google Scholar
Viswanathan, G. M. et al. Optimizing the success of random searches. Nature 401, 911–914 (1999).
Article ADS CAS Google Scholar
Cohen, J. D., McClure, S. M. & Yu, A. J. Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration. Philos. Trans. R. Soc. Ser. B 362, 933–942 (2007).
Article Google Scholar
Sutton, R. S. & Barto, A. G. Reinforcement Learning: An Introduction (MIT Press, 2018).
Burt, R. S. & Merluzzi, J. Network oscillation. Acad. Manag. Discov. 2, 368–391 (2016).
Article Google Scholar
Manso, G., Balsmeier, B. & Fleming, L. Heterogeneous innovation over the business cycle. Working Paper (University of California at Berkeley, 2017).
Tshitoyan, V. et al. Unsupervised word embeddings capture latent knowledge from materials science literature. Nature 571, 95–98 (2019).
Article ADS CAS Google Scholar
Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S. & Dean, J. In Advances in Neural Information Processing Systems, 3111–3119 (NIPS, 2013).
Peng, H., Ke, Q., Budak, C., Romero, D. M. & Ahn, Y.-Y. Neural embeddings of scholarly periodicals reveal complex disciplinary organizations. Sci. Adv. 7, eabb9004 (2021).
Mao, H., Cheung, M. & She, J. In Proc. 25th ACM International Conference on Multimedia, 1183–1191 (ACM, 2015).
Perozzi, B., Al-Rfou, R. & Skiena, S. In Proc. 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 701–710 (ACM, 2014).
Simonton, D. K. Cinematic success criteria and their predictors: the art and business of the film industry. Psychol. Mark. 26, 400–420 (2009).
Article Google Scholar

Download references

Acknowledgements

We thank A.-L. Barabási, W. Ocasio, B. Uzzi, J. Evans, K. Rao, C. Candia, S. Medya, G. Tripodi, and all members of the Center for Science of Science and Innovation (CSSI) for invaluable comments. This work is supported by the Air Force Office of Scientific Research under award numbers FA9550-15-1-0162, FA9550-17-1-0089, and FA9550-19-1-0354.

Author information

Authors and Affiliations

Center for Science of Science and Innovation, Northwestern University, Evanston, IL, USA
Lu Liu, Nima Dehmamy, Jillian Chown & Dashun Wang
Northwestern Institute on Complex Systems, Northwestern University, Evanston, IL, USA
Lu Liu, Nima Dehmamy & Dashun Wang
Kellogg School of Management, Northwestern University, Evanston, IL, USA
Lu Liu, Nima Dehmamy, Jillian Chown & Dashun Wang
College of Information Sciences and Technology, Pennsylvania State University, University Park, PA, USA
Lu Liu & C. Lee Giles
Department of Computer Science and Engineering, Pennsylvania State University, University Park, PA, USA
C. Lee Giles
McCormick School of Engineering, Northwestern University, Evanston, IL, USA
Dashun Wang

Authors

Lu Liu
View author publications
You can also search for this author in PubMed Google Scholar
Nima Dehmamy
View author publications
You can also search for this author in PubMed Google Scholar
Jillian Chown
View author publications
You can also search for this author in PubMed Google Scholar
C. Lee Giles
View author publications
You can also search for this author in PubMed Google Scholar
Dashun Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.W. conceived the project and designed the experiments; L.L. and N.D. collected data and performed empirical analyses with help from J.C., C.L.G. and D.W.; all authors discussed and interpreted results; D.W., L.L. and N.D. wrote the manuscript; all authors edited the manuscript.

Corresponding author

Correspondence to Dashun Wang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Steven Skiena and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Reporting Summary

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Liu, L., Dehmamy, N., Chown, J. et al. Understanding the onset of hot streaks across artistic, cultural, and scientific careers. Nat Commun 12, 5392 (2021). https://doi.org/10.1038/s41467-021-25477-8

Download citation

Received: 22 February 2021
Accepted: 04 August 2021
Published: 13 September 2021
DOI: https://doi.org/10.1038/s41467-021-25477-8

This article is cited by

Data, measurement and empirical methods in the science of science
- Lu Liu
- Benjamin F. Jones
- Dashun Wang
Nature Human Behaviour (2023)
Surprising combinations of research contents and contexts are related to impact and emerge with scientific outsiders from distant disciplines
- Feng Shi
- James Evans
Nature Communications (2023)
Quantifying hierarchy and prestige in US ballet academies as social predictors of career success
- Yessica Herrera-Guzmán
- Alexander J. Gates
- Albert-László Barabási
Scientific Reports (2023)
Patterns of interest change in stack overflow
- Chenbo Fu
- Xinchen Yue
- Yong Min
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.