Main

Despite substantial international conservation efforts, diversity and abundance continue to decline within many of the world’s ecosystems1,7. Most conservation approaches aim to identify and protect places of high ecological integrity under minimal threat8. Yet, with escalating social and environmental drivers of change, conservation actions are also needed where people and nature coexist, especially where human effects are already severe9. Here, we highlight an approach for implementing conservation in coupled human–natural systems focused on identifying and learning from outliers—places that are performing substantially better than expected, given the socioeconomic and environmental conditions they are exposed to. By their very nature, outliers deviate from expectations, and consequently can provide novel insights into confronting complex problems where conventional solutions have failed. This type of positive deviance, or bright spot analysis has been used in fields such as business, health, and human development to uncover local actions and governance systems that work in the context of widespread failure10,11, and holds much promise in informing conservation.

To demonstrate this approach, we compiled data from 2,514 coral reefs in 46 countries, states, and territories (hereafter ‘nations/states’) and developed a Bayesian hierarchical model to generate expected conditions of how standing reef fish biomass (a key indicator of resource availability and ecosystem functions12) was related to 18 key environmental variables and socioeconomic drivers (Fig. 1; Extended Data Tables 1,2,3,4; Extended Data Figs 1,2,3; Methods). Drawing on a broad body of theoretical and empirical research in the social sciences13,14,15 and ecology2,6,16 on coupled human–natural systems, we quantified how reef fish biomass (Fig. 1a) was related to distal social drivers such as markets, affluence, governance, and population (Fig. 1b, c), while controlling for well-known environmental conditions such as depth, habitat, and productivity (Fig. 1d) (Extended Data Table 1; Methods). In contrast to many global studies of reef systems that are focused on demonstrating the severity of human effects6, our examination seeks to uncover potential policy levers by highlighting the relative role of specific social drivers. A key finding from our global analysis is that our metric of potential interactions with urban centres, called market gravity17 (Methods), more so than local or national population pressure, management, environmental conditions, or national socioeconomic context, had the strongest relationship with reef fish biomass (Fig. 1). Specifically, we found that reef fish biomass decreased as the size and accessibility of markets increased (Extended Data Fig. 1b). Somewhat counter-intuitively, fish biomass was higher in places with high local human population growth rates, probably reflecting human migration to areas of better environmental quality18—a phenomenon that could result in increased degradation at these sites over time. We found a strong positive, but less certain relationship (that is, a high standardized effect size, but only >75% of the posterior distribution above zero) with the Human Development Index, meaning that reefs tended to be in better condition in wealthier nations/states (Fig. 1c). Our analysis also confirmed the role that marine reserves can play in sustaining biomass on coral reefs, but only when compliance is high (Fig. 1b), reinforcing the importance of fostering compliance for reserves to be successful.

Figure 1: Global patterns and drivers of reef fish biomass.
figure 1

a, Reef fish biomass among 918 study sites. Points vary in size and colour proportional to the amount of fish biomass. bd, Standardized effect size of local-scale social drivers, nation/state-scale social drivers, and environmental covariates, respectively. Parameter estimates are Bayesian posterior median values, 95% uncertainty intervals (UI; thin lines), and 50% UI (thick lines). Black dots indicate that the 95% UI does not overlap 0; grey closed circles indicates that 75% of the posterior distribution lies to one side of 0; and grey open circles indicate that the 50% UI overlaps 0.

PowerPoint slide

Next, we identified 15 bright spots and 35 dark spots among the world’s coral reefs, defined as sites with biomass levels more than two standard deviations higher or lower than expectations from our global model, respectively (Fig. 2; Methods; Extended Data Table 5). Rather than simply identifying places in the best or worst condition, our bright spots approach reveals the places that most strongly defy expectations. Using them to inform the conservation discourse will certainly challenge established ideas of where and how conservation efforts should be focused. For example, remote places far from human impacts are conventionally considered near-pristine areas of high conservation value6, yet most of the bright spots we identified occur in fished, populated areas (Extended Data Table 5), some with biomass values below the global average. Alternatively, some remote places such as parts of the northwest Hawaiian Islands underperform (that is, were identified as dark spots).

Figure 2: Bright and dark spots among the world’s coral reefs.
figure 2

a, Each site’s deviation from expected biomass (y axis) along a gradient of nation/state mean biomass (x axis). The 50 sites with biomass values >2 standard deviations above or below expected values were considered bright (yellow) and dark (black) spots, respectively. Each grey vertical line represents a nation/state; those with bright or dark spots are labelled and numbered. There can be multiple bright or dark spots in each nation/state. b, Map highlighting bright and dark spots with large circles, and other sites in small circles. Numbers correspond to panel a.

PowerPoint slide

Detailed analysis of why bright spots can evade the fate of similar areas facing equivalent stresses will require a new research agenda gathering detailed site-level information on social and institutional conditions, technological innovations, external influences, and ecological processes19 that are simply not available in a global-scale analysis. As a hypothesis-generating exploration to begin uncovering why bright and dark spots may diverge from expectations, we surveyed data providers who sampled the sites and other experts with first-hand knowledge about the presence or absence of ten key social and environmental conditions at the 15 bright spots, 35 dark spots, and 14 average sites with biomass values closest to model expectations (see Methods and Supplementary Information for details). Our initial exploration revealed that bright spots were more likely to have high levels of local engagement in the management process, high dependence on coastal resources, and the presence of sociocultural governance institutions such as customary tenure or taboos (Fig. 3; Methods). For example, in one bright spot, Karkar Island, Papua New Guinea, resource use is restricted through an adaptive rotational harvest system based on ecological feedbacks, marine tenure that allows for the exclusion of fishers from outside the local village, and initiation rights that limit individuals’ entry into certain fisheries20. Bright spots were also generally proximate to deep water, which may help provide a refuge from disturbance for corals and fish21 (Fig. 3; Extended Data Fig. 4). Conversely, dark spots were distinguished by having fishing technologies allowing for more intensive exploitation, such as fish freezers and potentially destructive netting, as well as a recent history of environmental shocks (for example, coral bleaching or cyclone; Fig. 3). The latter is particularly worrisome in the context of climate change, which is likely to lead to increased coral bleaching and more intense cyclones22.

Figure 3: Differences in key social and environmental conditions between bright spots, dark spots, and ‘average’ sites. a, Social and institutional conditions; b, external- or donor-driven projects; c, technologies; d, environmental conditions.
figure 3

P values are determined using Fisher’s exact test. Intensive netting includes beach seine nets, surround gill nets, and muro-ami.

PowerPoint slide

Our global analyses highlight two novel opportunities to inform coral reef governance. The first is to use bright spots as agents of change to expand the conservation discourse from the current focus on protecting places under minimal threat8, towards harnessing lessons from places that have successfully confronted numerous or severe stressors.

Our bright spots approach can be used to inform the types of investments and governance structures that may help to create more sustainable pathways for impacted coral reefs. Specifically, our initial investigation highlights how investments that strengthen fisheries governance, particularly issues such as participation and property rights, could help communities to innovate in ways that allow them to defy expectations. Conversely, the more typical efforts to provide capture and storage infrastructure, particularly where there are environmental shocks and local-scale governance is weak, may lead to social–ecological traps23 that reinforce resource degradation beyond expectations. Effectively harnessing the potential to learn from both bright and dark spots will require scientists to increase research efforts in these places, NGOs to catalyse lessons from other areas, donors to start investing in novel solutions, and policy makers to ensure that governance structures foster flexible learning and experimentation. Indeed, bright spots may have much to offer in terms of how to creatively confront drivers of change and prioritize conservation actions. Likewise, dark spots can help identify development strategies to avoid. Critically, the bright spots we identified span the development spectrum from low to high income (for example, Solomon Islands and territories of the USA, respectively; Fig. 2), showing that lessons about effective reef management can emerge from diverse places.

A second opportunity stems from a renewed focus on managing the socioeconomic drivers that shape reef conditions. Many social drivers are amenable to governance interventions, and our comprehensive analysis (Fig. 1) suggests that an increased policy focus on social drivers such as markets and development could result in improvements to reef fish biomass. For example, given the important influence of markets in our analysis, reef managers, donor organizations, conservation groups, and coastal communities could improve sustainability by developing interventions that dampen the negative influence of markets on reef systems. A portfolio of market interventions, including eco-labelling and sustainable harvesting certifications, fisheries improvement projects, and value chain interventions have been developed within large-scale industrial fisheries to condition access to markets based on sustainable harvesting24,25. Although there is considerable scope for adapting these interventions to artisanal coral reef fisheries in both local and regional markets, effectively dampening the negative influence of markets may also require developing novel interventions that address the range of ways in which markets can lead to overexploitation. Existing research suggests that markets create incentives for overexploitation not only by affecting price and price variability for reef products26, but also by influencing people’s behaviour27,28, including their willingness to cooperate in the collective management of natural resources29.

The long-term viability of coral reefs will ultimately depend on international action to reduce carbon emissions22. However, fisheries remain a pervasive source of reef degradation, and effective local-level fisheries governance is crucial to sustaining ecological processes that give reefs the best chance of coping with global environmental change30. Seeking out and learning from bright spots is a novel approach to conservation that may offer insights into confronting the complex governance problems facing coupled human–natural systems such as coral reefs.

Methods

No statistical methods were used to predetermine sample size.

Scales of data

Our data were organized at three spatial scales:

  1. i

    Reef (n = 2,514). The smallest scale, which had an average of 2.4 surveys/transects.

  2. ii

    Site (a cluster of reefs; n = 918). We clustered reefs together that were within 4 km of each other, and used the centroid of these clusters to estimate site-level social and site-level environmental covariates (Extended Data Table 1). To make these clusters, we first estimated the linear distance between all reefs, then used a hierarchical analysis with the complete-linkage clustering technique based on the maximum distance between reefs. We set the cut-off at 4 km to select mutually exclusive sites where reefs cannot be more distant than 4 km. The choice of 4 km was informed by a 3-year study of the spatial movement patterns of artisanal coral reef fishers, corresponding to the highest density of fishing activities on reefs based on GPS-derived effort density maps of artisanal coral reef fishing activities31. This clustering analysis was carried out using the R functions hclust and cutree, resulting in an average of 2.7 reefs per site.

  3. iii

    Nation/state (nation, state, or territory; n = 46). A larger scale in our analysis was nation/state, which are jurisdictions that generally correspond to individual nations (but could also include states, territories, overseas regions, or extremely remote areas within a state such as the northwest Hawaiian Islands; Extended Data Table 2), within which sites and reefs were nested for analysis.

Estimating biomass

Reef fish biomass can reflect a broad selection of reef fish functioning and benthic conditions12,32,33,34, and is a key metric of resource availability for reef fisheries. Reef fish biomass estimates were based on instantaneous visual counts from 6,088 surveys collected from 2,514 reefs. All surveys used standard belt-transects, distance sampling, or point-counts, and were conducted between 2004 and 2013. Where data from multiple years were available from a single reef, we included only data from the year closest to 2010. Within each survey area, reef associated fishes were identified to species level, abundance counted, and total length (TL) estimated, with the exception of one data provider who measured biomass at the family level. To make estimates of biomass from these transect-level data comparable among studies, we:

  1. i

    Retained families that were consistently studied and were above a minimum size cut-off. Thus, we retained counts of >10-cm diurnally active, non-cryptic reef fish that are resident on the reef (20 families, 774 species), excluding sharks and semi-pelagic species. We also excluded three groups of fishes that are strongly associated with coral habitat conditions and are rarely targets for fisheries (Anthiinae, Chaetodontidae, and Cirrhitidae). Families included are: Acanthuridae, Balistidae, Diodontidae, Ephippidae, Haemulidae, Kyphosidae, Labridae, Lethrinidae, Lutjanidae, Monacanthidae, Mullidae, Nemipteridae, Pinguipedidae, Pomacanthidae, Serranidae, Siganidae, Sparidae, Synodontidae, Tetraodontidae and Zanclidae. We calculated the total biomass of fish on each reef using standard published species-level length–weight relationship parameters or those available on FishBase35. When length–weight relationship parameters were not available for a species, we used the parameters for a closely related species or genus.

  2. ii

    Directly accounted for depth and habitat as covariates in the model (see Environmental conditions section below).

  3. iii

    Accounted for any potential bias among data providers (capturing information on both inter-observer differences, and census methods) by including each data provider as a random effect in our model.

Biomass means, medians, and standard deviations were calculated at the reef-scale. All reported log values are the natural log.

Social drivers

Local population growth. We created a 100 km buffer around each site and used this to calculate human population within the buffer in 2000 and 2010 based on the Socioeconomic Data and Application Centre (SEDAC) gridded population of the world database36. Population growth was the proportional difference between the population in 2000 and 2010. We chose a 100 km buffer as a reasonable range at which many key human impacts from population (for example, land-use and nutrients) might affect reefs37.

Management. For each site, we determined if it was unfished, that is, whether it fell within the borders of a no-take marine reserve (we asked data providers to further classify whether the reserve had high or low levels of compliance); restricted—whether there were active restrictions on gears (for example, bans on the use of nets, spear guns, or traps) or fishing effort (which could have included areas inside marine parks that were not necessarily no take); or fished, that is, regularly fished without effective restrictions. To determine these classifications, we used the expert opinion of the data providers, and triangulated this with a global database of marine reserve boundaries38.

Gravity. We adapted the economic geography concept of ‘gravity’17,39,40,41, also called interactance42, to examine potential interactions between reefs and: (i) major urban centres/markets (defined as provincial capital cities, major population centres, landmark cities, national capitals, and ports); and (ii) the nearest human settlements. This application of the gravity concept infers that potential interactions increase with population size, but decay exponentially with the effective distance between two points. Thus, we gathered data on both population estimates and a surrogate for distance: travel time.

Population estimations. We gathered population estimates for: (i) the nearest major markets (which includes national capitals, provincial capitals, major population centres, ports, and landmark cities) using the World Cities base map from ESRI; and (ii) the nearest human settlement within a 500 km radius using LandScan 2011 database. The different data sets were required because the latter is available in raster format while the former is available as point data. We chose a 500 km radius from the nearest settlement as the maximum distance any non-market fishing activities for fresh reef fish are likely to occur.

Travel time calculation. Travel time was computed using a cost–distance algorithm that computes the least ‘cost’ (in minutes) of travelling between two locations on a regular raster grid. In our case, the two locations were either the centroid of the site (that is, reef cluster) and the nearest settlement, or the centroid of the site and the major market. The cost (that is, time) of travelling between the two locations was determined by using a raster grid of land cover and road networks with the cells containing values that represent the time required to travel across them43:

  • Tree cover, broadleaved, deciduous and evergreen, closed; regularly flooded tree cover, shrub, or herbaceous cover (fresh, saline, & brackish water) = speed of 1 km h−1

  • Tree cover, broadleaved, deciduous, open (open = 15–40% tree cover) = speed of 1.25 km h−1

  • Tree cover, needle-leaved, deciduous and evergreen, mixed leaf type; shrub cover, closed-open, deciduous and evergreen; herbaceous cover, closed-open; cultivated and managed areas; mosaic: cropland/tree cover/other natural vegetation, cropland/shrub or grass cover = speed of 1.5 km h−1

  • Mosaic: tree cover/other natural vegetation; tree cover, burnt = speed of 1.25 km h−1

  • Sparse herbaceous or sparse shrub cover = speed of 2.5 km h−1

  • Water = speed of 20 km h−1

  • Roads = speed of 60 km h−1

  • Track = speed of 30 km h−1

  • Artificial surfaces and associated areas = speed of 30 km h−1

  • Missing values = speed of 1.4 km h−1

We termed this raster grid a friction-surface (with the time required to travel across different types of surfaces analogous to different levels of friction). To develop the friction-surface, we used global data sets of road networks, land cover, and shorelines:

  • Road network data was extracted from the Vector Map Level 0 (VMap0) from the National Imagery and Mapping Agency’s (NIMA) Digital Chart of the World (DCW). We converted vector data from VMap0 to 1 km resolution raster.

  • Land cover data were extracted from the Global Land Cover 2000 (ref. 44).

  • To define the shorelines, we used the GSHHS (Global Self-consistent, Hierarchical, High-resolution Shoreline) database version 2.2.2.

These three friction components (road networks, land cover, and water bodies) were combined into a single friction surface with a Behrmann map projection. We calculated our cost-distance models in R45 using the accCost function of the gdistance package. The function uses Dijkstra’s algorithm to calculate least-cost distance between two cells on the grid and the associated distance taking into account obstacles and the local friction of the landscape46. Travel time estimates over a particular surface could be affected by the infrastructure (for example, road quality) and types of technology used (for example, types of boats). These types of data were not available at a global scale but could be important modifications in more localized studies.

Gravity computation. To compute the gravity to the nearest market, we calculated the population of the nearest major market and divided that by the squared travel time between the market and the site. Although other exponents can be used47, we used the squared distance (or in our case, travel time), which is relatively common in geography and economics. This decay function could be influenced by local considerations, such as infrastructure quality (for example, roads), the types of transport technology (that is, vessels being used), and fuel prices, which were not available in a comparable format for this global analysis, but could be important considerations in more localized adaptations of this study.

To determine the gravity of the nearest settlement, we located the nearest populated pixel within 500 km, determined the population of that pixel, and divided that by the squared travel time between that cell and the reef site.

As is standard practice in many agricultural economics studies48, an assumption in our study is that the nearest major capital or landmark city represents a market. Ideally we would have used a global database of all local and regional markets for coral reef fish, but this type of database is not available at a global scale. As a sensitivity analysis to help justify our assumption that capital and landmark cities were a reasonable proxy for reef fish markets, we tested a series of candidate models that predicted biomass based on: (1) cumulative gravity of all cities within 500 km; (2) gravity of the nearest city; (3) travel time to the nearest city; (4) population of the nearest city; (5) gravity to the nearest human population above 40 people km−2 (assumed to be a small peri-urban area and potential local market); (6) the travel time between the reef and a small peri-urban area; (7) the population size of the small peri-urban population; (8) gravity to the nearest human population above 75 people km−2 (assumed to be a large peri-urban area and potential market); (9) the travel time between the reef and this large peri-urban population; (10) the population size of this large peri-urban population; and (11) the total population size within a 500 km radius. Model selection revealed that the best two models were gravity of the nearest city and gravity of all cities within 500 km (with a 3 AIC value difference between them; Extended Data Table 3). Importantly, when looking at the individual components of gravity models, the travel time components all had a much lower AIC value than the population components, which is broadly consistent with previous systematic review studies49. Similarly, travel time to the nearest city had a lower AIC score than any aspect of either the peri-urban or urban measures. This suggests our use of capital and landmark cities is likely to better capture exploitation drivers from markets rather than simple population pressures. This may be because market dynamics are difficult to capture by population threshold estimates; for example some small provincial capitals where fish markets are located have very low population densities, while some larger population centres may not have a market. Downscaled regional or local analyses could attempt to use more detailed knowledge about fish markets, but we used the best proxy available at a global scale.

Human Development Index (HDI). HDI is a summary measure of human development encompassing: a long and healthy life, being knowledgeable, and having a decent standard of living. In cases where HDI values were not available specific to the State (for example, Florida and Hawaii), we used the national (for example, USA) HDI value.

Population size. For each nation/state, we determined the size of the human population. Data were derived mainly from census reports, the CIA fact book, and Wikipedia.

Tourism. We examined tourist arrivals relative to the nation/state population size (above). Tourism arrivals were gathered primarily from the World Tourism Organization’s Compendium of Tourism Statistics.

National reef fish landings. Catch data were obtained from the Sea Around Us Project (SAUP) catch database (http://www.seaaroundus.org), except for Florida, which was not reported separately in the database. We identified 200 reef fish species and taxon groups in the SAUP catch database50. Note that reef-associated pelagics such as scombrids and carangids normally form part of reef fish catches. However, we chose not to include these species because they are also targeted and caught in large amounts by large-scale, non-reef operations.

Voice and accountability. This metric, from the World Bank survey on governance, reflects the perceptions of the extent to which a country’s citizens are able to participate in selecting their government, as well as freedom of expression, freedom of association, and a free media. In cases where governance values were not available specific to the nation/state (for example, Florida and Hawaii), we used national (for example, USA) values.

Environmental drivers

Depth. The depth of reef surveys were grouped into the following categories: <4 m, 4–10 m, >10 m to account for broad differences in reef fish community structure attributable to a number of inter-linked depth-related factors. Categories were necessary to standardise methods used by data providers and were determined by pre-existing categories used by several data providers.

Habitat. We included the following habitat categories:

  1. i

    Slope. The reef slope habitat is typically on the ocean side of a reef, where the reef slopes down into deeper water.

  2. ii

    Crest. The reef crest habitat is the section that joins a reef slope to the reef flat. The zone is typified by high wave energy (that is, where the waves break). It is also typified by a change in the angle of the reef from an inclined slope to a horizontal reef flat.

  3. iii

    Flat. The reef flat habitat is typically horizontal and extends back from the reef crest for 10’s to 100’s of metres;

  4. iv

    Lagoon/back reef. Lagoon reef habitats are where the continuous reef flat breaks up into more patchy reef environments sheltered from wave energy. These habitats can be behind barrier/fringing reefs or within atolls. Back reef habitats are similar broken habitats where the wave energy does not typically reach the reefs and thus forms a less continuous ‘lagoon style’ reef habitat. Due to minimal representation among our sample, we excluded other less prevalent habitat types, such as channels and banks. To verify the sites’ habitat information, we used the Millennium Coral Reef Mapping Project (MCRMP) hierarchical data51, Google Earth, and site depth information.

Productivity. We examined ocean productivity for each of our sites in mg of C per m2 per day (http://www.science.oregonstate.edu/ocean.productivity/). Using the monthly data for years 2005 to 2010 (in hdf format), we imported and converted those data into ArcGIS. We then calculated yearly average and finally an average for all these years. We used a 100 km buffer around each of our sites and examined the average productivity within that radius. Note that ocean productivity estimates are less accurate for near-shore environments, but we used the best available data.

Analyses

We first looked for collinearity among our covariates using bivariate correlations and variance inflation factor estimates (Extended Data Fig. 2 and Extended Data Table 4). This led to the exclusion of several covariates (not described above): (i) geographic basin (tropical Atlantic, western Indo-Pacific, central Indo-Pacific, or eastern Indo-Pacific); (ii) gross domestic product (purchasing power parity); (iii) rule of law (World Bank governance index); (iv) control of corruption (World Bank governance index); and (v) sedimentation. Additionally, we removed an index of climate stress, developed by Maina et al.52, which incorporated 11 different environmental conditions, such as the mean and variability of sea surface temperature due to repeated lack of convergence for this parameter in the model, likely indicative of unidentified multicollinearity. All other covariates had correlation coefficients 0.7 or less and variance inflation factor scores less than 5 (indicating multicollinearity was not a serious concern). Care must be taken in causal attribution of covariates that were significant in our model, but demonstrated collinearity with candidate covariates that were removed during the aforementioned process. Importantly, the covariate that exhibited the largest effect size in our model, market gravity, was not strongly collinear with other candidate covariates.

To quantify the multi-scale social, environmental, and economic factors affecting reef fish biomass we adopted a Bayesian hierarchical modelling approach that explicitly recognized the three scales of spatial organization: reef (j), site (k), and nation/state (s).

In adopting the Bayesian approach we developed two models for inference: a null model, consisting only of the hierarchical units of observation (that is, intercepts-only) and a full model that included all of our covariates (drivers) of interest. Covariates were entered into the model at the relevant scale, leading to a hierarchical model whereby lower-level intercepts (averages) were placed in the context of higher-level covariates in which they were nested. We used the null model as a baseline against which we could ensure that our full model performed better than a model with no covariate information. We did not remove ‘non-significant’ covariates from the model because each covariate was carefully considered for inclusion and could therefore reasonably be considered as having an effect, even if small or uncertain; removing factors from the model is equivalent to fixing parameter estimates at exactly zero—a highly-subjective modelling decision after covariates have already been selected as potentially important53.

The full model assumed the observed, reef-scale observations of fish biomass (yijks) were modelled using a non-central t distribution, allowing for fatter tails than typical log-normal models of reef fish biomass32. We chose the non-central t after having initially used a log-normal model because our model diagnostics suggested that several model parameters had not converged. We ran a supplementary analysis to support our use of the non-central t distribution with 3.5 degrees of freedom (see Supplementary Information). Therefore our model was:

with Xreef representing the matrix of observed reef-scale covariates and βreef array of estimated reef-scale parameters. The τreef (and all subsequent τ values) were assumed common across observations in the final model and were minimally informative53. Using a similar structure, the reef-scale intercepts (β0jks) were structured as a function of site-scale covariates (Xsit):

with γsit representing an array of site-scale parameters. Building upon the hierarchy, the site-scale intercepts (γ0ks) were structured as a function of state-scale covariates (Xsta):

Finally, at the top scale of the analysis we allowed for a global (overall) estimate of average log-biomass (γ0):

The relationships between fish biomass and reef, site, and state-scale drivers was carried out using the PyMC package54 for the Python programming language, using a Metropolis-Hastings (MH) sampler run for 106 iterations, with a 900,000 iteration burn-in thinned by 10, leaving 10,000 samples in the posterior distribution of each parameter; these long burn-in times are often required with a complex model using the MH algorithm. Convergence was monitored by examining posterior chains and distributions for stability and by running multiple chains from different starting points and checking for convergence using Gelman–Rubin statistics55 for parameters across multiple chains; all were at or close to 1, indicating good convergence of parameters across multiple chains.

Overall model fit

We conducted posterior predictive checks for goodness of fit (GoF) using Bayesian P values43 (BpV), whereby fit was assessed by the discrepancy between observed or simulated data and their expected values. To do this we simulated new data (yinew) by sampling from the joint posterior of our model (θ) and calculated the Freeman–Tukey measure of discrepancy for the observed (yiobs) or simulated data, given their expected values (μi):

yielding two arrays of median discrepancies D(yobs/θ) and D(ynew/θ) that were then used to calculate a BpV for our model by recording the proportion of times D(yobs/θ) was greater than D(ynew/θ) (Extended Data Fig. 3a). A BpV above 0.975 or under 0.025 provides substantial evidence for lack of model fit. Evaluated by the deviance information criterion (DIC), the full model greatly outperformed a null model that included no covariates (ΔDIC = 472).

To examine homoscedasticity, we checked residuals against fitted values. We also checked the residuals against all covariates included in the model, and several covariates that were not included in the model (primarily due to collinearity), including: (i) Atoll, a binary metric of whether the reef was on an atoll or not; (ii) control of corruption, perceptions of the extent to which public power is exercised for private gain, including both petty and grand forms of corruption, as well as ‘capture’ of the state by elites and private interests, derived from the World Bank survey on governance; (iii) geographic basin, whether the site was in the tropical Atlantic, western Indo-Pacific, central Indo-Pacific, or eastern Indo-Pacific; (iv) connectivity, we examined three measures based on the area of coral reef within a 30 km, 100 km, and 600 km radius of the site; (v) sedimentation; (vi) coral cover (which was only available for a subset of the sites); (vii) climate stress52; and (viii) census method. The model residuals showed no patterns with these eight additional covariates, suggesting they would not explain additional information in our model.

Bright and dark spot estimates

Because the performance of site scale locations are of substantial interest in uncovering novel solutions for reef conservation, we defined bright and dark spots at the site scale. To this end, we defined bright (or dark) spots as locations where expected site-scale intercepts (γ0ks) differed by more than two standard deviations from their nation/state-scale expected value (μks), given all the covariates present in the full hierarchical model:

This, in effect, probabilistically identified the most deviant sites, given the model, while shrinking sites towards their group-level means, thereby allowing us to overcome potential bias due to low and varying sample sizes that can lead to extreme values from chance alone. After an initial log-normal model formulation, where we were not confident in model convergence, we employed a non-central t distribution at the observation scale, which facilitated model convergence and dampened any effects of potentially extreme reef-scale observations on the bright and dark spot estimates. Further, we did not consider a site a bright or dark spot if the group-level (that is, nation/state) mean included fewer than five sites.

Analysing conditions at bright spots

For our preliminary exploration into why bright and dark spots may diverge from expectations, we surveyed data providers and other experts about key social, institutional, and environmental conditions at the 15 bright spots, 35 dark spots, and 14 sites that performed most closely to model specifications. Specifically, we developed an online survey (SI) using Survey Monkey (http://www.surveymonkey.com) software, which we asked data providers who sampled those sites to complete with input from local experts, where necessary. Data providers generally filled in the survey in consultation with nationally based field team members who had detailed local knowledge of the socioeconomic and environmental conditions at each of the sites. Research on bright spots in agricultural development19 highlights several types of social and environmental conditions that may lead to bright spots, which we adapted and developed proxies for as the basis of our survey into why our bright and dark spots may diverge from expectations. These include:

  1. i

    Social and institutional conditions. We examined the presence of customary management institutions such as taboos and marine tenure institutions, whether there was substantial engagement by local people in management (specifically defined as there being active engagement by local people in reef management decisions; token involvement and consultation were not considered substantial engagement), and whether there were high levels of dependence on marine resources (specifically, whether a majority of local residents depend on reef fish as a primary source of food or income). All social and institutional conditions were converted to presence/absence data. Dependence on resources and engagement were limited to sites that had adjacent human populations. All other conditions were recorded regardless of whether there is an adjacent community.

  2. ii

    Technological use/innovation. We examined the presence of motorized vessels, intensive capture equipment (such as beach seine nets, surround gill nets, and muro-ami nets), and storage capacity (that is, freezers).

  3. iii

    External influences (such as donor-driven projects). We examined the presence of NGOs, fishery development projects, development initiatives (such as alternative livelihoods), and fisheries improvement projects. All external influences were recorded as present/absent then summarized into a single index of whether external projects were occurring at the site.

  4. iv

    Environmental/ecological processes (for example, recruitment and connectivity). We examined whether sites were within 5 km of mangroves and deep-water refuges, and whether there had been any major environmental disturbances such as coral bleaching, tsunami, and cyclones within the past 5 years. All environmental conditions were recorded as present/absent.

As an exploratory analysis of associations between these conditions and whether sites diverged more or less from expectations, we used two complementary approaches. The link between the presence/absence of the aforementioned conditions and whether a site was bright, average, or dark was assessed using a Fisher’s exact test. Then we tested whether the mean deviation in fish biomass from expected was similar between sites with presence or absence of the mechanisms in question (that is, the presence or absence of marine tenure/taboos) using an ANOVA assuming unequal variance. The two tests yielded similar results, but provide slightly different ways to conceptualize the issue, the former is correlative while the latter explains deviation from expectations based on conditions, so we provide both (Fig. 3 and Extended Data Fig. 4). It is important to note that some of these social and environmental conditions were significantly associated (that is, Fisher’s exact probabilities <0.05), and further research is required to uncover how these and other conditions may make sites bright or dark.