Main

To further our understanding of this galaxy cluster, particularly the properties of its member galaxies, we undertook a series of observations of XLSSC 122 with the Hubble Space telescope (HST) Wide Field Camera 3 (WFC3). We obtained images of the cluster in two wavebands, F105W and F140W, and performed low-spectral-resolution slitless spectroscopy using the G141 grism (see Methods). These observations cover the observed frame wavelength interval 1.0–1.7 μm, corresponding to an interval of 0.33 μm to 0.57 μm in the rest frame of a galaxy at redshift z = 2. Figure 1 displays the F140W image of XLSSC 122 and shows a compact cluster of galaxies associated with the extended X-ray-emitting region.

Fig. 1: HST image of the galaxy cluster XLSSC 122.
figure 1

The greyscale is the F140W image. Contours display X-ray emission corresponding to the 100-ks XMM-Newton image presented in ref. 4. The dashed circle is drawn with a radius equal to the measured value of r500 (the radius within which the average matter density is 500 times the critical density of the Universe). Spectroscopic ‘gold’ and ‘silver’ members (see Methods) of the z = 1.98 cluster are indicated by red and green circles, respectively. Members of the z = 1.93 foreground structure are indicated by blue circles. See text for further details.

We extracted one-dimensional spectra of all galaxies identified within the dispersed G141 grism image of the field (see Methods) and computed redshifts using a galaxy template-fitting algorithm with redshift as a free parameter. Figure 2 displays the histogram of galaxy redshifts in the field of XLSSC 122 over the restricted interval 1.9 < z < 2.05. Inspection of this interval reveals a primary peak at z = 1.98 associated with the central, red galaxies closest to the X-ray peak, and a secondary redshift peak at z = 1.93 associated with a mixture of red and blue galaxies, located at larger projected cluster-centric distances (see Fig. 1). The line-of-sight separation between z = 1.93 and z = 1.98 is 76 co-moving megaparsecs, far larger than the size of the XLSSC 122 cluster, and the two structures are therefore physically distinct. As outlined in the Methods, we identify 37 galaxies as being members of the cluster and a further 13 galaxies identified as members of the foreground structure.

Fig. 2: Redshift histogram of all galaxies along the line of sight to XLSSC 122.
figure 2

The histogram considers galaxies satisfying the magnitude measurement F140WKron < 24. Galaxies classified as ‘gold’ members of the z = 1.98 cluster are shown in red, ‘silver’ members are shown in green and members of the z = 1.93 structure are shown in blue. Galaxies not classified as a member of either the z = 1.98 cluster or the z = 1.93 structure are shown in grey. The vertical dashed lines show the unweighted mean redshift of both the cluster and the foreground structure (see text for further details).

We performed photometry of all galaxies within the HST field of view in both the F105W and F140W images (see Methods) and summarize this information in Fig. 3. The galaxies identified at 1.9 < z < 2.05 form a clear bimodal distribution in colour with a well populated sequence of red galaxies (corresponding to larger values of F105W − F140W) clearly separated from a broader distribution of blue galaxies.

Fig. 3: Colour–magnitude diagram of all galaxies within the HST/WFC3 field of view.
figure 3

Spectroscopically confirmed z = 1.98 ‘gold’ and ‘silver’ cluster members are indicated as red squares and green triangles, respectively. Members of the z = 1.93 structure are indicated as blue squares. Galaxies at z ≈ 2 yet which are not formal cluster members are shown as solid black squares, whereas potentially contaminated or confused spectroscopic sources are shown as open black squares. Galaxies with visually classified emission lines are marked using black circles (only z = 1.98 and z = 1.93 are marked in this manner). All other galaxies in the field are indicated by grey squares. Error bars indicate the 1-sigma measurement uncertainty. The spectroscopic completeness limits of F140WKron = 24 and 24.5 are indicated by the vertical dashed and dotted lines, respectively. The horizontal dot-dashed line shows the lower colour limit for a source to be considered on the cluster’s red sequence. The angled solid line indicates a simple least-squares fit to the colour–magnitude relation for red-sequence cluster members. Subscript ‘ap’ indicates that the magnitudes of these objects are measured within an aperture of fixed angular size as opposed to a flexible aperture, indicated by subscript ‘Kron’.

Interpreting this red sequence as representing a restricted locus of star-formation histories, Bower, Lucey and Ellis5 employed B − V (the astronomical magnitude difference between a blue and a visual filter) photometry of red-sequence galaxies in the Virgo and Coma clusters to constrain the dispersion of stellar ages in their member galaxies. The F105W and F140W photometry obtained for XLSSC 122 at z = 2 spans almost exactly the age-sensitive break feature at wavelength 4,000 Å in the member galaxy rest-frame spectral energy distributions (SEDs) and permits a similar analysis.

We therefore employed the F105W and F140W photometry of red-sequence galaxies to constrain the posterior distributions of luminosity-weighted stellar age and stellar mass for a set of synthetic stellar population models (see Methods). Computing the product of these posterior distributions generates a mean posterior on the luminosity-weighted stellar age of the red-sequence cluster members (Fig. 4), the details of which are presented in Table 1.

Fig. 4: The luminosity-weighted age distribution of stars within red-sequence cluster galaxies.
figure 4

The lines depict mean tw posteriors for 19 ‘gold’ z = 1.98 cluster red-sequence members for each of the three SED models characterized by AV = 0.0 (solid), AV = 0.3 (dashed) and AV = 0.5 (dotted). The vertical dashed line indicates the age of the Universe at z = 1.98 for the assumed cosmological model.

Table 1 The mean luminosity-weighted stellar ages of red-sequence cluster galaxies

Our analysis assumes that the tight correlation of colour on the red sequence arises from scatter in age at fixed metallicity and internal dust absorption. Assuming no dust absorption (AV = 0.0) we determine a mean red-sequence luminosity-weighted stellar age of 2.98 billion years (Gyr), corresponding to a redshift marking the onset of star formation of 12. This value is consistent with the inferred formation redshifts of the earliest observations of star formation in the Universe6. We also consider dust absorption characterized by AV = 0.3 and AV = 0.5, which generate lower mean stellar ages and greater dispersion of the mean age. Despite the uncertainties that govern a number of the assumptions in such stellar population analyses, the main conclusion of this analysis is that red-sequence galaxies in XLSSC 122 are composed of stars of uniformly old age. When combined with the already large look-back time to this cluster, it is clear that star formation occurred in these galaxies in a coordinated manner at early times.

Analysis of the X-ray-emitting gas in XLSSC 122 provides an alternative perspective on the formation history of the galaxy cluster as a whole. Using standard theory7, one may combine the X-ray gas temperature (kBT = 5 keV, where kB is the Boltzmann constant) and an estimate of the virial radius of the cluster (r200 = 440 kpc, ref. 4) to obtain a sound-crossing time for XLCCS 122 of 3.3 × 108 yr. Hydrodynamical simulations of the gas physics in a forming cluster indicate that structures typically achieve virial equilibrium following a minimum of 2 to 3 sound-crossing timescales8. This indicates that XLSSC 122 is unlikely to have assembled earlier than 1 Gyr before the epoch of observation, equivalent to a redshift of 2.8. Although this argument does not place an upper limit on the elapsed time between assembly and virialization, the assembly of a cluster of mass equal to XLSSC 122 at redshifts greater than 3 appears unlikely9,10. It is clear, therefore, even allowing for the uncertainty present in these estimates, that coordinated star formation in the member galaxies located in XLSSC 122 preceded the assembly of the cluster environment.

Computer simulations of the accretion history of massive, gravitationally bound halos in an expanding Universe indicate that it is likely that XLSSC 122 will evolve with time into a present-day galaxy cluster comparable in mass to that of Coma, that is, about 1 × 1015 solar masses9,10,11. Although caution is required in both the interpretation of the scatter in the accretion histories of halos of fixed total mass and the more subtle point of whether XLSSC 122 represents a typical galaxy cluster at z = 2 or is perhaps an extreme case, the conclusion remains robust that this system will continue to grow in mass until it becomes a massive galaxy cluster in the present-day Universe.

The recent discovery of SPT2349-56, a massive proto-cluster of galaxies at a redshift of 4.3 (ref. 12), provides a further, tantalising, glimpse of the kind of structure from which XLSSC 122 may have evolved. The same structure growth simulations that predict the future evolution of massive halos can also be used to infer their likely past accretion histories. Even taking into account the caveats expressed above, such simulations indicate that structures such as SPT2349, XLSSC 122 and Coma may represent similar clusters viewed at very different cosmic epochs. From such studies we are beginning to achieve a coherent view of the formation and evolution of the largest gravitationally bound structures in the Universe.

Methods

We assume a Lambda cold dark matter (CDM) cosmological model described by the parameters ΩM = 0.286, ΩΛ = 0.714, H0 = 69.6 km s−1 Mpc−1 (ref. 13). The present-day age of the Universe in this model is 13.72 Gyr. All magnitude information is presented using the AB system.

The HST observations were obtained between 4 November 2017 and 13 January 2018 and comprised one orbit in F105W and 12 orbits in the F140W+G141 filter and grism combination. The 12 F140W+G141 orbits were split into three orbits at each of four orientations using an ABB BBA pattern for exposures in each orbit. The total exposure times in F105W, F140W and G141 were, respectively, 2,612 s, 5,171 s and 26,541 s.

The imaging and spectroscopic observations were reduced with Grizli version 0.3.0 (ref. 14). Raw HST data products were processed by applying standard image-calibration techniques with additional corrections applied for variable backgrounds (the HST reduction pipeline calwf3 assumes a constant background not appropriate for WFC3 infrared observations) and to mask artefacts such as satellite trail features15,16. Relative and absolute astrometric registration was achieved by aligning to reference sources in the Sloan Digital Sky Survey. The final steps included flat fielding and master background subtraction for both the direct and grism images, and drizzling of the individual data frames to produce stacked images.

Reduced data were processed with SExtractor (version 2.5.0; www.astromatic.net/software/sextractor) to generate photometric catalogues. The F140W image was processed using standard WFC3 zero point information with the gain parameter set to the image exposure time. Source detection used a pixel-based inverse variance weighting (pipeline IVM file in SExtractor), whereas source photometry employed a root mean square (pipeline RMS file in SExtractor) variation per pixel weight. The F105W image was processed employing the SExtractor two-image mode with the F140W image used as the detection image. Source fluxes and AB magnitudes were computed within two apertures: a 0.8-arcsecond circular aperture17,18 and an elliptical aperture based upon the Kron radius (a statistical moment computed from the surface brightness distribution in each object) with the Kron factor set to k = 0.8 to avoid excessive source blending in the central cluster regions. Sources with a half-light radius of <0.22 arcseconds were classified as stellar. In the following analysis we consider sources brighter than F140W = 25.5, corresponding to an image signal-to-noise ratio (SNR) >10.

Spectral extraction from the G141 images employed the F140W segmentation map produced by SExtractor (see above) to identify undispersed source positions. These source positions were then employed to construct a full field contamination model of each G141 image. The contamination model initially assumes a spectrally flat continuum for all sources brighter than 25th magnitude in F140W. This provides a first-pass estimate of those pixels contaminated by spectra from more than one source. Spectral traces, represented by 2nd-order polynomial functions, were fitted to all of the above bright sources in each exposure at each orientation. Extracted spectra for these sources were then employed to compute a second-pass contamination model. The model was further refined for 26 bright objects, which were identified as contaminating the spectra of bright red-sequence galaxies. In these cases synthetic stellar population models were fitted to the contaminating spectra and these updated spectral models were propagated to the global contamination model. Employing the above procedures, and with the G141 observations split into four orientations, we were able to obtain a satisfactory contamination model for most sources even in such a densely packed field.

Two-dimensional spectra were extracted separately for each G141 exposure and resulted in a maximum of 48 spectral extractions per source. These spectra were optimally extracted19 and simultaneously fitted with a suite of galaxy templates20. The templates were stepped in redshift over a coarse (∆z = 0.01) grid from z = 0.2–4.0 and subsequently refitted over a fine grid (∆z = 0.0004) in redshift around peaks in the probability distribution function.

We define the SNR of each spectrum as the average spectral flux per pixel divided by the pipeline-computed noise per pixel integrated over the wavelength interval 1.3–1.55 μm. A galaxy of brightness F140WKron = 24 typically generates a spectrum of SNR = 5 with a scatter consistent with random noise. We inspected visually all spectra displaying a spectral SNR >2 to assess the reliability of the fitted redshift and template model. We concluded that all spectra displaying SNR ≥ 5 possess a visually reliable redshift measurement and consequently we employ F140WKron = 24 as the galaxy brightness corresponding to our spectroscopic completeness limit. Furthermore, we determined that sources with visually identified emission lines possess a reliable redshift to a limit of SNR ≥ 3, corresponding to a brightness F140WKron = 24.5, which we adopt for our spectroscopic completeness limit for emission line sources. Extended Data Fig. 1 shows two examples of extracted grism spectra.

Galaxy membership of the z = 1.98 cluster was defined according to a number of criteria that we describe below. We define ‘gold’ members as those displaying F140WKron = 24 (24.5 for emission line sources) and Pmem > 0.5 where Pmem is defined as the integral of the redshift probability distribution function for each galaxy over the interval 1.96 < z < 2.00. This interval corresponds to zcluster ± 3σz where σz is the observed frame velocity dispersion of a 5-keV galaxy cluster expressed in redshift space21. There are 33 galaxies in this class (of which four are emission line sources with 24 < F140WKron ≤ 24.5). We compute the redshift of XLSSC 122 as the unweighted mean of the ‘gold’ cluster member redshifts. The redshift is z = 1.978 ± 0.010. We define ‘silver’ members as those displaying 0.1 < Pmem < 0.5—a change that adds four new members (one of which is on the red sequence)—for a total of 37. Finally, we create an additional class to identify members of the z = 1.93 foreground structure as those displaying Pmem > 0.5, where Pmem is defined as the integral of the redshift probability distribution function for each galaxy over the interval 1.91 < z < 1.95 with the same brightness limits as before. There are 13 galaxies in this class. We compute the redshift of this structure as the unweighted mean redshift of these 13 galaxies. The resulting structure redshift is z = 1.934 ± 0.007. This analysis therefore identifies a total of 50 galaxies that are members of either XLSSC 122 or the z = 1.93 structure (see Extended Data Table 1 and Fig. 1 for these members plotted on the greyscale HST/WFC3 image).

Figure 3 shows the colour–magnitude diagram for all galaxies identified within the HST field. We identify a total of 30 red-sequence members according to 1.15 < F105Wap − F140Wap < 1.65 and F140WKron < 24. Of these, 19 are defined as ‘gold’ cluster members as described above. Of the remaining 11 galaxies, one is a silver cluster member, one is located at z ≈ 2 with a relatively broad redshift probability distribution function, five are located within the z = 1.93 structure and four are located at z > 2 yet display spectra affected by source confusion and contamination. We restrict our subsequent red-sequence analysis to the 19 ‘gold’ cluster members. An unweighted, linear, least-squares fit to the red-sequence members generates the angled dotted line shown in Fig. 3. The root-mean-square deviation in colour about this line normalized by the photometric error is 1.72, that is, the observed scatter is 72% larger than expected from the computed colour errors.

At redshifts z < 1, the dominant populations of evolved, red galaxies are interpreted to be the result of the prompt suppression of star formation within galaxies accreting into the cluster environment22. The details of this process, euphemistically referred to as ‘quenching’, remain uncertain, with likely physical scenarios including the ram pressure stripping of gas from galaxies falling through the hot, X-ray-emitting, intra-cluster medium23,24. The exact mass scale at which quenching occurs is also debatable, with uncertainty as to whether the suppression of star formation occurs as galaxies are accreted into less massive groups before encountering more massive clusters25. With a quenched fraction of 0.51 ± 0.14 at a look-back time of 10.4 Gyr for XLSSC 122, it is clear that the physical processes involved in quenching were established at an even earlier cosmic epoch.

We employ the F105W and F140W photometry of red-sequence galaxies to constrain the stellar age, star formation rate and stellar mass of a set of synthetic stellar population models. The analysis presented in this paper intentionally follows that performed by Andreon et al.26 and Newman et al.27 of galaxies within the cluster JKCS 041 at z = 1.8. This was done in order to allow as direct a comparison as possible between galaxy populations in two high redshift clusters, albeit observed in different photometric filters.

We employ a grid of simple stellar population models28 to generate synthetic F105W and F140W photometry. The likelihood of the model photometry given the data is expressed as L = exp(−χ2/2), where:

$${\chi }^{2}=\sum _{i}{\left(\frac{{D}_{i}-{M}_{i}}{{\sigma }_{i}}\right)}^{2}$$

and Di represents the measured apparent magnitude in the ith filter, Mi is the apparent magnitude computed from the stellar population model and σi is the uncertainty in the measured magnitude. The stellar population models are characterized by an exponentially declining burst of star formation where the star formation rate SFR exp(−t/τ). The variable t denotes the time since the burst commenced and τ is the e-folding time. Models are further characterized by a Salpeter29 initial mass function and solar metallicity. Gas lost during stellar evolution is not recycled and the effects of dust are included by applying a Calzetti attenuation law30 parameterized by the extinction parameter AV at 5,500 Å. The stellar population model photometry is normalized per unit stellar mass and is scaled by a total stellar mass variable, Mstar.

The stellar population model grid spans 8 < log[t (yr)] < 9.7 and 8 < log[τ (yr)] < 9.7. We compute posterior distributions in logt, logτ and logMstar, employing a Markov chain Monte Carlo algorithm and assuming flat priors. We do not explore the AV posterior explicitly at this stage. Instead we compute the posterior distributions of the above variables at three explicit values of AV (0.0, 0.3 and 0.5). Finally, we compute the average luminosity-weighted stellar population age, following refs. 26,31, as:

$${t}_{{\rm{w}}}=\frac{t}{1-{{\rm{e}}}^{-t/\tau }}-\tau $$

The posterior distributions of tw and Mstar are displayed in Extended Data Fig. 2 for the 19 ‘gold’ cluster members. Extended Data Fig. 3 displays the one-dimensional posterior distribution in tw for each cluster member having marginalized over Mstar. In addition, Fig. 4 compares the average tw posterior for all cluster members, computed as the product of individual posteriors, for each of the three dust models, AV = 0.0, 0.3 and 0.5. Values of mean luminosity-weighted stellar age and standard deviation are listed in Table 1. For the canonical model employing zero dust absorption the mean stellar age of 2.98 Gyr at z = 1.98 corresponds to a mean star formation redshift of z = 12.0.

The spread of stellar age values in XLSSC 122 overlaps with those determined for the galaxy cluster JKCS 041 at z = 1.803 (ref. 26), yet the mean stellar age in XLSSC 122 is older, even though the Universe is 0.32 Gyr younger at z = 1.98 compared to z = 1.8. Although this comparison employs the same analysis methodology, the two clusters are observed using different photometric filters, while the clusters themselves may represent very different structures. It is instructive therefore to further compare our results to the study of Strazzullo et al.32 who also analysed HST WFC3 F105W and F140W photometry for the z = 2 cluster CL1449+085633. Applying a stellar population model characterized by a short (0.25 Gyr) burst of metal-rich (150% the solar value) star formation, they obtain a typical formation redshift of 3 to 5 for galaxies of similar colour and redshift to those analysed in XLSSC 122. Applying a similar model to the data for XLSSC 122, we obtain a typical stellar population age of 1.4 Gyr, corresponding to formation redshift of 3.3, in agreement with ref. 32. Ultimately, we consider the assumption of a short, metal-rich burst of star formation to be unnecessarily restrictive given the considerable uncertainty regarding the exact physical state of these high-redshift stellar populations and adopt a more flexible approach as outlined in this paper. Overall however, the comparison is instructive because it highlights the key influence of the assumptions governing the stellar population model upon the inferred formation redshift of the luminosity weighted stellar content of the cluster member galaxies. The acquisition of further data, in particular concerning the dust and metal content of the member galaxies of these high-redshift clusters provides a clear observational route to resolving such issues. We therefore emphasize in conclusion that the results of such stellar population modelling, when based upon broad-band photometry, are most conservatively interpreted as indicating the range of physically reasonable input parameters and not as indicating a definitive physical state of the stellar population.