Main

Gaia is a European Space Agency (ESA) mission that was designed primarily to investigate the origin, evolution and structure of the Milky Way, and has recently delivered the largest and most precise census of positions, velocities and other stellar properties of more than a billion stars. By exploring the phase space (positions and velocities) of more than six million stars within a few kiloparsecs of the Sun in the Galactic disk from Gaia data release 2 (DR2; see Methods)19, we find that certain phase-space projections (Figs. 1a, 2) have many substructures that had not been predicted by existing models. These substructures had remained blurred until now, owing to the limitations on the number of stars in and the precision of previously available datasets.

Fig. 1: Vertical positions and velocities of the stars.
figure 1

The plots show the distribution of stars in the vertical position–velocity (ZVZ) plane from our sample of Gaia data for stars with Galactocentric radii of 8.24 kpc < R < 8.44 kpc. a, Two-dimensional histogram in bins of ΔZ = 0.02 kpc and ΔVZ = 1 km s−1, with the darkness of the colour scale proportional to the number of stars. b, ZVZ plane coloured as a function of median radial velocity VR in bins of ΔZ = 0.02 kpc and ΔVZ = 1 km s−1. c, Same as b, but for the azimuthal velocity Vϕ. VR and Vϕ are positive towards the Galactic anticentre and the direction of Galactic rotation, respectively.

Fig. 2: Positions and velocities of the stars in the disk plane.
figure 2

Distribution of azimuthal velocities Vϕ as a function of Galactocentric radius R for all stars in our sample with six-dimensional phase space-coordinates from Gaia DR2, shown as a two-dimensional histogram in bins of ΔVϕ = 1 km s−1 and ΔR = 0.01 kpc.

In Fig. 1a we show the projection of phase space in vertical position and velocity, ZVZ. The stars follow a curled, spiral-shaped distribution, the density of which increases towards the leading edge of the spiral. Figure 1b, c demonstrates that this ‘snail shell’ pattern is still present when the stars are colour-coded according to their radial and azimuthal velocities, VR and Vϕ, which implies a strong correlation between the vertical and in-plane motions of the stars. The pattern is particularly pronounced in the case of Vϕ (Fig. 1c), even up to VZ ≈ 40 km s−1. Furthermore, we see a gradient of different values of Vϕ across the spiral shape, which follows the density variations. Details of the relationship between the snail shell and the other velocity features observed in the solar neighbourhood are shown in Extended Data Fig. 1.

The spiral shape in Fig. 1a is clearly reminiscent of the effects of phase mixing in two dimensions that have been discussed in several areas of astrophysics20,21,22 and in quantum physics23, but never in the context of dynamical models of the Galactic disk. This process can be better understood by using a toy model. Consider a Galaxy model in which the vertical potential of the Galaxy can be approximated by an anharmonic oscillator (see equation (1) in Methods). In this approximation, the vertical frequencies of oscillation ν depend on the amplitude of the oscillation A and the Galactocentric radius R, to first order22 (see equation (2) in Methods). By assuming that stars follow a simple harmonic oscillation with these frequencies, their movement over time t is described by Z = Acos[ϕ(t)] and VZ = −sin[ϕ(t)] (where ϕ(t) is the orbital phase), which traces an oval shape in the clockwise direction in the ZVZ projection. However, stars revolve at different angular speeds depending on their frequency. Thus, an ensemble of stars will stretch out in phase space, with the range of frequencies causing a spiral shape in this projection. The detailed time evolution of stars in this toy model is described in Methods and shown in Extended Data Fig. 3. As time goes by, the spiral gets more tightly wound and eventually this process of phase mixing leads to a spiral that is so tightly wound that the coarse-grained distribution appears to be smooth. The clarity of the spiral shape in the ZVZ plane revealed by Gaia DR2 implies that this time has not yet been reached in the Milky Way, providing evidence that phase mixing is currently taking place in the Galactic disk.

This interpretation also implies that the shape of the spiral can be used to obtain information about: (i) the shape of the potential, which determines the vertical frequencies; (ii) the starting time of phase mixing; and (iii) the type of perturbation that brought the disk into a non-equilibrium state, which sets the initial conditions for the phase-mixing event that we are witnessing. For instance, we can estimate the time t of the event from the separation between two consecutive spiral turns, because these have a phase separation of \(({\nu }_{2}t+{\phi }_{{0}_{2}})-({\nu }_{1}t+{\phi }_{{0}_{1}})=2{\rm{\pi }}\), where the subscripts ‘1’ and ‘2’ indicate two consecutive turns of the spiral. Therefore, assuming that the initial phase \({\phi }_{{0}_{i}}\) is the same for turns 1 and 2, we have t = 2π/(ν2 − ν1). Using several potentials for the Milky Way (see Methods), we estimate that the vertical phase-mixing event started between 300 Myr and 900 Myr ago. The toy model that illustrates this process is shown in Fig. 3a, which depicts a snail shell that formed after 500 Myr from an ensemble of stars with a starting distribution that is out of equilibrium; this modelled snail shell is similar to the one seen in the data.

Fig. 3: Models of the phase-space distribution of the Galaxy disk.
figure 3

a, Modelled spiral shape created in the vertical position–velocity (ZVZ) plane as a result of phase mixing in the evolution of an ensemble of particles for 500 Myr in a Galactic potential, starting from a distribution that is out of equilibrium, presumably after a perturbation. b, Modelled diagonal ridges created in the distribution of azimuthal velocities Vϕ as a function of Galactocentric radius R as a result of the phase mixing in the evolution of an ensemble of particles for 1,000 Myr in a Galactic potential, starting from a distribution that is out of equilibrium. c, Same as b, but for diagonal ridges created as a result of the effects of the barred potential and its resonant structure. See Methods for more details.

A possible perturbation that might have initiated the on-going vertical phase mixing that we observe is the influence of a satellite galaxy. In particular, the last pericentre of the orbit of the Sagittarius dwarf galaxy has been shown to have strong effects on the stellar disk4,8,9. In addition, most models place this pericentric passage between 200 Myr and 1,000 Myr ago9,24,25, which is consistent with our findings. Nevertheless, other processes that may induce snail shell patterns include the formation of the central bar and of the transient spiral structure, provided that these processes can induce vertical asymmetries, other global changes in the potential, and the dissolution of a massive stellar system such as a cluster or accreted satellite.

Another phase-space projection in which the Gaia data have a markedly different appearance is that in azimuthal velocity and cylindrical radius, VϕR (Fig. 2). Although this phase-space projection has been explored previously with other data26, the spatial coverage, high sampling and unprecedented precision of the Gaia data reveal many thin diagonal ridges that were not previously evident. The arches in the velocity-space projection VRVϕ in the solar neighbourhood that were discovered recently in Gaia data12 (see Extended Data Fig. 1a) are projections of these diagonal ridges, but at a fixed Galactic position. Therefore, Fig. 2 reveals that arched velocity structures must be present at many different radii, but have thus far been unexplored, and that their characteristics vary with distance from the Galactic centre, diminishing their velocity towards the outskirts of the Galaxy in a continuous way.

These diagonal ridges could be signatures of phase mixing in the horizontal direction, as has been predicted for the arches in velocity space6,7,8. Alternatively, the bar and the spiral arms could induce diagonal ridges through their resonant orbital structure, creating regions in phase space of stable and unstable orbits27, and hence with over-densities and gaps. The toy model of phase mixing (Fig. 3b) and a disk simulation with a Galactic potential that contains a bar (Fig. 3c) both show several diagonal ridges. The Vϕ separation of consecutive ridges in the data is about 10 km s−1. Comparing this separation to that of our toy model (see Methods) indicates that, if these ridges are caused by phase mixing from a single perturbation, it should have taken place longer ago than the perturbation that gave rise to the vertical mixing. This is consistent with the timing derived using the separation between arches in the local velocity plane7 and with the existence of a group of co-moving stars that appear not to be fully phase mixed vertically18, which suggests that another perturbation occurred about 2 Gyr ago. The relationship between the various features is not clear, and it is not unlikely that there are or were several perturbations creating superposed features.

Our interpretation of the features that we found is based on toy models, the main limitations of which are their lack of self-consistency, the choice of initial conditions not necessarily reflecting those stemming from the impact of a satellite galaxy, and the fact that we study separately both the effects of resonances and phase mixing, and the different phase-space dimensions (horizontal and vertical) involved. A challenging task for the future will be to model our findings taking into account collective effects, such as in the perturbative regime28, and with self-consistent N-body models25.

Methods

Data and samples selection

We used Gaia DR2 sources for which the six-dimensional phase-space coordinates can be computed, that is all stars for which a five-parameters astrometric solution (sky positions, parallax and proper motions) and a line-of-sight velocity are available. We selected only stars with positive parallaxes ϖ with relative uncertainty less than 20%, that is, satisfying ϖ/σϖ > 5. This selection ensures that 1/ϖ is a reasonably good estimator of the distance to the stars29; alternatively, we also used Bayesian distances (see below). This sample has 6,376,803 stars and has been well studied and characterized previously12. The data were obtained directly through the following query in the public Gaia archive (https://gea.esac.esa.int/archive):

SELECT G.source_id, G.radial_velocity, G.radial_velocity_error, G.ra, G.ra_error, G.dec, G.dec_error, G.parallax, G.parallax_error, G.pmra, G.pmra_error, G.pmdec, G.pmdec_error, G.ra_dec_corr, G.ra_parallax_corr, G.ra_pmra_corr, G.ra_pmdec_corr, G.dec_parallax_corr, G.dec_pmra_corr, G.dec_pmdec_corr, G.parallax_pmra_corr, G.parallax_pmdec_corr, G.pmra_pmdec_corr

FROM gaiadr2.gaia_source G

WHERE G.radial_velocity IS NOT Null AND G.parallax_over_error>5.

From the five-parameter astrometric solution and line-of-sight velocities (α, δ, ϖ, \({\mu }_{\alpha }^{\ast }\), μδ, Vlos) of these stars, we derived distances (as 1/ϖ), positions and velocities in the cylindrical Galactic reference frame (R, ϕ, Z, VR, Vϕ, VZ). For convenience, we took ϕ to be positive in the direction of Galactic rotation and with the origin at the Sun–Galactic centre line. For these transformations, we adopted a vertical distance of the Sun above the plane of30 27 pc, a distance of the Sun to the Galactic centre31 of Rʘ = 8.34 kpc and a circular velocity at the Sun radius of31 VC(Rʘ) = 240 km s−1. We assumed a peculiar velocity of the Sun with respect to the local standard of rest of32 (Uʘ, Vʘ, Wʘ) = (11.1, 12.24, 7.25) km s−1. Our choice of values gives (VC(Rʘ) + Vʘ)/Rʘ = 30.2 km s−1 kpc−1, which is compatible with the reflex motion of Sgr A*33. To derive the uncertainties in these coordinates, we propagate the full covariance matrix. The median uncertainties in VR, Vϕ and VZ are 1.4 km s−1, 1.5 km s−1 and 1.0 km s−1, respectively, and 80% of stars have uncertainty smaller than 3.3 km s−1, 3.7 km s−1 and 2.2 km s−1, respectively, in these velocities. The positions in the Cartesian coordinates XY and XZ of the sample are shown in Extended Data Fig. 2.

For part of our study, we selected from our sample the 935,590 stars located in the solar Galactic cylindrical ring, with Galactocentric radius 8.24 kpc < R < 8.44 kpc (dotted lines in Extended Data Fig. 2). For this selection, the median uncertainties in VR, Vϕ and VZ are 0.5 km s−1, 0.8 km s−1 and 0.6 km s−1, respectively, and 80% of stars have uncertainty smaller than 1.1 km s−1, 2.0 km s−1 and 1.3 km s−1, respectively, in these velocities.

The velocity uncertainties are significantly smaller than the sizes of the substructures detected; together with the large number of stars in our samples, this is what made their detection possible. Although there are some correlations between the astrometric Gaia observables34, these are not responsible for the correlations and substructures seen in our phase-space plots. This is because the stars in our sample are distributed across all sky directions, and the phase-space coordinates come from combinations of astrometric measurements and radial velocities in different contributions depending on the direction on the sky. Besides, the astrometric correlations for our sample are small (less than 0.2 in their absolute value for more than 50% of stars); this, combined with the small errors, makes their contribution insignificant.

Alternatively, we used distances determined through a Bayesian inference method using the existing implementation in TOPCAT35, taking the mode of the posterior distribution and a prior of an exponentially decreasing density of stars with scale length of36 1.35 kpc. We found that the differences between this distance determination and the inverse of the parallax are between −2% and 0.6% for 90% of the 6,376,803 stars with ϖ/σϖ > 5, which is expected for small relative errors in parallax. Consequently, the phase-space diagrams presented here vary only at the pixel level. The diagrams do not vary even when using the set of 7,183,262 of stars with available radial velocities, which includes stars with larger parallax errors and with negative parallaxes, for which the estimator of the inverse of the parallax would yield unphysical distances. When using another alternative set of Bayesian distances specifically derived for stars from Gaia DR2 with radial velocities using a different prior37, we found the differences between these distances and the inverse of the parallax to be between −9% and 5% for 90% of the stars—slightly larger than before, but again with no noticeable effects on the phase-space projections examined here.

Models for vertical phase mixing

We first reproduced the spiral shape observed in the ZVZ plane with Gaia data by using a toy model. The classic harmonic oscillator is often used to describe the vertical movement of stars in galactic disks under the epicyclic theory38. However, in this approximation, which is valid only for very small-amplitude orbits for which the potential changes little vertically, stars have the same vertical oscillatory frequency ν and there is no phase mixing, unless orbits at different guiding radius, and therefore with different frequencies, are considered. Instead, we use an anharmonic oscillator with potential

$$\Phi (Z)\propto -{\alpha }_{0}+\frac{1}{2}{\alpha }_{1}{Z}^{2}-\frac{1}{4}{\alpha }_{2}{Z}^{4}$$
(1)

We adopt coefficients α0, α1 and α2 that correspond to the expansion for small Z (derived elsewhere22) of a Miyamoto–Nagai potential39 with a = 6.5 kpc, b = 0.26 kpc and M = 1011Mʘ. These coefficients depend on Galactocentric radius R because the vertical pull depends on the distance to the Galactic centre. In this anharmonic potential, the frequencies of oscillation are described by

$$\nu (A,R)={\alpha }_{1}{(R)}^{1/2}\left[1-\frac{3{\alpha }_{2}(R){A}^{2}}{8{\alpha }_{1}(R)}\right]$$
(2)

where \({\nu }_{0}={\alpha }_{1}^{1/2}\) is the vertical frequency in the epicyclic approximation.

Given an initial distribution of stars with Z(t = 0) and VZ(t = 0), the vertical amplitudes of the orbits can be derived through the conservation of energy and using the fact that at the vertical turn-around point of the orbit (VZ = 0) the (vertical) kinetic energy is null22. Assuming that stars follow a simple harmonic oscillation (but with different frequencies), the movement of the stars with time is described by

$$\begin{array}{c}Z=A{\rm{\cos }}\left[\nu (A,R)t+{\phi }_{0}\right]\\ {V}_{Z}=-A\nu (A,R)sin\left[\nu (A,R)t+{\phi }_{0}\right]\end{array}$$
(3)

where the initial phase of the stars ϕ0 = ϕ(t = 0) is obtained from the initial distribution of Z and VZ and the corresponding amplitudes.

The phase-space evolution described above is shown in Extended Data Fig. 3a–c. Initially, the particles follow a Gaussian distribution in Z(t = 0) and VZ(t = 0) with mean and dispersion of −0.1 kpc and 0.04 kpc, and −2 km s−1 and 1 km s−1, respectively. We located all particles at the same Galactocentric radius of R = 8.5 kpc; therefore, they all move under the same functional form of the vertical potential. The initial conditions are shown in Extended Data Fig. 3a, in which we have colour-coded the particles according to their period. Following equation (3), each star follows a clockwise rotation in the ZVZ plane. However, they do so at different angular speeds: stars with smaller period located at closer distances from the mid-plane (Z = 0) revolve faster than those located at largest distances from the mid-plane. The range of frequencies is therefore what creates the spiral shape. Extended Data Fig. 3b shows the evolution of the system for three initial phases of the time evolution when the spiral shape begins to form. Extended Data Fig. 3c shows the spiral shape after 1,000 Myr of evolution.

In the Gaia data (Fig. 1), we do not see a thin spiral but a thick one, with many of the stars in the volume participating in it. A similar effect was observed with our toy model when we included particles at different radius for which the vertical potential and the range of amplitudes and frequencies changes. In Extended Data Fig. 3d–f we let a similar system evolve as in Extended Data Fig. 3a–c but starting with the initial radius following a skewed normal distribution, which creates a density that decreases with radius, as in galaxy disks, with a skewness of 10, location parameter of 8.4 kpc and scale parameter of 0.2 kpc. The spiral structure is now thickened, similarly to the data, with a higher density of stars at the leading edge of the spiral.

To estimate the time of the phase-mixing event from the spiral seen in the Gaia data (Fig. 1) using

$$t=\frac{2{\rm{\pi }}}{{\nu }_{2}-{\nu }_{1}}$$
(4)

we need to locate two consecutive turns of the spiral and estimate their vertical frequencies from their amplitudes and mean radius. For this, we used Extended Data Fig. 5, which is colour-coded as a function of median guiding radius. The guiding radius is approximated as Rg ≈ VϕRʘ/[Vc(Rʘ)], under the hypothesis of a flat rotation curve, where we used the values of Rʘ = 8.34 kpc and Vc(Rʘ) = 240 km s−1 assumed in the coordinate transformation of the data. From Extended Data Fig. 5 we see that the density gradient across the spiral shape is created by stars with different guiding radius that arrive at the solar neighbourhood owing to their different amplitudes of (horizontal) radial oscillation. To determine two consecutive turns of the spiral, we focused on stars at the turn-around points (VZ = 0) near the leading edges of the spiral. By visual inspection, we determined an approximate range of Z in which the turn-around points are located in Extended Data Fig. 5, concentrating on red colours, for which the spiral is well defined. The ranges of the turn-around points are marked with vertical lines and listed, together with the middle value, in Extended Data Table 1. For these turn-around points, the amplitudes are simply A = Z and from the colour bar we note that the median Rg is around 8.2 kpc. Small changes in this value do not change substantially our final determination of the time of the perturbation.

To estimate the vertical orbital frequencies of these turn-around points, we could not use the toy model presented above because it is valid only for oscillations with small amplitude A—in particular, smaller than the vertical scale b of the potential \(\left(A\ll 0.26\hspace{2.22144pt}{\rm{kpc}}\right)\). Therefore, we took an existing model40 with updated parameters that fit current estimates such as for the Sun Galactocentric radius and the circular velocity curve41. We computed the vertical frequency numerically in a grid of different radii and vertical amplitudes by integrating orbits and measuring their vertical periods (Extended Data Fig. 4a). The vertical frequency can change along the orbits for stars with large eccentricities in the horizontal direction, but here for simplicity we put all particles on nearly circular orbits. We estimated the vertical frequency at each turn-around position directly by interpolating the calculations of Extended Data Fig. 4a using the estimated values for the amplitude and radius.

Finally, taking each pair of turning points, we obtained an estimate of the time since the perturbation using equation (4). As an example, the two turning points (amplitudes) of the left part of the spiral are located at −0.59 ± 5 kpc and −0.23 ± 5 kpc, respectively. These correspond to vertical frequencies of 0.058 ± 0.002 rad Myr−1 and 0.072 ± 0.002 rad Myr−1 for Rg = 8.2 kpc, which gives a time of \(46{1}_{-105}^{+183}\hspace{2.22144pt}{\rm{Myr}}\). We repeated the same procedure for the second pair of consecutive turning points, and also for the edges of the spiral at Z = 0 (mid-plane points, which have VZ = ), estimating the frequencies by interpolating in Extended Data Fig. 4b. The mid-plane positions are marked as horizontal lines in Extended Data Fig. 5. All results are summarized in Extended Data Tables 1 and 2. The mean of the three time estimates is 510 Myr and the minimum and maximum times from the uncertainty ranges are 356 Myr and 856 Myr.

We tested the dependence of our time determination on the potential model used by using a different model42. Compared to our previous model, this one has different shapes for the halo, disks and bulge, a different total mass, and includes thin and thick disks and two gas disks. The frequencies in this model are smaller by 4% on average and smaller than 6% for 90% of the points in the grid of Extended Data Fig. 4a. By repeating the whole process to determine the perturbation time, we obtained 528 Myr with minimum and maximum times from the uncertainty ranges of 361 Myr and 899 Myr, very similar to our previous determination.

Our determination is subject to several approximations: (1) we used the vertical frequencies of orbits with conditions of circularity on the plane and for certain assumed Galactic potential models; (2) we considered a unique guiding radius; and (3) we took equal initial phases for the turn-around and mid-plane points.

We finally ran a simulation (Fig. 3a) by integrating 100,000 test-particle orbits in the updated model41 with initial vertical positions and velocities following Gaussian distributions centred at Z = −0.4 kpc and VZ = −5 km s−1 and with dispersions of 0.15 kpc and 2 km s−1, respectively. Horizontally in the disk plane, the test particles were distributed following a skewed normal distribution in radius R, with a scale parameter of 0.8 kpc, location parameter of 8, skewness of 10, and all particles at an azimuthal angle ϕ = 0. The horizontal velocities were set to 0 for the radial component and to the circular velocity at the particle’s radius and Z = 0 for the azimuthal component. These are initial conditions of circularity on the Galactic plane but not necessarily for orbits with large excursions in Z. The particle’s orbits were integrated forwards in time for 500 Myr as estimated from the data. This is not meant to be a fit to the data because we have not explored all possible initial configurations that could lead to a similar spiral shape. However, we see that an initial distribution that is asymmetric in Z, with most particles located at positive or negative Z, is required to obtain a single spiral instead of a symmetrical double one.

Model for the horizontal phase mixing

We used a toy model to reproduce the diagonal ridges observed in the VϕR plane (Fig. 3b). This model is built by integrating orbits in the Galactic potential of ref. 40 with updated parameters that fit current estimates such as for the Sun Galactocentric radius or the circular velocity curve41. We used as initial conditions a set of test particles distributed in Galactocentric radius according to a skewed normal distribution with a skewness of 10, location parameter of 4 kpc and scale parameter of 6 kpc. The azimuthal angle was fixed between 0° and 50° to mimic a localized perturbation in the disk. For simplicity, all particles were put at the mid-plane with null vertical velocities. The radial and azimuthal velocities were initialized, respectively, following Gaussian distributions centred at 0 with a dispersion of 40 km s−1 and centred at the circular velocity at the particular radius with a dispersion of 30 km s−1. The particle orbits were computed for 1 Gyr. To these particles we added particles in the disk that had been integrated for a much longer timescale and are therefore well mixed, to simulate the stellar populations that have not been perturbed. Of all the particles in the simulation, we used only the ones located in a range of 10° in azimuthal angle at the end of the integration, similarly to the data.

Model for the horizontal resonances

The model of Fig. 3c is from a test-particle simulation of orbits integrated in a Galactic potential model including a bar43. The axisymmetric part of the potential was from an existing model40. The Galactic bar potential was built using Ferrers ellipsoids44 oriented with semi-major axes at 20° from the Sun–Galactic centre line, and with pattern speed set to 50 km s−1 kpc−1, which corresponds to a period of about 120 Myr. The simulation consisted of 68 million test particles with an initial radial velocity dispersion of 30 km s−1 at the solar radius. Their orbits were first integrated in the axisymmetric potential model for 10 Gyr until they were approximately fully phase-mixed. Next, the bar potential was grown in Tgrow = 4 rotations of the bar. More details on the bar potential, initial conditions and integration procedure are specified elsewhere43. Here we used the final conditions after Tgrow (about 500 Myr) and eight additional bar rotations (about 1,000 Myr). From all of the particles in the simulation, we used only the ones located in a range of 10° in azimuthal angle centred on the Sun, similarly to the data.

Code availability

We have made use of standard data analysis tools in Python. The codes used to generate the toy models and simulations and to compute the orbital frequencies are available from the corresponding author on reasonable request. The code used to compute the orbits for the potential from ref. 42 is available at https://github.com/PaulMcMillan-Astro/GalPot. The code used to compute Bayesian distances from parallaxes is available in the TOPCAT platform35.