Similarity and economy of scale in urban transportation networks and optimal transport-based infrastructures

Leite, Daniela; De Bacco, Caterina

doi:10.1038/s41467-024-52313-6

Download PDF

Article
Open access
Published: 12 September 2024

Similarity and economy of scale in urban transportation networks and optimal transport-based infrastructures

Nature Communications volume 15, Article number: 7981 (2024) Cite this article

1 Altmetric
Metrics details

Subjects

Abstract

Designing and optimizing the structure of urban transportation networks is a challenging task. In this study, we propose a method inspired by optimal transport theory and the principle of economy of scale that uses little information in input to generate structures that are similar to those of public transportation networks. Contrarily to standard approaches, it does not assume any initial backbone network infrastructure but rather extracts this directly from a continuous space using only a few origin and destination points, generating networks from scratch. Analyzing a set of urban train, tram and subway networks, we find a noteworthy degree of similarity in several of the studied cases between simulated and real infrastructures. By tuning one parameter, our method can simulate a range of different subway, tram and train networks that can be further used to suggest possible improvements in terms of relevant transportation properties. Outputs of our algorithm provide naturally a principled quantitative measure of similarity between two networks that can be used to automatize the selection of similar simulated networks.

Understanding traffic capacity of urban networks

Article Open access 08 November 2019

A global empirical study on how street networks facilitate driving longer distances

Article Open access 24 October 2023

Topological assessment of recoverability in public transport networks

Article Open access 26 March 2024

Introduction

Transportation networks are a fundamental part of a city’s infrastructure. Their design impacts the efficiency with which the system is operated, hence, they should follow optimal principles while being constrained by limitations like budget or physical obstacles. Existing approaches for studying the quality of network design often rely on the analysis of the topological network properties, and relate them to optimal features like transportation cost, efficiency or robustness. These analyses are usually made a posteriori, only once the network has been constructed, and thus only resulting properties can be analyzed^1,2. A different approach is that of posing a priori a principled optimization setup, where one defines a cost function that a network should minimize under a set of constraints, and then searches for optimal solutions in terms of network topologies. Numerous studies have explored this approach in biological networks, transportation networks, etc^3,4. However, most of these methods rely on an existing backbone of a network infrastructure that can be optimized in terms of traffic distribution^5,6 but do not consider the possibility of building the network from scratch, starting from a limited set of nodes. Alternatively, as optimizing over all possible topologies is difficult, one can investigate only various simple shapes from a predetermined set of possible geometries^7,8,9 or rely on heuristics^10,11. In fact, proposed solutions for the network design problem (also known as transit design problem) often rely on heuristics that do not necessarily generate equilibrium solutions^12,13. Alternatively, bilevel formulations account for the needs of both passengers and network manager in a hierarchical way, framing the passengers’ objective as a constraint on the network manager’s one^14,15, but solving them is NP-hard¹⁶. Another approach is that of using models that mimic biological networks. For instance, ref. ¹⁷ presented a two-step agent-based model that replicates biologically-grown networks and proposes them as a template for urban design. Nevertheless, the lack of a principled metric to measure the similarities between an observed network and a simulated one poses a challenge for making this evaluation effective. In addition, most of these approaches extract optimal networks starting from an initial backbone infrastructure, which impacts the resulting topology. Instead, we take a different approach and generate a network from scratch, starting only from a few origins and destination points in a continuous space, identifying where nodes and edges should be located.

In this work, we show that urban transportation systems can exhibit underlying network topologies similar to those that follow optimality principles as defined in optimal transport theory. Specifically, we propose a model to characterize real transportation networks based on a simple optimal transport framework, similar to what is observed in biological systems like the slime mold Physarum polycephalum, which adapts its network structure to reach food patches in an optimal way. A previous study by ref. ¹⁸ shows how this mold forms networks with comparable optimal transportation properties, e.g., efficiency and cost, to those of the Tokyo rail system, but provided no rigorous quantitative definition of network similarity beyond measuring these properties. We empirically validate our approach with a systematic characterization of the structure of several urban transportation networks and propose a rigorous definition of similarity in terms of optimal transport theory.

Urban transportation networks often exhibit different network structures based on the goals of network designers. For instance, some networks focus on connecting people living in the outer layers of the city to the city core, while others prefer to develop a robust infrastructure servicing the core¹⁹. Several studies have focused on analyzing properties like scaling laws and network connectivity^20,21,22, which are indications of an underlying optimality mechanism that these networks might follow to make a city efficient, both in reduced infrastructure costs per capita and in increased productivity. However, our understanding of what optimality principles are captured in real transportation networks is incomplete. In fact, studying network properties could only partially explain the underlying mechanisms regulating network design, as each property captures a different aspect. Here, we take a different approach and build the network from scratch while comparing it with the real ones observed from data, starting with only a few shared nodes in input. Specifically, we model network structures observed in urban transportation networks by adapting a classical optimal transport framework to simulate a network-design problem dependent on realistic travel demand settings and using little information in input. We then compare the resulting networks with those observed from real data and assess their similarity. Importantly, the model can simulate different optimal strategies by tuning a parameter β, which interpolates between minimizing infrastructural and operating costs, in a similar fashion as in the principle of economy of scale, a fundamental concept in economy that establishes the relationship between growth and production costs. This principle affirms that, as the quantity of produced units rises, the average cost per unit of production declines²³. This impacts the balance between the costs of producing and that of maintaining and operating the network. On one hand, this allows to simulate optimal networks that resemble those observed in real transportation systems more closely, as we tune β. On the other hand, by comparing the networks resulting for various values of β with those observed from real data, we can also assess the impact of the two types of cost in the design of various urban infrastructures.

We use this model to analyze several transportation networks from 18 cities and a national rail network^24,25,26. Despite the complex nature of the mechanisms driving the design of transportation networks, we observe that multiple of the studied urban transportation networks follow a surprisingly similar topological pattern, as noticed in biological systems. We observed that in several cases, the optimal networks obtained with our approach have similar cost and performance to those observed in real ones.

Results

Modeling network design in transportation networks

Consider an urban area where a set of points of interest (POI) are located in certain positions in space. These may correspond to a combination of central and peripheral points where people work and live. The goal is to connect them by building a transportation network under the perspectives of an optimality criterion, based on the minimization of a cost-based energy functional. At this point, we do not observe any network but are rather free to use the whole space where the urban area is located, i.e., a 2D surface. From this, we need to select a set of points and edges connecting them, in other words, a network. In Fig. 1a, b we illustrate the problem setup for the subway network in Rome, where green and red markers denote an example set of such reference points and the lines denote edges in the observed metro network infrastructure. In the same figure, we show how existing stations are placed across urban areas with different population densities, as the evolution of subway networks often reflects the evolution of population and activity densities²⁷.

**Fig. 1: Problem setup for the subway network of Rome.**

In general, there are many choices for designing the network. For instance, in Fig. 1c–e we show three examples of intuitive shortest-path-like minimization solutions for the settings shown in Fig. 1a. These are however quite different from the observed network in Fig. 1b. The question we address is what network design principle is producing simulated networks that are more similar to those observed in real urban networks. Optimality could be defined in various ways depending on the network engineers’ and designers’ goals, but generally, it is not known what principles they used when building the network. Instead, we want to assess this by observing real data of transportation networks and fitting them with a flexible and computationally efficient optimization setup guided by optimal transport theory. In this context, a well-defined cost-based functional combines aspects that are critical for a transportation network: the cost of building the infrastructure and that of operating the network, in terms of power dissipation. This is relevant in scenarios where we expect infrastructures to be regulated by energy-saving requirements and the principle of economy of scale, where it is more convenient to consolidate traffic into fewer and larger edges. We expect this to be a reasonable assumption in urban transportation networks. As with any other natural or urban system, we do not know a priori what (if any) is the functional being optimized in the network under study. In fact, many of these systems (e.g., metro and tram) are built in phases¹⁹, where the design of an initial backbone structure is followed by several expansion steps, which may lead to suboptimal structures. However, our model allows considering, among the many possible choices, a simple but yet principled mechanism of optimality. By measuring the degree of similarity of networks that follow these principles with existing urban transportation networks, we can assess if the observed ones can be explained by this simple mechanism. And if not, we can point out alternative infrastructures that can be better in terms of certain relevant network properties, e.g., total path length. Our setting is simple because we consider only a limited input (few nodes that need to exist, e.g., main origin and destination stations), but otherwise do not consider any other constraint, besides main physical laws such as conservation of mass, and let the model select nodes and edges from a two-dimensional space where it can be optimal to drive passengers, tuning only one parameter. For this, we adopt the formalism recently developed by ref. ^28,29,30 that generalizes to a continuous space the original idea of ref. ³¹. In particular, this allows starting with only a set of relatively few origin and destination nodes in input and then designing a network by exploring the 2D surface, i.e., without the need of an initial existing backbone. The idea is inspired by the behavior of the slime mold P. polycephalum, which dynamically builds a network-like body shape when foraging. One can thus consider a dynamics for the two main quantities involved, flows and conductivities, that implements this mechanism at any point in space. The stationary solution of this dynamics corresponds to the minimizer of a Lyapunov cost in a standard optimization setup, which has a nice interpretation in terms of infrastructure and operating transportation costs. From these solutions, one can then automatically extract optimal network structures using the approach presented in ref. ³². From now onwards, we refer to the algorithmic implementation of this approach as “Nextrout”.

Having introduced the main problem and ideas, we now briefly describe the model. Consider a surface in 2D and a set of points on it. Specifically, we denote a set of origins and destinations as f ⁺ and f ⁻, respectively. These contain the reference points where people enter and exit the transportation network. By defining f = f ⁺−f ⁻, mass conservation can be enforced with the constraint ∫ fdx = 0. The two main quantities of interest are denoted with μ(x, t), the transport density (or conductivity), and u(x, t) the transport potential. The former can be seen as a quantity proportional to the size of a network edge, while the latter determines the fluxes traveling along them. The dynamical equations in this continuous setting are

$$-\nabla \cdot (\mu (t,x)\nabla u(t,x))=f,$$

(1)

$$\frac{\partial \mu (t,x)}{\partial t}={\left(\mu (t,x)\nabla u(t,x)\right)}^{\beta }-\mu (t,x),$$

(2)

$$\mu (0,x)={\mu }_{0}(x) \, > \, 0.$$

(3)

Equation (1) determines the spatial balance of the flux, assumed to be governed by the Fick–Poiseuille flux as q = −μ ∇ u; Eq. (2) enforces optimal solutions, and represents the P. polycephalum dynamics in the continuous domain; Eq. (3) is the initial condition. The parameter β captures different optimization mechanisms: β < 1 enforces congested transportation, β = 1 is the shortest path-like and β > 1 is branched transportation. In Fig. 1f–h we show examples of different optimal configurations, with β = 1, β = 1.5 and β = 2.0. Here, we consider the cases 1 < β ≤ 2, where the approximate support of the conductivity μ displays a network-like structure. Under the lenses of a network, the conductivities can be viewed as the traffic capacities on the edges, hence Eq. (3) defines how the initial traffic capacities are distributed along the network, while Eq. (2) describes how these capacities evolve in response to the fluxes. As time evolves (i.e., ${\lim }_{t\to \infty }$), the equilibrium solution pair (μ^*, u^*) is reached. In refs. ^30,33 the authors show that under certain assumptions, this equilibrium solution is a minimizer of the functional

$${{{\mathscr{L}}}}(\mu,u)=\frac{1}{2}\int\,\mu | \nabla u{| }^{2}dx+\int\,\frac{\beta }{2-\beta }{\mu }^{\frac{2-\beta }{\beta }}.$$

(4)

This can be interpreted as the network transportation cost, where the first term is a network operating cost (or power dissipation, it is the Dirichlet energy to the solution of the first partial differential equation), while the second is a non-linear cost to build the infrastructure. When β > 1, this second term corresponds to a principle of economy of scale, where it is more convenient to consolidate traffic into fewer (but larger) edges. This is the scenario we consider here. By changing β, one can tune their relative contribution to the total transportation cost, thus tuning the impact of the principle of economy of scale and how much concentrated path trajectories are. Besides being relevant for urban transportation, this strategy seems to be a fundamental mechanism in various natural systems, e.g., tree branches and roots, blood vessels or river networks^34,35.

Alternative approaches can be considered to design a network infrastructure from simple mechanisms. Examples are cost-benefit analysis³⁶, maximizing for efficiency³⁷ accounting for paths and flows of passengers, or minimizing the total length, as in the Euclidean minimum spanning tree problem. In discrete settings, when an initial network backbone is given, the cost in Eq. (4) has been shown to be implicitly related to the total path length minimization accounting for the passengers’ trajectories³⁸. One main difference between ours and these types of approaches is that we focus on a continuous space (as opposed to discrete settings) where the only necessary input is a set of origins and destinations, but otherwise no initial backbone network is given. This enables the design of a network from scratch, simulating where nodes and edges should be located in space to minimize the cost.

Once the optimal (μ^*, u^*) are obtained, one can then use the model described in ref. ³² to extract a final network structure, i.e., a set of nodes, a set of edges connecting them, and their weights proportional to the conductivities. This can then be compared with the one observed from real data and repeated for various values of β. It is important to remark that in our setting, besides the parameter β, the other input quantities that need to be specified are origins and destinations via the function f. By imposing non-zero entries to this function, a user automatically selects a set of nodes that will be necessarily present in the output network. Otherwise, no other set of nodes or edges needs to be given but is rather automatically learned by solving the optimization problem described above. This implies that similarity between simulated and observed networks trivially increases as we add more non-zero terms in f. Here we consider the non-trivial scenario where we fix only a small number of origins and destinations, as described in more detail below.

Selecting origin and destination points

As we aim at extracting a network, our problem starts by defining a set of origins and destinations (O-D) points in the space with coordinates (x, y), where passengers might enter or exit. This choice necessarily impacts the output network, as the optimization problem depends on it. Ideally, one could reframe it by including O-D pairs as variables to be optimized along with conductivities and fluxes. But this becomes a different and more complex problem and it is not clear how to solve it. Here instead, we focus on the optimization setting introduced above and treat O-D pairs as fixed in input. While we limit selection to a small number of points, specifically one destination and few origins, it is important to decide where to place these input nodes in space. There are no universal criteria to define what are the most relevant points where city planners should add a stop to accommodate traffic when designing a network. However, existing infrastructures often evolve reflecting needs such as population increase or land usage³⁹. With this in mind, we can assume that at least some of the existing nodes have already been placed in positions relevant to transportation needs. Hence, we select O-D pairs from important nodes as observed in existing urban networks. Specifically, we use centrality measures obtained from the original network: nodes with the smallest and highest centrality are assigned as origin and destination nodes, respectively. These measures might reflect the choice on where to place new stations that are often made by urban planners or transportation engineers, usually based on a variety of factors, such as population density, land use patterns or available funding⁴⁰. For instance, in Fig. 2a we observe higher population density in peripheral regions, where the stations with lower centrality are located, whereas those with higher centrality are located towards the center, with lower population density. In the same figure, we show node sizes as proportional to the annual traffic in each station, as measured in 2019⁴¹. In this example, the highest traffic corresponds to the station with the highest degree centrality, thus reinforcing the choice of that node as a destination.

**Fig. 2: Comparing criteria to select origin and destination nodes.**

Another possibility is to incorporate urban features by using some measure of attractiveness, which takes into account the densities of POI and population in a given urban area, an approach also used in other works³⁷. However, this strategy may not be scalable, as it requires the integration of several datasets, whereas using network centralities can be done automatically from the observed network data at no additional cost. We show an example of this for the metro network of Rome, to assess the extent to which these two strategies align. Here we categorize POIs into various types such as tourism, economy, culture, utilities, history, education, food, etc... as also done in previous studies⁴². We notice that the high centrality nodes found by the two criteria are located nearby geographically and yield similar simulated networks, see Fig. 2. Hence, we adopt the centrality criteria as a good approximation for attractiveness to select origins and destinations in all the networks investigated here.

Investigating the similarity of optimal simulated networks and the observed transportation systems

We apply the proposed dynamics to empirical data collected from 18 different cities in multiple geographical regions around the world. For each city, we selected various available types of public transportation systems, such as rail, subway and tram, keeping the largest connected component. The networks considered in this manuscript have a few loops, as the dynamics can only retrieve loopless structures in the regime where network extraction is meaningful³². These networks could be seen as phase I in the classification of ref. ¹⁹, i.e., the initial phase where a backbone infrastructure is built, before a later evolution where further additional links are added through time. We expect these to be more likely to follow a global optimization criteria as the one formulated in our model (as opposed to other greedy heuristics for later extension phases). We thus measure the loop ratio L_ratio = n_L/E as the number of loops divided by the number of edges and select networks with a low ratio, i.e., with L_ratio < 0.2 (see Methods for more details). One could in principle recover loopy structures by employing numerical schemes, e.g., superposition of different outputs⁴³, but this is not the main focus of this work. Instead, we point towards directions on the loop recover perspective, presented later in this manuscript, by exploring an example of a more complex network structure as the New York subway.

The applied Optimal Transport (OT) dynamics successfully describe the transportation network structures observed in different cities at a macroscopic level. While the selected transportation networks have different topologies and include multiple transportation modes, the networks reconstructed by Nextrout with only little information in input show a significant degree of similarity with the real ones (see Fig. 3a–c) for several of the studied cases, as evidenced by different similarity measures that we calculated to compare the topology of the simulated networks against the real ones, see Fig. 4 and next sections for details. This suggests the existence of simple universal optimality rules captured by the so-called Dynamic Monge-Kantorovich (DMK) dynamics for the modeling of urban transportation rail networks, similar to what has been observed for the behavior of the slime mold P. polycephalum.

**Fig. 3: Example of different network topologies generated by Nextrout (yellow), plotted against the corresponding real network (blue).**

**Fig. 4: Performance measures for real and simulated networks.**

An example of the different simulated networks is shown in Fig. 5a–d, where, as we increase the values of β, one can notice how the network infrastructure evolves from a shortest path-like (β = 1) to a branching topology, where two branches (β = 1.5) are created to lower the cost of building the infrastructure. In particular, the southern branch in Fig. 5b resembles an analogous one observed in the real subway network of Rome. As β increases to 2, this branch disappears to build a unique path that connects two destination points in the southeast side of the city, further lowering the cost to build the infrastructure. However, at this extreme value, the network is now less similar to the real one. Notice that in principle one can increase this similarity further, by simply adding more information in input in terms of origin and destinations (see Fig. 5d for an example with multiple origins and destinations). However, here we are interested in recovering macroscopic structure in a more challenging scenario where input information is strictly limited to one central destination and few peripheral origins.

**Fig. 5: Wasserstein similarity measure between graphs for automatic selection of β.**

Beyond a qualitative visual comparison, we explore how our simulated networks score in terms of core network properties relevant for transportation compared to the real networks. For this, we consider the cost, the total path length l, the distribution of traffic, and the density of branching points (or bifurcations; here we use the two terms interchangeably), for both extracted and original networks. Similar to ref. ¹⁸, we define the cost as the total length of the network (TL), i.e., the total number of edges. Passengers may not always take the shortest path, but may rather consolidate on fewer main arteries (e.g., to minimize the number of stops or connections)⁴⁴, a behavior that can be captured by a DMK discrete dynamics (built-in the filtering step of Nextrout) by varying β and extracting the flows u_e on edges, quantities proportional to the number of passengers using an edge. Hence, we consider an alternative measure of total path length as $l: \!\!={\sum }_{e\in {E}_{i}}{{{{\rm{l}}}}}_{{{{\rm{e}}}}}| {u}_{e}|$, where l_e is the Euclidean distance. This takes into account u_e, the flow of passengers on an edge e, and its absolute value ∣u_e∣ is proportional to the number of passengers traveling on an edge e, i.e., how traffic is distributed, assuming that passengers follow optimality principles to consolidate paths. This is a reasonable assumption in rail networks as the ones studied here, where the cost to build the infrastructure can be high (and thus should be minimized) and minimizing traffic congestion is not as relevant as in, e.g., road networks. In our experiments, we extract optimal flows u_e by running the discrete DMK dynamics on the extracted and real networks, using the same sets of origins and destinations as used in the original network extraction problem, setting β = 1.5. This information can also be used to measure the macroscopic behavior of traffic on edges, which can be measured using the Gini coefficient⁴⁵ (Gini(T_e)) on the traffic T_e = ∣u_e∣. This coefficient ranges from [0, 1], where the closer to 1, the more unequal is the traffic distribution on the network. Finally, we calculate the percentage of bifurcations (D_BP) as the fraction of nodes with degree equal to 3. In several cases, the simulated networks display similar properties as those observed on the real ones, as shown in Fig. 4. While similarity differs depending on the property and datasets vary in their range values, we notice that most of the datasets have at least a pair of properties that have a close value between simulated and observed networks. For instance, in Fig. 3a we notice that Adelaide rail has an intuitively similar path length, which is confirmed in Fig. 4a. Furthermore, the same network shows comparable results for traffic and cost. As for the tram networks of Bordeaux and Nantes in Fig. 3b, c, we observe similar behavior for cost and traffic.

Automatic selection of similar simulated networks

Our method allows extracting various simulated networks by varying the parameter β. One can select the one that more closely resembles the observed one in terms of a particular metric of interest, as shown in the previous section. However, different metrics may lead to different most similar simulated networks (i.e., different β), which may not be ideal for a practitioner willing to consider an individual simulated network that resembles well the observed one in terms of all metrics. Hence, the need for a principle automatic selection criteria for choosing the value of β.

The formalism introduced in the previous section suggests a natural way to tackle this problem by considering the Wasserstein similarity measure, a main quantity in optimal transport theory⁴⁶. Given two graphs that need to be compared, intuitively, this measure captures the minimum “effort" required to move a certain distribution of mass from one to the other. Similar ideas based on optimal transport to measure similarity between graphs have also been proposed in recent works^47,48. Here, we describe our proposal for a similarity measure and thus automatic selection of β in detail. Denote the observed network with G₁(V₁, E₁) and the one obtained from the model introduced in previous sections with G₂(V₂, E₂), where V_i, E_i denote the set of nodes and edges, respectively, i = 1, 2. We consider the union graph G_U(V_U, E_U), with sets of nodes V_U = V₁ ∪ V₂ and edges E_U = E₁ ∪ E₂. One can further assign weights w_e ∈ W_U to the edges e ∈ E, for instance using the Euclidean distance ℓ_e between the nodes i, j ∈ V_U, where e = (i, j), or simply binary values {0, 1}. Notice that the observed network G₁ may contain nodes that do not correspond exactly to nodes in G₂, because in this continuous setting the model uses all the 2D space where the original network is embedded. Only the input origin and destination nodes are guaranteed to be present in both graphs, as they are given in input to the model.

Working on this union network, we then exploit a similar setting as the one already introduced with the model to obtain a Wasserstein-based similarity measure between G₁ and G₂. Specifically, we denote with B the unsigned incidence matrix of G_U with entries B_ie = +1 if node i is a start or end point of the edge e and 0 otherwise. Defining q_i as an indicator vector for the edges in G_U that are also in G_i, i.e., q_ie = 1 if e ∈ E_i, and q_ie = 0 otherwise, ∀ e ∈ E_U and i = 1, 2, we can set the origin and destination vectors f = f ⁺−f ⁻, such that f ⁺ = B q₁ and f ⁻ = B q₂, so that G₁ contributes to f⁺ and G₂ to f⁻. By running a discrete dynamics analogous to the continuous one described in Eqs. (1) to (3), which can be done using Nextrout³², one naturally obtains our Wasserstein similarity measure defined as:

$${W}_{1}({G}_{1},{G}_{2})={\sum}_{e\in {E}_{U}}{w}_{e}\,{\mu }_{e}\,,$$

(5)

where μ_e are the optimal solutions for the conductivities on G_U and w_e is the weight of edge e = (i, j). Here, we fix this to be the Euclidean distance between nodes i and j. Examples of how the Wasserstein measure changes depending on the different output networks are shown in Fig. 5i, where we show simulated networks and report their Wasserstein measure from the observed network of the Grenoble tram (N = 80 nodes). Intuitively, the Wasserstein similarity captures how much “cost” is “paid” to move information between G₁ and G₂. This means that the more similar these networks are, the lower the Wasserstein is, i.e., when G₁ = G₂, W₁ = 0, and if there are no nodes connecting them, W₁ → ∞. In Fig. 5b we show an example where W₁ is higher simply because there is a disruption in one of the branches of the network, thus increasing the cost to move from the real network to G₂ = G_β=1.6. Notice that if similarity is chosen to be defined in terms of the cost, the closest network to the real one would be that with β = 1.4, as shown in Fig. 5e.

We further validate this measure by comparing it with other selection criteria based on the topological properties described above and found that the simulated graph selected with the Wasserstein measure has, on average, higher similarity with the real networks compared to the other selection criteria, across various properties. In other words, it shows transportation properties that are consistently more aligned to those behold by the observed network, see Supplementary Information S1.

The New York subway system: a look into more complex structures

The New York subway is one of the largest and busiest transportation systems in the world. Due to its size and complexity, navigating through such network might be difficult for humans⁴⁹, but understanding its properties and structure could be indicative of improvement to city planners and urban designers.

In the scope of recovering such a complex structure, the strategy of selecting only a small number of origins and destinations, as done for the studied networks so far, might produce networks that are far from similar to the real one, especially given the high complexity of this particular system (see Fig. S7). We thus use a different approach that could be a pointer towards recovering structures from major urban transportation systems. Each line that comprises the subway infrastructure of New York could be seen as an independent network itself. With this assumption, we selected four major lines (red, green, orange and yellow), extracting the nodes with lower degree as origins and one common destination for all of them, corresponding to the point with higher density of POI (see Fig. 6).

**Fig. 6: Comparison for real and simulated networks for major lines of the subway system in New York, with distribution of POI.**

We notice that in terms of cost (TL), our simulated networks have similar values in all cases, with equal performance for the red line, and a small difference for all the other lines. For the density of branching points (D_BP)—besides an intuitive visual similarity—we observe comparable results. For instance, the green line (456) has similar branches both on the north and south sides, and similar considerations apply to the red line (123) and the north side of the yellow line (NRQW). One can also investigate how main differences are distributed in the orange line (BDFM), where there is not much similarity between observed and simulated infrastructures. This is because two destinations on the east side are traversed by a unique branch in our simulated network, while the real one splits them into two branches. Furthermore, the southern part of the network also shows a distinct behavior, where our simulated network has two branches, the real one splits into a more complex pattern that cannot be explained by our optimality principles. While it is not clear whether these differences are due to different underlying optimization rules or a lack of optimality in the observed network, our method enables practitioners to identify key insights on principled alternative designs where optimality is clearly defined in terms of network operating and infrastructural costs.

Initial network development: the French Railway in the 1850s

Our model builds a network backbone from scratch, with a global optimization that follows a principle of economy of scale. This could be particularly suited to study the initial development of a rail network infrastructure, as opposed to later stages where the network is gradually extended. We thus study a real scenario of the French railway system where we have access to historical information about network development in time²⁵, focusing on an initial phase around the year 1850. At that stage, the network topology contains multiple connected components but no loops - which only appear in a later stage, around several years later (see Supplementary Information S6, Fig. S9).

We focus on the four biggest components with more interesting topologies, as the remaining components are either too small or simple straight lines, and select the node with the highest degree as a destination. In the biggest component, for instance, this corresponds to Paris. In Fig. 7 we show examples of real and simulated networks highlighting the D_BP and the total length (L), the latter measured given the longitude and latitude coordinates mapped into a [0, 1] system of coordinates. We note various degrees of similarity in the different components between observed and simulated network. For instance, the biggest component has similar D_BP but the observed network has a larger total length, mainly due to branches taking slightly longer detours to reach the sinks and the two small branches south-west of Paris being split from Paris onward into two in the observed network, while they are only later split in the simulated one. A similar behavior is observed in component 3 (north-est France), where we see the real network splitting earlier on than what simulated, thus causing a longer total length. The other two components have higher similarity in terms of both metrics, in particular, component 2 (south France) has a main branching point est of Nîmes similarly located in the observed and simulated network. The higher path length in this case is due to a longer detour of the southern branch. These types of detours could be caused by geographical obstacles that are not included in our more coarse-grained model.

**Fig. 7: Simulating the initial French Railways in the year 1850.**

Understanding the underlying optimization principles that drive how transportation networks changed and evolved over time might point towards creating better transportation systems. Our approach is a pointer towards comprehending the initial stages of such evolution, particularly suited for systems that follow the principle of economy of scale.

Improving network properties

Simulating networks that follow optimality principles and resemble well those observed in real datasets can be used to assess how urban transportation networks perform in terms of main transportation properties. This can guide network managers towards potential measures directed at improving certain properties. This possibility is conveniently enabled by our approach, as by continuously tuning the parameter β we can simulate various transportation scenarios, thus assessing how a network can increase or decrease a certain property. In Fig. 8 we show the main properties in all the networks simulated with our model and compare with those observed in real data, aiming to compare their trade-offs between various performance metrics. In general, we notice how simulated networks cover a wider range of values than the real ones for the four transportation properties investigated in this work. This allows obtaining, for instance, networks that have shorter total path length ℓ with a comparable cost, as shown in Fig. 8a where many simulated networks are located in the regime 0 < TL < 200 with ℓ sharply dropping towards 0.1, while many real networks have ℓ > 0.1. A similar behavior is observed also for the traffic against TL in Fig. 8c, where simulated networks cover areas of the plot where traffic is less congested (smaller Gini(T_e)), in contrast to several real networks. Among the simulated networks, those selected according to the best Wasserstein measure tend to have lower cost and a smaller percentage of bifurcations D_BL, indicating that this measure encourages not only the usage of a lower amount of edges but also nodes with low degree.

**Fig. 8: Comparison of simulated and observed networks.**

Discussion

Planning transportation systems in a city is a challenging task. This study has shown that some urban transportation systems with a small number of loops can be simulated by simple principles based on optimal transport theory and economy of scale. Using empirical data from one national railway and multiple rail transportation types across 18 cities, our model provides simulated networks obtained with little information in input and no a priori backbone network structure, exhibiting properties that resemble those observed in real transportation systems to various extents. Our model interpolates between various transportation regimes by tuning a single parameter, while allowing for a natural definition of a similarity measure to compare the simulated networks with those observed in real systems. We observed how the selected networks with this criterion can exhibit transportation properties that on average resemble the corresponding real networks well or point towards alternative infrastructures that improve relevant topological properties.

A limitation of this study is that our OT-based model does not capture infrastructures with loops, thus limiting its applicability to rail networks, or subway and rail networks with a very small density of loops. Possible extensions of the formalism presented in this work to account for loops are an interesting direction for future work. Besides simple heuristics and beyond the example we presented for the New York subway, one can make other interesting modeling choices to effectively tackle this problem. For instance, one could generalize our approach to situations where travel demands are treated stochastically^50,51,52 or change in time⁵³, scenarios where in certain regimes an OT-based approach can naturally lead to the formation of loops. Similarly, loops could emerge in multicommodity settings where fluxes of passengers are distinguished by their origin and destination stations, using an OT-based multicommodity framework as the one proposed in refs. ^38,54,55. Both directions, provided they could be generalized to a continuous setting as the one studied here, can potentially result in optimal simulated network infrastructures capturing properties that differ from the ones analyzed here, e.g., robustness to disruptions.

Another limitation is the assumption that networks are static, i.e., do not change in time. It would be interesting to compare how differences between simulated and observed networks may arise because different network branches may have been developed in different time periods. This could be used to study the evolution of transportation properties in time, as done in refs. ^27,56. Similarly, the networks studied in this work are often one layer of a multimode network. Integrating other transportation modes into a multilayer formalism and suitably adapting our OT-based approach, e.g., borrowing ideas from⁵⁷, could give us a deeper understanding of optimal network design in interconnected urban systems.

Our model takes a few inputs (origins and destinations), but it does not consider any geographical obstacle. This can result in fine-grained mismatches between simulated and observed topologies due to longer detours in the real infrastructures to avoid these physical obstacles. In principle, this could be incorporated into our model by properly adding extra terms in the dynamical equations that drive flows and conductivities differently based on the location in space. However, this would require fine-grain details to be specified in input, an information that may not be easily available.

In our work, we focus on designing a network from scratch. This is relevant in cases where infrastructures are relatively new or did not change considerably compared to their initial design, as in the cases we investigated here. However, this may be limited to studying infrastructures that have evolved over time. For these scenarios, it would be appropriate to consider how our model can be adapted to study network growth, where an initial backbone is further extended with new branches. This problem is related to the interplay between an urban transportation network and the distribution of its underlying population, as there could be a co-evolution between the two that should be taken into account⁵⁸.

In summary, there are many factors contributing to the development of urban transportation networks. Our simple optimization scheme provides a principled and computationally efficient benchmark for comparison with real-world networks. By interpolating between different transportation regimes, we can vary the degree of similarity between the networks simulated by optimal transport principles and those observed in real systems. In particular, measuring relevant topological properties on simulated network resulting from different parameters’ values against those observed in real networks can give us indications on how to improve transportation performance when taking into account principles of optimal transport and economy of scale.

Methods

Data collection and analysis

We collected network data from various public transportation networks from 18 different cities²⁴ and one national railway²⁵. Network statistics are detailed in Table 1. Each city had one or multiple transportation modes available. Node ids were associated with the longitude and latitude coordinates of real stations for multiple means of transportation, as well as possible connections between them. Our main goal was to analyze each network individually, therefore we did not address the multilayer case where connections among the different means of transportation exist. For instance, if rail and subway stops have the same coordinates, they are treated as distinct in each network.

Table 1 Description of real networks considered

Full size table

To avoid possible redundancies or inconsistencies in the data, such as duplicated nodes or edges that looked too long, we performed a preprocessing step. Specifically, we considered a threshold τ that corresponds to the minimum distance in kilometers between the pairs of nodes that had the same node ids and no connections between them, matching features stored in the node metadata such as names of the real stations. If the Euclidean distance d(i, j) between nodes i and j was smaller than this threshold, we collapsed the two nodes into one, i.e., i = j. This was used to avoid those entries and exits of each station would be counted as two distinct nodes in the same network, and possibly affecting the selection of origins and destinations.

To match the latitude and longitude coordinates of the datasets with those in the 2-dimensional plane that Nextrout uses to solve the continuous problem, we re-scaled every pair (lon, lat) to a (0, 1) system of coordinates. Starting with a total of 64 data points (networks), we extracted the individual disconnected components and the number of loops for each of them. Network extraction was performed on the biggest components only.

We extract networks using Nextrout³², selecting 1 < β ≤ 2 such that for every pair (origins, destinations) we simulate 10 different networks. Since the extracted networks may contain redundancies, we remove them using the graph filtering step from Nextrout. Outputs of this step have less redundant structures and are closer to the optimal topologies.

Selecting origins and destinations with points of interest

For the 16 studied networks, we used origins and destinations based on the degree and betweenness centrality measures. The degree centrality d_i of a given node is defined as the number of edges connected to it. The betweenness centrality is defined as the frequency with which a node is on the shortest path between all other nodes,

$${B}_{i}={\sum}_{i\ne j\ne k}\frac{{\sigma }_{ik}(\;j)}{{\sigma }_{ik}},$$

where σ_ik is the total number of shortest paths from node i to node k and σ_ik(j) those shortest paths passing through j.

Terminals were chosen as follows: nodes with d_i≤ 1 are assigned as origins, while those with ${d}_{i}={\max }_{n}\left\{{d}_{n}\right\}$ or ${B}_{i}={\max }_{n}\left\{{B}_{n}\right\}$ as destinations. We selected this set {origins,destinations} to be small, in order to use the least amount of information in input. In multiple cases the set of origins was equivalent in both centralities, with the difference being on the location of the destinations, thus the output networks were different. In terms of final networks properties, the results are comparable for both studied centralities (see SI for more details).

In order to account for other measures of attractiveness for the destinations and investigate the impact of including realistic information about urban areas, we explored a different approach for the cities of Rome and New York, by looking at a measure that accounts for both population distribution and land usage. To do that, we collected both population and distribution of POI from Open Street Map (OSM) data and mapped them to an H3 tiling discretization of the space. We then defined the density of POI as ρ_POI = P_ijW_ij, where P_ij is the population density and W_ij is the number of POI for the corresponding ij cell. Our hypothesis is that stations with higher centralities correspond to higher density cells. The destinationis then assigned based on the center of this H3 cell with highest ρ_POI. In Fig. 2b we show how this new criterion generates a configuration that leads to a different optimal network but still preserves similarity compared with the real network. We also notice that the highest centrality node is connected to regions with higher densities compared to the peripheral areas, where the origins are placed.

Data availability

Due to its high volume, the synthetic data generated in this study can be obtained from the corresponding author upon request. An example for the generated networks of the Rome subway is available at ref. ⁵⁹.

Code availability

The open source codes and executables are available at ref. ⁵⁹.

References

Zhang, J., Zhao, M., Liu, H. & Xu, X. Networked characteristics of the urban rail transit networks. Phys. A Stat. Mech. Appl. 392, 1538–1546 (2013).
Article Google Scholar
Ding, R. et al. Application of complex networks theory in urban traffic network researches. Netw. Spatial Econ. 19, 1281–1317 (2019).
Article Google Scholar
Navlakha, S. & Bar-Joseph, Z. Algorithms in nature: the convergence of systems biology and computational thinking. Mol. Syst. Biol. 7, 546 (2011).
Article PubMed PubMed Central Google Scholar
Easley, D. et al. Networks, Crowds, and Markets, Vol. 8 (Cambridge University Press Cambridge, 2010).
Watanabe, S., Tero, A., Takamatsu, A. & Nakagaki, T. Traffic optimization in railroad networks using an algorithm mimicking an amoeba-like organism, physarum plasmodium. Biosystems 105, 225–232 (2011).
Article PubMed Google Scholar
Liu, L., Song, Y., Zhang, H., Ma, H. & Vasilakos, A. V. Physarum optimization: a biology-inspired algorithm for the steiner tree problem in networks. IEEE Trans. Comput. 64, 818–831 (2013).
Article MathSciNet Google Scholar
Aldous, D. & Barthelemy, M. Optimal geometry of transportation networks. Phys. Rev. E 99, 052303 (2019).
Article ADS CAS PubMed Google Scholar
Mc Gettrick, M. The role of city geometry in determining the utility of a small urban light rail/tram system. Public Transp. 12, 233–259 (2020).
Article Google Scholar
Barthelemy, M. Optimal transportation networks and network design. In Spatial Networks (Springer, 2022).
Cantarella, G. E., Pavone, G. & Vitetta, A. Heuristics for urban road network design: lane layout and signal settings. Eur. J. Oper. Res. 175, 1682–1695 (2006).
Article Google Scholar
Laporte, G., Mesa, J., Ortega, F. & Perea, F. Planning rapid transit networks. Socio-Econ. Plan. Sci. 45, 95–104 (2011).
Article Google Scholar
Durán-Micco, J. & Vansteenwegen, P. A survey on the transit network design and frequency setting problem. Public Transp. 14, 155–190 (2022).
Article Google Scholar
Farahani, R. Z., Miandoabchi, E., Szeto, W. Y. & Rashidi, H. A review of urban transportation network design problems. Eur. J. Oper. Res. 229, 281–302 (2013).
Article MathSciNet Google Scholar
Lonardi, A. & De Bacco, C. Bilevel optimization for traffic mitigation in optimal transport networks. Phys. Rev. Lett. 131, 267401 (2023).
Article ADS MathSciNet CAS PubMed Google Scholar
Li, B., Saad, D. & Yeung, C. H. Bilevel optimization in flow networks:a message-passing approach. Phys. Rev. E 106, L042301 (2022).
Article ADS CAS PubMed Google Scholar
Ben-Ayed, O., Boyce, D. E. & Blair III, C. E. A general bilevel linear programming formulation of the network design problem. Transp. Res. Part B Methodol. 22, 311–318 (1988).
Article MathSciNet Google Scholar
Kay, R., Mattacchione, A., Katrycz, C. & Hatton, B. D. Stepwise slime mould growth as a template for urban design. Sci. Rep. 12, 1–15 (2022).
Article ADS Google Scholar
Tero, A. et al. Rules for biologically inspired adaptive network design. Science 327, 439–442 (2010).
Article ADS MathSciNet CAS PubMed Google Scholar
Derrible, S. & Kennedy, C. Characterizing metro networks: state, form, and structure. Transportation 37, 275–297 (2010).
Article Google Scholar
Levinson, D. Network structure and city size. PLoS ONE 7, e29721 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Derrible, S. & Kennedy, C. The complexity and robustness of metro networks. Phys. A Stat. Mech. Appl. 389, 3678–3691 (2010).
Article CAS Google Scholar
Lin, J. & Ban, Y. Complex network topology of transportation systems. Transp. Rev. 33, 658–685 (2013).
Article Google Scholar
Silberston, A. Economies of scale in theory and practice. Econ. J. 82, 369–391 (1972).
Article Google Scholar
Kujala, R., Weckström, C., Darst, R. K., Mladenović, M. N. & Saramäki, J. A collection of public transport network data sets for 25 cities. Sci. Data 5, 1–14 (2018).
Article Google Scholar
Litvine, A. et al. French historical GIS, 1700-2020. Administrative units, populations, transports, economy. zenodo. https://doi.org/10.5281/zenodo.3727274 (2024).
New York City Open Data. Subway lines and subway stations dataset. https://data.cityofnewyork.us/Transportation/Subway-Lines/3qz8-muuu (2018).
Pei, A., Xiao, F., Yu, S. & Li, L. Efficiency in the evolution of metro networks. Sci. Rep. 12, 1–10 (2022).
Article ADS Google Scholar
Facca, E., Cardin, F. & Putti, M. Towards a stationary Monge–Kantorovich dynamics: the Physarum polycephalum experience. SIAM J. Appl. Math. 78, 651–676 (2018).
Article MathSciNet Google Scholar
Facca, E., Daneri, S., Cardin, F. & Putti, M. Numerical solution of Monge–Kantorovich equations via a dynamic formulation. J. Sci. Comput. 82, 1–26 (2020).
Article MathSciNet Google Scholar
Facca, E., Cardin, F. & Putti, M. Branching structures emerging from a continuous optimal transport model. J. Comput. Phys. 447, 110700 (2021).
Article MathSciNet Google Scholar
Tero, A., Kobayashi, R. & Nakagaki, T. A mathematical model for adaptive transport network in path finding by true slime mold. J. Theor. Biol. 244, 553–564 (2007).
Article ADS MathSciNet PubMed Google Scholar
Baptista, D., Leite, D., Facca, E., Putti, M. & De Bacco, C. Network extraction by routing optimization. Sci. Rep. 10, 1–13 (2020).
Article Google Scholar
Facca, E., Daneri, S., Cardin, F. & Putti, M. Numerical solution of monge–kantorovich equations via a dynamic formulation. J. Sci. Comput. 82, 68 (2020).
Article MathSciNet Google Scholar
Banavar, J. R., Maritan, A. & Rinaldo, A. Size and form in efficient transportation networks. Nature 399, 130–132 (1999).
Article ADS CAS PubMed Google Scholar
Banavar, J. R., Cooke, T. J., Rinaldo, A. & Maritan, A. Form, function, and evolution of living organisms. Proc. Natl Acad. Sci. 111, 3332–3337 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Louf, R., Jensen, P. & Barthelemy, M. Emergence of hierarchy in cost-driven growth of spatial networks. Proc. Natl Acad. Sci. 110, 8824–8829 (2013).
Article ADS MathSciNet CAS PubMed PubMed Central Google Scholar
Bontorin, S., Cencetti, G., Gallotti, R., Lepri, B. & De Domenico, M. Emergence of complex network topologies from flow-weighted optimization of network efficiency. Phys. Rev. X 14, 021050 (2024).
Google Scholar
Lonardi, A., Facca, E., Putti, M. & De Bacco, C. Designing optimal networks for multicommodity transport problem. Phys. Rev. Res. 3, 043010 (2021).
Article CAS Google Scholar
Yang, L. et al. Integrated design of transport infrastructure and public spaces considering human behavior: A review of state-of-the-art methods and tools. Front. Archit. Res. 8, 429–453 (2019).
Article CAS Google Scholar
Louf, R., Roth, C. & Barthelemy, M. Scaling in transportation networks. PLoS ONE 9, e102007 (2014).
Article ADS PubMed PubMed Central Google Scholar
City of Rome. Il Trasporto Pubblico locale a Roma, Anno 2019. https://www.comune.roma.it/web-resources/cms/documents/TPL_Roma_2019_rev.pdf (2019).
Xu, Y., Olmos, L. E., Abbar, S. & González, M. C. Deconstructing laws of accessibility and facility distribution in cities. Sci. Adv. 6, eabb4112 (2020).
Article ADS PubMed PubMed Central Google Scholar
Baptista, D. & De Bacco, C. Principled network extraction from images. R. Soc. Open Sci. 8, 210025 (2021).
Article ADS PubMed PubMed Central Google Scholar
Ibrahim, A. A., Leite, D. & De Bacco, C. Sustainable optimal transport in multilayer networks. Phys. Rev. E 105, 064302 (2022).
Article ADS MathSciNet CAS PubMed Google Scholar
Dixon, P. M., Weiner, J., Mitchell-Olds, T. & Woodley, R. Bootstrapping the gini coefficient of inequality. Ecology 68, 1548–1551 (1987).
Article Google Scholar
Santambrogio, F. Optimal transport for applied mathematicians. Birkäuser 55, 94 (2015).
Google Scholar
Petric Maretic, H., El Gheche, M., Chierchia, G. & Frossard, P. GOT: an optimal transport framework for graph comparison. In Advances in Neural Information Processing Systems 32 (Curran Associates, Inc., 2019).
Xu, H., Luo, D., Zha, H. & Duke, L. C. Gromov-Wasserstein learning for graph matching and node embedding. In International Conference on Machine Learning 6932–6941 (PMLR, 2019).
Gallotti, R., Porter, M. A. & Barthelemy, M. Lost in transportation: Information measures and cognitive limits in multilayer navigation. Sci. Adv. 2, e1500445 (2016).
Article ADS PubMed PubMed Central Google Scholar
Corson, F. Fluctuations and redundancy in optimal transport networks. Phys. Rev. Lett. 104, 048703 (2010).
Article ADS PubMed Google Scholar
Hu, D. & Cai, D. Adaptation and optimization of biological transport networks. Phys. Rev. Lett. 111, 138701 (2013).
Article ADS PubMed Google Scholar
Katifori, E., Szöllősi, G. J. & Magnasco, M. O. Damage and fluctuations induce loops in optimal transport networks. Phys. Rev. Lett. 104, 048704 (2010).
Article ADS PubMed Google Scholar
Lonardi, A., Facca, E., Putti, M. & De Bacco, C. Infrastructure adaptation and emergence of loops in network routing with time-dependent loads. Phys. Rev. E 107, 024302 (2023).
Article ADS CAS PubMed Google Scholar
Lonardi, A., Putti, M. & De Bacco, C. Multicommodity routing optimization for engineering networks. Sci. Rep. 12, 1–11 (2022).
Article Google Scholar
Bonifaci, V. et al. Physarum-inspired multi-commodity flow dynamics. Theor. Comput. Sci. 920, 1–20 (2022).
Cats, O. Topological evolution of a metropolitan rail transport network: the case of Stockholm. J. Transp. Geogr. 62, 172–183 (2017).
Article Google Scholar
Ibrahim, A. A., Lonardi, A. & Bacco, C. D. Optimal transport in multilayer networks for traffic flow optimization. Algorithms 14, 189 (2021).
Article Google Scholar
Brelsford, C., Martin, T., Hand, J. & Bettencourt, L. M. Toward cities without slums: topology and the spatial evolution of neighborhoods. Sci. Adv. 4, eaar4644 (2018).
Article ADS PubMed PubMed Central Google Scholar
Leite, D. Danielaleite/opt-urban-nextrout: v1.0-alpha. https://doi.org/10.5281/zenodo.12723673 (2024).
Dipartimento Trasformazione Digitale U.O. Statistica - Open Data. La popolazione di Roma. Struttura e dinamica demografica Anno 2019. https://www.comune.roma.it/web-resources/cms/documents/La_popolazione_a_Roma2019.pdf (2020).

Download references

Acknowledgements

The authors thank the International Max Planck Research School for Intelligent Systems (IMPRS-IS) for supporting Daniela Leite.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Max Planck Institute for Intelligent Systems, Cyber Valley, Tübingen, Germany
Daniela Leite & Caterina De Bacco

Authors

Daniela Leite
View author publications
You can also search for this author in PubMed Google Scholar
Caterina De Bacco
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.L. and C.D.B contributed to writing the manuscript, evaluating and interpreting the results.

Corresponding author

Correspondence to Caterina De Bacco.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Luis Nunes Amaral, Marc Barthelemy and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Leite, D., De Bacco, C. Similarity and economy of scale in urban transportation networks and optimal transport-based infrastructures. Nat Commun 15, 7981 (2024). https://doi.org/10.1038/s41467-024-52313-6

Download citation

Received: 14 September 2022
Accepted: 01 September 2024
Published: 12 September 2024
DOI: https://doi.org/10.1038/s41467-024-52313-6

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.