Identification of key regulators in prostate cancer from gene expression datasets of patients

Mangangcha, Irengbam Rocky; Malik, Md. Zubbair; Küçük, Ömer; Ali, Shakir; Singh, R. K. Brojen

doi:10.1038/s41598-019-52896-x

Download PDF

Article
Open access
Published: 11 November 2019

Identification of key regulators in prostate cancer from gene expression datasets of patients

Irengbam Rocky Mangangcha^1,2,3,4^na1,
Md. Zubbair Malik⁴^na1,
Ömer Küçük⁵,
Shakir Ali ORCID: orcid.org/0000-0002-4002-1231^1,2 &
…
R. K. Brojen Singh⁴

Scientific Reports volume 9, Article number: 16420 (2019) Cite this article

3002 Accesses
19 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Identification of key regulators and regulatory pathways is an important step in the discovery of genes involved in cancer. Here, we propose a method to identify key regulators in prostate cancer (PCa) from a network constructed from gene expression datasets of PCa patients. Overexpressed genes were identified using BioXpress, having a mutational status according to COSMIC, followed by the construction of PCa Interactome network using the curated genes. The topological parameters of the network exhibited power law nature indicating hierarchical scale-free properties and five levels of organization. Highest degree hubs (k ≥ 65) were selected from the PCa network, traced, and 19 of them was identified as novel key regulators, as they participated at all network levels serving as backbone. Of the 19 hubs, some have been reported in literature to be associated with PCa and other cancers. Based on participation coefficient values most of these are connector or kinless hubs suggesting significant roles in modular linkage. The observation of non-monotonicity in the rich club formation suggested the importance of intermediate hubs in network integration, and they may play crucial roles in network stabilization. The network was self-organized as evident from fractal nature in topological parameters of it and lacked a central control mechanism.

Identifying proteomic risk factors for cancer using prospective and exome analyses of 1463 circulating proteins and risk of 19 cancers in the UK Biobank

Article Open access 15 May 2024

Systematic dissection of tumor-normal single-cell ecosystems across a thousand tumors of 30 cancer types

Article Open access 14 May 2024

3D genomic mapping reveals multifocality of human pancreatic precancers

Article 01 May 2024

Introduction

Prostate is a gland of the male reproductive system which secretes seminal fluid in human adult¹. According to World Cancer Report 2014, the cancer of prostate or Prostate cancer (PCa) in man is second most common cancer, after lung cancer, and is responsible for a fifth of cancer deaths in males worldwide². PCa, based on the type of origin in prostate, can be classified into five types: (i) acinar adenocarcinoma, (ii) ductal adenocarcinoma, (iii) transitional cell (or urothelial) cancer, (iv) squamous cell cancer and (v) small cell prostate cancer, with adenocarcinomas being the most common, even though metastasis is much quicker in other types^3,4.

In recent years, gene expression studies using high-throughput techniques namely next generation sequencing, microarray and proteomics have led to the identification of new genes and pathways in PCa. The identification of novel key regulators is important as the current therapeutic modalities against PCa, including the use of antiandrogens and blocking androgen synthetic pathway⁵ and using Luteinizing hormone-releasing hormone (LHRH) agonists and antagonists along with cytotoxic anticancer drugs, cause notable side effects^6,7. Moreover, PCa diagnosis, which is largely dependent on the Prostate specific antigen (PSA) and Digital rectal examination (DRE), has its own limitations^8,9. PSA is also elevated in benign prostatic hyperplasia and other noncancerous conditions⁹. This necessitates the discovery of more reliable biomarkers for better and early diagnosis, as well as identification of new targets other than the genes involved in androgen metabolism for the discovery and development of new and more potent drugs which have less toxicity and lesser side effects.

Genes are regulated in a coordinated way and the expression of one gene usually depends on the presence or absence of another gene (gene interaction). Network theory, which studies the relations between discrete objects through graphs as their representations, can be used to study complex gene regulatory networks which can have different types (random, scale-free, small world and hierarchical networks). The development of algorithms to study of these networks can provide an important tool to find/identify disease-associated genes in complex diseases such as cancer. Earlier, the network theory-based methods have been used to predict disease genes from networks generated using curated list of genes reported to be associated with the disease and mapping them to the human gene interaction network (HPRD database)¹⁰. In such approach, the studies have been limited to the curated gene list forming the network not completely representing the system and patient-specific information is not considered. Moreover, current studies on complex networks in human disease models to discover key disease genes rely mostly on clustering and identifying the high degree hubs or/and motif discovery from the networks^11,12. Therefore, the application of network theoretical methods to the protein–protein interaction (PPI) networks of cancer associated genes constructed from the corresponding genes by analyzing high-throughput gene expression datasets of human cancer patients may be used for better sensitivity and forecast in understanding the key regulating genes of the corresponding disease. The clinical impact of using patients’ gene expression data over gene expression data from cancer cell lines will also give a systematic insight in predicting key regulator genes expressed in cancer and understanding their roles in disease manifestation and progression. In this study, we have used the gene expression data (RNAseq) of PCa patients to construct complex PPI network and analyze it. The study gives equal importance to the hubs, motifs and modules of the network to identify the key regulators and regulatory pathways not restricting only to overrepresented motifs or hubs identification, establishing a relationship between them in gene-disease association studies using network theory. The method used in this study is new and takes a holistic approach for predicting key disease genes and their pathways within network theoretical framework using datasets of PCa patients.

Materials and Methods

Identification and selection of PCa-associated genes

BioXpress v3.0 (https://hive.biochemistry.gwu.edu/bioxpress), which uses TCGA (https://tcga-data.nci.nih.gov/) RNA sequencing datasets derived from the human cancer patients¹³, was used to differentiate the deregulated genes in cancer. The cancer browser tool of COSMIC (https://cancer.sanger.ac.uk/cosmic)¹⁴ was used for the mutational status and accordingly, non-redundant genes overexpressed in human PCa were identified. Systematic flow chart of methodology is given in Fig. 1.

Construction of protein-protein interaction (PPI) network

After excluding the redundancy and redundant copies, out of 4,890 genes found to be significantly overexpressed (FC > 1, adjusted p < 0.05) in PCa patients from BioXpress, 3,871 genes, which had mutational status in PCa according to COSMIC, were used to construct an interactome network using GeneMANIA app¹⁵ in Cytoscape 3.6.0¹⁶. From the network, only the physical interaction network, which represented the protein-protein interaction network of PCa-associated genes, was extracted. After curation of the network (removal of isolated node/nodes), a protein-protein interaction network of 2,960 nodes and 20,372 edges was finally constructed as primary network representing a graph denoted by G(N, E), where, N is the set of nodes with N = {ni}; i = 1, 2, …, N and E the set of edges with E = {eij}; i, j = 1, 2, 3, …., N.

Method for detection of levels of organization

Considering the size of the network and its sensitivity, Louvain method of modularity (Q) maximization was used for community detection¹⁷. The first level of organization was established by the interaction of communities constructed from primary PPI network. The sub-communities constructed from all communities in the first level of organization constituted second level of organization. In the same way, successive levels were constructed until the level of motifs, thereby each smaller community had a minimum of one triangular motif defined by sub-graph G(3,3). Since the triangular motifs are overrepresented in PPI network and serve as controlling unit in a network¹⁸, we used motif G(3,3) as qualifying criteria for a community/subcommunity as a constituting member at a certain level of organization. Further, each community or smaller community landed up to different level of organization.

Topological analyses of the networks

Cytoscape plugins, NetworkAnalyzer¹⁹ and CytoNCA²⁰ were used to analyse the topological properties of the network for centralities, degree distribution, clustering coefficients and neighbourhood connectivity. The highest degree nodes were identified as hubs of the PCa network. Top 103 hub proteins having degree k ≥ 65 were considered for tracing the key regulators of the network. Other topological parameters, viz., Rich club coefficients (Φ), Participation coefficients (P_i) and Within-module degree (Z_i score) were calculated using Igraph package “brainGraph” (https://github.com/cwatson/brainGraph) in R. Another parameter subgraph centrality was also calculated using Igraph functions.

Degree (k)

In the analysis of network, degree k indicates the total number of links established by a node in a network and is used to measure the local significance of a node in regulating the network. In a graph represented by G = (N, E), where N denotes nodes and E the edges, the degree of i^th node (k_i) is expressed as ${k}_{i}=\mathop{\sum }\limits_{ij}^{N}{A}_{ij}$, where A_ij denotes the adjacency matrix elements of the graph.

Probability of degree distribution (P(k))

It is the probability of a random node to have a degree k out of the total number of nodes in the network and is represented as fraction of nodes having degree (k), as shown in Eq. (1), where N_k is the total number of nodes with degree k and N, total nodes in the network.

$$P(k)=\frac{{N}_{k}}{N}$$

(1)

P(k) of random and small-world networks follow Poison distribution in degree distribution against degree, but most real-world networks, scale-free and hierarchical networks follow power law distribution P(k) ~k^-γ, where, 4 ≥ γ ≥ 2^. In hierarchical networks, γ ~2.26 (mean-field value) indicating a modular organization at different topological levels²¹. Therefore, P(k) pattern defines the characteristic topology of a network.

Clustering coefficients C(k)

The strength of internal connectivity among the nodes neighbourhoods which quantifies the inherent clustering tendency of the nodes in the network is characterised by the Clustering coefficient C(k), which is the ratio between the number of triangular motifs formed by a node with its nearest neighbours and the maximum possible number of triangular motifs in the network. For any node i having degree k_i in an undirected graph, C(k) can be expressed as Eq. (2), where m_i is the total number of edges among its nearest neighbours. In scale-free networks C(k) ~ constant, but it exhibits power law in hierarchical network against degree, C(k) ~ k^−α, with α ~ 1²¹.

$$C(k)=\frac{2{m}_{i}}{{k}_{i}\,({k}_{i}-1)}$$

(2)

Neighbourhood connectivity C _N(k)

The node neighbourhood connectivity is the average connectivity established by the nearest-neighbours of a node with degree k, represented by C_N(k) can be expressed as shown in Eq. (3), where, P(q|k) is conditional probability of the links of a node with k connections to another node having q connections.

$${C}_{N}(k)=\sum _{q}\,qP(q|k)$$

(3)

In hierarchical network topology, C_N(k) exhibit power law against degree k, that is, C_N(k) ~ k^β, where, β ~ 0.5²². Further, the positivity or negativity of the exponent β can be defined as, respectively, the assortivity or disassortivity nature of a network topology²³.

Centrality measures

A node’s global functional significance in regulating a network through information processing is estimated by the basic Centrality measures - Closeness centrality C_C, Betweenness centrality C_B and Eigenvector centrality C_E²⁴. Another centrality measure, Subgraph centrality C_S is also used to describe the participation of nodes in other subgraphs in the network²⁵. These centrality measures collectively determine the cost effectiveness and efficiency of information processing in a network.

The closeness centrality C_C represents the total geodesic distance from a given node to all its other connected nodes. It represents the speed of spreading of information in a network from a node to other connected nodes²⁶. C_C of a node i in a network is calculated by the division of total number of nodes in the network, n by sum of geodesic path lengths between nodes i and j which is represented by d_ij in Eq. (4).

$${C}_{C}(k)=\frac{n}{{\sum }_{j}\,{d}_{ij}}$$

(4)

Betweenness Centrality C_B is the measure of a node which is the share of all shortest-path traffic from all possible routes through nodes i to j. Thus, it characterizes a node’s ability to benefit extraction from the information flow in the network²⁷ and its controlling ability of signal processing over other nodes in the network²⁸. If d_ij(v) denotes the number of geodesic paths from node i to node j passing through node v, then C_B(v) of node v can be obtained by Eq. (5).

$${C}_{b}(v)=\sum _{i,j;i\ne j\ne k}\,\frac{{d}_{ij}(v)}{{d}_{ij}}$$

(5)

If M denotes the number of node pairs, excluding v, then normalized betweenness centrality is given by the Eq. (6).

$${C}_{B}(v)=\frac{1}{M{C}_{b}(v)}$$

(6)

Eigenvector centrality C_E is proportional to the sum of the centrality of all neighbours of a node and it reflects the intensity of these most prominent nodes influencing the signal processing in the network²⁹. If nearest neighbours of node i in the network is denoted by nn(i) with eigenvalue λ and eigenvector v_i of eigen-value equations, Av_i = v_i(v) where, A is the network adjacency matrix, C_E can be shown by the Eq. (7),

$${C}_{E}(i)=\frac{1}{\lambda }\sum _{j=nn(i)}\,{v}_{j}$$

(7)

C_E score corresponds to maximum positive eigenvalue, λ_max, of the principal eigenvector of A²⁹. Since a node’s C_E function depends on the centralities of its neighbours, it varies across different networks association of high C_E nodes; within closely connected locality of such nodes reduces the chances of isolation of nodes²⁹. Thus, C_E becomes a powerful indicator of information transmission power of a node in the network.

The subgraph centrality C_S of a node calculates the number of subgraphs the node participates in a network. It can be calculated using eigenvalues and eigenvectors of adjacency matrix of the graph, as shown in Eq. (8), where λ_j is the j^th eigenvalue and v_j(i), the i^th element of the associated eigenvector. The weightages are higher for smaller graphs. Higher subgraph centrality of a node corresponds to better efficiency of information transmission and increase in essentiality of the node in the network²⁵.

$${C}_{S}(i)=\mathop{\sum }\limits_{j=1}^{N}{v}_{j}{(i)}^{2}{e}^{{\lambda }_{j}}$$

(8)

Within-module degree and Participation coefficients of the hubs

In complex networks the characterization of hubs as high degree nodes with higher centrality values is incomplete without exploring the role of nodes at the modular levels³⁰. The role of nodes at the modular level is determined through the participation of nodes in establishing links between the nodes within the module as well as outside the module and calculating the modular degree of the nodes. Within-module degree or Z-score, Z_i, signifies the connections of a node i in the modules and categorizes a node as modular hub-node with high (Z_i ≥ 2.5) signifying more intra-module connectivity of the node than inter-module, whereas, lower Z values, Z_i < 2.5, categorizes as non-hub nodes with less intra-module connectivity³⁰. The Z- score can be calculated as shown in Eq. (9), where k_i represents the number of links of node i to other nodes in its modules S_i and ${\bar{k}}_{{s}_{i}}$, the average of degree (k) over all nodes in S_i; ${\sigma }_{{k}_{{s}_{i}}}$, is the standard deviation of k in S_i.

$${Z}_{i}=\frac{({k}_{i}-{\bar{k}}_{{s}_{i}})}{{\sigma }_{{k}_{{s}_{i}}}}$$

(9)

The participation coefficient, P_i determines the participation of the node i in linking the nodes inside and outside its module³⁰. P_i values lie in the range of 0−1 with higher values corresponding to the participation of nodes in establishing links outside the modules with homogeneous distribution of its links among all modules, and if k_is is taken to represent the number of links of node i to nodes in modules s and k_i, the total degree of node i, P_i can be calculated as in Eq. (10), where, N_M is the number of modules in the network.

$${P}_{i}=1-\mathop{\sum }\limits_{s=1}^{{N}_{M}}{(\frac{{k}_{is}}{{k}_{i}})}^{2}$$

(10)

Rich-club analysis

Identification of hubs in a network generally is done through general centrality measures, especially higher degree nodes are commonly considered as hubs and existence of high degree nodes in a network correlate with the local regulatory roles of these high degree hubs in the network³¹. This phenomenon of formation of rich club connection between high degree hubs exhibit the robustness of the network and the resilience when the hubs are targeted³². The existence of rich club phenomenon among hubs is investigated by calculating the Rich-club coefficients Φ(k) across the degree range³². Φ(k) is equivalent to the clustering coefficient among a subgroup of nodes with degrees ≥k. In order to remove the random interconnection probability factor, normalization of the rich club coefficients can be done by the Eq. (11), where Φ_rand(k) is the rich-club coefficient of random networks with similar size and degree sequence and Φ_norm(k) > 1 indicating rich-club formation. This rich club phenomenon is associated with the assortivity nature of the networks and is important to understand the roles played by these hubs’ roles in the network integration and efficient transmission of signals³³.

$${{\Phi }}_{norm}(k)=\frac{{\Phi }(k)}{{{\Phi }}_{rand}(k)}$$

(11)

Tracking the key regulators in the networks

The most influential genes in the PCa network were identified first through calculating the centrality measures. Since, higher degree nodes have higher centrality values, top 103 highest degree nodes (Degree k ≥ 65) were considered among the hub nodes of the network for tracing the key regulators which may play important role in regulating the network. Then tracing of nodes from the primary network up to motif level G(3, 3) was done on the basis of representation of the respective nodes (proteins) across the sub modules obtained from Louvain method of community detection/clustering¹⁷. Finally, the hub-nodes (proteins) which were represented at the modules at every hierarchical level were considered as key regulators of the PCa network.

Functional association analysis of modules

The modules at all levels of hierarchy were analysed for their functional annotations with DAVID functional annotation tool³⁴. The functions and pathways with corrected p < 0.05 were considered statistically significant.

Results

PPI network in PCa follows hierarchical scale-free topology composed of modules at five levels of hierarchy

From the interactome network of 3,871 PCa genes, the physical interacting PPI network of 2,960 proteins with 2,960 nodes and 20,372 edges was constructed as the primary network (Fig. 1). Analysis of this primary PCa network showed that the network followed power law distributions for probability of node degree distribution, P(k), clustering coefficient C(k) and neighbourhood connectivity distribution C_N(k) against degree (k) with negative exponents²² (Eq. 12) (Fig. 2). This power law feature indicates that the network exhibited hierarchical-scale free behaviour with systems level organization of modules/communities. Further, community finding using Louvain modularity optimization method¹⁷ led to the detection of communities and sub-communities at various levels of organization (Fig. 3A). Thus, a total of 436 communities and smaller communities were detected, out of which 38 reached up to level V, the level of motif G(3,3).

Communities at the first hierarchical level also showed power law distribution for P(k), C(k) and C_N(k) against degree distribution with negative exponents indicating further systems level organization of modules (Eq. 12) except in case of communities C8, C10 and C15 where the C_N(k) exhibit power law against degree k with positive exponents (β ~ 0.05, 0.13, 0.14 respectively) (Fig. 2). This indicates assortivity nature in the modules indicating the possibility of rich-club formation in these modules, where, hubs play significant role in maintaining network properties and stability²².

$$(\begin{array}{c}P(k)\\ C(k)\\ {C}_{N}(k)\end{array}) \sim (\begin{array}{c}{k}^{-\gamma }\\ {k}^{-\alpha }\\ {k}^{-\beta }\end{array});(\begin{array}{c}\gamma \\ \alpha \\ \beta \end{array})\to (\begin{array}{c}0.82-2.52\\ 0.15-0.67\\ 0.02-0.57\end{array})$$

(12)

Nineteen (19) novel regulators served as backbone of the network

Centrality measures are used to assess the importance of the nodes in information processing in a network. Betweenness centrality C_B, Closeness centrality C_C, Eigenvector centrality C_E and Subgraph centrality C_S are various topological properties which can determine the efficiency of signal transmission in a network²⁵. In PCa network and modules at the first hierarchical level, these parameters also exhibited power law as a function of degree (k) with positive exponents where the centralities tend to increase with higher degree nodes (Eq. 13) (Fig. 2). This behaviour revealed the increase in efficiency of signal processing with higher degree nodes in the network showing the importance of these nodes in controlling the flow of information, thereby regulating and stabilizing the network. Hence, hub proteins had a significant influence in regulating the network and might be playing an important role in PCa. In order to identify the most influential key regulator proteins in the network, top 103 hub-proteins having degree (k) ≥ 65 were considered for identification of the key regulators through their representation at every topological level (Supplementary Table 1). After tracing hubs at every topological level, 19 (RPL11, RPL15, RPL19, RPL23A, RPL3, RPL5, RPL6, RPLP0, RPS11, RPS8, RPSA, HSPA5, NOP2, RANBP2, SNU13, CUL7, CCT4, ASHA1 and EIF3A) (Tables 1, 2) were found to be the backbone of the network. These key regulators along with their partners forming the motifs (Fig. 3B), might be playing the most important roles in regulating and maintaining the stability (network integrity, optimization of signal processing, dynamics etc.) of the network.

$$(\begin{array}{c}\begin{array}{c}{C}_{C}\\ {C}_{B}\\ {C}_{E}\end{array}\\ {C}_{S}\end{array}) \sim (\begin{array}{c}\begin{array}{c}{k}^{\varepsilon }\\ {k}^{\eta }\\ {k}^{\delta }\end{array}\\ {k}^{\zeta }\end{array});(\begin{array}{c}\begin{array}{c}\varepsilon \\ \eta \\ \delta \end{array}\\ \zeta \end{array})\to (\begin{array}{c}\begin{array}{c}0.09-0.14\\ 0.89-2.00\\ 0.90-1.44\end{array}\\ 0.07-3.20\end{array})$$

(13)

Table 1 Key regulators and their topological properties.

Full size table

Table 2 The key regulator identified in this study and their key functions in disease conditions.

Full size table

Modules of the network were associated with specific functions

Community detection of the network using Louvain modularity optimization method leads to clustering of the primary PCa network up to the level of motifs (Fig. 3A). This clustering showed that Modularity (Q) of the networks exhibited an increasing pattern with topological levels with highest average Modularity (Q = 0.5527) seen at the first hierarchical level, and lowest (Q = 0.0013) at the level V, the motif level^35,36.

In complex PPI networks the modules have biological meanings relating to functions and gene ontology analyses have revealed enrichment of certain known functions and pathways in the modules³⁷. Our primary PCa-network was composed of 14 modules deduced from the community detection and their mean clustering coefficients C(k) ~ 0.094−0.392 (Table 3). Among these, modules C12 and C13 which were the largest and had the highest mean clustering coefficients C(k) = 0.392 and 0.218 respectively, showing a functional homogeneity in the modules. These modules were analysed for their functional annotations with DAVID functional annotation tool³⁴ to reveal association with different functions (Table 3).

Table 3 Average Clustering coefficients of the PCa modules at first hierarchical level.

Full size table

Hubs in the PCa network coordinate the modules acting as modular hubs

In complex hierarchical networks, the modularity of sub-communities and the roles played by the nodes in the modules is defined with the nodes Within-module Z score, Z_i along with their Participation coefficients P_i³⁰. Z_i gives the degree of the nodes within their modules, and P_i describes the influence of a node inside the module, as well outside it, in terms of signal processing as well as maintaining network stabilization. Hence, Z_i and P_i were calculated for each node in the modules using Eqs (9), (10), respectively. Accordingly, within-module Z score, the nodes are classified as follows:

(1)
Modular non-hub nodes Z_i < 2.5: (R1) Ultraperipheral nodes: The nodes linking all other nodes within their modules, P_i ≤ 0.05(R2) Peripheral nodes: nodes linking most other nodes in their modules, 0.05 < P_i ≤ 0.62; (R3) non-hub connector nodes: nodes linking many nodes in other modules, 0.62 < P_i ≤ 0.80; and (R4) Non-hub kinless nodes: nodes linking all other modules, P_i > 0.80.
(2)
Modular hubs Z_i ≥ 2.5: (R5) Provincial hubs; hub nodes linking vast majority nodes within their modules, P_i ≤ 0.30; (R6) Connector hubs; hubs linking most the other modules, 0.30 < P_i ≤ 0.75; and (R7) Kinless hubs; hubs linking among all modules, P_i > 0.75.

In the PCa PPI network™study, many hub-proteins were acting as modular hubs, helping in establishing connection between the modules at different hierarchical levels. For example, CUL7 and RANBP2 were among important key regulator protein hubs in PCa which also acted as modular kinless and connector hubs of module C3 and C5 at the first hierarchical level (Fig. 4A,B). P53, E2F1 and c−MYC acted as kinless global hubs of module C9 connecting with all the modules and other proteins in the network. NOP56, FBL, RNF2 and NPM1 also acted as connector modular hubs of C12 module connecting other modules at the same level (Fig. 4A,C).

PCa network exhibited non-monotonicity in rich-club formation across the hierarchy

Identification of rich club nodes is another common feature to study the influence of hubs in the network forming a strong connection among them which is done by calculating normalized rich club coefficient Φ_norm across the degree range k (Eq. 11). Normalized rich-club coefficient Φ_norm > 1 indicates the existence of rich club among the nodes which play key role in network integration, increasing its stability and improving the efficiency of transmission of information among hub proteins. Since, PCa network is hierarchical and shows disassortativity in nature with node neighbourhood connectivity C_N(k) following power law distribution against degree (k) with negative value of exponent β (Eq. 13), rich club formation among the hub proteins is quite unlikely^32,38. Although rich club formation is not exhibited among high degree hub proteins, the moderate intermediate degree protein with degree 19 ≤ k ≤ 107 showed higher rich club coefficients than the hubs in PCa network (Fig. 5). In the PCa network across the hierarchy, different patterns of rich club coefficients were exhibited among the modules (Fig. 5), showing the phenomenon of non-monotonic behaviour at different hierarchical levels. With respect to modules C12 and C13 at first hierarchical level, they exhibit rich club formation between the high degree nodes but the pattern changes moving at the lower levels. However, in the modules C8, C10 and C15, the topological properties of these modules exhibit assortativity nature due to (i) the node neighbourhood connectivity C_N(k) in these modules follow power law with positive β exponents, (ii) Φ increases monotonically with degree k, and (iii) Φ_norm approximately increases with degree k with values of Φ_norm > 1 (Fig. 6), indicating the possibility of rich club formation among the high degree nodes (Fig. 6A). Considering the nodes with degrees whose Φ_norm is larger than one, the approximate range of degrees of nodes forming rich-club in these three modules are 61≥k≥14 (C8), 52≥k≥6 (C10), 37≥k≥6 (C15), and clearly show rich-club formations in the respective network modules (red coloured nodes in the respective modules in Fig. 6).

Discussion

The real-world complex networks generally have hierarchically organized community structure, which is evident from fractal studies and scaling behaviour of these networks²¹. Even though there is no specific definition of communities or modules in a network, each community/module is established by densely interconnected nodes forming clusters around the hub nodes which generally have their own local properties and organization³⁵. The hubs have highest interactions in the network due to their high degree, constitute both intra and inter communities’ interactions in the network in a hierarchical manner, and thus play a central role in information processing in the network³¹. The primary PPI PCa network constructed in this study for tracking the hubs up to the level of motifs led to the identification of 19 key regulators (hubs) from 3,871 genes found to be significantly overexpressed in human prostate adenocarcinomas. There have been limited community finding methods in complex networks, among which the Newman and Girvan leading eigenvector algorithm^35,36, is commonly used. However, in comparatively large complex networks, Louvain method, which is based on modularity, Q maximization/optimization¹⁷, is the most suitable, sensitive and comparatively faster. In our study, considering the size of the network and its sensitivity, we used Louvain method for community detection and while giving equal importance to the hubs, motifs and modules of the network, we identified the novel key regulators. 11 key regulators (RPL11, RPL15, RPL19, RPL23A, RPL3, RPL5, RPL6, RPLP0, RPS11, RPS8 and RPSA) belong to the family of ribosomal proteins (RPs) which are involved in ribosomal biosynthesis and other eight predicted regulators (HSPA5, NOP2, RANBP2, SNU13, CUL7, CCT4, ASHA1 and EIF3A) have important functions reported to be associated with various other cancers. Moreover, at the level of motifs these key regulators interact with other proteins which may also be playing important roles in PCa and establishing themselves to be the candidate disease-genes along with key PCa regulators (Fig. 3B).

The emergence of 11 RPs as key regulators in PCa is an important finding in this study. It could be due to the crucial role of RPs in cell growth and proliferation propagated through protein synthesis. In cancers, ribosomal biosynthesis increases to meet the requirement of rapidly growing/proliferating cells³⁹. Some RPs take part in extra-ribosomal functions involved in tumorigenesis, immune cell signalling, and development and regulating diseases through translocation across the nuclear pore complex⁴⁰. RPs have been associated with tumorigenesis either as oncoproteins or tumour suppressors, with differential roles being reported in different cancers. During ribosomal or nucleolar stress such as hypoxia, lack of nutrient, starvation, deregulation of genes etc., RPs modulate the p53-mediated apoptosis. The association of RPs with cancers as discussed in Table 2 suggests a potential unexplored function of these proteins in PCa, both as therapeutic target and predictive biomarker. An understanding of the functions and the pathways of key RPs, for example their role in stabilizing p53 during ribosomal stress and role in cell growth/proliferation in PCa patients is of immense significance as it provides new insights into the control and prevention of PCa.

Besides, other non-ribosomal predicted key regulators identified in this study, SNU13, CCT4, AHSA1, CUL7, EIF3A, HSPA5, NOP2 and RANBP2, are also vital in cell physiology and are equally important for their involvement in cell growth and proliferation in one way or another. The NHP2−likeprotein1(SNU13) identified in this study as another key regulator, is a component of the spliceosome complex⁴¹ which interacts with several RPs and strengthens the role of RPs in cancers. CCT4, Chaperonin containing TCP1 subunit 4, is a chaperone which when mutated is associated with hereditary sensory neuropathy⁴².

AHSA1, theActivatorofHSP90ATPaseActivity1, is a positive regulator of the heat shock protein 90 (HSP90) and when activated HSP90 forms a complex with HSP70 which helps in either binding of the tumour suppressor p53 to DNA, or its degradation by ubiquitination⁴³. In cancers, activated HSP90 stabilizes the mutated p53 which decreases its DNA binding activity and degradation through binding with its inhibitor MDM2, thus promoting tumour progression⁴⁴. The activation and transportation of steroid hormones (androgen receptor, AR and oestrogen receptor, ER) to the nucleus is also mediated by HSP90⁴⁵; thus, AHSA1 activation of HSP90 may influence the androgen metabolism in PCa. Moreover, AHSA1 is a regulator of the cell growth, apoptosis, migration and invasion through Wnt/β−catenin signaling pathway⁴⁶, which suggests its role as a candidate-disease gene in PCa.

CUL7, Culin7, is a component of an E3ubiquitin−proteinligase complex and interacts with p53, CUL9 and FBXW8, and is reported to be an antiapoptotic oncogene⁴⁷. CUL7 has been associated with various cancer types, but its promotion of epithelial-mesenchymal transition in metastasis and its regulation of ERK−SNAI2 signalling affecting the expression of cell adhesion proteins, E−cadherins, fibronectin, N−cadherin and vimentin in cancer is well studied⁴⁸. CUL7 inhibits apoptosis in lung cancer through inhibition of p53 which regulates c−MYC cell cycle progression⁴⁷. CUL7 regulates cell cycle progression through CyclinA overexpression and affects the cell migration, which is a hallmark of cancer, influencing microtubule dynamics in breast cancer⁴⁹. Therefore, the targeted knockdown and silencing of CUL7 has led to a decrease in cell proliferation, weaker −tubulin accumulation in microtubules, promoting their stability and decreasing cell migration (in breast, liver and lung carcinoma cells) and has been suggested as a potential therapeutic target in various cancers^47,48,49.

The Eukaryotic translation initiationfactor 3 subunit A(EIF3A) forms 43SPre−initiation complex(43SPIC) with other initiation factors and 40Sribosome and initiates the protein synthesis process. This translates mainly genes involved in cell proliferation, cell differentiation, apoptosis etc. and exerts transcriptional activation/repression through forming different forms of stem loop binding with the mRNAs⁵⁰. Dysregulation of translation initiation and the role of EIF3 has been studied in cancers and involvement of EIF3 complex in regulation of mTOR pathway⁵¹, makes it an interesting protein to study for its regulatory role in PCa.

The Heat shock protein family A (HSP70) member 5(HSPA5) or glucose−regulated protein 78kDa (GRP78), is a chaperone localized in endoplasmic reticulum (ER) and involved in folding and assembly of proteins and plays an active role in unfolded protein response in ER stress, promoting cell survival which is a common process of escaping cell death in cancers^52,53. Due to this activity, HSPA5 is an emerging therapeutic drug target for cancer.

NOP2(p120) is a putative RNA methyl transferase protein and its expression is detectable in proliferating normal and tumour cells, but undetectable in non-proliferating normal cells⁵⁴. Its role in regulating cell cycle progression from G1 to S phase and transformation of normal fibroblast cells^55,56 makes NOP2 an interesting protein which can be used as biomarker for cell transformation. Ran binding protein 2 (RANBP2) is another key regulator identified in this study which is involved in the SUMOylation of TopioisomeraseII− before the onset of anaphase, helping in separation of chromatids from the centromere and its under-expression, mutation or deficiency has been observed in various cancers specially lung cancer and myelomocytic leukemia acting as tumor suppressor genes⁵⁷. Since SUMOylation plays an important role in tumour progression⁵⁸, the p150/importin/RANBP2 pathway may also play a significant role in PCa progression.

In PCa, p53 and AR are the most mutated genes reported according to COSMIC¹⁴. The protein-protein interactome of GeneMANIA¹⁵ showed that out of the 19 key regulators identified in this study, 12 (CUL7, HSPA5, CCT4, RPL19, RPL11, RPL3, RPL6, RPLP0, RANBP2, RPS8, RPL23A and RPL15) interact directly with p53 and other key regulators through them (Fig. 3C). Association of mutation in the Androgen Receptor gene (AR) which causes the mutated receptor to remain in activated state and continue to maintain androgen receptor mediated downstream signalling even in lower level of circulating androgens leading to discovery of androgen independency in prostate cancer⁵⁹. A recent report suggests several mutations in the AR gene in different metastatic castration-resistance (CRPC) patients in prostate cancer suggesting AR mutants as a good biomarker candidate⁶⁰. β−catenin (CTNNB1) and GSK−3β are other co-regulators of Androgen receptor and phosphorylation of AR by GSK−3β which inhibit AR driven transcription, but in prostate cancer, the increase in the activity of AKT suppression of GSK−3β due to phosphorylation helps in PCa progression⁶¹. In the PCa, loss of tumour suppressor PTEN gene also releases the inhibitory effect on AR increasing its trans localization to nucleus and transcriptional activity⁶². Therefore, the interaction of the key regulators on AR acted indirectly through p53 and β-catenin(CTNNB1) (Fig. 3C), where the 12 key regulators interact with p53 which regulates GSK−3 and PTEN which are the upstream regulators of AR. In addition, key regulators, RPSA and HSPA5 interact with AR indirectly through β−catenin (CTNNB1) and AKT1 suggesting an important role of the reported key regulators in regulating the functions mediated through p53 and AR in PCa. The findings reiterate the putative roles of these hubs in PCa manifestation and progression. This study may prove fundamental in characterizing the potential therapeutic targets and biomarkers for sensitive intervention and diagnosis of PCa.

It is to be noted that in this study the PCa PPI network followed hierarchical scale free topology. Along with the conventional centrality measures, C_B, C_C, C_E and C_S, probability degree distribution P(k), clustering coefficient C(k) and node neighbourhood connectivity distribution C_N(k) are used to characterize a network whether one is scale-free, random, small-network or hierarchical network²¹. PCa PPI network followed power law distributions for probability of node degree distribution, P(k), clustering coefficient, C(k), and neighbourhood connectivity distribution against degree k with negative exponents²¹ (Eq. 12) (Fig. 2), indicating the network falls in hierarchical-scale free behaviour which can exhibit systems level organization of modules/communities.

Since, node neighbourhood connectivity distribution C_N(k) as a function of degree k obeyed power law with negative exponent β, it showed its disassortative nature indicating that there is no signature of rich club formation among high degree nodes in the network³². Degree centrality is the most commonly used centrality measure used to define the hubs which are the high degree nodes in the network. This disassortivity may be due to the sparse distribution of the hubs among the modules playing key roles in coordinating specific function within each module as well as establishing the connections among the modules³². Furthermore, we used Louvain modularity optimization method¹⁷ to detect, find communities and sub-communities and their organization at various levels of organization (Fig. 3A). The communities/sub-communities at various hierarchically organized levels also exhibited hierarchical scale-free topology, as was the case in the primary PCa network (Fig. 2). This hierarchical organization shows the systematic coordinating role of the emerged modules/communities and hubs in regulating and maintaining the properties of the network¹⁰. In such type of networks, the centrality-lethality rule³¹ is not obeyed which indicates that disturbing the hub/hubs in the network will not cause the whole network collapse.

Another important feature we found in PCa network is the observation of the non-monotonic behaviour in the rich club formation in the PCa PPI network and across its hierarchy (Fig. 5). The intermediate degree nodes (19 ≤ k ≤ 107) in PCa network showed normalised rich club coefficients (Φ_norm > 1) greater than the highest degree hubs, indicating an important role of these intermediate degree nodes (even AR also falls in this category) in regulating the network organization and maintaining stability through establishing key links between the low degree nodes and high degree hubs. Hence, this category of nodes could perform key roles specially in integrating various types of nodes in the network to optimize topological properties of the network. Formation of rich club among the high degree nodes in the communities C8, C10 and C15 (Fig. 6A) indicating an increase in sensitivity of these hubs on being targeted hence take significant roles in regulating their respective modular functions, i.e., endocytosis, proteosome and DNA repair mechanisms (Table 3). These high degree hubs in these modules fall among the intermediate degree nodes in the primary PCa PPI network (Fig. 6B). Thus the varying pattern of rich club signatures across the hierarchy may possibly relate to the change in popularity of the proteins at different levels of organization, and hence hub-proteins preserve their level-dependent influence across the hierarchy¹⁰. Such behaviour in the PPI networks can be correlated to their weaker resilience and instability at sub-system/modular level which may be critical for certain functional modules due to malfunctions in the key regulator hub-proteins.

The Centrality measures are used to assess the importance of the nodes in information processing in the network. Betweenness centrality C_B, closeness centrality C_C, eigenvector centrality C_E and subgraph centrality C_S are the topological properties which can determine efficiency of signal transmission in a network²⁵. The behaviour of these parameters exhibiting power law as a function of degree k with positive exponents, where the centralities tend to increase with higher degree nodes (Eq. 13) (Fig. 2), reveals the increase in efficiency of signal processing with higher degree nodes in PCa network, showing the importance of hubs in controlling the flow of information, thereby regulating and stabilizing the network organization. Therefore, hub-proteins have a significant influence in regulating the network although they do not control the whole network completely, thereby increasing the risk of being targeted in the network. Hence, the certain hubs might be acting as key regulators in PCa and the 19 predicted key regulators might serve as a backbone of the network.

Community detection of the network using Louvain modularity optimization method led to clustering of the primary PCa network up to the level of motifs (Fig. 3A). This clustering showed that modularity, Q, of the networks exhibit an increasing pattern with the topological levels with highest average modularity (Q = 0.5527) seen at the first hierarchical level of PCa network and lowest (Q = 0.0013) at level V, that is, at the motif level^35,36. In complex PPI network the modules have biological meanings and gene ontology analyses have revealed enrichment of certain known functions and pathways in the modules³⁷. The functional homogeneity in the modules of PCa network has been correlated to their mean clustering coefficients as modules with higher mean clustering coefficients have better chance to be associated with specific functions^63,64. Moreover, in disease interactome, the disease modules which are unique modules representing the interaction between disease genes and their neighbourhood, overlaps with the topological modules derived from the network and functional modules associated with functions and are interrelated⁶⁵. Primary PCa network is composed of 14 modules deduced from the community detection method with their mean clustering coefficients C(k) ~ 0.094−0.392 (Table 3). Among them modules C12 and C13 which were the largest had the highest mean clustering coefficients, C(k) = 0.392 & 0.218, respectively, showing a functional homogeneity in these modules. These modules have been analysed for their functional annotations with DAVID functional annotation tool³⁴ which revealed association with different functions (Table 3). Modules C12 and C13 are represented with ribosomal biosynthesis and transcriptional regulation, respectively. This suggests a bigger role of RPs in PCa which is also evident from the representation of various RPs (RPL3, 5, 6, 11, 15, 19 etc.) as key regulators in PCa network. Transcriptional regulation is the most important level of gene regulation which is accomplished mainly through interaction of transcription factors along with their cofactors to the promoter regions of many genes. The tumour suppressor transcription factor (TF) p53 gene—the most mutated among all PCa—is one of the hub proteins represented in this community. Another important TF, c−MYC—an oncogene acting as a regulator of the cell cycle progression and cell division—is also represented in this community. Moreover, reports on regulations of p53 with the key ribosomal proteins (RPL5, RPL6, RPL11 etc.) and c−MYC key regulator CUL7 through p53 in several cancers suggest a critical association of transcriptional regulation in PCa.

Since the study of complex hierarchical networks is incomplete without understanding the modularity of sub-communities and the roles played by the nodes in the modules, our study applied the approach to characterize the nodes in PCa network through defining their within-module Z score Z_i with their participation coefficients P_i³⁰. In the PCa network many hub proteins act as modular kinless hubs or connector modular hubs maintaining the links within the modules as well as connecting other modules at the same level (Fig. 4A–C). This shows the importance of the hub-proteins in the hierarchical organization of the network exhibiting their involvement in establishing links among the nodes in each module as well as among the modules in the network which are associated with specific functions.

Conclusions

This paper introduces a new method for finding key regulators in prostate adenocarcinomas using biological networks constructed from high throughput datasets of Prostate cancer patients. The network theoretical approach used here placed equal emphasis on the hubs, motifs and modules of the network to identify key regulators/regulatory pathways, not restricting only to overrepresented motifs or hubs. It established a relationship between hubs, modules and motifs. The network used all genes associated with the disease, rather than using manually curated datasets. Highest degree hubs (k ≥ 65) were identified, out of which 19 were novel key regulators. The network, as evident from fractal nature in topological parameters, was a self-organized network and lacked a central control mechanism. Identification of novel key regulators in prostate cancer, particularly ribosomal proteins add new dimension to the understanding of PCa and its treatment and predicting key disease genes/pathways within network theoretical framework. This method can be used to any networks constructed from patients’ datasets which follow hierarchical topology.

References

Aaron, L. T., Franco, O. E. & Hayward, S. W. Review of Prostate Anatomy and Embryology and the etiology of Benign Prostatic Hyperplasia. Urologic Clinics. 43(3), 279–288 (2016).
PubMed Google Scholar
World Cancer Report 2014. World Health Organization. pp. Chapter 5.11. ISBN 9283204298.
Tobias, J. & Hochhauser, D. Cancer and its management (7th edition). (Wiley-Blackwell, West Sussex, UK 2015).
Edge, S. B. et al. American Joint Committee on Cancer (AJCC). Cancer Staging Manual. 7th ed. Springer, New York, USA (2009).
Mateo., J., Smith, A., Ong, M. & de Bono, J. S. Novel drugs targeting the androgen receptor pathway in prostate cancer. Cancer metastasis reviews. 33, 567–579 (2014).
Article CAS PubMed Google Scholar
Ritch, C. R. & Cookson, M. S. Advances in the management of castration resistant prostate cancer. BMJ 355, i4405 (2016).
Article PubMed Google Scholar
Erdogan, B., Kostek, O. & Bekirhacioglu, M. Enzalutamide in Prostate Cancer, A Review on Enzalutamide and cancer. EJMO 2(3), 121–129 (2018).
Google Scholar
Saini, S. PSA and beyond: alternative prostate cancer biomarkers. Cellular Oncology. 39(2), 97–106 (2016).
Article MathSciNet CAS Google Scholar
Naji, L. et al. Digital Rectal Examination for Prostate Cancer Screening in Primary Care: A Systematic Review and Meta-Analysis. Ann. Fam. Med. 16(2), 149–154 (2018).
Article PubMed PubMed Central Google Scholar
Ali, S. et al. Exploring novel key regulators in breast cancer network. Plos One 13(6), e0198525 (2018).
Article CAS PubMed PubMed Central Google Scholar
Milo, R. et al. Network Motifs: Simple Building Blocks of Complex Networks. Science 298(5594), 824–827 (2002).
Article ADS CAS PubMed Google Scholar
Alon, U. Network motifs: theory and experimental approaches. Nature Reviews, Genetics 8, 450–461 (2007).
Article CAS PubMed Google Scholar
Dingerdissen, H. M. et al. BioMuta and BioXpress: mutation and expression knowledgebases for cancer biomarker discovery. Nucleic Acids Research 46(Database issue), D1128–D1136 (2018).
Article CAS PubMed Google Scholar
Tate, J. G. et al. COSMIC: The Catalogue Of Somatic Mutations In Cancer. Nucleic Acids Res. 47(D1), D941–D947 (2019).
Article CAS PubMed Google Scholar
Warde-Farley, D. et al. The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function. Nucleic Acids Res. 38, W214–W220 (2010).
Article CAS PubMed PubMed Central Google Scholar
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Research 13(11), 2498–504 (2003).
Article CAS PubMed PubMed Central Google Scholar
Blondel, V. D., Guillaume, J.-L., Lambiotte, R. & Lefebvre, E. Fast unfolding of communities in large networks. Journal of Statistical Mechanics: Theory and Experiment 10, P10008 (2008).
Article MATH Google Scholar
Yeger-Lotem, E. et al. Network motifs in integrated cellular networks of transcription-regulation and protein-protein interaction. PNAS 101(16), 5934–5939 (2004).
Article ADS CAS PubMed PubMed Central Google Scholar
Assenov, Y., Ramírez, F., Schelhorn, S. E., Lengauer, T. & Albrecht, M. Computing topological parameters of biological networks. Bioinformatics 24(2), 282–284 (2008).
Article CAS PubMed Google Scholar
Tang, Y., Li, M., Wang, J., Pan, Y. & Wu, F.-X. CytoNCA: A cytoscape plugin for centrality analysis and evaluation of protein interaction networks. Biosystems 127, 67–72 (2015).
Article CAS PubMed Google Scholar
Ravasz, E., & Barabási, A.-L. Hierarchical organization in complex networks. Physical Review E, 67(2) (2003).
Pastor-Satorras, R., Vzquez, A. & Vespignani, A. Dynamical and correlation properties of the Internet. Physical review letters 87(25), 258701 (2001).
Article ADS CAS PubMed Google Scholar
Barrat, A., Barthelemy, M., Pastor-Satorras, R. & Vespignani, A. The architecture of complex weighted networks. PNAS, USA 101(11), 3747–3752 (2004).
Article ADS CAS Google Scholar
Newman, M. E. J. & Girvan, M. Finding and evaluating community structure in networks. Physical Review E 69(2), 026113 (2004).
Article ADS CAS Google Scholar
Estrada, E. & Rodríguez-Velázquez, J. A. Subgraph centrality in complex networks. Physical Review E 71(5), 056103–1-9 (2005).
Article ADS MathSciNet CAS Google Scholar
Canright, G. & Engo-Monsen, K. Roles in networks. Science of Computer Programming 53(2), 195–214 (2004).
Article MathSciNet MATH Google Scholar
Borgatti, S. P. & Everett, M. G. A graph-theoretic perspective on centrality. Social networks 28(4), 466–484 (2006).
Article Google Scholar
Brandes, U. A faster algorithm for betweenness centrality. J. Math. Sociol. 25, 163–177 (2001).
Article MATH Google Scholar
Canright, G. S. & Engo-Monsen, K. Spreading on networks: a topographic view. Complexus 3(1–3), 131–146 (2006).
Article MATH Google Scholar
Guimerà, R. & Nunes Amaral, L. A. Functional cartography of complex metabolic networks. Nature 433(7028), 895–900 (2005).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Jeong, H., Mason, S. P., Barabasi, A. L. & Oltvai, Z. N. Lethality and centrality in protein networks. Nature 411, 41–42 (2001).
Article ADS CAS PubMed Google Scholar
Colizza, V., Flammini, A., Serrano, M. A. & Vespignani, A. Detecting rich-club ordering in complex networks. Nature Physics 2(2), 110–115 (2006).
Article ADS CAS Google Scholar
Rubinov, M. & Sporns, O. Complex network measures of brain connectivity: Uses and interpretations. NeuroImage 52(3), 1059–1069 (2010).
Article PubMed Google Scholar
Huang, D. W., Sherman, B. T. & Lempicki, R. A. Systematic and integrative analysis of large gene lists using DAVID Bioinformatics Resources. Nature Protoc. 4(1), 44–57 (2009).
Article CAS Google Scholar
Newman, M. E., J. Modularity and community structure in networks. PNAS 103(23), 8577–8582 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Newman, M. E. J. The Structure and Function of Complex Networks. SIAM Review 45(2), 167–256 (2003).
Article ADS MathSciNet MATH Google Scholar
Dong, J. & Horvath, S. Understanding network concepts in modules. BMC Systems Biology 1(1), 24 (2007).
Article CAS PubMed PubMed Central Google Scholar
Zhou, S. & Mondragon, R. J. The Rich-Club Phenomenon in the Internet Topology. IEEE Communications Letters 8(3), 180–182 (2004).
Article Google Scholar
Dolezal, J. M., Dash, A. P. & Prochownik, E. V. Diagnostic and prognostic implications of ribosomal protein transcript expression patterns in human cancers. BMC Cancer 18, 275 (2018).
Article CAS PubMed PubMed Central Google Scholar
Zhou, X., Liao, W.-J., Liao, J.-M., Liao, P. & Lu, H. Ribosomal proteins: functions beyond the ribosome. Journal of Molecular Cell Biology 7(2), 92–104 (2015).
Article CAS PubMed PubMed Central Google Scholar
Bertram, K. et al. Cryo-EM Structure of a Pre-catalytic Human Spliceosome Primed for Activation. Cell 170(4), 701–713.e11 (2017).
Article CAS PubMed Google Scholar
Li, J., Soroka, J. & Buchner, J. The Hsp90 chaperone machinery: Conformational dynamics and regulation by co-chaperones. Biochimica et Biophysica Acta (BBA) - Molecular Cell Research 1823(3), 624–635 (2012).
Article CAS Google Scholar
Müller, L., Schaupp, A., Walerych, D., Wegele, H. & Buchner, J. Hsp90 Regulates the Activity of Wild Type p53 under Physiological and Elevated Temperatures. Journal of Biological Chemistry 279(47), 48846–48854 (2004).
Article CAS PubMed Google Scholar
Peng, Y., Chen, L., Li, C., Lu, W. & Chen, J. Inhibition of MDM2 by hsp90 Contributes to Mutant p53 Stabilization. Journal of Biological Chemistry 276(44), 40583–40590 (2001).
Article CAS PubMed Google Scholar
Ratajczak, T., Cluning, C. & Ward, B. K. Steroid Receptor-Associated Immunophilins: A Gateway to Steroid Signalling. Clin. Biochem. Rev. 36(2), 31–52 (2015).
PubMed PubMed Central Google Scholar
Shao, J., Wang, L., Zhong, C., Qi, R. & Li, Y. AHSA1 regulates proliferation, apoptosis, migration, and invasion of osteosarcoma. Biomedicine & Pharmacotherapy 77, 45–51 (2016).
Article CAS Google Scholar
Kim, S. S. et al. CUL7 Is a Novel Antiapoptotic Oncogene. Cancer Research 67(20), 9616–9622 (2007).
Article CAS PubMed Google Scholar
Tian, P., Liu, D., Sun, L. & Sun, H. Cullin7 promotes epithelial-mesenchymal transition of esophageal carcinoma via the ERK-SNAI2 signaling pathway. Molecular Medicine Reports 17(4), 5362–5367 (2018).
CAS PubMed Google Scholar
Qiu, N. et al. Cullin7 is a predictor of poor prognosis in breast cancer patients and is involved in the proliferation and invasion of breast cancer cells by regulating the cell cycle and microtubule stability. Oncology Reports 39, 603–610 (2017).
PubMed Google Scholar
Lee, A. S. Y., Kranzusch, P. J., Doudna, J. A. & Cate, J. H. D. eIF3d is an mRNA cap-binding protein that is required for specialized translation initiation. Nature 536(7614), 96–99 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Schipany, K., Rosner, M., Ionce, L., Hengstschläger, M. & Kovacic, B. eIF3 controls cell size independently of S6K1-activity. Oncotarget 6, 24361–24375 (2015).
Article PubMed PubMed Central Google Scholar
Wang, M., Wey, S., Zhang, Y., Ye, R. & Lee, A. S. Role of the Unfolded Protein Response Regulator GRP78/BiP in Development, Cancer, and Neurological Disorders. Antioxidants & Redox Signaling 11(9), 2307–2316 (2009).
Article CAS Google Scholar
Cerezo, M. & Rocchi, S. New anti-cancer molecules targeting HSPA5/BIP to induce endoplasmic reticulum stress, autophagy and apoptosis. Autophagy 13(1), 216–217 (2016).
Article CAS PubMed PubMed Central Google Scholar
de Beus, E., Brockenbrough, J. S., Hong, B. & Aris, J. P. Yeast NOP2 encodes an essential nucleolar protein with homology to a human proliferation marker. J. Cell Biol. 127, 1799–1813 (1994).
Article PubMed Google Scholar
Perlaky, L. et al. Increased growth of NIH/3T3 cells by transfection with human p120 complementary DNA and inhibition by a p120 antisense construct. Cancer Research 52(2), 428–436 (1992).
CAS PubMed Google Scholar
Fonagy, A. et al. Cell cycle regulated expression of nucleolar antigen P120 in normal and transformed human fibroblasts. Journal of Cellular Physiology 154(1), 16–27 (1993).
Article CAS PubMed Google Scholar
Lim, J. H. et al. RANBP2-ALK fusion combined with monosomy 7 in acute myelomonocytic leukemia. Cancer. Genetics 207(1–2), 40–45 (2014).
CAS Google Scholar
Eifler, K. & Vertegaal, A. C. O. SUMOylation-Mediated Regulation of Cell Cycle Progression and Cancer. Trends in Biochemical Sciences 40(12), 779–793 (2015).
Article CAS PubMed PubMed Central Google Scholar
De Marzo, A. M. et al. Pathological and molecular mechanisms of prostate carcinogenesis: Implications for diagnosis, detection, prevention, and treatment. Journal of Cellular Biochemistry 91(3), 459–477 (2004).
Article CAS PubMed Google Scholar
Lallous, N. et al. Functional analysis of androgen receptor mutations that confer anti-androgen resistance identified in circulating cell-free DNA from prostate cancer patients. Genome Biology, 17 ( 1 ) (2016).
Salas, T. R. et al. Glycogen Synthase Kinase-3β Is Involved in the Phosphorylation and Suppression of Androgen Receptor Activity. Journal of Biological Chemistry 279(18), 19191–19200 (2004).
Article CAS PubMed Google Scholar
Lin, H.-K., Hu, Y.-C., Lee, D. K. & Chang, C. Regulation of Androgen Receptor Signaling by PTEN (Phosphatase and Tensin Homolog Deleted on Chromosome 10) Tumor Suppressor through Distinct Mechanisms in Prostate Cancer Cells. Molecular Endocrinology 18(10), 2409–2423 (2004).
Article CAS PubMed Google Scholar
Colizza, V., Flammini, A., Maritan, A. & Vespignani, A. Characterization and modeling of protein–protein interaction networks. Physica A: Statistical Mechanics and Its Applications 352(1), 1–27 (2005).
Article ADS CAS Google Scholar
Lewis, A. C., Jones, N. S., Porter, M. A. & Charlotte, D. M. The function of communities in protein interaction networks at multiple scales. BMC Systems Biology 4(1), 100 (2010).
Article CAS PubMed PubMed Central Google Scholar
Barabási, A. L., Gulbahce, N. & Loscalzo, J. Network medicine: a network-based approach to human disease. Nature Reviews Genetics. 12(1), 56–68 (2011).
Article CAS PubMed PubMed Central Google Scholar
Golomb, L., Volarevic, S. & Oren, M. p53 and ribosome biogenesis stress: the essentials. FEBS Lett. 588, 1–9 (2014).
Article CAS Google Scholar
Meng, X. et al. RPL23 Links Oncogenic RAS Signaling to p53-Mediated Tumor Suppression. Cancer Research 76(17), 5030–5039 (2016).
Article CAS PubMed PubMed Central Google Scholar
Dai, M.-S., Arnold, H., Sun, X.-X., Sears, R. & Lu, H. Inhibition of c-Myc activity by ribosomal protein L11. The EMBO Journal 26(14), 3332–3345 (2007).
Article CAS PubMed PubMed Central Google Scholar
Gou, Y. et al. Ribosomal protein L6 promotes growth and cell cycle progression through upregulating cyclin E in gastric cancer cells. Biochemical and Biophysical Research Communications 393, 788–793 (2010).
Article CAS PubMed Google Scholar
Chen, R. et al. Proteins associated with pancreatic cancer survival in patients with resectable pancreatic ductal adenocarcinoma. Laboratory Investigation 95(1), 43–55 (2014).
Article CAS PubMed Google Scholar
Zhang, Y. Z. et al. Discovery and validation of prognostic markers in gastric cancer by genome-wide expression profiling. World J Gastroenterol 17, 1710–1717 (2011).
Article PubMed PubMed Central Google Scholar
Mao-De, L. & Jing, X. Ribosomal Proteins and Colorectal Cancer. Current Genomics 8, 43–49 (2007).
Article Google Scholar
Callari, M. et al. Gene expression analysis reveals a different transcriptomic landscape in female and male breast cancer. Breast Cancer Res Treat. 127, 601–10 (2011).
Article CAS PubMed Google Scholar
Kato, Y. et al. Gene expression pattern in oral cancer cervical lymph node metastasis. Oncology Reports 16, 1009–1014 (2006).
CAS PubMed Google Scholar
Teller, A. et al. Dysregulation of apoptotic signaling pathways by interaction of RPLP0 and cathepsin X/Z in gastric cancer. Pathology - Research and Practice 211(1), 62–70 (2015).
Article CAS PubMed Google Scholar
Artero-Castro, A. et al. Expression of the ribosomal proteins Rplp0, Rplp1, and Rplp2 in gynecologic tumors. Human Pathology 42(2), 194–203 (2011).
Article CAS PubMed Google Scholar
Zhang, S.-C. et al. RPSA Gene Mutants Associated with Risk of Colorectal Cancer among the Chinese Population. Asian Pac J Cancer Prev 14(12), 7127–7131 (2013).
Article PubMed Google Scholar
Jiang, G. et al. A novel biomarker C6orf106 promotes the malignant progression of breast cancer. Tumor Biology 36(10), 7881–7889 (2015).
Article CAS PubMed Google Scholar
Yong, W. H. et al. Ribosomal Proteins RPS11 and RPS20, Two Stress-Response Markers of Glioblastoma Stem Cells, Are Novel Predictors of Poor Prognosis in Glioblastoma Patients. PLOS ONE 10(10), e0141334 (2015).
Article CAS PubMed PubMed Central Google Scholar
Sethi, M. K. et al. Quantitative proteomic analysis of paired colorectal cancer and non-tumorigenic tissues reveals signature proteins and perturbed pathways involved in CRC progression and metastasis. Journal of Proteomics 126, 54–67 (2015).
Article CAS PubMed Google Scholar
Bee, A. et al. Ribosomal Protein L19 Is a Prognostic Marker for Human Prostate Cancer. Clinical Cancer Research 12(7), 2061–2065 (2006).
Article CAS PubMed Google Scholar
Russo, A. et al. Regulatory role of rpL3 in cell response to nucleolar stress induced by Act D in tumor cells lacking functional p53. Cell Cycle 15(1), 41–51 (2015).
Article CAS PubMed Central Google Scholar
Bee, A. et al. siRNA Knockdown of Ribosomal Protein Gene RPL19 Abrogates the Aggressive Phenotype of Human Prostate Cancer. Plos One 6(7), e22672 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

I.R.M. acknowledges Deshbandhu College, University of Delhi for the study leave to pursue doctoral research. M.Z.M. acknowledges financial assistance from Department of Health Research, Ministry of Health and Family Welfare, Government of India under Young Scientist scheme (Sanction File No. R.12014/01/2018-HR, FTS No. 3146887). S.A. acknowledges the Department of Biotechnology, Ministry of Science and Technology, Government of India for the bioinformatics facility at Jamia Hamdard under BTISNet, the Biotechnology Information System Network (Sanction no. BT/BI/25/062/2012(BIF). S.A. and O.K. acknowledge Indian Council of Medical Research for International fellowship to SA to visit Emory Winship Cancer Institute, Atlanta. R.K.B.S. acknowledges Jawaharlal Nehru University and UGC for UPE-II (Sanction no. 101) for financial assistance.

Author information

These authors contributed equally: Irengbam Rocky Mangangcha and Md Zubbair Malik.

Authors and Affiliations

School of Interdisciplinary Sciences and Technology, Jamia Hamdard, New Delhi, 110062, India
Irengbam Rocky Mangangcha & Shakir Ali
Bioinformatics Infrastracture Facility, BIF, Jamia Hamdard & Department of Biochemistry, School of Chemical and Life Sciences, Jamia Hamdard, New Delhi, 110062, India
Irengbam Rocky Mangangcha & Shakir Ali
Department of Zoology, Deshbandhu College, University of Delhi, New Delhi, 110019, India
Irengbam Rocky Mangangcha
School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi, 110067, India
Irengbam Rocky Mangangcha, Md. Zubbair Malik & R. K. Brojen Singh
Winship Cancer Institute of Emory University, 1365 Clifton Road NE, Atlanta, GA, 30322, USA
Ömer Küçük

Authors

Irengbam Rocky Mangangcha
View author publications
You can also search for this author in PubMed Google Scholar
Md. Zubbair Malik
View author publications
You can also search for this author in PubMed Google Scholar
Ömer Küçük
View author publications
You can also search for this author in PubMed Google Scholar
Shakir Ali
View author publications
You can also search for this author in PubMed Google Scholar
R. K. Brojen Singh
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.K.B.S., S.A., I.R.M. and M.Z.M. conceived the model and conducted numerical experiments. I.R.M. and M.Z.M. prepared figures of the numerical results. I.R.M., M.Z.M., O.K., S.A. and R.K.B.S. analysed and interpreted the simulation results and wrote the manuscript. R.K.B.S. and S.A. jointly supervised the study and approved the final draft.

Corresponding authors

Correspondence to Shakir Ali or R. K. Brojen Singh.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Table 1. Top 103 hubs.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Mangangcha, I.R., Malik, M.Z., Küçük, Ö. et al. Identification of key regulators in prostate cancer from gene expression datasets of patients. Sci Rep 9, 16420 (2019). https://doi.org/10.1038/s41598-019-52896-x

Download citation

Received: 08 July 2019
Accepted: 15 October 2019
Published: 11 November 2019
DOI: https://doi.org/10.1038/s41598-019-52896-x

This article is cited by

A bioinformatics approach to elucidate conserved genes and pathways in C. elegans as an animal model for cardiovascular research
- Ashwini Kumar Ray
- Anjali Priya
- Rupesh Chaturvedi
Scientific Reports (2024)
Identification of key regulators in Sarcoidosis through multidimensional systems biological approach
- Safia Tazyeen
- Mohd Murshad Ahmed
- Romana Ishrat
Scientific Reports (2022)
Artificial intelligence in cancer target identification and drug discovery
- Yujie You
- Xin Lai
- Le Zhang
Signal Transduction and Targeted Therapy (2022)
The TRIM proteins in cancer: from expression to emerging regulatory mechanisms
- A. Mohammadi
- M. S. Pour Abbasi
- F. Ebrahimzadeh
Clinical and Translational Oncology (2022)
AHA1 regulates cell migration and invasion via the EMT pathway in colorectal adenocarcinomas
- Dasom Kim
- Ji Wook Moon
- Ji-Yun Lee
Scientific Reports (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Materials and Methods

Identification and selection of PCa-associated genes

Construction of protein-protein interaction (PPI) network

Method for detection of levels of organization

Topological analyses of the networks

Degree (k)

Probability of degree distribution (P(k))

Clustering coefficients C(k)

Neighbourhood connectivity C N(k)

Centrality measures

Within-module degree and Participation coefficients of the hubs

Rich-club analysis

Tracking the key regulators in the networks

Functional association analysis of modules

Results

PPI network in PCa follows hierarchical scale-free topology composed of modules at five levels of hierarchy

Nineteen (19) novel regulators served as backbone of the network

Modules of the network were associated with specific functions

Hubs in the PCa network coordinate the modules acting as modular hubs

PCa network exhibited non-monotonicity in rich-club formation across the hierarchy

Discussion

Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links

Neighbourhood connectivity C _N(k)