Abstract
NEET proteins belong to a unique family of iron-sulfur proteins in which the 2Fe-2S cluster is coordinated by a CDGSH domain that is followed by the “NEET” motif. They are involved in the regulation of iron and reactive oxygen metabolism, and have been associated with the progression of diabetes, cancer, aging and neurodegenerative diseases. Despite their important biological functions, the evolution and diversification of eukaryotic NEET proteins are largely unknown. Here we used the three members of the human NEET protein family (CISD1, mitoNEET; CISD2, NAF-1 or Miner 1; and CISD3, Miner2) as our guides to conduct a phylogenetic analysis of eukaryotic NEET proteins and their evolution. Our findings identified the slime mold Dictyostelium discoideum’s CISD proteins as the closest to the ancient archetype of eukaryotic NEET proteins. We further identified CISD3 homologs in fungi that were previously reported not to contain any NEET proteins, and revealed that plants lack homolog(s) of CISD3. Furthermore, our study suggests that the mammalian NEET proteins, mitoNEET (CISD1) and NAF-1 (CISD2), emerged via gene duplication around the origin of vertebrates. Our findings provide new insights into the classification and expansion of the NEET protein family, as well as offer clues to the diverged functions of the human mitoNEET and NAF-1 proteins.
Similar content being viewed by others
Introduction
NEET proteins belong to a newly discovered class of iron-sulfur (Fe-S) proteins that harbor the 3Cys-1His CDGSH 2Fe-2S binding domain [C-X-C-X2-(S/T)-X3-P-X-C-D-G-(S/A/T)-H], followed by the “NEET” motif1. They were originally classified as zinc-finger proteins based on the presence of the zf-CDGSH domain (originally identified as a zinc-finger motif), but were later discovered to contain an 2Fe-2S cluster bound to the 3Cys-1His coordinates of the CDGSH motif by biochemical and X-ray structural analysis2,3,4,5,6. NEET proteins are unique among Fe-S proteins because their 3Cys-1His cluster coordination structure allows them to be both relatively stable, as well as to easily donate their 2Fe-2S cluster to other cluster acceptor proteins7,8,9. This feature makes NEET proteins highly versatile in their biological functions and has led to the idea that they participate in the regulation of iron, Fe-S, reactive oxygen species (ROS), and/or redox metabolism of cells1,10,11,12,13,14,15,16,17,18,19,20,21,22,23. Gain- and loss-of-function analysis of mammalian and plant NEET proteins indeed revealed that the plant and mammalian proteins have at least one conserved function in maintaining the overall iron and ROS homeostasis of cells, and in particular that of the mitochondria1,19,21,24.
NEET proteins can be classified into two types: Class I NEET proteins containing a single copy of the CDGSH 2Fe-2S binding motif per polypeptide chain, and Class II NEET proteins containing two copies of the CDGSH motif within a single polypeptide chain. Class I NEET proteins function in cells as a homodimer that is anchored to a membrane, and Class II NEET proteins function as a soluble monomer1,25. In humans, Class I NEET proteins are encoded by two genes: CISD1 that encodes mitoNEET (mNT), a protein that is anchored to the mitochondrial outer membrane, and CISD2 that encodes NAF-1 (previously called Miner1), a protein that is anchored to the ER, mitochondria and their interacting membranes (MAM)1. The Class II NEET protein in humans is encoded by the CISD3 (also called Miner 2) gene, and its protein product, which is localized to the mitochondria, does not contain a membrane anchoring domain1,25. Human mNT (CISD1) and NAF-1 (CISD2) proteins share 54% identical residues and 69% similar residues (sharing similar physicochemical properties) over 99% of their sequence lengths (Supplementary Fig. S1). In contrast, human CISD3 shares 50% identical and 63% similar residues with mNT over 50% of its sequence length, and 38% identical and 50% similar residues with NAF-1 over 63% of its sequence length (Supplementary Fig. S1).
Recent studies revealed that NEET proteins play important roles in several different human diseases. For example, mNT was implicated in diabetes, obesity, and cancer1,16,17, and NAF-1 was implicated in BCL-2-Beclin-1-BIK-dependent autophagy and BCL-2-dependent apoptosis, as well as in neurodegenerative diseases, skeletal muscle maintenance, cancer, and aging1,10,11,12,13,14,15,17,18,21,22,23. In addition, a homozygous intragenic deletion, or a missense mutation, that abolishes NAF-1 function leads to a rare genetic disease called Wolfram Syndrome 2 (WFS2); phenotypes associated with this disease include hearing deficiencies, neurodegeneration, severe blindness, diabetes and a lower life expectancy1,23.
Due to the importance of NEET proteins to human health1,10,11,12,13,14,15,17,18,21,22,23, their cluster transfer flexibility potential that was found to be critical for their function in cancer cells24, and their apparent presence in different unicellular and multicellular organisms1,2,3,4,5,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,25, we decided to conduct a phylogenetic analysis of NEET proteins in eukaryotic organisms. In particular, we were interested to find how many different NEET variants exist in different species, what is the origin of the human CISD genes, and why NEET proteins are absent in fungi, as reported in the previous studies25. Providing an answer to these questions would help in choosing different model organisms to study NEET protein function, as well as shed light on the different roles the different human NEET proteins play in cells.
Results
To conduct a phylogenetic analysis of CDGSH-motif containing NEET proteins, we first examined how many different types of proteins contain the zf-CDGSH motif. At least 23 different proteins containing the CDGSH motif were found in the Pfam26 database (Fig. 1). These varied from a single or a double motif of CDGSH to combinations of the CDGSH motif(s) with other domains such as thioredoxin, ferritin-like, and Glu-synthase. Although many different proteins were found to contain multiple copies of a particular domain, no protein variants with three or more copies of the CDGSH motif were found by our search. The reason for this is currently unknown, but it could be related to either the function of this motif as a putative Fe-S binding domain involved in cluster transfer reactions, or the ability of this motif to oligimerize1. The large number of different proteins containing the CDGSH motif, coupled with the uncertainty of how many of these proteins are Fe-S proteins as opposed to zinc-finger proteins, prompted us to conduct our phylogenetic analysis using the three well-defined human CDGSH-NEET proteins, that were biochemically and structurally shown to harbor an Fe-S cluster1, as guides (Fig. 1). This strategy provided us with a set of homologs of the three human NEET proteins in different species. The three human NEET proteins are represented in Fig. 1 as CDGSH variants 1 and 3 (CISD1 and CISD2), and 2 (CISD3), respectively (highlighted with a dashed box in Fig. 1). It should be noted that the domain annotated as MitoNEET_N in the Pfam database appears in both the human mNT (CISD1) and the NAF-1 (CISD2) proteins.
To identify homologs of human CISD1-, CISD2- and CISD3-NEET proteins in different organisms from different lineages, we first determined the thresholds for the PSI-BLAST searches to be used in our analysis. For this purpose we conducted a sensitivity analysis. Human CISD1, CISD2 and CISD 3 sequences were compared to each other using different PSI-BLAST parameters and the PSI-BLAST parameters at which any of these sequences, when used as the query sequence, returned the other two sequences among the BLAST hits were determined (Supplementary Table S1). Based on this analysis we used an Expect threshold (e-value) of 10 and a PSI-BLAST threshold of 5 for our PSI-BLAST27 searches for the three human NEET proteins (CISD1–3) in each of the genomes analyzed.
To examine the phyletic pattern of NEET proteins, that is, the presence or absence of NEET protein variants in organisms from different lineages, we retrieved a common tree of species from the NCBI taxonomy site (http://www.ncbi.nlm.nih.gov/Taxonomy/CommonTree/www.cmt.cgi), and populated it with organisms with fully sequenced and annotated genomes (retrieved from: http://www.ncbi.nlm.nih.gov/genome/browse/; Supplementary Table S2). For each branch on the tree we included only a single representative organism with a fully sequenced genome (Fig. 2). We focused on eukaryotes, with prokaryotes represented by a few bacterial and archaeal species. Each genome represented on the species tree was individually subjected to a protein PSI-BLAST search using human CISD1, CISD2, or CISD3 as a query, and the presence or absence of each of the different CISD homologs was determined and indicated next to the organism name on the species tree (Fig. 2). If a NEET protein homolog could not be unambiguously classified as CISD1 or CISD2 (i.e., its similarity was below a 50% cut-off to each of these proteins), it was annotated as CISD (Fig. 2). As shown in Fig. 2, we could only clearly distinguish between CISD1 and CISD2 clades in vertebrates (Chordata), suggesting that the gene duplication that resulted in the emergence of CISD1 and CISD2 likely coincided with the origin of vertebrates. Interestingly, we could identify several fungi that contained homologs of CISD3 (see also Fig. 1). However, we did not find CISD3 homologs in plants and this was further verified by performing a PSI-BLAST search for human CISD3 in all plant sequences (Supplementary Fig. S2; the only hits we got were of one of the CDGSH domains of human CISD3 with the single CDGSH domain found in the plant Class I CISD proteins). Although most organisms represented on the species tree shown in Fig. 2 contain at least one homolog of NEET proteins, some organisms, for example, Saccharomyces cerevisiae (fungi), or Acanthamoeba castellanii (amoeba) do not appear to have homologs of NEET proteins. Likewise, some bacteria such as Escherichia coli or Pseudomonas fluorescens do not harbor homologs of NEET proteins. These findings suggest that although NEET proteins are highly conserved in most multicellular organisms, they may not be essential for some eukaryotes or prokaryotes.
To further uncover the evolution the NEET protein family in eukaryotes, we performed multiple sequence alignments (MSAs; Supplementary Figs S3 and S4) and constructed phylogenetic trees for CISD1 and CISD2 homologs (Class I; Fig. 3 and Supplementary Fig. S5), and CISD3 homologs (Class II; Fig. 4 and Supplementary Fig. S6) using a maximum likelihood method (detailed in the Methods section). For the phylogenetic analyses of protein sequences for organisms shown in Figs 3, 4, and Supplementary Figs S3–S6, and to increase the sensitivity of these trees, we used two different organisms with a fully sequenced genome for each branch of the species tree (Fig. 2) and included all protein sequences that were identified from these genomes with the CISD1/CISD2/CISD3 PSI-BLAST search described above (these sometimes included different protein sequences that originated from the same gene via alternative splicing; Supplementary Table S3). As shown in Fig. 3, CISD1 and CISD2 protein sequences formed two distinct clades, highlighting within and between clade relationships among the CISD1/CISD2 sequences, and distinguishing the CISD1 and CISD2 sequences of vertebrates from the rest of the CISD sequences (Fig. 2). Interestingly, the plant, insect and worm CISDs were closer to the CISD2 clade than to the CISD1 clade, whereas the slime molds CISDs were very distinct from either of the CISD1 or CISD2 clades, with more proximity to that of the outlier (Bacillus CISD3). These findings suggest that the slime mold CISD protein could represent an early or ancient version of eukaryotic CISDs before the emergence of CISD1 and CISD2 genes. A similar analysis performed for all protein sequences with homology to human CISD3 (all containing two copies of the CDGSH domain within a single protomer) revealed that, with the exception of lancelets and elephant shark, all vertebrate CISD3s grouped as a distinct clade. Interestingly, the CISD3 proteins of some organisms that contained more than one CISD3 protein in their genome (e.g., slime mold and worm) did not group together, suggesting that the two different CISD3 proteins found in these organisms might have acquired different functions during evolution (Fig. 4). In general, compared to CISD1 and 2 proteins (Fig. 3), CISD3 proteins from different organisms (Fig. 4) displayed a high degree of divergence in structure and function. As with CISD1 and 2 (Fig. 3), at least one of the slime mold CISD3 proteins was very distinct from the rest of the CISD3 clades, with more proximity to the outlier (Archaea CISD; Fig. 4).
Surprisingly, and contrary to previous reports25, our study revealed the presence of CISD3 proteins in fungi. To confirm that the fungal CISD3 sequences identified were indeed CISD-like proteins we aligned all fungal CISD proteins with the CISD3 proteins of human and bacteria (Bacillus subtilis). As shown in Fig. 5a, all fungal CISD3-like sequences contained two highly conserved CDGSH domains confirming that they are indeed CISD3 homologs. A phylogenetic tree constructed for the fungal, human, and bacterial CISD3 proteins further revealed that fungal CISD3-like genes belonged to two distinct groups- one that shares similarity with human and bacteria, and the other that is more distinct (Fig. 5b). These findings support the existence of NEET proteins in parasitic as well as free-living fungi and suggest that fungal NEET proteins could have diverged in their functions to facilitate adaptation to their hosts or environments.
Discussion
The conserved structure of the CDGSH domain allowed us to conduct a comparative phylogenetic analysis of NEET proteins primarily focusing on eukaryotic organisms. A previous analysis of CDGSH proteins in prokaryotes identified many different types of CDGSH proteins with a single or double CDGSH domain, but did not address the complex nature of CISD-like proteins in eukaryotic organisms25. A key finding of our analysis was the identification of the vertebrate origin as the putative point of gene duplication that yielded the two different mammalian CISD proteins: CISD1 (mNT) and CISD2 (NAF-1) (Fig. 2). The emergence of vertebrates was accompanied by several important cellular and developmental milestones. These included among others the origin of an adaptive immune system, the emergence of neural crest cells and complex neuronal networks, and the appearance of synchronized and complex symmetric segmentation patterns28,29,30. Because NAF-1 is linked to several different neurological disorders1,10,11,13,18, it is plausible that the origin of NAF-1 through ancestral CISD duplication and differentiation coincided with the origin of specialized neuronal cells in vertebrates, and that NAF-1 conferred important adaptive functions for the maintenance of these neuronal cells. This could be reflected by the important role NAF-1 currently plays in neurodegenerative diseases. Another possibility, which could be associated with NAF-1 role in the regulation of apoptosis and autophagy1,11,12,15,21, might be evolutionary linked to the appearance of the adaptive immune system and the utilization of cell death pathways by lymphocytes, as part of this system. Further studies are of course needed to establish the evolutionary significance of mNT and NAF-1 functions in vertebrates. With respect to the putative duplication event that resulted in CISD1 and CISD2, it is worth noting that although the CISD phyletic pattern overlaid on the species tree indicates that this duplication event is likely linked to the emergence of vertebrates (Fig. 2), the phylogenetic tree analysis of CISD1 and CISD2 from different organisms (Fig. 3) showed one clade of CISD proteins from snail, lancelet, hydra, lingual, sponge and sea anemone to be more closely related to CISD2 than to CISD1. This finding could suggest that CISD2 evolved first, before the vertebrates emerged, and that CISD1 appeared via gene duplication around or after the radiation of vertebrates. Of course this observation could also reflect a discrepancy between the phylogenetic gene tree and the species tree, arising as a consequence of factors such as incomplete lineage sorting, recombination, horizontal gene transfer, etc, or the inability of the currently available data to resolve the gene tree31,32.
In addition to inferring the divergence of mNT and NAF-1 using our species and gene tree analysis, our study also revealed the presence of CISD3 genes in fungi. Fungal CISD3 proteins display high sequence similarity to human CISD3 and are present in at least 5 species of fungi (Fig. 5). Because fungi lack CISD1- or CISD2-like proteins, and some do not appear to have any type of CISD protein, it is possible that only certain types of fungi with specialized requirements retained the CISD3 gene, while others lost it completely. It would be of interest in future studies to decipher the common features and growth requirements that distinguish the fungi that contain CISD proteins from the ones that do not. The possibility that some organisms lost a specific class of CISD proteins is further highlighted by our striking finding that plants do not contain the homologs of CISD3 (Fig. 2). Because plants contain chloroplasts, that took over during evolution some of the biosynthetic and metabolic pathways that are common to mitochondria in animals, and because the plant CISD protein (e.g., AtNEET19) is associated with both chloroplasts and mitochondria, it is possible that CISD3 function in mitochondria (that is largely unknown at present) is not required in plants.
Perhaps one of the most interesting questions, when dealing with a phylogenetic analysis of a protein family, is - what is the most ancestral form of the family? From the phylogenetic standpoint of eukaryotic CISD evolution it appears from our analysis that the slime mold Dictyostelium discoideum’s CISD proteins (with a single or double copy of the CDGSH domain per protomer) are the closest to the archetype of eukaryotic Class I and Class II CISD proteins. These proteins were found to be closest to the outliers in our phylogenetic analysis of CISD1/2 and CISD3 proteins (Figs 3 and 4, respectively). Of course from the standpoint of evolution of prokaryotic and eukaryotic organisms it is much harder to determine what is the most ancestral form of all CISDs, aside from speculating that the proto-CISD had only one copy of the CDGSH domain and that it either evolved to form a single-copy CDGSH CISD-like protein, or underwent a CDGSH domain duplication to yield a CISD3-like protein. Another possibility is of course that a CISD3-like ancestral protein (containing two CDGSH domains) was duplicated, with each gene copy losing one of its CDGSH domains to form CISD1- and CISD2- single CDGSH domain-like proteins that would enable a higher degree of cooperativity and regulation in their interaction and function, similar to the mammalian NAF-1 and mNT1. In this context it is worthwhile to note that both bacteria and archaea were found to contain members of Class I (single CDGSH domain) or Class II (two CDGSH domains) of the CISD family of proteins (Figs 1 and 2)25. Because CISD1 (mNT) and CISD3 localize to mitochondria in eukaryotes, whereas CISD2 (NAF-1) is primarily localized to the ER and was shown to have more diverged functions than CISD1 or CISD3 1,11,12,13,15,17,22,23, it is also tempting to speculate that the duplication of the Class I CISD gene resulting in the emergence of CISD1 and CISD2 proteins was followed by the acquisition of additional roles and functions by CISD2 (NAF-1) that coincided with the evolution of vertebrates as described above. The path of CISD evolution was therefore paved by important gene duplication events (i.e., the appearance of mNT and NAF-1), as well as gene deletions and loss of function (e.g., the absence of CISD3 in plants and some fungi). Further studies are of course required to identify the molecular, biochemical and environmental factors that affected the evolution of CISD proteins.
Methods
Selection of organisms for analysis
To determine the presence or absence of different NEET proteins in organisms from different lineages, we first retrieved a common tree of species from the NCBI taxonomy site (http://www.ncbi.nlm.nih.gov/Taxonomy/CommonTree/www.cmt.cgi), and then selected representative organisms with fully sequenced and annotated genomes (obtained from: http://www.ncbi.nlm.nih.gov/genome/browse/; Supplementary Table S2). A total of 43 eukaryotes, 3 bacteria and 2 archaea were represented as shown in Fig. 2. As indicated in Supplementary Table S2, some of the genomes used were assembled based on reference genomes and some were assembled de novo.
Protein sequence retrieval
The complete protein sequences of human CISD1, CISD2, and CISD3 were retrieved from the NCBI database (http://www.ncbi.nlm.nih.gov/protein). We used these as queries in a PSI-BLAST27 search to obtain CISD homologs from the genomes of the organisms selected for our analysis (Supplementary Table S2). To determine the thresholds for the PSI-BLAST searches we conducted a sensitivity analysis. Thus, CISD1, CISD2 and CISD 3 were compared to each other using different PSI-BLAST parameters to determine the parameter setting where the CISD1, CISD2, and CISD3 are retrieved as mutual blast hits of each other (Supplementary Table S1). Based on this analysis we used an Expect threshold (e-value) of 10 and a PSI-BLAST threshold of 5 for our PSI-BLAST27 searches for the three human NEET proteins (CISD1-3) in each of the genomes analyzed. We further generated a maximum likelihood tree using human CISD1, CISD2 and CISD3 protein sequences (Supplementary Fig. S1). This tree showed a high similarity between CISD1 and CISD2 and a low similarity between CISD1 or CISD2 and CISD3, which necessitated the more relaxed PSI-BLAST thresholds for our analysis. The PSI-BLAST parameters determined from our sensitivity analysis were therefore set to ensure that the program returns as “hits” all three human CISD sequences when any of the human CISD sequences is used as a query sequence. Using human CISD1, CISD2, and CISD3 as queries, PSI-BLAST searches were conducted against the non-redundant (NR) database of completely sequenced genomes (http://www.ncbi.nlm.nih.gov/genome/browse/; Supplementary Table S2). Iterative PSI-BLAST searches were further performed until no new CISD homologs were found. The candidate CISD 1, CISD2, CISD3 sequences obtained were further examined for the presence of the signature zf-CDGSH domain, of which the conserved sequence C-X-C-X2-(S/T)-X3-P-X-C-D-G-(S/A/T)-H is a defining feature1. We utilized the services of PFAM26 and InterProScan33 for this analysis. Partial sequences and those lacking the CDGSH domain were eliminated manually. This procedure yielded a dataset of 96 CISD1 and CISD2 candidates and 60 CISD3 candidates that were used for our phylogenetic study.
PFAM domain architectures
The unique domain architectures of zf-CDGSH were obtained from the PFAM database. A total of 489 sequences across 274 species containing one of the 23 domain architectures were recorded in the PFAM database, as shown in Fig. 1.
Sequence alignment and phylogenetic analysis
Multiple sequence alignments of the 96 candidate CISD1 and CISD2 proteins, and of the 60 candidate CISD3 proteins, or all of the candidate CISD proteins, were performed by command-line MUSCLE34 with default options. For trimming poorly aligned regions, trimAL was employed (-automated1 option) to generate better quality alignments35.
PhyML version 3.0 was employed to construct phylogenetic trees using a maximum-likelihood method36. Trees were built for CISD1 and CISD2 protein sequences, and for CISD3 sequences. For statistical reliability, the following test or parameters were used: posterior probability distribution on trees, and an approximate likelihood-ratio test (aLRT) based on logarithm of the ratio of likelihood computed for the current tree and that of the best alternative. To estimate the optimal model of substitution, ProtTest was used for each alignment37. ProtTest indicated the VT amino acid model with gamma distribution shape parameter (VT + G) as the best fitting model among the 112 examined evolutionary models, based on Akaike information criterion (AIC) statistics. The maximum likelihood trees were generated using the VT + G model. Trees were visualized and edited using the program FigTree 1.4.0 38.
Additional Information
How to cite this article: Inupakutika, M. A. et al. Phylogenetic analysis of eukaryotic NEET proteins uncovers a link between a key gene duplication event and the evolution of vertebrates. Sci. Rep. 7, 42571; doi: 10.1038/srep42571 (2017).
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
Tamir, S. et al. Structure–function analysis of NEET proteins uncovers their role as key regulators of iron and ROS homeostasis in health and disease. Biochimica et Biophysica Acta (BBA) - Molecular Cell Research 1853, 1294–1315, doi: 10.1016/j.bbamcr.2014.10.014 (2015).
Conlan, A. R. et al. Crystal Structure of Miner1: The Redox-active 2Fe-2S Protein Causative in Wolfram Syndrome 2. Journal of Molecular Biology 392, 143–153, doi: 10.1016/j.jmb.2009.06.079 (2009).
Lin, J., Zhou, T., Ye, K. & Wang, J. Crystal structure of human mitoNEET reveals distinct groups of iron sulfur proteins. Proceedings of the National Academy of Sciences 104, 14640–14645, doi: 10.1073/pnas.0702426104 (2007).
Paddock, M. L. et al. MitoNEET is a uniquely folded 2Fe 2S outer mitochondrial membrane protein stabilized by pioglitazone. Proceedings of the National Academy of Sciences 104, 14342–14347, doi: 10.1073/pnas.0707189104 (2007).
Wiley, S. E. et al. The Outer Mitochondrial Membrane Protein mitoNEET Contains a Novel Redox-active 2Fe-2S Cluster. Journal of Biological Chemistry 282, 23745–23749, doi: 10.1074/jbc.c700107200 (2007).
Hou, X. et al. Crystallographic studies of human MitoNEET. Journal of Biological Chemistry 282, 33242–33246 (2007).
Tamir, S. et al. A point mutation in the [2Fe–2S] cluster binding region of the NAF-1 protein (H114C) dramatically hinders the cluster donor properties. Acta Cryst D Biol Crystallogr 70, 1572–1578, doi: 10.1107/s1399004714005458 (2014).
Zuris, J. A. et al. Engineering the Redox Potential over a Wide Range within a New Class of FeS Proteins. J. Am. Chem. Soc. 132, 13120–13122, doi: 10.1021/ja103920k (2010).
Zuris, J. A. et al. Facile transfer of [2Fe-2S] clusters from the diabetes drug target mitoNEET to an apo-acceptor protein. Proceedings of the National Academy of Sciences 108, 13047–13052, doi: 10.1073/pnas.1109986108 (2011).
Amr, S. et al. A Homozygous Mutation in a Novel Zinc-Finger Protein, ERIS, Is Responsible for Wolfram Syndrome 2. The American Journal of Human Genetics 81, 673–683, doi: 10.1086/520961 (2007).
Chang, N. C. et al. Bcl-2-associated autophagy regulator Naf-1 required for maintenance of skeletal muscle. Human molecular genetics 21, 2277–2287, doi: 10.1093/hmg/dds048 (2012).
Chang, N. C., Nguyen, M., Germain, M. & Shore, G. C. Antagonism of Beclin 1-dependent autophagy by BCL-2 at the endoplasmic reticulum requires NAF-1. EMBO J 29, 606–618, doi: 10.1038/emboj.2009.369 (2009).
Chen, Y. F. et al. Cisd2 deficiency drives premature aging and causes mitochondria-mediated defects in mice. Genes & development 23, 1183–1194, doi: 10.1101/gad.1779509 (2009).
Colca, J. R. Identification of a novel mitochondrial protein (“mitoNEET”) cross-linked specifically by a thiazolidinedione photoprobe. AJP: Endocrinology and Metabolism 286, 252E–260, doi: 10.1152/ajpendo.00424.2003 (2003).
Holt, S. H. et al. Activation of apoptosis in NAF-1-deficient human epithelial breast cancer cells. Journal of Cell Science 129, 155–165, doi: 10.1242/jcs.178293 (2015).
Kusminski, C. M. et al. MitoNEET-driven alterations in adipocyte mitochondrial activity reveal a crucial adaptive process that preserves insulin sensitivity in obesity. Nature Medicine 18, 1539–1549, doi: 10.1038/nm.2899 (2012).
Liu, L. et al. CISD2 expression is a novel marker correlating with pelvic lymph node metastasis and prognosis in patients with early-stage cervical cancer. Medical Oncology 31, doi: 10.1007/s12032-014-0183-5 (2014).
Mozzillo, E. et al. A novel CISD2 intragenic deletion, optic neuropathy and platelet aggregation defect in Wolfram syndrome type 2. BMC Medical Genetics 15, 88, doi: 10.1186/1471-2350-15-88 (2014).
Nechushtai, R. et al. Characterization of Arabidopsis NEET Reveals an Ancient Role for NEET Proteins in Iron Metabolism. The Plant Cell 24, 2139–2154, doi: 10.1105/tpc.112.097634 (2012).
Salem, A. F., Whitaker-Menezes, D., Howell, A., Sotgia, F. & Lisanti, M. P. Mitochondrial biogenesis in epithelial cancer cells promotes breast cancer tumor growth and confers autophagy resistance. Cell Cycle 11, 4174–4180, doi: 10.4161/cc.22376 (2012).
Sohn, Y.-S. et al. NAF-1 and mitoNEET are central to human breast cancer proliferation by maintaining mitochondrial homeostasis and promoting tumor growth. Proceedings of the National Academy of Sciences 110, 14676–14681, doi: 10.1073/pnas.1313198110 (2013).
Wang, L. et al. Overexpressed CISD2 has prognostic value in human gastric cancer and promotes gastric cancer cell proliferation and tumorigenesis via AKT signaling pathway. Oncotarget 7, 3791–3805, doi: 10.18632/oncotarget.6302 (2016).
Wu, C. Y. et al. A persistent level of Cisd2 extends healthy lifespan and delays aging in mice. Human molecular genetics 21, 3956–3968, doi: 10.1093/hmg/dds210 (2012).
Darash-Yahana, M. et al. Breast cancer tumorigenicity is dependent on high expression levels of NAF-1 and the lability of its Fe-S clusters. Proceedings of the National Academy of Sciences 113, 10890–10895 (2016).
Lin, J., Zhang, L., Lai, S. & Ye, K. Structure and Molecular Evolution of CDGSH Iron-Sulfur Domains. PLoS ONE 6, e24790, doi: 10.1371/journal.pone.0024790 (2011).
Finn, R. D. et al. The Pfam protein families database: towards a more sustainable future. Nucleic acids research, gkv1344 (2015).
Altschul, S. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Research 25, 3389–3402, doi: 10.1093/nar/25.17.3389 (1997).
Muñoz, W. A. & Trainor, P. A. In Current Topics in Developmental Biology 3–26 (Elsevier BV, 2015).
Oates, A. C., Morelli, L. G. & Ares, S. Patterning embryos with oscillations: structure, function and dynamics of the vertebrate segmentation clock. Development 139, 625–639, doi: 10.1242/dev.063735 (2012).
Yuan, S., Tao, X., Huang, S., Chen, S. & Xu, A. Comparative Immune Systems in Animals. Annual Review of Animal Biosciences 2, 235–258, doi: 10.1146/annurev-animal-031412-103634 (2014).
Nichols, R. Gene trees and species trees are not the same. Trends in Ecology & Evolution 16, 358–364 (2001).
Pamilo, P. & Nei, M. Relationships between gene trees and species trees. Molecular biology and evolution 5, 568–583 (1988).
Zdobnov, E. M. & Apweiler, R. InterProScan - an integration platform for the signature-recognition methods in InterPro. Bioinformatics 17, 847–848, doi: 10.1093/bioinformatics/17.9.847 (2001).
Edgar, R. C. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Research 32, 1792–1797, doi: 10.1093/nar/gkh340 (2004).
Capella-Gutiérrez, S., Silla-Martínez, J. M. & Gabaldón, T. TrimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–1973 (2009).
Guindon, S. et al. New Algorithms and Methods to Estimate Maximum-Likelihood Phylogenies: Assessing the Performance of PhyML 3.0. Systematic Biology 59, 307–321, doi: 10.1093/sysbio/syq010 (2010).
Darriba, D., Taboada, G. L., Doallo, R. & Posada, D. ProtTest 3: fast selection of best-fit models of protein evolution. Bioinformatics 27, 1164–1165, doi: 10.1093/bioinformatics/btr088 (2011).
Rambaut, A. FigTree version 1.4. 0. Available at http://tree.bio.ed.ac.uk/software/figtree (2012).
Acknowledgements
This work was supported by the National Science Foundation IOS-1557787 awarded to P.P. and R.M., the National Science Foundation MCB-1613462 awarded to R.M., R.N. and R.K.A., Israel Science Foundation - ISF 865/13 awarded to R.N., the National Institutes of Health DK54441 awarded to P.A.J., and funds from the University of North Texas College of Arts and Sciences awarded to P.P., R.M. and R.K.A. J.N.O. was supported by the Cancer Prevention and Research Institute of Texas (CPRIT - grant R1110), by the Center for Theoretical Biological Physics sponsored by the NSF (Grant PHY- 1427654) and by NSF- CHE 1614101. The funders had no role in the design, data collection, analysis, decision to publish, or preparation of the manuscript.
Author information
Authors and Affiliations
Contributions
M.I. and S.S.G. performed the experiments and analyzed the data, P.P., R.K.A., P.A.J., J.N.O. and R.M. analyzed the data and designed experiments. P.P., R.N., P.A.J., R.K.A. and R.M. wrote the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Supplementary information
Rights and permissions
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
About this article
Cite this article
Inupakutika, M., Sengupta, S., Nechushtai, R. et al. Phylogenetic analysis of eukaryotic NEET proteins uncovers a link between a key gene duplication event and the evolution of vertebrates. Sci Rep 7, 42571 (2017). https://doi.org/10.1038/srep42571
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/srep42571
This article is cited by
-
Intracellular targeting of Cisd2/Miner1 to the endoplasmic reticulum
BMC Molecular and Cell Biology (2021)
-
The Mitochondrial mitoNEET Ligand NL-1 Is Protective in a Murine Model of Transient Cerebral Ischemic Stroke
Pharmaceutical Research (2021)
-
Vesicular transport mediates the uptake of cytoplasmic proteins into mitochondria in Drosophila melanogaster
Nature Communications (2020)
-
The cisd gene family regulates physiological germline apoptosis through ced-13 and the canonical cell death pathway in Caenorhabditis elegans
Cell Death & Differentiation (2019)
-
Glycogen branching enzyme controls cellular iron homeostasis via Iron Regulatory Protein 1 and mitoNEET
Nature Communications (2019)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.