- Open Access
Analyzing the genes related to Alzheimer’s disease via a network and pathway-based approach
Alzheimer's Research & Therapy volume 9, Article number: 29 (2017)
Our understanding of the molecular mechanisms underlying Alzheimer’s disease (AD) remains incomplete. Previous studies have revealed that genetic factors provide a significant contribution to the pathogenesis and development of AD. In the past years, numerous genes implicated in this disease have been identified via genetic association studies on candidate genes or at the genome-wide level. However, in many cases, the roles of these genes and their interactions in AD are still unclear. A comprehensive and systematic analysis focusing on the biological function and interactions of these genes in the context of AD will therefore provide valuable insights to understand the molecular features of the disease.
In this study, we collected genes potentially associated with AD by screening publications on genetic association studies deposited in PubMed. The major biological themes linked with these genes were then revealed by function and biochemical pathway enrichment analysis, and the relation between the pathways was explored by pathway crosstalk analysis. Furthermore, the network features of these AD-related genes were analyzed in the context of human interactome and an AD-specific network was inferred using the Steiner minimal tree algorithm.
We compiled 430 human genes reported to be associated with AD from 823 publications. Biological theme analysis indicated that the biological processes and biochemical pathways related to neurodevelopment, metabolism, cell growth and/or survival, and immunology were enriched in these genes. Pathway crosstalk analysis then revealed that the significantly enriched pathways could be grouped into three interlinked modules—neuronal and metabolic module, cell growth/survival and neuroendocrine pathway module, and immune response-related module—indicating an AD-specific immune-endocrine-neuronal regulatory network. Furthermore, an AD-specific protein network was inferred and novel genes potentially associated with AD were identified.
By means of network and pathway-based methodology, we explored the pathogenetic mechanism underlying AD at a systems biology level. Results from our work could provide valuable clues for understanding the molecular mechanism underlying AD. In addition, the framework proposed in this study could be used to investigate the pathological molecular network and genes relevant to other complex diseases or phenotypes.
Alzheimer’s disease (AD) is the most prevalent neurodegenerative disorder and accounts for the majority of people diagnosed with dementia . As a complex and chronic neurological disease, AD affects about 6% of people aged 65 years and older , and is responsible for about 480,000 deaths per year around the world . In addition to its affect on the life quality of those suffering from the disorder and their families, AD also causes a severe burden on society. In the USA alone, the health-care costs related to AD are about $172 billion per year .
AD can be diagnosed by symptoms such as short-term memory loss, mood swings, learning impairments, and disruptions in daily activities . However, as an age-related and progressive disease, some pathological features of AD (e.g., amyloid deposition, accumulation of neurofibrillary tangles, as well as function and structure changes of brain regions involved in memory) often appear many years prior to clinical manifestations [6, 7]. These pathological changes eventually lead to the damage and death of specific neurons, resulting in the emergence of clinical symptoms.
The cause of AD is still poorly understood although much effort has been dedicated to exploring the pathological and molecular mechanisms of AD via various approaches—e.g., animal models, gene expression profiling, genome-wide association studies (GWAS), neuroimaging techniques, or a systems biology framework [2, 8–11]. It is agreed that AD develops as a result of the combination of multiple factors, including genetic factors, a history of head injuries, depression, or hypertension. Among these factors, it is estimated that about 70% of the risk for AD is attributable to genetics [1, 12]. Established genetic causes of AD include the dominant mutations of genes encoding amyloid precursor protein (APP), presenilin 1 (PSEN1), and presenilin 1 (PSEN2). However, these genes are only responsible for the pathogenesis of AD in about 5% of patients with clinical symptoms appearing in midlife. On the other hand, genetic analyses have suggested that, in complex disorders like AD, individual differences can be caused by many genes and their variants. Genes with various biological functions may act in coordination to increase the risk of AD, with a moderate or small effect exerted by each gene . Consistent with this view, more and more genes—e.g., apolipoprotein E (APOE), glycogen synthase kinase 3 beta (GSK3B), dual specificity tyrosine-phosphorylation-regulated kinase 1A (DYRK1A), and Tau—have been found to be potentially associated with AD [1, 13]. For these genes, although a few plausible candidate genes have been partially replicated, some are considered problematic. This is especially true as high-throughput methods like GWAS are being more widely applied to genetic studies of AD. Under such circumstances, a comprehensive analysis of potential causal genes of AD within a pathway and/or a network framework may not only provide us with important insights beyond the conventional single-gene analyses, but also offer consolidated validation for the individual candidate gene.
In the current study, we implemented a comprehensive curation of AD-related genes from genetic association studies. We then conducted biological enrichment analyses to detect the significant functional themes within these genetic factors and analyzed the interactions among the enriched biochemical pathways by pathway crosstalk analysis. Furthermore, an AD-specific protein network was inferred and evaluated with the human protein–protein interaction network as the background. This study should offer valuable hints for understanding the molecular mechanisms of AD from a perspective of systems biology.
Identification of AD-related genes
The genes genetically associated with AD were collected by retrieving the human genetic association studies deposited in PubMed (http://www.ncbi.nlm.nih.gov/pubmed/). We retrieved publications associated with AD with the searching term ‘(Alzheimer’s Disease [MeSH]) AND (Polymorphism [MeSH] OR Genotype [MeSH] OR Alleles [MeSH]) NOT (Neoplasms [MeSH])’. By July 7, 2015, a total of 5298 reports were retrieved. After reviewing all abstracts of these publications, only the genetic association studies on AD were selected. From the obtained publication pool, we then concentrated on those studies reporting a significant association of gene(s) with AD. In order to reduce the number of potential false-positive genes, the studies reporting insignificant or negative associations were excluded even though some genes in these studies might actually be truly associated with AD. We then reviewed the full reports of each selected publication to make sure that the conclusion was consistent with its contents. In several studies, some genes were found to function cooperatively to exert significant influences on AD, with each gene having a small or mild impact; these genes were also included in our list. In addition, the genes from several GWAS analyses on AD, showing genetic association at a genome-wide significance level, were also included.
Functional enrichment analysis of genes related to AD
WebGestalt  and ToppGene  were utilized to detect the biological themes of the AD-related genes. As a web-based bioinformation-mining platform, WebGestalt integrates information from multiple resources to determine the biological themes, including identifying the overrepresented Gene Ontology (GO) terms, amid the candidate gene listing. In this study, only the GO biological process terms with false discovery rate (FDR) value smaller than 0.05 were kept as the significantly enriched ones. ToppGene was used to identify and analyze the enriched biological pathways in the input genes. Pathways with FDR < 0.05 were considered to be significantly enriched.
Analysis of crosstalks among pathways
We further built crosstalks among pathways to investigate interlinks and interactions of the enriched pathways. To measure the overlap between two pathways, the overlap coefficient (OC) and the Jaccard coefficient (JC) were calculated using the corresponding formulas:
in which A and B are the lists of genes of the two examined pathways. Briefly, the following procedure was adopted to construct the pathway crosstalks:
Only pathways with FDR < 0.05 were kept for crosstalk analysis. Meanwhile, pathways with five or fewer candidate genes were discarded because pathways with too few candidate genes might present few or biased connections with other pathways.
Counting the common candidate genes of each pathway pair—those pathway pairs with less than two overlapped genes were removed.
Measuring the overlap in every pathway pair by the corresponding JC and OC values.
Constructing the pathway crosstalk with Cytoscape software .
Compilation of the human protein–protein interaction network
To explore the correlation and interaction among the AD-related genes, we compiled a comprehensive protein–protein interaction (PPI) network, based on which the protein network topological properties of the gene set related to AD were calculated and analyzed. Briefly, the human protein–protein interaction data were obtained from the Protein Interaction Network Analysis (PINA) database (latest release version: May 21, 2014)  by pooling and curating the unique physical interaction information from six main public protein interaction databases: BioGRID, IntAct, DIP, MINT, MIPS/MPact, and HPRD. In the meantime, another interactome for Homo sapiens  that contained 141,296 edges (physical protein interactions) among 13,460 nodes (proteins), consisting of metabolic pathway-related interactions, regulatory and protein–protein interactions, and interaction pairs for kinase and specific substrate, was selected as an additional source of interactome data. After merging the two interactome data by excluding the self-interacting and redundant pairs, the proteins in the list were mapped onto Entrez protein-coding genes for Homo sapiens via the Uniprot ID mapping tool (http://www.uniprot.org/uploadlists). Finally, we compiled a relatively comprehensive human physical interactome, which comprised 16,022 genes/proteins and 228,122 interactions (see Additional file 1).
Construction of the AD-specific protein subnetwork
A subnetwork specific to a given disease can provide us with hints for how the disease-related molecules interact with each other. A network parsimony principle has been demonstrated in the context of biological processes ; that is, the molecular networks/pathways often follow the shortest molecular paths between known disease-associated components (disease-related genes or proteins in our case). The Steiner minimal tree algorithm coincides with this biological principle, which uses a greedy heuristic strategy to iteratively link the smaller trees to larger ones until there is only one tree connecting all seed nodes . GenRev  was utilized to identify the pathological subnetwork from the human interactome using the curated AD-related genes as input. To assess the non-randomness of the constructed network, 1000 random networks with the same number of vertices and interactions as the AD-specific network were generated using the Erdos-Renyi model in R igraph package .
Compilation of genes associated with AD
Genes associated with AD were compiled through searching the published genetic association studies on AD in PubMed. Only the publications reporting gene(s) significantly associated with the disease were pooled, and those reporting a negative or insignificant association were excluded. Altogether, from 823 reports, we collected 430 genes reported to be associated with AD (Additional file 2: Table S1; the gene list is referred to as Alzgset). Among them were seven apolipoprotein genes (APOA1, APOA4, APOC1, APOC2, APOC4, APOD, and APOE), five genes encoding subunits of nicotinic acetylcholine receptors (CHRNA3, CHRNA4, CHRNA7, CHRNB2, and CHRFAM7A), four adrenoceptors (ADRA2B, ADRB1, ADRB2, and ADRB3), two serotonin receptors (HTR2A and HTR6), three dopamine degradation genes (COMT, DBH, and MAOA), and one dopamine receptor (DRD4). A few transport-related genes were also collected, such as ATP-binding cassette transporters (ABCA1, ABCA2, ABCA7, ABCC2, ABCG1, and ABCG2), a dopamine transporter (SLC6A3), a serotonin transporter (SLC6A4), two glucose transporters (SLC2A9 and SLC2A14), a folate transporter (SLC19A1), and ion transporters (SLC24A4). The other genes were those involved in the biological processes related to nitric oxide synthesis (NOS1 and NOS3), immune response (e.g., IL1A, IL6, IL10, and NLRC3), as well as mitochondria-specific function (e.g., MT-ATP6, MT-CO1, MT-CYB, and MTHFR). Clearly, the genes significantly associated with AD were diverse in function, consistent with the complexity of this mental disorder.
Biological function enrichment analysis of Alzgset
Functional enrichment analysis revealed a more detailed biological function spectrum of these AD-related genes (see Additional file 2: Table S2). Among the GO terms overrepresented in Alzgset, those related to lipid and/or lipoprotein-related processes, drug reactions, neural development, or synaptic transmission were included. GO terms associated with drug reactions (e.g., response to ethanol, response to nicotine, and response to cocaine) and metabolic processes (e.g., xenobiotic metabolic process) were overrepresented. These results were in line with previous findings that complicated correlations existed between the pathophysiological state of AD and drug abuse [23, 24]. Of significance, top-ranked terms included some lipid/lipoprotein-related processes, including phospholipid efflux, reverse cholesterol transport, cholesterol homeostasis, and lipoprotein metabolic processes. Biological process terms related to synaptic transmission (e.g., positive regulation of transmission of nerve impulse; synaptic transmission, cholinergic; regulation of synaptic transmission, dopaminergic; and regulation of neurotransmitter secretion), dopamine metabolism (dopamine metabolic process), and other neural functions (e.g., synaptic vesicle transport, regulation of neuronal synaptic plasticity, neuron migration, and memory) were also enriched. Meanwhile, GO terms related to immunological function (e.g., T-helper 1 type immune response, positive regulation of interleukin-6 production, and chronic inflammatory response) were overrepresented. The diversity in the function of AD-related genes demonstrated the complexity of the disease.
Biochemical pathway enriched in Alzgset
Detecting the biological pathways overrepresented among Alzgset may provide useful information about the pathogenic molecular mechanism underlying AD. For Alzgset, 68 enriched pathways were identified (Table 1). Among them, several pathways related to immune processes were included (e.g., cytokines and inflammatory response, cytokine network, dendritic cells in regulating TH1 and TH2 development, and IL-5 signaling), consistent with previous studies [25, 26]. Also, neurotransmitter signaling-related pathways were identified, such as cholinergic synapse, dopaminergic synapse, serotonergic synapse, and so forth. Additionally, in the Alzgset enriched pathway list, there were some pathways related to cell growth and/or survival, including neurotrophin signaling, PI3K-Akt signaling, mTOR signaling, Notch signaling, and so forth, which are vital for cell growth/survival state of neurons in the process of AD [27, 28]. Moreover, metabolism-related pathways, consisting of drug metabolism (cytochrome P450), glutathione metabolism, and metabolism of xenobiotics by cytochrome P450, were also significantly enriched, indicating that related metabolism processes were involved in the etiology and development processes of AD. What is more, the pathway of the intestinal immune network for IgA production was enriched, which might suggest a connection between AD and the intestinal microbiota [29, 30]. Furthermore, pathways involved in osteoclast differentiation and adipocytokine signaling were also detected, complying with prior studies [31–33].
Crosstalks among significantly enriched pathways
To explore the correlations between the pathways, we implemented a pathway crosstalk analysis for the 68 enriched pathways. Here we assumed that crosstalk existed in a pathway pair if they had a proportion of common genes in Alzgset . There were 41 pathways including six or more members in Alzgset, of which 37 pathways met the criterion for crosstalk analysis; that is, each pathway shared at least two genes with one or more other pathways. All of the pathway pairs (207 crosstalks among 37 pathways) were used for constructing the pathway crosstalk network and the overlap significance of each pathway pair was evaluated based on the average of JC and OC.
Based on their crosstalks, these pathways could be roughly divided into three major modules, with pathways in each group having more crosstalks with each other than with those outside of this module and more likely being related to the same or similar biological process (Fig. 1). The first module primarily included neuronal-related and xenobiotic or drug metabolism-related pathways (e.g., calcium signaling, dopaminergic synapse, cholinergic synapse, serotonergic synapse and neurotrophin signaling, metabolism of xenobiotics by cytochrome P450, and drug metabolism—cytochrome P450). The major theme of the second module was cell growth/survival and neuroendocrine-related pathways (e.g., PI3K-Akt signaling, mTOR signaling, notch signaling, prolactin signaling, etc.). The third module included immune response-related pathways (e.g., toll-like receptor signaling, Fc epsilon RI signaling pathway). At the same time, the three modules were interlinked with each other, indicating the existence of an AD-specific immune-endocrine-neuronal regulatory network.
AD-specific protein network
To further examine the potential pathological protein network of Alzgset, we constructed a subnetwork for AD from the human protein–protein interaction network via the Steiner minimal tree algorithm. This method tries to connect the largest number of input nodes (genes included in Alzgset in our case) via the least number of interlinking nodes. As shown in Fig. 2, the protein network of AD comprised 496 nodes and 1521 edges (interactions).
As shown, 393 out of 430 Alzgset genes were included in the AD-specific network, which accounted for 79.2% of 496 genes in the network and 91.4% of Alzgset, demonstrating a high coverage of Alzgset in the subnetwork. There were 103 genes in the AD-specific molecular network outside of Alzgset (Table 2). Given that these intermediate genes interacted closely with those known to be related to AD, they might also be involved in the pathological process of the disease phenotype. Notably, a number of the genes—e.g., epidermal growth factor receptor (EGFR), nuclear respiratory factor 1 (NRF1), somatostatin receptor 2 (SSTR2), and sortilin 1 (SORT1)—had been shown related to AD in several previous studies [35–38]. Some of these genes have not been reported to be directly involved in the pathophysiological condition of AD, but genes linking to them or other members of the same protein family may have been found to play a role in such processes. For instance, ATP binding cassette subfamily G member 5 (ABCG5), a member of a transport system superfamily, involved in ATP binding and transporting of substrates across cytomembranes, was a node in the AD-specific network but was out of Alzgset. However, six members from the same family were included in Alzgset (ABCA1, ABCA2, ABCA7, ABCC2, ABCG1, and ABCG2), and there was experimental evidence for their involvement in AD; for example, the expression reduction or loss of function of ABCA7 could alter Alzheimer amyloid processing . Solute carrier family 40 member 1 (SLC40A1), encoding a cytomembrane protein that may be linked to iron export from duodenal epithelial cells, was also included in the AD-specific network. SLC40A1can interact with Golgi membrane protein 1 (GOLM1) and hepcidin antimicrobial peptide (HAMP). The former was a gene in Alzgset and its mutation may be related to reduced regional gray matter volume in AD patients , and the expression of HAMP was significantly reduced in hippocampal lysates from AD brains . Thus, it is likely that some of the 103 genes in the AD-specific network may play roles in AD susceptibility and can be novel targets for further exploration.
We have made great progress in exploring the molecular mechanisms of Alzheimer’s disease in recent years. With the advancement and maturity of high-throughput technology, we are able to identify the elements related to this disease on much larger scales. Although more and more genes/proteins potentially involved in the disease have been reported, a thorough analysis of the biochemical processes associated with the pathogenesis of AD from the molecular aspect is still missing. In such cases, a systematic analysis of AD-related genes via a pathway-based and network-based analytical framework will provide us with insight into the disease beyond the single candidate gene-based analyses [42–44]. In this study, by pooling and curating human genes related to AD from genetic studies, and systematically delineating the interconnection of these genes by means of pathway-based and network-based analyses, we analyzed AD-related biochemical processes and their interactions.
Compared with the candidate gene(s)-based approach, a comprehensive analysis on AD-related genes conducted in this study has its own advantages. By implementing an extensive compilation and curation of human genes from genetic association studies on AD, we could obtain valuable gene source data for further analysis. Especially, because the risk of AD susceptibility can be attributed to many genes, with multiple genes functioning in a concerted manner and each gene exerting a small effect , we took this into consideration by also retrieving genes jointly showing significant genetic association with AD. At the same time, by focusing on the biological correlation of genes, pathway and network analysis can not only give us a more comprehensive view for the pathological mechanisms of AD, but are also more robust to the influence of false-positive genes.
As revealed by function enrichment analysis, genes in Alzgset may play important roles in lipid/lipoprotein-related procedures, the immune system, the metabolic process, drug response processes, and neurodevelopment. For example, terms such as reverse cholesterol transport, positive regulation of interleukin-6 production, response to ethanol, lipoprotein metabolic process, diol metabolic process, xenobiotic metabolic process, and regulation of neuronal synaptic plasticity were overrepresented among Alzgset genes, implying the important roles of these processes in the pathological processes of AD. Furthermore, we noticed several terms of memory, visual learning, social behavior, sleep, axon regeneration, and axon guidance also emerged in the enriched list, concurrent with a-priori biological findings for AD [46–50].
Our biochemical pathway analysis showed that immune-related pathways were enriched among Alzgset, which further highlighted the connections between AD and immune-related biological activities. Previous studies have shown the involvement of neuroinflammation in AD pathology, with inflammatory cytokines exerting central efforts [51, 52]. Simultaneously, four pathways associated with neurotransmitters were found to be overrepresented in Alzgset, coinciding with their essential roles in the etiology and progression of AD. Acetylcholine, dopamine, and serotonin are major neurotransmitters, involved in advanced neuronal functions (e.g., learning, memory, language, etc.), exerting key effects in the pathologic processes of AD. These neurotransmitters could be involved in the damaging procedure of synaptic plasticity like long-term potentiation and long-term depression in AD subjects or animal models, which in turn may impair some synapse-based higher brain functions such as memory and cognition [53–55]. Moreover, our results detected several pathways pertaining to neuroendocrine activities (i.e., ovarian steroidogenesis and prolactin signaling), cuing endocrine processes for the pathogenesis of AD [56, 57]. In addition, the adipocytokine signaling pathway was enriched in Alzgset. Adipocytokines, including leptin, adiponectin, NAMPT, RBP-4, and other proinflammatory cytokines, have attracted much attention due to their close connection with AD [32, 57, 58]. Detection of the adipocytokine signaling pathway in this study provides further evidence for the relationship between adipocytokine and the development and progression of AD, and may also support the idea that AD could be a metabolic disease [59–61]. As suggested by the results shown, the molecular mechanisms underlying AD are pretty complicated, calling for further thorough studies to decode the underlying pathologic mechanisms.
Of significance, we detected three major pathway groups through pathway crosstalk analysis. One group basically involved the pathways related to the nervous system and metabolism-related activities. Amid these pathways, cholinergic synapse, the calcium signaling pathway, dopaminergic synapse, serotonergic synapse, and neurotrophin signaling have been well dissected to function in the progress of AD [62–65]. In the second module, pathways were largely dominated by immune response or related functions, and by cell growth/survival and neuroendocrine pathways for the third group. Furthermore, we could notice that these three pathway modules were interconnected and acted as an immune-endocrine-neuronal regulatory network for the AD-related pathological conditions. Of note, one pathway (i.e., intestinal immune network for IgA production) was found to be a component part of the immune module. These results might suggest that the gut–brain axis, made up of immune, neuroendocrine, and neuronal components, was involved in the pathogenesis of AD [66–68], in line with results from pathway crosstalk analysis (i.e., there being three similar modules containing Alzgset-enriched pathways). Subsequently, via in-depth examination, we observed that the immune module has plenty of pathway crosstalks and plenty of crosstalk strength. In turn, the cell growth/survival and neuroendocrine module has lower number and less strength, compared with the immune module; however, in terms of the neural module, the number and strength of crosstalks are greater and larger. In spite of the limited number of crosstalks, there exist paramount crosstalk levels among metabolic pathways. These observed results might provide causal and regulatory hints for AD. Integrating results from biochemical pathway and pathway crosstalk analyses and the a-priori biological knowledge base, the major pathways related to AD could be summarized in a diagram (Fig. 3).
Further, we extracted an AD-specific protein network on the basis of the human protein–protein interaction network. It is worth noting that some linking genes outside Alzgset but included in the human protein–protein interaction network may be potentially related to AD. For example, nuclear respiratory factor-1 (NRF1) could be affected by early changes in genes participating in the insulin and energy metabolism pathways in an APP/PS1 transgenic mouse model of AD . TYROBP, a transmembrane signaling protein, appeared in our AD-specific subnetwork. By constructing gene regulatory networks in 1647 postmortem brain tissues from late-onset Alzheimer’s disease (LOAD) patients and normal subjects, an immune and microglia-related module dominated by genes participating in pathogen phagocytosis was identified, with TYROBP as a key causal regulator upregulated in LOAD . CDH2, a classical cadherin playing roles in the development of the nervous system, was found with the pathogenic copy number variations from 261 early-onset familial Alzheimer’s disease and early/mixed-onset pedigree individuals using high-density DNA microarrays . By applying cell-based studies and FBXO2 knockout mice, it was found that FBXO2 could regulate amyloid precursor protein-related activities in the brain and might modulate AD pathogenesis, coupling with our result to consolidate its involvement in AD . Although no evidence indicated that VSTM2L, one of the intermediate genes, was directly related to AD, it interacted with ataxin 1 (ATXN1) of Alzgset , whose biological function is presently unknown, and also might be a secreted antagonist of Humanin (HN)  which mediated attenuation of AD-related memory impairment and Aβ-induced AD-like pathological changes [75, 76]. As specified by the results detailed, this protein subnetwork predicting approach could not only engender a significant predicted subnetwork of Alzgset for AD, but could also possess the potentiality to detect promising relevant genes.
There have been several available datasets or projects focused on the curation of AD-related genes, including AlzGene , Alzheimer’s Disease Neuroimaging Initiative (ADNI) , the Alzheimer Disease & Frontotemporal Dementia Mutation Database (AD&FTDMDB) , and AlzBase . While AlzGene maintains a comprehensive catalog of genetic association studies on AD and also includes results from meta-analysis of polymorphisms with genotype data available in several GWAS projects on AD, AD&FTDMDB is dedicated to the known mutations of genes associated with AD and frontotemporal dementias from the published reports or presentations at scientific meetings. The ADNI project aims at facilitating the investigation of genetic influences on AD onset and progression reflected in imaging changes, fluid biomarkers, and cognitive status. It has reported several neuroimaging GWAS with imaging quotas as quantitative phenotypes, such as hippocampal volume and hippocampal gray matter density. On the other hand, AlzBase is an integrative database for genes dysregulated in AD and related diseases, and comprises annotations and expression information on more than 7800 differentially expressed genes collected from multiple microarray datasets. These datasets with different features provide valuable information on genes and/or phenotypes for exploring and understanding AD and its mechanisms.
Similar to AlzGene, Alzgset is also a compilation of AD-related genes identified in genetic association studies. While AlzGene includes both genes showing positive and negative association with AD, Alzgset focuses only on the genes reported to be positively associated with AD by the original authors. Because AlzGene has not been updated since April 2011, results from many recent genetic association studies may not be included. In association with studies on candidate genes, some genes may each possess a mild to moderate p value, but two or more genes could collectively show a more significant association with AD due to the fact they probably act in a concerted manner. In such cases, all of these candidates were included in Alzgset as long as the original authors could provide sufficient evidence. On the other hand, the genes in AlzGene were selected from meta-analyses for each polymorphism and a relative uniform criterion was adopted, so the genes mentioned may be neglected. Thus, Alzgset should offer an informative supplement for AlzGene and serve as a useful dataset for AD investigation.
However, there were several limitations in this study. First, our pathway-based and network-based analyses results relied on genes in the publications reported to be associated with AD. In view of the fact that identification of risk genes for AD is still an ongoing task, the GO biological process terms, biochemical pathways, and results derived from network analysis should also be treated in the similar manner. Second, we adopted the results and conclusions offered by the original authors of each selected report when collecting the genes, which inevitably impacts our results due to possible bias and insufficiency in the available reports. Then, in order to decrease the false-positive rate of AD-associated genes, we eliminated reports with insignificant or negative results. Nevertheless, we cannot avoid the fact that some genes in those studies might be actually associated with the disease phenotype. Additionally, although the GO terms enriched in Alzgset could provide valuable hints and might serve as an important resource for understanding the molecular mechanisms of AD, it should be noted that GO is biased towards fields like cancer biology and the concepts related to neurology are underrepresented . Thus, some important neurological processes related to AD may be missed in our analysis. At the same time, despite overall levels of protein–protein interaction databases having been greatly improved, the present human interactome is still incomplete and some false-positive data may also be included . Thus, the present research status of the human interactome may also influence our results. It can be expected that, as the protein–protein interaction data become more comprehensive and accurate, the inferred AD-specific subnetwork can become more reliable and valuable.
In summary, via a systems biology approach, we investigated the pathways and molecular networks related to AD based on the genes associated with the disease. Integrating biological function, biochemical pathway, and pathway crosstalk analyses, we identified that biochemical processes and pathways linked with lipid and/or lipoprotein-related processes, metabolism, the immune system, and neural development were overrepresented among Alzgset and there existed three inter-connected pathway modules: neuronal and metabolic module, cell growth/survival and neuroendocrine clique, and immunological cluster. What is more, an AD-specific protein network was built via the Steiner minimal tree algorithm and some novel genes latently associated with AD were predicted. Such analysis of genes involved in AD will not only improve our understanding of the contribution of genetic factors and their interaction with environmental factors to the pathogenesis of this disease, but will also help us to identify potential biomarkers for further exploration. It could be anticipated that as more genetic factors related to AD are identified, a systematic and comprehensive analysis such as the one adopted in this study will be more useful to explore the molecular mechanisms underlying AD.
Alzheimer’s disease gene set
- APOE :
- APP :
Amyloid precursor protein
- DYRK1A :
Dual specificity tyrosine-phosphorylation-regulated kinase 1A
False discovery rate
- GSK3B :
Glycogen synthase kinase 3 beta
Genome-wide association study
- PESN1 :
Protein interaction network analysis
Ballard C, Gauthier S, Corbett A, Brayne C, Aarsland D, Jones E. Alzheimer’s disease. Lancet. 2011;377(9770):1019–31.
Burns A, Iliffe S. Alzheimer’s disease. BMJ. 2009;338:b158.
Lozano R, Naghavi M, Foreman K, Lim S, Shibuya K, Aboyans V, Abraham J, Adair T, Aggarwal R, Ahn SY, et al. Global and regional mortality from 235 causes of death for 20 age groups in 1990 and 2010: a systematic analysis for the Global Burden of Disease Study 2010. Lancet. 2012;380(9859):2095–128.
Reitz C, Mayeux R. Alzheimer disease: epidemiology, diagnostic criteria, risk factors and biomarkers. Biochem Pharmacol. 2014;88(4):640–51.
Ager RR, Davis JL, Agazaryan A, Benavente F, Poon WW, LaFerla FM, Blurton-Jones M. Human neural stem cells improve cognition and promote synaptic growth in two complementary transgenic models of Alzheimer’s disease and neuronal loss. Hippocampus. 2015;25(7):813–26.
Bateman RJ, Xiong C, Benzinger TL, Fagan AM, Goate A, Fox NC, Marcus DS, Cairns NJ, Xie X, Blazey TM, et al. Clinical and biomarker changes in dominantly inherited Alzheimer’s disease. N Engl J Med. 2012;367(9):795–804.
Solomon A, Mangialasche F, Richard E, Andrieu S, Bennett DA, Breteler M, Fratiglioni L, Hooshmand B, Khachaturian AS, Schneider LS, et al. Advances in the prevention of Alzheimer’s disease and dementia. J Intern Med. 2014;275(3):229–50.
Ryu JK, Cho T, Choi HB, Jantaratnotai N, McLarnon JG. Pharmacological antagonism of interleukin-8 receptor CXCR2 inhibits inflammatory reactivity and is neuroprotective in an animal model of Alzheimer’s disease. J Neuroinflammation. 2015;12:144.
Allen M, Zou F, Chai HS, Younkin CS, Crook J, Pankratz VS, Carrasquillo MM, Rowley CN, Nair AA, Middha S, et al. Novel late-onset Alzheimer disease loci variants associate with brain gene expression. Neurology. 2012;79(3):221–8.
Naj AC, Jun G, Reitz C, Kunkle BW, Perry W, Park YS, Beecham GW, Rajbhandary RA, Hamilton-Nelson KL, Wang LS, et al. Effects of multiple genetic loci on age at onset in late-onset Alzheimer disease: a genome-wide association study. JAMA Neurol. 2014;71(11):1394–404.
Cabral C, Morgado PM, Campos Costa D, Silveira M. Alzheimers Disease Neuroimaging Initiative. Predicting conversion from MCI to AD with FDG-PET brain images at different prodromal stages. Comput Biol Med. 2015;58:101–9.
Gatz M, Reynolds CA, Fratiglioni L, Johansson B, Mortimer JA, Berg S, Fiske A, Pedersen NL. Role of genes and environments for explaining Alzheimer disease. Arch Gen Psychiatry. 2006;63(2):168–74.
Ertekin-Taner N. Genetics of Alzheimer disease in the pre- and post-GWAS era. Alzheimers Res Ther. 2010;2(1):3.
Zhang B, Kirov S, Snoddy J. WebGestalt: an integrated system for exploring gene sets in various biological contexts. Nucleic Acids Res. 2005;33(Web Server issue):W741–8.
Chen J, Bardes EE, Aronow BJ, Jegga AG. ToppGene Suite for gene list enrichment analysis and candidate gene prioritization. Nucleic Acids Res. 2009;37(Web Server issue):W305–11.
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498–504.
Cowley MJ, Pinese M, Kassahn KS, Waddell N, Pearson JV, Grimmond SM, Biankin AV, Hautaniemi S, Wu J. PINA v2.0: mining interactome modules. Nucleic Acids Res. 2012;40(Database issue):D862–5.
Menche J, Sharma A, Kitsak M, Ghiassian SD, Vidal M, Loscalzo J, Barabasi AL. Disease networks. Uncovering disease-disease relationships through the incomplete interactome. Science. 2015;347(6224):1257601.
Barabasi AL, Gulbahce N, Loscalzo J. Network medicine: a network-based approach to human disease. Nat Rev Genet. 2011;12(1):56–68.
Klein P, Ravi R. A nearly best-possible approximation algorithm for node-weighted Steiner trees. J Algorithms. 1995;19(1):104–15.
Zheng S, Zhao Z. GenRev: exploring functional relevance of genes in molecular networks. Genomics. 2012;99(3):183–8.
Erdos P, Rényi A. On the evolution of random graphs. Publ Math Inst Hungar Acad Sci. 1960;5:17–61.
Roussotte FF, Daianu M, Jahanshad N, Leonardo CD, Thompson PM. Neuroimaging and genetic risk for Alzheimer’s disease and addiction-related degenerative brain disorders. Brain Imaging Behav. 2014;8(2):217–33.
Anstey KJ, Mack HA, Cherbuin N. Alcohol consumption as a risk factor for dementia and cognitive decline: meta-analysis of prospective studies. Am J Geriatr Psychiatry. 2009;17(7):542–55.
Gjoneska E, Pfenning AR, Mathys H, Quon G, Kundaje A, Tsai LH, Kellis M. Conserved epigenomic signals in mice and humans reveal immune basis of Alzheimer’s disease. Nature. 2015;518(7539):365–9.
Heneka MT, Golenbock DT, Latz E. Innate immunity in Alzheimer’s disease. Nat Immunol. 2015;16(3):229–36.
Wang C, Zhang X, Teng Z, Zhang T, Li Y. Downregulation of PI3K/Akt/mTOR signaling pathway in curcumin-induced autophagy in APP/PS1 double transgenic mice. Eur J Pharmacol. 2014;740:312–20.
Polychronidou E, Vlachakis D, Vlamos P, Baumann M, Kossida S. Notch signaling and ageing. Adv Exp Med Biol. 2015;822:25–36.
Wang D, Ho L, Faith J, Ono K, Janle EM, Lachcik PJ, Cooper BR, Jannasch AH, D’Arcy BR, Williams BA, et al. Role of intestinal microbiota in the generation of polyphenol-derived phenolic acid mediated attenuation of Alzheimer’s disease beta-amyloid oligomerization. Mol Nutr Food Res. 2015;59(6):1025–40.
Alam MZ, Alam Q, Kamal MA, Abuzenadah AM, Haque A. A possible link of gut microbiota alteration in type 2 diabetes and Alzheimer’s disease pathogenicity: an update. CNS Neurol Disord Drug Targets. 2014;13(3):383–90.
Roos PM. Osteoporosis in neurodegeneration. J Trace Elem Med Biol. 2014;28(4):418–21.
Letra L, Santana I, Seica R. Obesity as a risk factor for Alzheimer’s disease: the role of adipocytokines. Metab Brain Dis. 2014;29(3):563–8.
Teixeira AL, Diniz BS, Campos AC, Miranda AS, Rocha NP, Talib LL, Gattaz WF, Forlenza OV. Decreased levels of circulating adiponectin in mild cognitive impairment and Alzheimer’s disease. Neruomol Med. 2013;15(1):115–21.
Jia P, Kao CF, Kuo PH, Zhao Z. A comprehensive network and pathway analysis of candidate genes in major depressive disorder. BMC Syst Biol. 2011;5 Suppl 3:S12.
Leal MC, Magnani N, Villordo S, Buslje CM, Evelson P, Castano EM, Morelli L. Transcriptional regulation of insulin-degrading enzyme modulates mitochondrial amyloid beta (Abeta) peptide catabolism and functionality. J Biol Chem. 2013;288(18):12920–31.
Conejero-Goldberg C, Hyde TM, Chen S, Dreses-Werringloer U, Herman MM, Kleinman JE, Davies P, Goldberg TE. Molecular signatures in post-mortem brain tissue of younger individuals at high risk for Alzheimer’s disease as based on APOE genotype. Mol Psychiatry. 2011;16(8):836–47.
Adori C, Gluck L, Barde S, Yoshitake T, Kovacs GG, Mulder J, Magloczky Z, Havas L, Bolcskei K, Mitsios N, et al. Critical role of somatostatin receptor 2 in the vulnerability of the central noradrenergic system: new aspects on Alzheimer’s disease. Acta Neuropathol. 2015;129(4):541–63.
Capsoni S, Amato G, Vignone D, Criscuolo C, Nykjaer A, Cattaneo A. Dissecting the role of sortilin receptor signaling in neurodegeneration induced by NGF deprivation. Biochem Biophys Res Commun. 2013;431(3):579–85.
Satoh K, Abe-Dohmae S, Yokoyama S, St George-Hyslop P, Fraser PE. ATP-binding cassette transporter A7 (ABCA7) loss of function alters Alzheimer amyloid processing. J Biol Chem. 2015;290(40):24152–65.
Inkster B, Rao AW, Ridler K, Filippini N, Whitcher B, Nichols TE, Wetten S, Gibson RA, Borrie M, Kertesz A, et al. Genetic variation in GOLM1 and prefrontal cortical volume in Alzheimer’s disease. Neurobiol Aging. 2012;33(3):457–65.
Raha AA, Vaishnav RA, Friedland RP, Bomford A, Raha-Chowdhury R. The systemic iron-regulatory proteins hepcidin and ferroportin are reduced in the brain in Alzheimer’s disease. Acta Neuropathol Commun. 2013;1:55.
Kong W, Zhang J, Mou X, Yang Y. Integrating gene expression and protein interaction data for signaling pathway prediction of Alzheimer’s disease. Comput Math Methods Med. 2014;2014:340758.
Ponzoni I, Nueda M, Tarazona S, Gotz S, Montaner D, Dussaut J, Dopazo J, Conesa A. Pathway network inference from gene expression data. BMC Syst Biol. 2014;8 Suppl 2:S7.
Sun Y, Bresell A, Rantalainen M, Hoglund K, Lebouvier T, Salter H. Alzheimer Disease Neuroimaging Initiative. An integrated bioinformatics approach for identifying genetic markers that predict cerebrospinal fluid biomarker p-tau181/Abeta1-42 ratio in ApoE4-negative mild cognitive impairment patients. J Alzheimers Dis. 2015;45(4):1061–76.
Williams-Skipp C, Raman T, Valuck RJ, Watkins H, Palmer BE, Scheinman RI. Unmasking of a protective tumor necrosis factor receptor I-mediated signal in the collagen-induced arthritis model. Arthritis Rheum. 2009;60(2):408–18.
Parra MA, Saarimaki H, Bastin ME, Londono AC, Pettit L, Lopera F, Della Sala S, Abrahams S. Memory binding and white matter integrity in familial Alzheimer’s disease. Brain. 2015;138(Pt 5):1355–69.
Ahmadian-Attari MM, Dargahi L, Mosaddegh M, Kamalinejad M, Khallaghi B, Noorbala F, Ahmadiani A. Impairment of rat spatial learning and memory in a new model of cold water-induced chronic hypothermia: implication for Alzheimer’s disease. Neurotox Res. 2015;28(2):95–107.
Peter-Derex L, Yammine P, Bastuji H, Croisile B. Sleep and Alzheimer’s disease. Sleep Med Rev. 2015;19:29–38.
Suzuki C, Yokote Y, Takahashi T. Changes in daily cognition and behavior of Alzheimer’s patients over time: a three-year evaluation using a daily cognition and behavior for Alzheimer’s disease scale. Dementia. 2015;14(1):126–35.
Satoh J, Tabunoki H, Ishida T, Saito Y, Arima K. Accumulation of a repulsive axonal guidance molecule RGMa in amyloid plaques: a possible hallmark of regenerative failure in Alzheimer’s disease brains. Neuropathol Appl Neurobiol. 2013;39(2):109–20.
Landlinger C, Oberleitner L, Gruber P, Noiges B, Yatsyk K, Santic R, Mandler M, Staffler G. Active immunization against complement factor C5a: a new therapeutic approach for Alzheimer’s disease. J Neuroinflammation. 2015;12:150.
Alcolea D, Martinez-Lage P, Sanchez-Juan P, Olazaran J, Antunez C, Izagirre A, Ecay-Torres M, Estanga A, Clerigue M, Guisasola MC, et al. Amyloid precursor protein metabolism and inflammation markers in preclinical Alzheimer disease. Neurology. 2015;85(7):626–33.
Wang X, Hu X, Yang Y, Takata T, Sakurai T. Systemic pyruvate administration markedly reduces neuronal death and cognitive impairment in a rat model of Alzheimer’s disease. Exp Neurol. 2015;271:145–54.
Ahmed T, Blum D, Burnouf S, Demeyer D, Buee-Scherrer V, D’Hooge R, Buee L, Balschun D. Rescue of impaired late-phase long-term depression in a tau transgenic mouse model. Neurobiol Aging. 2015;36(2):730–9.
Koch G, Di Lorenzo F, Bonni S, Ponzo V, Caltagirone C, Martorana A. Impaired LTP- but not LTD-like cortical plasticity in Alzheimer’s disease patients. J Alzheimers Dis. 2012;31(3):593–9.
Bethea CL, Reddy AP. Ovarian steroids regulate gene expression related to DNA repair and neurodegenerative diseases in serotonin neurons of macaques. Mol Psychiatry. 2015;20(12):1565–78.
Folch J, Patraca I, Martinez N, Pedros I, Petrov D, Ettcheto M, Abad S, Marin M, Beas-Zarate C, Camins A. The role of leptin in the sporadic form of Alzheimer’s disease. Interactions with the adipokines amylin, ghrelin and the pituitary hormone prolactin. Life Sci. 2015;140:19–28.
Magalhaes CA, Carvalho MG, Sousa LP, Caramelli P, Gomes KB. Leptin in Alzheimer’s disease. Clin Chim Acta. 2015;450:162–8.
de la Monte SM, Tong M. Brain metabolic dysfunction at the core of Alzheimer’s disease. Biochem Pharmacol. 2014;88(4):548–59.
Merlo S, Spampinato S, Canonico PL, Copani A, Sortino MA. Alzheimer’s disease: brain expression of a metabolic disorder? Trends Endocrinol Metab. 2010;21(9):537–44.
Demetrius LA, Driver J. Alzheimer’s as a metabolic disease. Biogerontology. 2013;14(6):641–9.
Perez SE, He B, Nadeem M, Wuu J, Scheff SW, Abrahamson EE, Ikonomovic MD, Mufson EJ. Resilience of precuneus neurotrophic signaling pathways despite amyloid pathology in prodromal Alzheimer’s disease. Biol Psychiatry. 2015;77(8):693–703.
Potter PE, Rauschkolb PK, Pandya Y, Sue LI, Sabbagh MN, Walker DG, Beach TG. Pre- and post-synaptic cortical cholinergic deficits are proportional to amyloid plaque presence and density at preclinical stages of Alzheimer’s disease. Acta Neuropathol. 2011;122(1):49–60.
Pimenova AA, Thathiah A, De Strooper B, Tesseur I. Regulation of amyloid precursor protein processing by serotonin signaling. PLoS One. 2014;9(1):e87014.
Egorova P, Popugaeva E, Bezprozvanny I. Disturbed calcium signaling in spinocerebellar ataxias and Alzheimer’s disease. Semin Cell Dev Biol. 2015;40:127–33.
Scheperjans F. Can microbiota research change our understanding of neurodegenerative diseases? Neurodegener Dis Manag. 2016;6(2):81–5.
Ghaisas S, Maher J, Kanthasamy A. Gut microbiome in health and disease: linking the microbiome-gut-brain axis and environmental factors in the pathogenesis of systemic and neurodegenerative diseases. Pharmacol Ther. 2016;158:52–62.
Catanzaro R, Anzalone M, Calabrese F, Milazzo M, Capuana M, Italia A, Occhipinti S, Marotta F. The gut microbiota and its correlations with the central nervous system disorders. Panminerva Med. 2015;57(3):127–43.
Pedros I, Petrov D, Allgaier M, Sureda F, Barroso E, Beas-Zarate C, Auladell C, Pallas M, Vazquez-Carrera M, Casadesus G, et al. Early alterations in energy metabolism in the hippocampus of APPswe/PS1dE9 mouse model of Alzheimer’s disease. Biochim Biophys Acta. 2014;1842(9):1556–66.
Zhang B, Gaiteri C, Bodea LG, Wang Z, McElwee J, Podtelezhnikov AA, Zhang C, Xie T, Tran L, Dobrin R, et al. Integrated systems approach identifies genetic nodes and networks in late-onset Alzheimer’s disease. Cell. 2013;153(3):707–20.
Hooli BV, Kovacs-Vajna ZM, Mullin K, Blumenthal MA, Mattheisen M, Zhang C, Lange C, Mohapatra G, Bertram L, Tanzi RE. Rare autosomal copy number variations in early-onset familial Alzheimer’s disease. Mol Psychiatry. 2014;19(6):676–81.
Atkin G, Hunt J, Minakawa E, Sharkey L, Tipper N, Tennant W, Paulson HL. F-box only protein 2 (Fbxo2) regulates amyloid precursor protein levels and processing. J Biol Chem. 2014;289(10):7038–48.
Lim J, Hao T, Shaw C, Patel AJ, Szabo G, Rual JF, Fisk CJ, Li N, Smolyar A, Hill DE, et al. A protein-protein interaction network for human inherited ataxias and disorders of Purkinje cell degeneration. Cell. 2006;125(4):801–14.
Rossini L, Hashimoto Y, Suzuki H, Kurita M, Gianfriddo M, Scali C, Roncarati R, Franceschini D, Pollio G, Trabalzini L, et al. VSTM2L is a novel secreted antagonist of the neuroprotective peptide Humanin. FASEB J. 2011;25(6):1983–2000.
Matsuoka M. Protective effects of Humanin and calmodulin-like skin protein in Alzheimer’s disease and broad range of abnormalities. Mol Neurobiol. 2015;51(3):1232–9.
Chai GS, Duan DX, Ma RH, Shen JY, Li HL, Ma ZW, Luo Y, Wang L, Qi XH, Wang Q, et al. Humanin attenuates Alzheimer-like cognitive deficits and pathological changes induced by amyloid beta-peptide in rats. Neurosci Bull. 2014;30(6):923–35.
Bertram L, McQueen MB, Mullin K, Blacker D, Tanzi RE. Systematic meta-analyses of Alzheimer disease genetic association studies: the AlzGene database. Nat Genet. 2007;39(1):17–23.
Saykin AJ, Shen L, Foroud TM, Potkin SG, Swaminathan S, Kim S, Risacher SL, Nho K, Huentelman MJ, Craig DW, et al. Alzheimer’s Disease Neuroimaging Initiative biomarkers as quantitative phenotypes: Genetics core aims, progress, and plans. Alzheimers Dement. 2010;6(3):265–73.
Cruts M, Theuns J, Van Broeckhoven C. Locus-specific mutation databases for neurodegenerative brain diseases. Hum Mutat. 2012;33(9):1340–4.
Bai Z, Han G, Xie B, Wang J, Song F, Peng X, Lei H. AlzBase: an integrative database for gene dysregulation in Alzheimer’s disease. Mol Neurobiol. 2016;53(1):310–9.
Roncaglia P, Martone ME, Hill DP, Berardini TZ, Foulger RE, Imam FT, Drabkin H, Mungall CJ, Lomax J. The Gene Ontology (GO) cellular component ontology: integration with SAO (Subcellular Anatomy Ontology) and other recent developments. J Biomed Semantics. 2013;4(1):20.
Ideker T, Sharan R. Protein networks in disease. Genome Res. 2008;18(4):644–52.
The authors thank Dr Tao Zhang, Dr Xianfu Yi and Dr Haixuan Qiao for helpful discussions in preparation of the manuscript.
This project was supported in part by grants from the National Key Research and Development Program of China (No. 2016YFC0906300), the National Natural Science Foundation of China (No. 31271411 and No. 61202379), and the Scientific Research Foundation for the Returned Overseas Chinese Scholars, State Education Ministry of China. The funding bodies played no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.
Availability of data and materials
Provided as additional supporting files.
Y-SH, JX, LZ, and JW designed the experiments. Y-SH, JX, YH, and JW performed the experiments and data analysis. Y-SH, LZ, and JW wrote the manuscript. All authors read and approved the final manuscript.
Y-SH, JX, YH, and JW are from the School of Biomedical Engineering, Tianjin Medical University, Tianjin, China. LZ is from the School of Computer Science and Technology, Tianjin University, Tianjin, China.
The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Is a list of the human interactome utilized in this study. The human protein interaction network contains 16,022 genes/proteins and 228,122 interactions. (TXT 5293 kb)
Is presenting a list of genes associated with Alzheimer’s disease and Table S2 presenting the GO biological process terms enriched in Alzgset. (DOC 990 kb)
About this article
Cite this article
Hu, YS., Xin, J., Hu, Y. et al. Analyzing the genes related to Alzheimer’s disease via a network and pathway-based approach. Alz Res Therapy 9, 29 (2017). https://doi.org/10.1186/s13195-017-0252-z
- Alzheimer’s disease
- Functional enrichment analysis
- Network analysis
- Pathway crosstalk