- Open Access
A large-scale polygenic risk score analysis identified candidate proteins associated with anxiety, depression and neuroticism
Molecular Brain volume 15, Article number: 66 (2022)
Psychiatric disorders and neuroticism are closely associated with central nervous system, whose proper functioning depends on efficient protein renewal. This study aims to systematically analyze the association between anxiety / depression / neuroticism and each of the 439 proteins. 47,536 pQTLs of 439 proteins in brain, plasma and cerebrospinal fluid (CSF) were collected from recent genome-wide association study. Polygenic risk scores (PRS) of the 439 proteins were then calculated using the UK Biobank cohort, including 120,729 subjects of neuroticism, 255,354 subjects of anxiety and 316,513 subjects of depression. Pearson correlation analyses were performed to evaluate the correlation between each protein and each of the mental traits by using calculated PRSs as the instrumental variables of protein. In general population, six correlations were identified in plasma and CSF such as plasma protease C1 inhibitor (C1-INH) with neuroticism score (r = − 0.011, P = 2.56 × 10− 9) in plasma, C1-INH with neuroticism score (r = -0.010, P = 3.09 × 10− 8) in CSF, and ERBB1 with self-reported depression (r = − 0.012, P = 4.65 × 10− 5) in CSF. C1-INH and ERBB1 may induce neuroticism and depression by affecting brain function and synaptic development. Gender subgroup analyses found that BST1 was correlated with neuroticism score in male CSF (r = − 0.011, P = 1.80 × 10− 5), while CNTN2 was correlated with depression score in female brain (r = − 0.013, P = 6.43 × 10− 4). BST1 and CNTN2 may be involved in nervous system metabolism and brain health. Six common candidate proteins were associated with all three traits (P < 0.05) and were confirmed in relevant proteomic studies, such as C1-INH in plasma, CNTN2 and MSP in the brain. Our results provide novel clues for revealing the roles of proteins in the development of anxiety, depression and neuroticism.
Neuroticism is a complex health-related personality factor that includes anxiety, moodiness, worrying, and negative emotions, and people affected by neuroticism feel, notice, and report more distress, symptoms and pain . Generalized anxiety disorder is characterized by chronic, pervasive anxiety and worry accompanied by nonspecific physical and psychological symptoms, including restlessness, fatigue, difficulty concentrating, irritability, muscle tension, or difficulty sleeping . Depression often presents with low self‑esteem, low mood, anhedonia, feeling of worthlessness, fatigue, sense of rejection and guilt, suicidal thoughts, among others . Anxiety and depression are common psychiatric disorders with lifetime prevalence of 12.9% (reported in 2014)  and 16.2% (reported in 2003)  respectively. Neuroticism is an important contributing factor for both anxiety and depression . Recent regression analyses concluded that neuroticism significantly predicted depression and anxiety . Nagel et al. performed Mendelian randomization analysis and observed bidirectional associations between neuroticism and depression . Although many genetic variants associated with neuroticism and anxiety/depression were identified, the relationships between these traits at the protein level remains elusive .
Changes of protein abundance in human brain were associated with psychiatric disorders and neurodegenerative diseases [9, 10], involving multiple regulatory mechanisms in transcription and translation, such as miRNA control and ubiquitin proteasome dependent degradation [11, 12]. Felger et al. identified the clusters of cerebrospinal fluid (CSF) inflammatory markers that were correlated with depressive symptom severity . Wang et al. integrated multiple proteomes including cortex, CSF and serum in Alzheimer’s disease (AD), and identified 37 proteins emerged as potential AD biomarker across these three tissues . Studies have shown that the proteins involved in brain, CSF and plasma were significantly different in people with mental disorders than in the general population [15, 16]. Therefore, a systematic study is needed to explore the relationships between anxiety, depression, and neuroticism with proteins in brain, CSF and plasma from a genetic perspective.
Although historically research has focused on transcription as the central governor of protein expression, protein translation is now increasingly being recognized as a major factor for determining protein levels within cells . SNPs in coding region or non-coding region may be associated with expression quantitative trait locus (eQTL) or altered protein quantitative trait loci (pQTL) . Many eQTLs have been identified to be associated with the mRNA expression of psychiatric disorders [19, 20]. However, the mRNA expression of many genes is poorly correlated with protein levels, in part due to the influence of many post-transcriptional factors such as protein translation and degradation . Compared with eQTL, pQTL mapping analysis showed that pQTL could provide more effective insights into the effects of genetic variation . Increasing evidence also suggested that impaired mRNA translation is a common feature found in numerous complex diseases [23, 24]. Thus, pQTL may play a key role in the post-transcriptional regulation mechanism of complex disease-related proteins.
Genome-wide association studies (GWASs) have identified multiple risk variants for complex diseases [8, 25]. Nevertheless, to what extent the risk variants of complex diseases can lead to cumulative risk of individual remains largely unknown. Polygenic risk score (PRS) was proposed to solve this dilemma, which reflects the sum of all known risk loci . PRS is an individual-level score calculated based on the number of risk variants, and weighted by SNP effect sizes derived from an independent large-scaled discovery GWAS . The effect sizes of multiple SNPs are combined into a single aggregate score that can be used to predict the risks of human diseases . Recently, PRS has shown promise in investigating the association between different psychiatric disease . Lin et al. tested the ability to predict brain disorders in postmortem expression datasets and clinical cohorts, and found that PRScis−eQTL scores were associated with late-life depression .
The present study systematically analyzed the association between protein in brain, CSF and plasma with neuroticism, anxiety and depression. The PRS scores of proteins in different tissues were calculated using the genotype data from the UK Biobank cohort, respectively. Pearson correlation analyses were then performed to investigate whether each protein was correlated with neuroticism, anxiety and depression by using calculated PRSs as the instrumental variables of protein. Our study may provide new insights into the application of pQTL data, and highlight the significant impact of proteins on the risks of neuroticism, anxiety and depression.
Neuroticism, anxiety and depression phenotypes in the UK Biobank cohort
The phenotypic and genotypic data used here were derived from the UK Biobank, which has recruited 502,656 participants aged between 40 and 69 years, and conducted a large prospective cohort study from 2006 to 2010 . The UK Biobank has collected a large collection of phenotypic, health-related information for each participant, including biometric and physical measurements, lifestyle indicators and genome-wide genotyping data. The present study accessed health-related records of each participant, including age, sex, smoking and alcohol use, Townsend deprivation index (TDI), body mass index (BMI), and education scores from screenshot question or verbal interview within assessment center. Neuroticism (data fields: 20,217) was defined based on Eysenck personality questionnaire (EPQ) and revised short form (FPQ-R-S) . Anxiety (data fields: 20,421 and 20,420) was defined based on general anxiety disorder (GAD-7) and composite international diagnostic interview short-form (CIDI-SF), while depression (data fields: 20,002, 20,126 and 20,544) was defined based on patient health questionnaire (PHQ-9) and CIDI-SF [32, 33]. In this study, neuroticism used symptom scores, while anxiety and depression used both case-control status and symptom scores. For the case-control phenotype, PHQ score ≤ 5 and GAD score < 5 were defined as the control cut-off for depression and anxiety, respectively. Ethical approval of UK Biobank was granted by the National Health Service National Research Ethics Service (reference 11/NW/0382). Neuroticism, anxiety and depression score were mean-centered and normalized to one standard deviation (SD) before further analysis. The detailed phenotype definitions of neuroticism, anxiety and depression in this study are shown in Additional file 1.
UK Biobank genotyping, imputation and quality control
Genome-wide genotyping was conducted in 489,212 participants with 812,428 SNPs using either the Affymetrix UK BiLEVE Axiom or Affymetrix UK Biobank Axiom array. Imputation was conducted by IMPUTE2 using the reference panel of the Haplotype Reference Consortium, 1000 Genomes and UK10K projects . The SNPs with high linkage disequilibrium (r2 > 0.5) were removed to select high-quality SNPs. 488,377 participants and 805,426 SNPs were kept after applying quality control measures. The researchers provided a list of 409,728 participants who self-report ethnicity as “British” and who have very similar genetic ancestral backgrounds according to the PCA. This set of individuals was referred as the “white British ancestry subset” (UK Biobank field ID: 21000) . After removing participants who reported inconsistencies between self-reported gender and genetic gender, as well as whom missing covariate information, 376,806 participants were retained for further analysis. Details of the array design, genotyping, and quality control procedures were published elsewhere .
Polygenic risk score datasets of neurological proteins
2678 pQTLs of 70 proteins in brain, 11,605 pQTLs of 152 proteins in plasma and 33,253 pQTLs of 217 proteins in CSF were collected from the proteome atlas of neurological disorders (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8521603/) . Briefly, protein samples from 1537 participants included three tissue types: CSF (collected from living individuals), plasma (collected from living individuals) and brain (collected from fresh frozen human parietal lobes). The proteomics data were processed using SomaDataIO (v1.8.0) and Biobase (v2.42.0). Proteins were mapped to UniProt identifiers and Entrez Gene symbols. Ensembl gene IDs and genomic position mapping was performed using gencode version 30 . Based on the original research , QC on both proteins and samples were described as follows. The protein level QC, starting from 1305 proteins; after step-1, Limit Of Detection VS 2-StDeviation, 807 CSF, 1301 plasma, 1109 brain proteins were kept with a pass-rate ≥85%; after step-2, given Max Difference of Scale Factor < 0.5, 749 CSF, 956 plasma, 1107 brain proteins were kept; after step-3, given Coefficient of Variation (of calibrator) < 0.15 and step-4, given IQR, sum(outliers) < 15%, 746 CSF, 955 plasma, 1106 brain proteins were kept. After step-5, 713 CSF, 931 plasma, 1079 brain proteins that shared by < 30 samples, < 10 samples, and < 21 samples (shared by ~ 80% of the subject outliers) were kept, respectively. The sample level QC, the proteomics from 1300 CSF, 648 plasma and 459 brain samples were profiled within each tissue. 971 CSF, 636 plasma and 458 brain samples were from unique donors in proteomics data. 965 CSF, 633 plasma and 450 brain samples were kept with available genotyping array data. 875 CSF, 561 plasma and 426 brain samples were kept with a European ancestry after adjusting for principal components. Moreover, 853 CSF, 542 plasma and 400 brain samples were kept that were not closely related with one another (PI_HAT < 0.05) after checking identity by descent. Finally, 835 CSF, 529 plasma and 380 brain samples remained by passing both the genotype and protein data QC. After removing low-quality SNPs, genotype imputation was performed using the Impute2 program with haplotypes derived from the 1000 Genomes Project. SNPs with an info-score quality of less than 0.3 reported by Impute2, with a MAF < 0.02 or out of HWE were removed . A total of 14,059,245 imputed and directly genotyped SNPs were used for final analyses. To test the association between genetic variants and protein levels, a linear regression with additive model was performed using age, sex, principal component factors from population stratification and genotype platform as covariates . The detailed information of sample collection, aptamer-based proteomics, proteomic and genomic data QC process, pQTL identification, and annotation of pQTL were described elsewhere .
PRS calculation of pQTL in the UK Biobank cohort
The linkage disequilibrium independent SNPs (r2 > 0.5) were first pruned for each protein using PLINK 2.0. According to the standard approach, PLINK 2.0 was used to calculate the PRS of each study subject for each protein using linkage disequilibrium independent SNPs and individual genotype data from the UK Biobank (http://www.cog-genomics.org/plink/2.0/) . Briefly, we set PRSn denotes the PRS value of pQTL for the nth subject, defined as:
where l denotes the total number of pQTL associated SNPs; Ei denotes the effect size of significant pQTL associated SNP i; Din denotes the dosage of the risk allele of the ith SNP for the nth individual (0 is coded for homozygous protective genotype, 1 for heterozygous and 2 for homozygous polymorphic genotypes).
Covariates in regression models
Alcohol use frequency/week, smoking frequency/day, body mass index (BMI), education score and Townsend deprivation index (TDI) were used as the covariates in regression models to improve the accuracy of our analysis. The association between smoking with depression and anxiety was found to be bidirectional, with occasional smoking initially used to alleviate symptoms, but in fact worsening them over time . A longitudinal follow-up study suggested that alcohol consumption as a risk factor for anxiety and depression . Torgersen et al. found a shared genetic structure between neuroticism and BMI, of which 61 of the shared loci with BMI are novel for neuroticism . Recently, we found the relevance of the TDI to psychiatric disorders such as anxiety and depression, and identified several candidate genes interacting with the TDI . TDI (data field: 189) was calculated immediately prior to participant joining UK Biobank, based on the preceding national census output areas. Each participant is assigned a score corresponding to the output area in which their postcode is located. Education scores (level 1–5 variables, representing the level of education from low to high) were constructed by mapping each major educational qualification in UK Biobank (data field: 6138) to an International Standard Classification of Education (ISCED) category .
The neuroticism score, anxiety score and depression score were firstly adjusted for top 3 principle components of population structure (PC1-PC3), sex, age, alcohol use frequency/week, smoking frequency/day, TDI, BMI, and education score as covariates using linear regression models. The self-reported anxiety and self-reported depression were firstly adjusted for the same covariates as above using logistic regression models. The residuals from regression models were then used as the phenotypic values for Pearson correlation analysis. Pearson correlation analysis was then performed to evaluate the correlation between each protein and each of the phenotypes by using calculated PRS as the instrumental variables of protein. The R software (version R 3.5.3) was used to conduct Pearson correlation analysis. The significant association thresholds should be P < 0.05/(number of independent protein) after strict Bonferroni correction. There were 70, 152, and 217 independent proteins in brain, plasma, and CSF, respectively. Therefore, P value thresholds were set at 7.14 × 10− 4 for brain, 3.29 × 10− 4 for plasma, and 2.30 × 10− 4 for CSF.
Validation of candidate proteins in proteomic studies
The association signals of proteins and pQTLs from previous proteomic studies were used to validate our results. Firstly, relevant proteomic studies were searched to verify the common proteins associated with anxiety, depression, and neuroticism in our study. In detail, a comprehensive literature search was conducted in PubMed up until December 1, 2021. The keywords in the search strategy included (proteomic[Title]) AND (anxiety[Title/Abstract]), (proteomic[Title]) AND (depression[Title/Abstract]), and (proteomic[Title]) AND (neuroticism[Title/Abstract]). The pQTL data in QTLbase were subsequently used to validate the significant relevant pQTLs in our study. QTLbase organizes and compiles genome-wide QTL summary statistics for many human molecular traits across over 70 tissue or cell types . The database comprises tens of millions significant genotype-molecular trait associations under different conditions. Search by trait in QUERY option was used to verify pQTLs in brain tissues according to protein name and the corresponding EntrezGeneSymbol.
Descriptive characteristics of study samples
The descriptive characteristics of participants in this study are presented in Table 1. There were 120,729, 316,513 and 255,354 study subjects for neuroticism score, depression score and anxiety score, respectively. Correlation matrix among covariates and neuroticism, anxiety, and depression score are presented in Additional file 2.
Disease/trait-associated proteins in general population
In total samples, one significant association were observed in plasma (Bonferroni-adjusted P value threshold: 3.29 × 10− 4), plasma protease C1 inhibitor (C1-Esterase Inhibitor) vs. neuroticism score (r = -0.01, P = 2.56 × 10− 9) (Fig. 1). In CSF (Bonferroni-adjusted P value threshold: 2.30 × 10− 4), five significant association were observed such as C1-Esterase Inhibitor vs. neuroticism score (r = -0.01, P = 3.09 × 10− 8), and NADPH-cytochrome P450 reductase (NADPH-P450 Oxidoreductase) vs. neuroticism score (r = -0.008, P = 2.51 × 10− 5) (Fig. 1).
The gender characteristics of disease/trait-associated proteins
In male participants of the UK Biobank cohort, 3 significant association was observed in CSF, such as ADP-ribosyl cyclase/cyclic ADP-ribose hydrolase 2 (BST1) vs. neuroticism score (r = -0.01, P = 1.80 × 10− 5), while there was no significant signal in plasma (Fig. 2). In female participants, 3 significant association were observed, including CNTN2 vs. depression score in brain (Bonferroni-adjusted P value threshold: 7.14 × 10− 4, r = 0.01, P = 6.43 × 10− 4), C1-Esterase Inhibitor vs. neuroticism score in CSF (r = -0.01, P = 1.83 × 10− 5) and plasma (r = -0.01, P = 5.73 × 10− 7) (Fig. 2).
Differences in disease/trait-associated proteins in brain, cerebrospinal fluid, and plasma
After combining the candidate proteins (P < 0.05) of anxiety score and self-reported anxiety, depression score and self-reported depression, respectively, we obtained 15, 17 and 29 candidate proteins for anxiety in brain, plasma and CSF; 13, 29 and 38 candidate proteins for depression in brain, plasma and CSF; 6, 15 and 37 candidate proteins for neuroticism in brain, plasma and CSF, respectively (Fig. 3).We detected several disorder-specific and common proteins among the three traits. For example, human chorionic gonadotropin (r = -0.007, P = 1.86 × 10− 2) and luteinizing hormone (r = − 0.007, P = 1.95 × 10− 2) were associated only with anxiety in plasma. Copine-1 (r = 0.007, P = 1.13 × 10− 2) and complement C4b (r = − 0.007, P = 1.31 × 10− 2) were associated only with depression in brain. Plasminogen (r = -0.004, P = 4.26 × 10− 2) and macrophage-capping protein (r = − 0.005, P = 7.22 × 10− 3) were associated only with neuroticism in CSF. C1-Esterase Inhibitor, BST1, UBP25 and Siglec-3 were associated with all the three traits in plasma.
Validation of common disease/trait-associated proteins in independent proteomic studies
To confirm the validity of our analysis, we validated the candidate proteins in our results that were commonly associated with neuroticism, depression and anxiety in other independent omics studies. As shown in Table 2, a total of 6 candidate proteins in our study have association signals in independent proteomic studies. For example, Contactin-2 (CNTN2) and Hepatocyte growth factor-like protein (MSP) were significantly correlated with anxiety and depression in the study of mouse proteomes by combining mass spectrometry. Ubiquitin carboxyl-terminal hydrolase 25 (UBP25) and Proprotein convertase subtilisin/kexin type 9 (PCSK9) were associated with neuroticism in human isobaric tags for relative and absolute quantification (iTRAQ)-based quantitative proteomic analysis. C1-esterase Inhibitor was significantly down-regulated in human CSF proteome of depression vs. control study.
Validation of disease/trait-associated pQTLs in independent proteomic studies
Our results were confirmed in other proteomic studies. As shown in Fig. 4, 8 pQTLs of NADPH-P450 Oxidoreductase (POR) were detected in the brain-spinal cord and linked to Alzheimer’s disease using genomic and multi-tissue proteomic integration. 18 pQTLs of BST1 were detected in the brain and linked to psychiatric disorders using genome-wide quantitative trait loci mapping of the human cerebrospinal fluid proteome such as depression. Detailed results of effectiveness evaluation are shown in Additional file 3.
In this study, we conducted a large observational and genetic PRS analyze to systematically evaluate the correlations between proteins and complex traits (anxiety, depression and neuroticism) using the UK Biobank cohort. We observed multiple significant correlations in plasma, CSF and brain. Further analysis provided evidence for gender differences between complex traits with protein in different tissues.
The proteomic dataset we used here is the largest brain and CSF pQTL analyses to date, as well as the first neurologically-relevant multi-tissue pQTL study, and a unique resource for leveraging multi-tissue pQTL to understand neurological traits . The large sample sizes and well study design ensure the accuracy of tissue-specific protein identification. Understanding the tissue-specific genetic controls of protein level is essential to uncover mechanisms of post-transcriptional gene regulation. We used this proteomic dataset to explore the tissue-specific protein characteristics for anxiety, depression and neuroticism. Recent research confirmed that neurology and psychiatry both addressed disorders of the nervous system . For example, psychiatric and neurologic depression seem to share common abnormalities and similar lesions in specific brain areas .
C1-Esterase inhibitor (C1-INH) in CSF and plasma were found to be significantly associated with neuroticism score in our study. The biologic activities of C1-INH may be divided into regulation of vascular permeability and anti-inflammatory functions . We also found a negative correlation between C1-INH and neuroticism score. Recently, a number of studies confirmed the neuroprotection role of C1-INH that supports our results. For example, Mercurio et al. indicated that recombinant human C1-INH exhibited stronger neuroprotective effects than the corresponding plasma-derived protein after experimental ischemia/reperfusion injury in the brain . Earlier studies found that C1-INH was produced in normal brain, whereas in Alzheimer disease (AD), C1-INH was significantly responsive to abnormal neuronal processes, such as dystrophic neurites and neuropil threads .
High neuroticism is a well-established risk for present and future depression and anxiety, as well as an emerging target for treatment and prevention . It was notable that there were gender-specific proteins in samples with neuroticism. For the neuroticism-related proteins, BST1 was detected in males CSF, while C1-INH was detected in female plasma and CSF. BST1 is associated with the metabolism of nervous system and anxiety / depression-like behaviors. Higashida et al. tested BST1 knockout mice of various ages to assess the relationship between the presence of BST1 in the brain and its enzymatic activity, and indicated that BST1 might play a role in the embryonic and adult nervous systems . After knocking out the BST1, the BST1−/− male mice exhibited anxiety-related and depression-like behaviors compared with wild-type mice . The SNP of BST1 gene was found to be associated with multiple neurological and psychiatric conditions, including ASD, PD, and SCZ . In addition, rs4698412 allele variant in BST1 was shown to regulate lingual gyrus function and might be associated with brain activation and balance dysfunctions in PD .
Growing evidence highlights the similarities in psychoactive metabolites and microbiota-gut-brain axis among ASD, PD and AD. For example, psychobiotics are effective in improving neurodegenerative and neurodevelopmental disorders, including ASD, PD and AD . The alterations in gut microbiome composition or diversity are implicated in the pathophysiology of neuropsychiatric disorders such as depression and anxiety, behavioural disorders such as ASD, and neurodegenerative disorders such as AD and PD . Recent research confirmed that neurology and psychiatry both addressed disorders of the nervous system . For example, psychiatric and neurologic depression seem to share common abnormalities and similar lesions in specific brain areas . Depression and anxiety are common neuropsychiatric comorbidities of PD, and the somatic symptoms of depression often overlap with the motor symptoms of PD .
NADPH-P450 Oxidoreductase (POR) in CSF is another protein negatively associated with neuroticism score in our study. POR has a major role in metabolism of drugs and steroids . Appropriate regulation of retinoic acid levels and tissue distribution by POR is essential for early embryonic development, brain morphogenesis and molecular patterning . POR was detected in multiple brain areas. For example, immunohistochemistry test in rats showed that POR was located in the dopaminergic rich region of the periventricular hypothalamus and arcuate nucleus . Haglund et al. found POR in the nigra, locus coereleus, dorsal raphe, hypothalamus, striatum, nucleus accumbens and olfactory tubercle . Studies in the past decade have shown that POR could affect brain function and nervous system through the metabolism of nitric oxide , synapses forming , activity of Ca2+ channels , and cellular defense . Together, these findings suggest that POR may relate to neuroticism by affecting brain function and neural development, but need more direct evidence.
Apolipoprotein L1 (APOL1) in plasma was found to be suggestively associated with self-reported depression in our study. APOL1 is ubiquitously expressed in human central nervous system (CNS), but at lower levels than that in peripheral tissues . Situ hybridization studies also demonstrated pan-neuronal expression of APOL1 mRNA in human frontal cortex . Recent studies found that APOL1 was involved in brain structure and psychiatric disorders. For example, high-throughput molecular spectroscopy studies proved that APOL was an important factor in psychiatric disorders . Hwang et al. used RNA-seq data from postmortem brain tissue hippocampus, and found that APOL1 was one of the differentially up-regulated genes in patients with SCZ . Hence, it can be inferred that APOL1 may induce abnormal function in the hippocampus, and may play a vital role in depression development.
This is the first systematic study of the relationship between proteins and depression / anxiety / neuroticism. However, our study does have certain limitations. The protein PRS data were collected from European ancestry, at the age averaged 82.2 years. The protein PRSs were calculated using the UK Biobank cohort, which were mainly middle-aged European populations. Thus, our findings should be carefully applied to young-aged and other ethnic populations. Besides, the pQTL dataset included both neurological disorder individuals and cognitively normal controls, which may result in slight bias on our results. Thirdly, since weight gain can be a symptom of depression, our adjustment for BMI in the regression analysis of depression phenotypes may have a potential bias for the results. In future studies, we need to validate our results using independent clinical samples or cohorts and explore the potential biological mechanism underlying the observed association between candidate proteins with depression, anxiety, and neuroticism.
Taken together, we systematically investigated the associations between proteins with depression, anxiety, and neuroticism utilizing UK Biobank individual level traits and genotype data and publicly available protein PRS data of brain, CSF, and plasma. Our study highlights the associations between neuroticism with C1-INH and POR, and may provide novel insights to uncover the roles of protein on the development of depression, anxiety and neuroticism.
Availability of data and materials
The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.
Protein quantitative trait loci
Polygenic risk scores
Expression quantitative trait locus
Genome-wide association studies
Townsend deprivation index
Body mass index
Autism spectrum disorder
Friedman HS. Neuroticism and health as individuals age. Personal Disord. 2019;10(1):25–32.
DeMartini J, Patel G, Fancher TL. Generalized anxiety disorder. Ann Intern Med. 2019;170(7):Itc49-64.
Villas Boas GR, Boerngen de Lacerda R, Paes MM, Gubert P, Almeida W, Rescia VC, de Carvalho PMG, de Carvalho AAV, Oesterreich SA. Molecular aspects of depression: a review from neurobiology to treatment. Eur J Pharmacol. 2019;851:99–121.
Steel Z, Marnane C, Iranpour C, Chey T, Jackson JW, Patel V, Silove D. The global prevalence of common mental disorders: a systematic review and meta-analysis 1980–2013. Int J Epidemiol. 2014;43(2):476–493.
Kessler RC, Berglund P, Demler O, Jin R, Koretz D, Merikangas KR, Rush AJ, Walters EE, Wang PS. The epidemiology of major depressive disorder: results from the National Comorbidity Survey Replication (NCS-R). JAMA. 2003;289(23):3095–3105.
Liao A, Walker R, Carmody TJ, Cooper C, Shaw MA, Grannemann BD, Adams P, Bruder GE, McInnis MG, Webb CA, et al. Anxiety and anhedonia in depression: Associations with neuroticism and cognitive control. J Affect Disord. 2019;245:1070–1078.
Smith EM, Reynolds S, Orchard F, Whalley HC, Chan SW. Cognitive biases predict symptoms of depression, anxiety and wellbeing above and beyond neuroticism in adolescence. J Affect Disord. 2018;241:446–453.
Nagel M, Jansen PR, Stringer S. Meta-analysis of genome-wide association studies for neuroticism in 449,484 individuals identifies novel genetic loci and pathways. Nat Genet. 2018;50(7):920–927.
Liu J, Li X, Luo XJ. Proteome-wide association study provides insights into the genetic component of protein abundance in psychiatric disorders. Biol Psychiatry. 2021;S0006-3223(21):01431–01431.
Raychaudhuri S, Dey S, Bhattacharyya NP, Mukhopadhyay D. The role of intrinsically unstructured proteins in neurodegenerative diseases. PLoS ONE. 2009;4(5):e5566.
Babu MM, van der Lee R, de Groot NS, Gsponer J. Intrinsically disordered proteins: regulation and disease. Curr Opin Struct Biol. 2011;21(3):432–440.
Vacic V, Iakoucheva LM. Disease mutations in disordered regions-exception to the rule? Mol BioSyst. 2012;8(1):27–32.
Felger JC, Haroon E. What does plasma CRP tell us about peripheral and central inflammation in depression? Mol Psychiatry. 2020;25(6):1301–1311.
Wang H, Dey KK, Chen PC, Li Y, Niu M, Cho JH, Wang X, Bai B, Jiao Y, Chepyala SR, et al. Integrated analysis of ultra-deep proteomes in cortex, cerebrospinal fluid and serum reveals a mitochondrial signature in Alzheimer’s disease. Mol Neurodegener. 2020;15(1):43.
Saia-Cereda VM, Cassoli JS, Martins-de-Souza D, Nascimento JM. Psychiatric disorders biochemical pathways unraveled by human brain proteomics. Eur Arch Psychiatry Clin NeuroSci. 2017;267(1):3–17.
Al Shweiki MR, Oeckl P, Steinacker P, Hengerer B, Schönfeldt-Lecuona C, Otto M. Major depressive disorder: insight into candidate cerebrospinal fluid protein biomarkers from proteomics studies. Expert Rev Proteomics. 2017;14(6):499–514.
Laguesse S, Ron D. Protein Translation and Psychiatric Disorders. The Neuroscientist: a review journal bringing neurobiology neurology and psychiatry. 2020;26(1):21–42.
Zheng Z, Huang D, Wang J, Zhao K, Zhou Y, Guo Z, Zhai S, Xu H, Cui H, Yao H, et al. QTLbase: an integrative resource for quantitative trait loci across multiple human molecular phenotypes. Nucleic Acids Res. 2020;48(D1):D983-d991.
Morgan LZ, Rollins B, Sequeira A, Byerley W, DeLisi LE, Schatzberg AF, Barchas JD, Myers RM, Watson SJ, Akil H, et al. Quantitative trait locus and brain expression of HLA-DPA1 Offers evidence of shared immune alterations in psychiatric disorders. Microarrays. 2016;5(1):6.
Yang Z, Zhou D, Li H, Cai X, Liu W, Wang L, Chang H, Li M. The genome-wide risk alleles for psychiatric disorders at 3p21.1 show convergent effects on mRNA expression, cognitive function, and mushroom dendritic spine. Mol Psychiatry. 2020;25(1):48–66.
Robins C, Liu Y, Fan W, Duong DM, Meigs J, Harerimana NV, Gerasimov ES, Dammer EB, Cutler DJ, Beach TG, et al. Genetic control of the human brain proteome. Am J Hum Genet. 2021;108(3):400–410.
Wu L, Candille SI, Choi Y, Xie D, Jiang L, Li-Pook-Than J, Tang H, Snyder M. Variation and genetic control of protein abundance in humans. Nature. 2013;499(7456):79–82.
Aguilar-Valles A, Haji N, De Gregorio D, Matta-Camacho E, Eslamizade MJ, Popic J. Translational control of depression-like behavior via phosphorylation of eukaryotic translation initiation factor 4E. Nat Commun. 2018;9(1):2459.
English JA, Fan Y, Föcking M, Lopez LM, Hryniewiecka M, Wynne K, Dicker P, Matigian N, Cagney G, Mackay-Sim A, et al. Reduced protein synthesis in schizophrenia patient-derived olfactory cells. Transl Psychiatry. 2015;5(10):e663.
Howard DM, Adams MJ. Genome-wide meta-analysis of depression identifies 102 independent variants and highlights the importance of the prefrontal brain regions. Nat Neurosci. 2019;22(3):343–352.
Purcell SM, Wray NR, Stone JL, Visscher PM, O’Donovan MC, Sullivan PF, Sklar P. Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature. 2009;460(7256):748–752.
Dudbridge F. Polygenic Epidemiology. Genet Epidemiol. 2016;40(4):268–272.
Shen X, Howard DM. A phenome-wide association and Mendelian Randomisation study of polygenic risk for depression in UK Biobank. Nat Commun. 2020;11(1):2301.
Lin CW, Chang LC, Ma T, Oh H, French B. Older molecular brain age in severe mental illness. Mol Psychiatry. 2021;26(7):3646–3656.
Bycroft C, Freeman C, Petkova D, Band G, Elliott LT, Sharp K, Motyer A, Vukcevic D, Delaneau O, O’Connell J, et al. The UK Biobank resource with deep phenotyping and genomic data. Nature. 2018;562(7726):203–209.
Eysenck SBG, Eysenck HJ, Barrett P. A revised version of the psychoticism scale. Pers Individ Dif. 1990;6(1):21–9.
Kroenke K, Spitzer RL, Williams JB, Löwe B. The patient health questionnaire somatic, anxiety, and depressive symptom scales: a systematic review. Gen Hosp Psychiatry. 2010;32(4):345–59.
Davis KAS, Cullen B. Indicators of mental disorders in UK Biobank-A comparison of approaches. Int J Methods Psychiatr Res. 2019;28(3):e1796.
Yang C, Farias FHG. Genomic atlas of the proteome from brain, CSF and plasma prioritizes proteins implicated in neurological disorders. Nat Neurosci. 2021;24(9):1302–1312.
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–575.
Fluharty M, Taylor AE, Grabski M, Munafò MR. The association of cigarette smoking with depression and anxiety: a systematic review. Nicotine Tob Res. 2017;19(1):3–13.
Haynes JC, Farrell M, Singleton N, Meltzer H, Araya R, Lewis G, Wiles NJ. Alcohol consumption as a risk factor for anxiety and depression: results from the longitudinal follow-up of the National Psychiatric Morbidity Survey. Br J Psychiatry J Mental Sci. 2005;187:544–51.
Torgersen K, Bahrami S, Frei O, Shadrin A. Shared genetic architecture between neuroticism, coronary artery disease and cardiovascular risk factors. Transl Psychiatry. 2021;11(1):368.
Ye J, Wen Y, Sun X, Chu X, Li P, Cheng B, Cheng S, Liu L, Zhang L, Ma M, et al. Socioeconomic Deprivation Index Is Associated With Psychiatric Disorders: An Observational and Genome-wide Gene-by-Environment Interaction Analysis in the UK Biobank Cohort. Biol Psychiatry. 2021;89(9):888–895.
Okbay A, Beauchamp JP, Fontana MA, Lee JJ, Pers TH, Rietveld CA, Turley P, Chen GB, Emilsson V, Meddens SF, et al. Genome-wide association study identifies 74 loci associated with educational attainment. Nature. 2016;533(7604):539–542.
Silbersweig D. Integrating Models of Neurologic and Psychiatric Disease. JAMA Neurol. 2017;74(7):759–760.
Benedetti F, Bernasconi A, Pontiggia A. Depression and neurological disorders. Curr Opin Psychiatry. 2006;19(1):14–18.
Davis AE 3rd, Mejia P, Lu F. Biological activities of C1 inhibitor. Mol Immunol. 2008;45(16):4057–4063.
Mercurio D, Piotti A, Valente A, Oggioni M, Ponstein Y, Van Amersfoort E, Gobbi M, Fumagalli S, De Simoni MG. Plasma-derived and recombinant C1 esterase inhibitor: Binding profiles and neuroprotective properties in brain ischemia/reperfusion injury. Brain Behav Immun. 2021;93:299–311.
Walker DG, Yasuhara O, Patston PA, McGeer EG, McGeer PL. Complement C1 inhibitor is produced by brain tissue and is cleaved in Alzheimer disease. Brain Res. 1995;675(1–2):75–82.
Vittengl JR. Who pays the price for high neuroticism? Moderators of longitudinal risks for depression and anxiety. Psychol Med. 2017;47(10):1794–1805.
Higashida H, Liang M, Yoshihara T, Akther S, Fakhrul A, Stanislav C, Nam TS, Kim UH, Kasai S, Nishimura T, et al. An immunohistochemical, enzymatic, and behavioral study of CD157/BST-1 as a neuroregulator. BMC Neurosci. 2017;18(1):35.
Lopatina O, Yoshihara T, Nishimura T, Zhong J, Akther S, Fakhrul AA, Liang M, Higashida C, Sumi K, Furuhara K, et al. Anxiety- and depression-like behavior in mice lacking the CD157/BST1 gene, a risk factor for Parkinson’s disease. Front Behav Neurosci. 2014;8:133.
Higashida H, Hashii M, Tanaka Y, Matsukawa S, Higuchi Y, Gabata R, Tsubomoto M, Seishima N, Teramachi M, Kamijima T, et al. CD38, CD157, and RAGE as molecular determinants for social behavior. Cells. 2019;9(1):62.
Shen YT, Wang JW, Wang M, Zhi Y, Li JY, Yuan YS, Wang XX, Zhang H, Zhang KZ. BST1 rs4698412 allelic variant increases the risk of gait or balance deficits in patients with Parkinson’s disease. CNS Neurosci Ther. 2019;25(4):422–429.
Cheng LH, Liu YW, Wu CC, Wang S, Tsai YC. Psychobiotics in mental health, neurodegenerative and neurodevelopmental disorders. J Food Drug Anal. 2019;27(3):632–48.
Wiley NC, Cryan JF, Dinan TG, Ross RP, Stanton C. Production of psychoactive metabolites by gut bacteria. Mod Trends Psychiatry. 2021;32:74–99.
Ray S, Agarwal P. Depression and Anxiety in Parkinson Disease. Clin Geriatr Med. 2020;36(1):93–104.
Riddick DS, Ding X, Wolf CR, Porter TD, Pandey AV, Zhang QY, Gu J, Finn RD, Ronseaux S, McLaughlin LA, et al. NADPH-cytochrome P450 oxidoreductase: roles in physiology, pharmacology, and toxicology. Drug Metab Dispos. 2013;41(1):12–23.
Ribes V, Otto DM, Dickmann L, Schmidt K, Schuhbaur B, Henderson C, Blomhoff R, Wolf CR, Tickle C, Dollé P. Rescue of cytochrome P450 oxidoreductase (Por) mouse mutants reveals functions in vasculogenesis, brain and limb patterning linked to retinoic acid homeostasis. Dev Biol. 2007;303(1):66–81.
Riedl AG, Watts PM, Edwards RJ, Boobis AR, Jenner P, Marsden CD. Selective localisation of P450 enzymes and NADPH-P450 oxidoreductase in rat basal ganglia using anti-peptide antisera. Brain Res. 1996;743(1–2):324–328.
Haglund L, Köhler C, Haaparanta T, Goldstein M, Gustafsson JA. Presence of NADPH-cytochrome P450 reductase in central catecholaminergic neurones. Nature. 1984;307(5948):259–262.
Hall CN, Keynes RG, Garthwaite J. Cytochrome P450 oxidoreductase participates in nitric oxide consumption by rat brain. Biochem J. 2009;419(2):411–418.
Li Z, Chadalapaka G, Ramesh A, Khoshbouei H, Maguire M, Safe S, Rhoades RE, Clark R, Jules G, McCallister M, et al. PAH particles perturb prenatal processes and phenotypes: protection from deficits in object discrimination afforded by dampening of brain oxidoreductase following in utero exposure to inhaled benzo(a)pyrene. Toxicol Sci. 2012;125(1):233–47.
Qu W, Bradbury JA, Tsao CC, Maronpot R, Harry GJ, Parker CE, Davis LS, Breyer MD, Waalkes MP, Falck JR, et al. Cytochrome P450 CYP2J9, a new mouse arachidonic acid omega-1 hydroxylase predominantly expressed in brain. J Biol Chem. 2001;276(27):25467–25479.
Ben-Shlomo R, Akhtar RA, Collins BH, Judah DJ, Davies R, Kyriacou CP. Light pulse-induced heme and iron-associated transcripts in mouse brain: a microarray analysis. Chronobiol Int. 2005;22(3):455–471.
Duchateau PN, Pullinger CR, Orellana RE, Kunitake ST, Naya-Vigne J, O’Connor PM, Malloy MJ, Kane JP. Apolipoprotein L, a new human high density lipoprotein apolipoprotein expressed by the pancreas. Identification, cloning, characterization, and plasma distribution of apolipoprotein L. J Biol Chem. 1997;272(41):25576–25582.
Mimmack ML, Ryan M, Baba H, Navarro-Ruiz J, Iritani S, Faull RL, McKenna PJ, Jones PB, Arai H, Starkey M, et al. Gene expression analysis in schizophrenia: reproducible up-regulation of several members of the apolipoprotein L family located in a high-susceptibility locus for schizophrenia on chromosome 22. Proc Natl Acad Sci USA. 2002;99(7):4680–4685.
Sutcliffe JG, Thomas EA. The neurobiology of apolipoproteins in psychiatric disorders. Mol Neurobiol. 2002;26(2–3):369–388.
Hwang Y, Kim J, Shin JY, Kim JI, Seo JS, Webster MJ, Lee D, Kim S. Gene expression profiling by mRNA sequencing reveals increased expression of immune/inflammation-related genes in the hippocampus of individuals with schizophrenia. Transl Psychiatry. 2013;3(10):e321.
Tang M, Huang H, Li S, Zhou M, Liu Z, Huang R, Liao W, Xie P, Zhou J. Hippocampal proteomic changes of susceptibility and resilience to depression or anxiety in a rat model of chronic mild stress. Transl Psychiatry. 2019;9(1):260.
Tian L, You HZ, Wu H, Wei Y, Zheng M, He L, Liu JY, Guo SZ, Zhao Y, Zhou RL, et al. iTRAQ-based quantitative proteomic analysis provides insight for molecular mechanism of neuroticism. Clin Proteomics. 2019;16:38.
Yuan X, Chen B. Depression and anxiety in patients with active ulcerative colitis: crosstalk of gut microbiota, metabolomics and proteomics. Gut Microbes. 2021;13(1):1987779.
Ditzen C, Tang N, Jastorff AM, Teplytska L, Yassouridis A, Maccarrone G, Uhr M, Bronisch T, Miller CA, Holsboer F, et al. Cerebrospinal fluid biomarkers for major depression confirm relevance of associated pathophysiology. Neuropsychopharmacology. 2012;37(4):1013–25.
Wingo TS, Liu Y, Gerasimov ES. Brain proteome-wide association study implicates novel proteins in depression pathogenesis. Nat Neurosci. 2021;24(6):810–817.
This work was supported by the National Natural Scientific Foundation of China (81922059), the Natural Science Basic Research Plan in Shaanxi Province of China (2021JCW-08), and the Fundamental Research Funds for the Central Universities.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional file 1.
Definitions of criterion for phenotypes in UK Biobank cohort
Additional file 2.
Correlation matrix among the covariates and neuroticism, anxiety, and depression score
Additional file 3.
Detailed results of effectiveness evaluation of protein PRS method in independent omics studies
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Cheng, B., Yang, X., Cheng, S. et al. A large-scale polygenic risk score analysis identified candidate proteins associated with anxiety, depression and neuroticism. Mol Brain 15, 66 (2022). https://doi.org/10.1186/s13041-022-00954-3
- Polygenic risk score
- Protein quantitative trait loci