Dr Zhanna Balkhiiarova
Publications
Introduction Polygenic Score (PGS) is a valuable method for assessing the estimated genetic liability to a given outcome or genetic variability contributing to a quantitative trait. While PRSs are widely used for complex traits, their application in uncovering shared genetic predisposition between phenotypes, i.e. when genetic variants influence more than one phenotype, remains limited. Methods We developed an R package, comorbidPGS, which facilitates a systematic evaluation of shared genetic effects among (cor)related phenotypes using PGSs. The comorbidPGS package takes as input a set of Single Nucleotide Polymorphisms (SNPs) along with their established effects on the original phenotype (Po), referred to as Po-PGS. It generates a comprehensive summary of effect(s) of Po-PGS on target phenotype(s) (Pt) with customisable graphical features. Results We applied comorbidPGS to investigate the shared genetic predisposition between phenotypes defining elevated blood pressure (Systolic Blood Pressure, SBP; Diastolic Blood Pressure, DBP; Pulse Pressure, PP) and several cancers (Breast Cancer, BrC; Pancreatic Cancer, PanC; Kidney Cancer, KidC; Prostate Cancer, PrC; Colorectal Cancer, CrC) using the European ancestry UK Biobank individuals and GWAS meta-analyses summary statistics from independent set of European ancestry individuals. We report a significant association between elevated DBP and the genetic risk of PrC (β (SE)=0.066 (0.017), P-value=9.64×10^(-5)), as well as between CrC PGS and both, lower SBP (β (SE)=-0.10 [0.029], P-value=3.83×10^(-4))) and lower DBP (β (SE)=-0.055 [0.017], P-value=1.05×10^(-3)). Our analysis highlights two nominally significant relationships for individuals with genetic predisposition to elevated SBP leading to higher risk of KidC (OR [95%CI]=1.04 [1.0039-1.087], P-value=2.82×10^(-2)) and PrC (OR [95%CI]=1.02 [1.003-1.041], P-value=2.22×10^(-2)). Conclusion Using comorbidPGS, we underscore mechanistic relationships between blood pressure regulation and susceptibility to three comorbid malignancies. This package offers valuable means to evaluate shared genetic susceptibility between (cor)related phenotypes through polygenic scores.
Polycystic ovary syndrome (PCOS) is a very common endocrine condition in women in India. Gut microbiome alterations were shown to be involved in PCOS, yet it is remarkably understudied in Indian women who have a higher incidence of PCOS as compared to other ethnic populations. During the regional PCOS screening program among young women, we recruited 19 drug naive women with PCOS and 20 control women at the Sher-i-Kashmir Institute of Medical Sciences, Kashmir, North India. We profiled the gut microbiome in faecal samples by 16S rRNA sequencing and included 40/58 operational taxonomic units (OTUs) detected in at least 1/3 of the subjects with relative abundance (RA) ≥ 0.1%. We compared the RAs at a family/genus level in PCOS/non-PCOS groups and their correlation with 33 metabolic and hormonal factors, and corrected for multiple testing, while taking the variation in day of menstrual cycle at sample collection, age and BMI into account. Five genera were significantly enriched in PCOS cases: , , and previously reported for PCOS , and confirmed by different statistical models. At the family level, the relative abundance of was enriched, whereas was decreased among cases. We observed increased relative abundance of and with higher fasting blood glucose levels, and and with larger hip, waist circumference, weight, and with lower prolactin levels. We also detected a novel association between and follicle-stimulating hormone levels and between and alkaline phosphatase, independently of the BMI of the participants. Our report supports that there is a relationship between gut microbiome composition and PCOS with links to specific reproductive health metabolic and hormonal predictors in Indian women.
Background Genome-wide association studies have captured a large proportion of genetic variation related to type 1 diabetes mellitus (T1D). However, most of these studies are performed in populations of European ancestry and therefore the disease risk estimations can be inaccurate when extrapolated to other world populations. Methods We conducted a case-control study in 1866 individuals from the three major populations of the Republic of Bashkortostan (Russians, Tatars, and Bashkirs) in Russian Federation, using single-locus and multilocus approach to identify genetic predictors of T1D. Results We found that LTA rs909253 and TNF rs1800629 polymorphisms were associated with T1D in the group of Tatars. Meta-analysis of the association study results in the three ethnic groups has confirmed the association between the T1D risk and LTA rs909253 genetic variant. LTA rs909253 and TNF rs1800629 loci were also featured in combinations most significantly associated with T1D. Conclusion Our findings suggest that LTA rs909253 and TNF rs1800629 polymorphisms are associated with the risk of T1D both independently and in combination with polymorphic markers in other inflammatory genes, and the analysis of multi-allelic combinations provides valuable insight in the study of polygenic traits.
Pulmonary arterial hypertension (PAH) is characterised by pulmonary vascular remodelling causing premature death from right heart failure. Established DNA variants influence PAH risk, but susceptibility from epigenetic changes is unknown. We addressed this through epigenome-wide association study (EWAS), testing 865,848 CpG sites for association with PAH in 429 individuals with PAH and 1226 controls. Three loci, at Cathepsin Z (CTSZ, cg04917472), Conserved oligomeric Golgi complex 6 (COG6, cg27396197), and Zinc Finger Protein 678 (ZNF678, cg03144189), reached epigenome-wide significance (p
Obesity and type 2 diabetes (T2D) are associated with increased risk of pancreatic cancer. Here we assessed the relationship between pancreatic cancer and two distinct measures of obesity, namely total adiposity, using BMI, versus abdominal adiposity, using BMI adjusted waist-to-hip ratio (WHRadjBMI) by utilising polygenic scores (PGS) and Mendelian randomisation (MR) analyses. We constructed z-score weighted PGS for BMI and WHRadjBMI using publicly available data and tested for their association with pancreatic cancer defined in UK biobank (UKBB). Using publicly available summary statistics, we then performed bi-directional MR analyses between the two obesity traits and pancreatic cancer. PGS(BMI) was significantly (multiple testing-corrected) associated with pancreatic cancer (OR[95%CI] = 1.0804[1.025-1.14], P = 0.0037). The significance of association declined after T2D adjustment (OR[95%CI] = 1.073[1.018-1.13], P = 0.00904). PGS(WHRadjBMI) association with pancreatic cancer was at the margin of statistical significance (OR[95%CI] = 1.047[0.99-1.104], P = 0.086). T2D adjustment effectively lost any suggestive association of PGS(WHRadjBMI) with pancreatic cancer (OR[95%CI] = 1.039[0.99-1.097], P = 0.14). MR analyses showed a nominally significant causal effect of WHRadjBMI on pancreatic cancer (OR[95%CI] = 1.00095[1.00011-1.0018], P = 0.027) but not for BMI on pancreatic cancer. Overall, we show that abdominal adiposity measured using WHRadjBMI, may be a more important causal risk factor for pancreatic cancer compared to total adiposity, with T2D being a potential driver of this relationship.
Background The prevalence of depression is higher among those with diabetes than in the general population. The Patient Health Questionnaire (PHQ-9) is commonly used to assess depression in people with diabetes, but measurement invariance of the PHQ-9 across groups of people with and without diabetes has not yet been investigated. Methods Data from three independent cohorts from the USA (n=1,886 with diabetes, n=4,153 without diabetes), Quebec, Canada (n= 800 with diabetes, n= 2,411 without diabetes), and the UK (n=4,981 with diabetes, n=145,570 without diabetes), were used to examine measurement invariance between adults with and without diabetes. A series of multiple group confirmatory factor analyses were performed, with increasingly stringent model constraints applied to assess configural, equal thresholds, and equal thresholds and loadings invariance, respectively. One-factor and two-factor (somatic and cognitive-affective items) models were examined. Results Results demonstrated that the most stringent models, testing equal loadings and thresholds, had satisfactory model fit in the three cohorts for one-factor models (RMSEA = .063 or below and CFI = .978 or above) and two-factor models (RMSEA = .042 or below and CFI = .989 or above). Limitations Data were from Western countries only and we could not distinguish between type of diabetes. Conclusions Results provide support for measurement invariance between groups of people with and without diabetes, using either a one-factor or a two-factor model. While the two-factor solution has a slightly better fit, the one-factor solution is more parsimonious. Depending on research or clinical needs, both factor structures can be used.
Conventional measurements of fasting and postprandial blood glucose levels investigated in genome-wide association studies (GWAS) cannot capture the effects of DNA variability on 'around the clock' glucoregulatory processes. Here we show that GWAS meta-analysis of glucose measurements under nonstandardized conditions (random glucose (RG)) in 476,326 individuals of diverse ancestries and without diabetes enables locus discovery and innovative pathophysiological observations. We discovered 120 RG loci represented by 150 distinct signals, including 13 with sex-dimorphic effects, two cross-ancestry and seven rare frequency signals. Of these, 44 loci are new for glycemic traits. Regulatory, glycosylation and metagenomic annotations highlight ileum and colon tissues, indicating an underappreciated role of the gastrointestinal tract in controlling blood glucose. Functional follow-up and molecular dynamics simulations of lower frequency coding variants in glucagon-like peptide-1 receptor (GLP1R), a type 2 diabetes treatment target, reveal that optimal selection of GLP-1R agonist therapy will benefit from tailored genetic stratification. We also provide evidence from Mendelian randomization that lung function is modulated by blood glucose and that pulmonary dysfunction is a diabetes complication. Our investigation yields new insights into the biology of glucose regulation, diabetes complications and pathways for treatment stratification. Genome-wide association analyses of blood glucose measurements under nonstandardized conditions provide insights into the biology of glucose regulation, diabetes complications and pathways for treatment stratification.
Pubertal growth patterns correlate with future health outcomes. However, the genetic mechanisms mediating growth trajectories remain largely unknown. Here, we modeled longitudinal height growth with Super-Imposition by Translation And Rotation (SITAR) growth curve analysis on ~ 56,000 trans-ancestry samples with repeated height measurements from age 5 years to adulthood. We performed genetic analysis on six phenotypes representing the magnitude, timing, and intensity of the pubertal growth spurt. To investigate the lifelong impact of genetic variants associated with pubertal growth trajectories, we performed genetic correlation analyses and phenome-wide association studies in the Penn Medicine BioBank and the UK Biobank. Large-scale growth modeling enables an unprecedented view of adolescent growth across contemporary and 20th-century pediatric cohorts. We identify 26 genome-wide significant loci and leverage trans-ancestry data to perform fine-mapping. Our data reveals genetic relationships between pediatric height growth and health across the life course, with different growth trajectories correlated with different outcomes. For instance, a faster tempo of pubertal growth correlates with higher bone mineral density, HOMA-IR, fasting insulin, type 2 diabetes, and lung cancer, whereas being taller at early puberty, taller across puberty, and having quicker pubertal growth were associated with higher risk for atrial fibrillation. We report novel genetic associations with the tempo of pubertal growth and find that genetic determinants of growth are correlated with reproductive, glycemic, respiratory, and cardiac traits in adulthood. These results aid in identifying specific growth trajectories impacting lifelong health and show that there may not be a single "optimal" pubertal growth pattern.
Obesity affects over 700 million people worldwide and its prevalence keeps growing steadily. The problem is particularly relevant due to the increased risk of COVID-19 complications and mortality in obese patients. Obesity prevalence increase is often associated with the influence of environmental and behavioural factors, leading to stigmatization of people with obesity due to beliefs that their problems are caused by poor lifestyle choices. However, hereditary predisposition to obesity has been established, likely polygenic in nature. Morbid obesity can result from rare mutations having a significant effect on energy metabolism and fat deposition, but the majority of patients does not present with monogenic forms. Microbiome low diversity significantly correlates with metabolic disorders (inflammation, insulin resistance), and the success of weight loss (bariatric) surgery. However, data on the long-term consequences of bariatric surgery and changes in the microbiome composition and genetic diversity before and after surgery are currently lacking. In this review, we summarize the results of studies of the genetic characteristics of obesity patients, molecular mechanisms of obesity, contributing to the unfavourable course of coronavirus infection, and the evolution of their microbiome during bariatric surgery, elucidating the mechanisms of disease development and creating opportunities to identify potential new treatment targets and design effective personalized approaches for the diagnosis, management, and prevention of obesity.
Depression is a common comorbidity of type 2 diabetes. We assessed the causal relationships and shared genetics between them. We applied two-sample, bidirectional Mendelian randomization (MR) to assess causality between type 2 diabetes and depression. We investigated potential mediation using two-step MR. To identify shared genetics, we performed 1) genome-wide association studies (GWAS) separately and 2) multiphenotype GWAS (MP-GWAS) of type 2 diabetes (19,344 case subjects, 463,641 control subjects) and depression using major depressive disorder (MDD) (5,262 case subjects, 86,275 control subjects) and self-reported depressive symptoms (n = 153,079) in the UK Biobank. We analyzed expression quantitative trait loci (eQTL) data from public databases to identify target genes in relevant tissues. MR demonstrated a significant causal effect of depression on type 2 diabetes (odds ratio 1.26 [95% CI 1.11-1.44], P = 5.46 × 10-4) but not in the reverse direction. Mediation analysis indicated that 36.5% (12.4-57.6%, P = 0.0499) of the effect from depression on type 2 diabetes was mediated by BMI. GWAS of type 2 diabetes and depressive symptoms did not identify shared loci. MP-GWAS identified seven shared loci mapped to TCF7L2, CDKAL1, IGF2BP2, SPRY2, CCND2-AS1, IRS1, CDKN2B-AS1. MDD has not brought any significant association in either GWAS or MP-GWAS. Most MP-GWAS loci had an eQTL, including single nucleotide polymorphisms implicating the cell cycle gene CCND2 in pancreatic islets and brain and the insulin signaling gene IRS1 in adipose tissue, suggesting a multitissue and pleiotropic underlying mechanism. Our results highlight the importance to prevent type 2 diabetes at the onset of depressive symptoms and the need to maintain a healthy weight in the context of its effect on depression and type 2 diabetes comorbidity.
We tested associations between 13 established genetic variants and type 2 diabetes (T2D) in 1371 study participants from the Volga-Ural region of the Eurasian continent, and evaluated the predictive ability of the model containing polygenic scores for the variants associated with T2D in our dataset, alone and in combination with other risk factors such as age and sex. Using logistic regression analysis, we found associations with T2D for the CCL20 rs6749704 (OR = 1.68, P-FDR = 3.40 x 10(-5)), CCR5 rs333 (OR = 1.99, P-FDR = 0.033), ADIPOQ rs17366743 (OR = 3.17, P-FDR = 2.64 x 10(-4)), TCF7L2 rs114758349 (OR = 1.77, P-FDR = 9.37 x 10(-5)), and CCL2 rs1024611 (OR = 1.38, P-FDR = 0.033) polymorphisms. We showed that the most informative prognostic model included weighted polygenic scores for these five loci, and non-genetic factors such as age and sex (AUC 85.8%, 95%CI 83.7-87.8%). Compared to the model containing only non-genetic parameters, adding the polygenic score for the five T2D-associated loci showed improved net reclassification (NRI = 37.62%, 1.39 x 10(-6)). Inclusion of all 13 tested SNPs to the model with age and sex did not improve the predictive ability compared to the model containing five T2D-associated variants (NRI = -17.86, p = 0.093). The five variants associated with T2D in people from the Volga-Ural region are linked to inflammation (CCR5, CCL2, CCL20) and glucose metabolism regulation (TCF7L, ADIPOQ2). Further studies in independent groups of T2D patients should validate the prognostic value of the model and elucidate the molecular mechanisms of the disease development.
The current epidemics of cardiovascular and metabolic noncommunicable diseases have emerged alongside dramatic modifications in lifestyle and living environments. These correspond to changes in our “modern” postwar societies globally characterized by rural-to-urban migration, modernization of agricultural practices, and transportation, climate change, and aging. Evidence suggests that these changes are related to each other, although the social and biological mechanisms as well as their interactions have yet to be uncovered. LongITools, as one of the 9 projects included in the European Human Exposome Network, will tackle this environmental health equation linking multidimensional environmental exposures to the occurrence of cardiovascular and metabolic noncommunicable diseases.
Epidemic obesity is the most important risk factor for prediabetes and type 2 diabetes (T2D) in youth as it is in adults. Obesity shares pathophysiological mechanisms with T2D and is likely to share part of the genetic background. We aimed to test if weighted genetic risk scores (GRSs) for T2D, fasting glucose (FG) and fasting insulin (FI) predict glycaemic traits and if there is a causal relationship between obesity and impaired glucose metabolism in children and adolescents. Genotyping of 42 SNPs established by genome-wide association studies for T2D, FG and FI was performed in 1660 Italian youths aged between 2 and 19 years. We defined GRS for T2D, FG and FI and tested their effects on glycaemic traits, including FG, FI, indices of insulin resistance/beta cell function and body mass index (BMI). We evaluated causal relationships between obesity and FG/FI using one-sample Mendelian randomization analyses in both directions. GRS-FG was associated with FG (beta = 0.075 mmol/l, SE = 0.011, P = 1.58 × 10 −11) and beta cell function (beta = −0.041, SE = 0.0090 P = 5.13 × 10 −6). GRS-T2D also demonstrated an association with beta cell function (beta = −0.020, SE = 0.021 P = 0.030). We detected a causal effect of increased BMI on levels of FI in Italian youths (beta = 0.31 ln (pmol/l), 95%CI [0.078, 0.54], P = 0.0085), while there was no effect of FG/FI levels on BMI. Our results demonstrate that the glycaemic and T2D risk genetic variants contribute to higher FG and FI levels and decreased beta cell function in children and adolescents. The causal effects of adiposity on increased insulin resistance are detectable from childhood age.
Although physical activity and sedentary behavior are moderately heritable, little is known about the mechanisms that influence these traits. Combining data for up to 703,901 individuals from 51 studies in a multi-ancestry meta-analysis of genome-wide association studies yields 99 loci that associate with self-reported moderate-to-vigorous intensity physical activity during leisure time (MVPA), leisure screen time (LST) and/or sedentary behavior at work. Loci associated with LST are enriched for genes whose expression in skeletal muscle is altered by resistance training. A missense variant in ACTN3 makes the alpha-actinin-3 filaments more flexible, resulting in lower maximal force in isolated type II muscle fibers, and possibly protection from exercise-induced muscle damage. Finally, Mendelian randomization analyses show that beneficial effects of lower LST and higher MVPA on several risk factors and diseases are mediated or confounded by body mass index (BMI). Our results provide insights into physical activity mechanisms and its role in disease prevention.