Novel applications of SNP array data in the analysis of the genetic structure of Europeans and in genetic association studies
Date
2012-09-25
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Inimese iga fenotüüpiline tunnus kujuneb elukestvalt genotüübi ning keskkonna vahelise koosmõju tagajärjel. Inimesegeneetika üks peamisi eesmärke on mõista komplekshaiguste geneetilisi põhjuseid ning läbi selle muuta haiguste diagnoosimist täpsemaks ja ravi tõhusamaks. Inimese genoomi täisjärjestuse avaldamine on viinud genotüpiseerimistehnoloogiate kiirele arengule ning teinud võimalikuks tuvastada sadades DNA proovides samaaegselt miljoneid järjestusvariatsioone. Laiapõhjaliste genoomiuuringute tulemusena on rohkem kui 3000 DNA järjestusvariatsiooni seostatud enam kui 600 erineva komplekstunnusega. Kuna üksikud järjestusvariatsioonid kirjeldavad enamasti ära vähem kui 1% tunnuse pärilikust komponendist on ülegenoomsetes assotsiatsioonuuringutes vaja kombineerida paljude kohortide andmestikke, et oleks võimalik formuleerida statistiliselt usaldusväärseid järeldusi. Üheks selliseks populatsioonipõhiseks kohortidiks on Tartu Ülikooli Eesti Geenivaramu biobank, mis sisaldab informatsiooni 51 000 eestlase elustiili ja põetud haiguste kohta ning samas talletab ka geenidoonorite bioloogilist materjali. Käesolevas töös uuriti Kirde-Euroopa populatsioonide, sh eestlaste, paiknemist Euroopa alleelisageduste geneetilise struktuuri kaardil. Selgus, et soomlased distantseeruvad nii rootslastest kui ka teistest Loode-Euroopa populatsioonidest, samas kui eestlased paiknevad lähestikku lätlaste, leedulaste ning loode-venelastega. Antud tulemus on võimaldanud kaasata Geenivaramu andmestikku rohkem kui kaheksakümnesse rahvusvahelisse geneetilisse assotsiatsioonuuringusse. Käesolevas töös on kirjeldatud kahte uut komplekstunnusseoselist DNA järjestusvarianti. Tuginedes antud uuringutele võib väita, et ülegenoomsed genotüpiseerimiskiibi andmed võimaldavad tuvastada valimi geneetilist struktureeritust ja leida haigusriski suurendavaid DNA järjestusvariatsioone.
Human complex traits arise from the new mutations as well as from the interplay between the existing genetic variants and exposure to environmental conditions. The ultimate goals of human genetics are to understand the genetic architecture of complex traits and transfer the genetic findings into medical field in order to improve disease diagnosis and treatment. The draft sequence of the human genome has lead to development of high-throughput genotyping techniques that enable cost-effective detection of millions of DNA sequence variants in large numbers of samples. These advances have made it possible to increase the number of validated complex human trait-associated loci to 3,000 independent genetic variants related to more than 600 distinct traits and diseases. A single sequence variant usually explains less than 1% of the phenotypic variance or genetic predispositions and hundreds of thousands of samples must be pooled in GWAS meta-analyses to detect statistically sound associations. The Estonian Biobank, which includes a wide range of health information, biological samples, and high-resolution genomics data for more than 50,000 samples from the Estonian population, is a valuable resource for human genetic research. The aim of the current dissertation was to fill in the gaps in the knowledge of the genetic structure of northeastern European popu¬lations using the whole-genome SNP allele frequency data. The results revealed that Finns positioned distantly from Swedes and the other northeastern Europeans, while Estonians clustered next to their geographical neighbors (Latvians, Lithuanians and northwestern Russians), and demonstrated, that the Estonian Biobank samples could be analyzed together with the other cohorts of European ancestry in large-scale gene discovery studies. Two novel associations between a complex trait and a DNA sequence variant were established. In summary, the results of this study demonstrated the importance of the whole-genome genotyping array data in detecting the genetic structure and seeking the DNA sequence variants that underlie the dynamic complexity of human phenotypes.
Human complex traits arise from the new mutations as well as from the interplay between the existing genetic variants and exposure to environmental conditions. The ultimate goals of human genetics are to understand the genetic architecture of complex traits and transfer the genetic findings into medical field in order to improve disease diagnosis and treatment. The draft sequence of the human genome has lead to development of high-throughput genotyping techniques that enable cost-effective detection of millions of DNA sequence variants in large numbers of samples. These advances have made it possible to increase the number of validated complex human trait-associated loci to 3,000 independent genetic variants related to more than 600 distinct traits and diseases. A single sequence variant usually explains less than 1% of the phenotypic variance or genetic predispositions and hundreds of thousands of samples must be pooled in GWAS meta-analyses to detect statistically sound associations. The Estonian Biobank, which includes a wide range of health information, biological samples, and high-resolution genomics data for more than 50,000 samples from the Estonian population, is a valuable resource for human genetic research. The aim of the current dissertation was to fill in the gaps in the knowledge of the genetic structure of northeastern European popu¬lations using the whole-genome SNP allele frequency data. The results revealed that Finns positioned distantly from Swedes and the other northeastern Europeans, while Estonians clustered next to their geographical neighbors (Latvians, Lithuanians and northwestern Russians), and demonstrated, that the Estonian Biobank samples could be analyzed together with the other cohorts of European ancestry in large-scale gene discovery studies. Two novel associations between a complex trait and a DNA sequence variant were established. In summary, the results of this study demonstrated the importance of the whole-genome genotyping array data in detecting the genetic structure and seeking the DNA sequence variants that underlie the dynamic complexity of human phenotypes.
Description
Väitekirja elektrooniline versioon ei sisalda publikatsioone.
Keywords
populatsioonigeneetika, genoomid, DNA analüüs, genotüüp, Euroopa, population genetics, genomes, DNA analysis, genotype