Genetic differences within European populations

By Razib Khan | September 7, 2010 3:15 am

genmap3One of the more popular posts on this weblog (going by StumbleUpon and search engine referrers) focuses on genetic variation in Europe as a function of geography. In some ways the results are common sense; populations closer to each other are more genetically related. Why not? Historically people have married their neighbors and so gene flow is often well modeled as isolation by distance. The scientific rationale for these studies is to smoke out population stratification in medical genetics research programs which attempt to find associations between genes and particular diseases. By population stratification I mean the fact that different populations will naturally have different gene frequencies, and if those populations exhibit different frequencies of the disease/trait under investigation then one may have to deal with spurious correlations. If, for example, your study population includes many people of African and European descent, presumably cautious researchers would immediately by aware of this problem and attempt to take it into account. But what about populations which are genetically closer, or whose genetic difference may not be so well manifest in physical characteristics which might clue you in to the issue of stratification?

ResearchBlogging.orgThat’s why the sorts of results which might seem common sense in the aggregate are useful. One can ask questions as to the genetic closeness of Irish and English, or Irish and Spanish, in a rigorous sense. In the United States research programs which are constrained to white cases and controls may hide population stratification because of the ethnic diversity of the American population. A primary motivation for studies of Jewish genetics are the cluster of “Jewish diseases” which are common within that population. In our age it is fashionable to focus on what binds us together as a species, but genetic differences matter a great deal. Ask the parents of multiracial children who require bone marrow transplants.

A new paper in Human Heredity examines a large sample of five European populations, and goes over the between population allele frequency differences with a fine tooth comb. Genetic Differences between Five European Populations:

We sought to examine the magnitude of the differences in SNP allele frequencies between five European populations (Scotland, Ireland, Sweden, Bulgaria and Portugal) and to identify the loci with the greatest differences…We found 40,593 SNPs which are genome-wide significantly…The largest differences clustered in gene ontology categories for immunity and pigmentation. Some of the top loci span genes that have already been reported as highly stratified: genes for hair color and pigmentation (HERC2, EXOC2, IRF4), the LCT gene, genes involved in NAD metabolism, and in immunity (HLA and the Toll-like receptor genes TLR10, TLR1, TLR6). However, several genes have not previously been reported as stratified within European populations, indicating that they might also have provided selective advantages: several zinc finger genes, two genes involved in glutathione synthesis or function, and most intriguingly, FOXP2, implicated in speech development. Conclusion: Our analysis demonstrates that many SNPs show genome-wide significant differences within European populations and the magnitude of the differences correlate with the geographical distance. At least some of these differences are due to the selective advantage of polymorphisms within these loci

They looked at ~350,000 SNPs across the five populations. The sample sizes were pretty large: 1,129 individuals from Bulgaria, 1,142 from Ireland, 656 from Scotland, 620 from Sweden, and 563 from Portugal. In the supplements they had a figure where they displayed the genetic variation on the two largest principal components for their sample and color-coded by region of origin. Next to this they transposed the PCA onto a map of Europe.

euro51

This confirms previous findings that the largest component of variation in Europe is north-south (at least evaluating to the west of a particular geographical cutoff), with a secondary east-west dimension. But the focus of the paper wasn’t really phylogenetic relationships between the populations as such, but the patterns of genetic differences across them. Table 1 shows the population to population differences in SNPs. Rescaled here means that the results were rescaled for sample size, which differed between populations, along with the value after a Bonferroni correction.

euro52

The pairwise differences are what you’d expect from the PCA. Most of the between population difference is probably due to history; populations random walk into their own gene frequencies through isolation by distance. But there’s more to the story than that, as is clear in table 2.

euro53

As noted by the authors genes in specific categories or classes are overrepresented among those with large between population differences. In particular, they focus on genes related to immune function and pigmentation. The reason for variation on the former is relatively straightforward, research on patterns of natural selection in the human genome have long pinpointed loci implicated in immune function as having been particularly shaped by this evolutionary genetic parameter, no doubt because disease resistance has a major impact on reproductive fitness. Additionally, it seems likely that immune related function is constantly being buffeted by selection because of the prominence of frequency dependent dynamics. As for pigmentation, it has also shown up as a major target of natural selection in many of the more recent papers, and it’s a trait whose genetic architecture we have a reasonably good grasp of now.  They also found that the NAD synthetase 1 gene was stratified. They note that this impacts metabolism and has been found to have a relationship to the disease pellagra. Loci related to diet also seem to be disproportionately affected by natural selection, and that stands to reason as the shift to agriculture was relatively recent and many populations may still be going through transients (e.g., gluten sensitivity). The densities and diets of European populations even today vary a great deal. Italy is about an order of magnitude more dense in population than Sweden, and this has likely been the case for many millennia due to differences in primary agricultural productivity. Finally, the authors observe that FOXP2 is also stratified. This is the famous “language gene,” which regularly makes press every few years. The short of it is that FOXP2 seems to be involved in complex vocalization, and been subject to selection in tetrapod lineages where vocal ability is pronounced (birds, humans, etc.). They don’t make much of the variation in the paper, but it seemed warranted to note that the gene had popped up in their tests.

The authors freely admit that their findings are provisional:

Our paper focuses on the top 11 loci and suggests plausible mechanisms for most of them. However, the total number of genome-wide significant SNPs is 150,000 and the top hits clustered in several GO categories. We cannot judge which ones are due to the effects of selection or to other mechanisms. We present a full list of genes with the best and median p values for SNPs within them (separately for the full sample and for controls only), so that others can make use of this information in future studies…

Citation: Moskvina V, Smith M, Ivanov D, Blackwood D, Stclair D, Hultman C, Toncheva D, Gill M, Corvin A, O’Dushlaine C, Morris DW, Wray NR, Sullivan P, Pato C, Pato MT, Sklar P, Purcell S, Holmans P, O’Donovan MC, Owen MJ, & Kirov G (2010). Genetic Differences between Five European Populations. Human heredity, 70 (2), 141-149 PMID: 20616560

CATEGORIZED UNDER: Anthroplogy, Genetics, Genomics
  • Longma

    This may sound like a stupid question, but if there is a strong North- South axis and a weaker East – West axis in Europe…my guess is that Northern Europeans are really a subset of Southern Europeans who repopulated areas in Northern Europe after the last ice age and some folks, maybe in the East may have absorbed more outlier populations (like Finnic tribes) than in the West, which was pretty empty…?

    In any case, my question, is there a group in Europe that can be seen as “genetically average” for a European? I”m guessing many Eastern Germans/Czechs, maybe Austrians?

  • http://blogs.discovermagazine.com/gnxp Razib Khan

    This may sound like a stupid question, but if there is a strong North- South axis and a weaker East – West axis in Europe…my guess is that Northern Europeans are really a subset of Southern Europeans

    yes. this is a pretty robust finding from what i can tell. nothern euros have less genetic variation than southern euros. the only issue i might quibble which is that post-ice age movements may have scrambled things a fair amount. today the finnic peoples are seen as outliers, and their influence on swedes and russians (the latter of whom are probably slavicized finns to a great extent in the northern areas) is seen as a perturbation from the european norm. some exogenous impact. but i think there is a modest probability that the finnic peoples are indigenous to europe and represent a “purer” paleolithic element than other peoples.

  • http://blogs.discovermagazine.com/gnxp Razib Khan

    In any case, my question, is there a group in Europe that can be seen as “genetically average” for a European? I”m guessing many Eastern Germans/Czechs, maybe Austrians?

    the answer may differ based on whether you weight by population or not.

  • Antonio

    Silly side question: I don’t get very well why the Finish are outliers: in the PCA maps I have seen here they don’t look really outliers to me, at least no more than Portuguese or Bulgarians or Russians.

  • Bolek

    swedes and russians (the latter of whom are probably slavicized finns to a great extent in the northern areas)

    Razib, why do you say so? I think that Corded Ware Proto-Slavic R1a1a was there earlier than Ugro-Finnic N1c which arrived from Siberia in Early Middle Ages probably. Isn’t it a similar story to what happened in Hungary? On PCA plots Estonians are very close to Russians who are close to Poles who don’t have any N1c. What is the explanation of it? I think it shows that Estonians are partially Uralicised Slavs.
    See Nelis at al. 2009:
    http://www.plosone.org/article/slideshow.action?uri=info:doi/10.1371/journal.pone.0005472&imageURI=info:doi/10.1371/journal.pone.0005472.g002

    [please don’t use markdown!]

  • http://blogs.discovermagazine.com/gnxp Razib Khan

    bolek

    1) haven’t seen any data on hungarians which implies it wasn’t elite emulation (i.e., not much impact on genes)

    2) you could be right about balts

    3) i was talking more of the russians northeast of the line between novgorod and moscow

    http://blogs.discovermagazine.com/gnxp/2008/01/from-where-came-the-slavs/

    my confidence in the assertion is modest btw. i don’t have a horse in the game, so i haven’t looked that closely (since you’re polish i assume you have more interest in the topic?).

  • onur

    I don’t get very well why the Finish are outliers: in the PCA maps I have seen here they don’t look really outliers to me, at least no more than Portuguese or Bulgarians or Russians.

    Some of the PCA maps involving Finns that Razib posted are consistent with your observation, but some are apparently not, and surely not this one:

    http://blogs.discovermagazine.com/gnxp/2009/05/genetic-structure-of-eastern-european-populations/

  • Colugo

    I thought that the Sami and Basques were the truly indigenous European groups, and they’re related to Berbers. And the Finns have some Sami admixture.

  • http://blogs.discovermagazine.com/gnxp Razib Khan

    I thought that the Sami and Basques were the truly indigenous European groups, and they’re related to Berbers. And the Finns have some Sami admixture.

    i think you’re going off some older mtDNA stuff. not necessarily refuted, but the newer stuff seems to reduce those phylogenetic connections in their importance (in particular, i believe haplogroup U5 may not be hunter-gatherer even….).

  • Pingback: Tweets that mention Genetic differences within European populations | Gene Expression | Discover Magazine -- Topsy.com()

  • Pingback: Linkage is Good for You: Diversity Edition (NSFW)()

NEW ON DISCOVER
OPEN
CITIZEN SCIENCE
ADVERTISEMENT

Discover's Newsletter

Sign up to get the latest science news delivered weekly right to your inbox!

Gene Expression

This blog is about evolution, genetics, genomics and their interstices. Please beware that comments are aggressively moderated. Uncivil or churlish comments will likely get you banned immediately, so make any contribution count!

About Razib Khan

I have degrees in biology and biochemistry, a passion for genetics, history, and philosophy, and shrimp is my favorite food. In relation to nationality I'm a American Northwesterner, in politics I'm a reactionary, and as for religion I have none (I'm an atheist). If you want to know more, see the links at http://www.razib.com

ADVERTISEMENT

See More

ADVERTISEMENT

RSS Razib’s Pinboard

Edifying books

Collapse bottom bar
+

Login to your Account

X
E-mail address:
Password:
Remember me
Forgot your password?
No problem. Click here to have it e-mailed to you.

Not Registered Yet?

Register now for FREE. Registration only takes a few minutes to complete. Register now »