Eurogenes 500K SNP BioGeographicAncestry Project

By Razib Khan | November 23, 2010 12:11 am

Since I have been promoting the Dodecad Ancestry Project, it seems only fair to bring to your attention Eurogenes 500K SNP BioGeographicAncestry Project. The sample populations are a bit different from Dodecad, but again ADMIXTURE is the primary tool. But the author also makes recourse to other methodologies to explore more than simply population level variation. For example, his most recent post is Locating and visualizing minority non-European admixtures across our genomes:

Imagine, for example, a white American carrying a couple of tiny segments of West African origin, from an ancestor who lived 250 years ago, and an eastern Finn with no Asian ancestors in the last 4000 years or more. If we run an inter-continental ADMIXTURE analysis with these two, it’s very likely the American will score 100% European, while the eastern Finn will probably come out around 9% North and East Asian due to really old Uralic influence.

That sort of thing isn’t a huge problem when comparing the genetic structure of populations. Obviously, overall, eastern Finns rather than white Americans are genetically closer to North Asians, and that’s basically what ADMIXTURE picks up. However, if the focus is also on individuals, this certainly can become an issue. Our hypothetical American might be aware of that African ancestor, with solid paperwork backing up their genealogical connection, but he’s pulling his hair out because nothing’s showing up via genetic tests.

So let’s take a look at a real life example of how RHHcounter can pick up segments of potentially recent Sub-Saharan African origin…

Olivia Munn & Uyghur woman

The basic issue here is that in terms of genomic variation old admixture looks different from new admixture. Someone who is a first generation Eurasian, with a Chinese and European parent, may be about the same ancestral mix proportionally as a Uyghur. They would resemble a Uyghur on STRUCTURE and be placed within that cluster on a PCA chart (this is what happens in 23andMe). But, the Uyghur “Eastern” and “Western” genetic heritage has been reshuffled to a great extent by recombination over the past 1,000-2,000 years. In contrast, a first generation Eurasian will have huge swaths of their genome which are Eastern or Western on alternating strands (from their respective parents). In population genetic language a group of first generations hybrids would be exhibit a lot of linkage disequilibrium (LD). In a panmictic hybrid population LD will decay due to recombination, which breaks apart the distinctive allelic associations inherited from the parental populations.

This is the key to differentiating between the old “Asian” ancestry which sometimes falls out of the genetic variation of Finns at low frequencies, and more recent Asian ancestry. For example, the paleoanthropologist Vance Haynes is apparently a great-grandson of one of the original “Siamese Twins,” Chang Bunker. Chang Bunker was a Chinese Thai, so presumably Vance Haynes would come out to be ~10% Asian, and would be shifted toward the Asian cluster in relation to other Europeans. On the other hand, a closer look at his genome would indicate differences from a Turk who was ~10% Asian, because Vance Haynes’ Asian ancestry has only had three generations for recombination to break apart the original allelic associations which were passed down from Chang Bunker. After only these few generations the genome would still show many segments of clustered ancestry with distinctive sets of markers characteristic of Han Chinese.

Let’s make this more concrete. Below are two “ancestry paintings” from 23andMe. One is of a reference example, a Uyghur woman, and another is of a Eurasian individual. The difference is pretty obvious:


23andmeclusFor the record, 23andMe says that the Eurasian man is 50% Asian, 50% European. For the Uyghur woman, 52% European, 48% Asian. As I indicated above, Eurasian individuals who are projected onto the variation of the HGDP sample tend to cluster with the Uyghurs. In the image to the left the black mark indicates the Eurasian man. The Uyghurs are green. The purple rectangles are Hazaras.

But obviously this is a trivial example. What’s the point of sniffing around for non-European ancestry in individuals whose non-European ancestry is 1) visible, and 2) recent and immediate. No, a bigger question here are claims and suggestions by some white Americans that they have significant non-European ancestry. Usually this is Native American. But in the case of one of the European-origin samples which “Polako” (the principal behind the BGA Project) analyzed it seems there is a suggestion of West African ancestry.

dandonThis individual is Dr. Don Conrad of Genomes Unzipped. In particular, Polako found that there were two nearby segments on two chromosomes which exhibited a pattern of population atypical heterozygosity in Dr. Don Conrad’s genome. Look at chromosomes 7 and 13. Contrast the pattern with my distant paternal cousin, Dr. Daniel MacArthur. He also exhibits points of heterozygosity, but they’re randomly distributed across the genome. It’s old admixture or just noise.

Polako doesn’t make much of Dr. Don Conrad’s results, and neither do I (presumably as Dr. Don Conrad is a member of Genomes Unzipped it’s easy to talk about his results without any of the ethical or moral hassles about confidentiality). On the other hand, unlike Dr. Dan MacArthur, a little utilization of the powers of the interwebs indicates that Dr. Don Conrad is an American. In particular, of recent Midwestern background. Though I’m not a total creep, so I didn’t start poking around But after the Pickrell affair I am probably just a touch more hesitant to laugh off peculiar results from these sorts of analyses as simply algorithms-gone-meshugana.

Image Credit: Colegota

CATEGORIZED UNDER: Genetics, Genomics

Comments (5)

  1. pconroy

    This is totally fascinating – I eagerly await my own results!

    I see also a somewhat related post:

    My parents IE6 (Dad) and IE7 (Mom) are featured as part of the Irish samples in a Northern Eurasian analysis, at K=8:
    Dad = 1% Selkup (North Eurasian)
    Mom = 1% Chukcki/Koryak (Bering SEA)

    All I can say is WOW!

  2. Well, at least now we know where DC got his impeccable fashion sense.

  3. Perhaps the Uyghur people look different then the admixed Americans for another reason. The Uyghur western Eurasian component is mostly from Indo-Iranian and /or Eastern European populations groups that have somewhat different physical (and genetic) features then the typical western Europeans of the United States. Also much of the Uyghur’s eastern Eurasian component might come from the Mongols – which are generally not found in the united states and genetically speaking cluster somewhat away from the agriculturalist Coastal Eastern Asians.


Discover's Newsletter

Sign up to get the latest science news delivered weekly right to your inbox!

Gene Expression

This blog is about evolution, genetics, genomics and their interstices. Please beware that comments are aggressively moderated. Uncivil or churlish comments will likely get you banned immediately, so make any contribution count!

About Razib Khan

I have degrees in biology and biochemistry, a passion for genetics, history, and philosophy, and shrimp is my favorite food. In relation to nationality I'm a American Northwesterner, in politics I'm a reactionary, and as for religion I have none (I'm an atheist). If you want to know more, see the links at


See More


RSS Razib’s Pinboard

Edifying books

Collapse bottom bar