And they came from Central Asia

By Razib Khan | August 7, 2010 7:24 am

There’s a new paper in AJHG out, Whole-Genome Genetic Diversity in a Sample of Australians with Deep Aboriginal Ancestry, which I’ll hit later. It doesn’t have anything too surprising, but in the supplements they have a figure which shows frappe and Structure plots for the HGDP populations as well as their Australian Aboriginal sample. These methods take an individual’s genome and assign elements to one of K ancestral populations. For African Americans this is highly illuminating, as K = 2 simply breaks down along European/African ancestral lines. The mean turns out to be ~20% for the minority quantum, exactly what had previously been ascertained through genealogy, classical autosomal markers (e.g., Duffy), and the average of uniparental lineages for European ancestry (African Americans tend to be enriched for European Y chromosomal markers, and have less than the expected European mtDNA markers. Again, totally intelligible in light of the history of relations in the old South).

These abstractions extract visually intelligible information out of the hundreds of thousands of concrete variant bases within human populations. They have clear and immediate utility when you have some inkling of the population history of a given sample. But when you attempt the same with populations whose histories are less clear and distinct, or who do not have such an obvious and well known genesis as African Americans, then things get murkier.

Therefore when it comes to higher values of K in many of these papers I just avoid reading too much into the results because the human mind is a pattern recognition machine, and it’s very easy to tell stories which have no way of being validated or falsified. Most of the authors of these papers tend to agree as higher K plots are usually nested in the supplements, not the main paper itself. But with all that caution entered into the record, I thought that K = 8 in the supplemental figure 1 was of some interest, and I want to focus on it just a little bit. I reedited it, removing many populations, and shifting the frappe and Structure plots at K = 8 next to each other. I also added some population labels for clarity, though if you’re familiar with the HGDP data set it’s clear what the abbreviations are.

camefromcentralasiaFirst, it seems that at K = 8 the fact that the non-indigenous ancestry of the Australian Aboriginal sample is Western European is pretty clear even without the known history (Dienekes noted this as well). The only question is distinguishing which Western Eurasian populations the contribution came from, and this is of some interest because of a possible connection between India and Australia. Many South Asians have a vague resemblance to Australian Aboriginals, and many Indian tribal groups are termed “Australoid.” More recently a very distant mtDNA link between Indian tribal groups and Aboriginals has been validated. But that’s totally expected, as all populations to the east of South Asia probably went through that region on the way out of Africa. A coalescence time on the order of 50,000 years ago seems to suggest that that is the connection, not a more recent migration as some have hypothesized, and which could give a phylogenetic causal basis to morphological similarities.

In the frappe plot, to the right, note that the South Asians are enriched for the orange shaded ancestral group. It’s residual in most Europeans, and almost absent in Australian Aboriginals. In the Structure plot, to the left, it’s the blue segment which is enriched in South Asians, and residual in Europeans. Again, it’s nearly absent in Aboriginals. That, combined with the attested presence of a high frequency of European diagnostic markers, such as the blue-eye OCA2 SNPs, should seal the deal in regards to the question of any more recent admixture from the initial settlement of the current indigenous stock with any group but Europeans.

But the reason I’m posting isn’t because of Aboriginal genetics. There are a few coarse clusters of human populations. Roughly, Amerindians, East Asians, Oceanians, West Eurasians + North Africans, and Sub-Saharan Africans. But within these clusters are further differences. Among the Mozabites (an Algerian Berber group with substantial Sub-Saharan African admixture), the Basque, and Sardinians, there seem to be an element which is nearly absent, but which increases in frequency as one goes east toward the heart of Eurasia. I am referring here to the aforementioned segments which I highlighted as the components whose lack suggests that Aboriginals received their non-indigenous ancestry from Europeans.

It makes me think about Li et al.’s argument that skewed population coverage has resulted in the omission of a major Central Eurasian ancestral population cluster between those of the west and east. If there was a major demographic pulse out of the center of Eurasia it would make sense that groups on the western fringe of the World Island, those in the western Mediterranean region, would show the least sign of it. I have no model for what such a pulse would be. Perhaps it wasn’t a pulse, but just isolation-by-distance and clinal variation which pops out in a discrete fashion if one cranks up the K’s. My initial thought is that it was the Indo-European languages, but it’s well represented in the Levant, and the Adygei (ADY) are not Indo-European anyway (though they could be distantly related to Indo-European and so exhibit some of the same genetic variation as the original population). I think there’s a good chance that here I’m confusing the analytical methods, frappe and Structure, for reality. But I thought I’d throw it out there since I’ve noticed this pattern for several years now….

CATEGORIZED UNDER: Genetics, Genomics
MORE ABOUT: Genetics, Genomics

Comments (6)

  1. “More recently a very distant mtDNA link between Indian tribal groups and Aboriginals has been validated.”

    The two mutations identified by Kumar (G8251A and A9156T) as shared between some Indian sequences and Australian M42 is better than the one-mutation (8793) link between Australian M42 and East Asian M10 (Chinese, Japanese, South Siberia) proposed by Hudjashov but in both cases the connection looks too slim to be significant. It’s possible that in both cases we’re dealing with homoplasy. Plus M42 is a minor haplogroup in Australia. The major Australian mtDNA haplogroup S has no parallels in India. As far as Y-DNA goes, C4 is very frequent in Australia, and India does have C5 but it’s again the similarity is very generic as various C lineages are widely dispersed in East Asia, America and Oceania.

    “My initial thought is that it was the Indo-European languages, but it’s well represented in the Levant, and the Adygei (ADY) are not Indo-European anyway (though they could be distantly related to Indo-European and so exhibit some of the same genetic variation as the original population). ”

    Consider also Sino-Caucasian (or Dene-Caucasian) connecting small families (North Caucasian, to which Adygey belongs) and isolates (Basque and Burushaski) in Eurasia with Sino-Tibetan. I’m not very fond of macrofamilies, and this one in particular has been called into question multiple times (Johanna Nichols also considers Nakh-Dagestanian languages as a separate family from Abkhazo-Adyghean), but at least the idea is out there and, like IE, it connects languages across Eurasia just like the genetic cline you’ve observed.

  2. Ponto

    I think it is a great pity that samples were not taken from unmixed Aborigines instead of the more common mixed Aborigine. It is hardly a surprise to any Australian that the mixed Aborigines have European ancestry and that it is NW European in origin. I understand there are issues with using samples from native populations and well as obtaining consent which would be difficult to do these times with the large number of authorities and laws existing to protect Aborigines from exploitation and loss of their biological heritage. I still think it would have been better to get unmixed Aborigines for the study.

  3. Sandgroper

    #3 – Yes, and it’s a bit odd that they only sampled in the Riverina. If they really want to settle the question completely beyond doubt, they would have done better to sample Aboriginal people in remote settlements in the north, particularly in the far north of Queensland, some of the non-Pama-Nyungan language groups in Arnhem Land, and also the Noongar people in the south west (although they will not find any unmixed people deriving from there at this point), where there is reliably dated evidence of human occupation c. 48,000 years ago. Sampling a few unmixed blonde kids from the western desert and some of the red heads in Arnhem Land wouldn’t hurt either. Having said that, it is a highly politicised issue, so it’s easier said than done. They obviously know what I just said.

    I’m pretty happy that the question has been more or less settled anyway, I think we’re beholden to the people of the Riverina for being willing to participate and live with the outcome, which won’t have disappointed them, and I’m a bit surprised that there does not seem to be more discussion about that yet, but there are plenty who will no doubt continue to argue otherwise based on morphological and language grounds.

    Sorry, that’s opining, but I’m just tickled pink to see these data. Nice one.

  4. 1) i am to understand that they tried to get unmixed samples, but the ethical agreements fell through

    2) one man in the riverina sample is unmixed


Discover's Newsletter

Sign up to get the latest science news delivered weekly right to your inbox!

Gene Expression

This blog is about evolution, genetics, genomics and their interstices. Please beware that comments are aggressively moderated. Uncivil or churlish comments will likely get you banned immediately, so make any contribution count!

About Razib Khan

I have degrees in biology and biochemistry, a passion for genetics, history, and philosophy, and shrimp is my favorite food. In relation to nationality I'm a American Northwesterner, in politics I'm a reactionary, and as for religion I have none (I'm an atheist). If you want to know more, see the links at


See More


RSS Razib’s Pinboard

Edifying books

Collapse bottom bar