Some have asked what the point is in poking around African population structure when Tishkoff et al. and Henn et al. have done such a good job in terms of coverage. First, it is nice to run your own analyses so you can slice & dice to your preference, and not rely on the constrained menu provided by others. There’s value in home cooking; you can flavor to your taste. Second, you never know what data people might leave on your doorstep. I’ve received the genotypes of three Somalis. Nothing too surprising, a touch more Cushitic than the Ethiopians in Behar et al., but interesting nonetheless.
Also, you can see how ADMIXTURE tends to come to weird conclusions in certain circumstances. Below is a K = 12 run ~50,000 SNPs. I’ve included in a few Behar et al. and HGDP populations to the Henn et al. set, as well as pruned a lot of the African groups which seem redundant in terms of information. I’ve added a few geographically informative labels as well.
Observe below that there is a Fulani cluster. I think this is pretty much an artifact. At K = 7 the Fulani have a majority component which is modal in West Africa & Bantu speakers, and a minority component which is identical to the one modal in Mozabite Berbers from Algeria. The Mozabites reside in the far northern Sahara, and their modal component drops off as one goes east toward western Asia and the eastern Mediterranean. I suspect that what is showing up in ADMIXTURE is the ancient hybridization of the Fulani, and perhaps their demographic expansion from this core group. We have some glimmers of the prehistory of the Fulani, and no expectation for them to be such a distinctive cluster, so I naturally jump to these inferences. But it does make me reconsider the nature of the “Sandawe,” “Mbuti” or “San” clusters in ADMIXTURE. These populations are culturally distinctive in deep ways from their neighbors, so a reflexive inference one might make is that they’re “pure” ancient substrate groups which have been overlain and marginalized by their Bantu neighbors. But their prehistory is far murkier than the Fulani because of their geographical isolation, so there is far less to go on. These “ancient” isolated groups themselves may have gone through the same sort of distinctive recent ethnogenesis processes which we presume occurred with the Fulani (also, in the plot below the Biaka are pure; but in most of the bar plots they have a minor element which they share with their neighbors, probably due to greater admixture and interaction between western Pygmies and their Bantu neighbors than among the easter ones).
Image Credit: Mark Dingemanse
I recall years ago someone on the blog of Jonathan Edelstein, a soc.history.what-if alum as well, mentioning offhand that archaeologists had “debunked” the idea of the Bantu demographic expansion. Because, unfortunately, much of archaeology consists of ideologically contingent fashion it was certainly plausible to me that archaeologists had “debunked” the expansion of the Bantu peoples. But how to explain the clear linguistic uniformity of the Bantu dialects, from Xhosa of South Africa, up through Angola and Kenya, to Cameroon? One extreme model could be a sort of rapid cultural diffusion, perhaps mediated by a trivial demographic impact. The spread of English exhibits this hybrid dynamic. In some areas (e.g., Australia) there was a substantial, even dominant, English demographic migration coincident with the rise of Anglo culture. In other areas, such as Jamaica, by and large the crystallization of an Anglophone culture arose atop a different demographic substrate, which synthesized with the Anglo institutions (e.g., English language and Protestant religion). The United States could arguably be held up as a in-between case, with an English founding core population, around which there was an accretion of a non-Anglo-Saxon stream of immigrants who serial adopted the Anglo culture, more or less. Sometimes this co-option of Anglo-Saxon norms may surprise. “Black English” (i.e., Ebonics) actually seems to be a genetic descendant of lower class northern English dialects. Other distinctive components of black American (e.g., “jumping the broom“) culture can also plausibly be derived back to the British Isles.
So cultural change is in the “its complicated” segment of dynamics. We have to go on a case-by-case basis. For the Bantu expansion though we have a good answer now thanks to genetics: this cultural change almost certainly was accompanied by a massive demographic migration. Thanks to Brenna Henn and company you can even run some analyses on your desktop to confirm the reality of this model. I pulled down the 55,000 SNPs from various African populations, merged with Palestinians, Tuscans, and Maya as outgroups, and pruned down to ~40,000 after removing those which were missing in more than 1% of the cases. The Hadza are also gone, as they’re such a small isolated group who always hogged up K’s all by themselves. I ran a bunch of different ADMIXTURES, from K = 2 to 12. You can see all 12 here, but let’s just focus on the 12th.
Below is a bar plot, somewhat sorted by ADMIXTURE elements. I’ve reedited some of the labels for clarity, adding regions. I’m sure some of you are ignorant of where the Brong people (Ghana) are from as I was before I looked them up. Also, please be careful about ADMIXTURE. There is a “Fulani” ancestral component below, but I’m 90% sure that’s just an artifact of recent Fulani demograhics + their unique genetic admixture.
Last weekend I mentioned a paper, The Genetic Structure and History of Africans and African Americans, which had the best coverage of disparate African populations we’ve seen so far. The map to the left shows the various ancestral population clusters inferred from the samples they had. Really the only failing is that they didn’t have samples from Angola, Zambia, Zimbabwe and Mozambique. Unfortunately, that’s not totally trivial. These are regions which were effected by the Bantu Expansion, with southern Angola in particular still having remnants of Khoisan language speakers which likely attest to the pre-Bantu populations. Luckily for us innovation and scientific ingenuity are such that minor questions can quickly be answered because of how cheap the basic methods have become. A new paper in The European Journal of Human Genetics tackles Mozambique in particular, and discerns a heretofore unknown possible population cluster. A genomic analysis identifies a novel component in the genetic structure of sub-Saharan African populations: