DISCOVER Magazine. Science, Technology and The Future
Current Issue
Subscribe Today »
  • Renew
  • Give a Gift
  • Archives
  • Customer Service
  • Facebook
  • Twitter
  • Newsletter
  • Health & Medicine
  • Mind & Brain
  • Technology
  • Space
  • Human Origins
  • Living World
  • Environment
  • Physics & Math
  • Video
  • Photos
  • Podcast
  • RSS
Gene Expression
« Open thread – August 22, 2010
A thousand little adaptive platoons »

Genetic variation within Africa (and the world)

Last year a paper came out in Science which made a rather large splash, The Genetic Structure and History of Africans and African Americans by Tishkoff et al. Since it’s more than a year old I recommend that those of you curious about the details of the paper and don’t have academic access go through the free registration, as you can then read it in full. Unlike Reich et al. the Science paper didn’t unveil a new method of analysis. It was the standard bread & butter, with PCA’s & STRUCTURE plots & phylogenetic trees. But the coverage of populations within Africa was massive. They had a lot of results and relationships to cover, and ended up with a 100 page supplement.

I commend the whole paper to you. But there are two elements I want to highlight. First, a three dimensional PCA plot. It has the first, second and third principal components of variation. In other words, the three largest independent dimensions in terms of explanatory power of genetic variation. Panel A includes all world populations, and panel B just Africans.


3DPCA

For panel A, PC1 = 20% of the variance, PC2 = 5%, and PC3 = 3.5%. For panel B the PCs didn’t drop off quite so much, PC1 = 11%, PC2 = 6%, PC3 = 5% and PC4 = 4%. In case you don’t know, the Hazda are Africa’s last obligate hunter-gatherers, and speak a language with clicks in it, just as the Bushmen do. The big division highlighted in this paper is that between the “indigenous” relict populations, the Hazda, Sandawe, Bushmen and Pygmies, and those who belong to the more widespread agriculturalist and pastoralist societies of Africa. Implicit within the paper is the model of a Bantu Expansion of farmers, as well as a possible later Nilotic expansion (which brought the Tutsi and Masaai) of herders, in a north-south direction. In the process they assimilated/and or/displaced the indigenous populations, of whom the aforementioned peoples are relict islands persisting in ecologically isolated or unfavorable domains.

324_1035_F5The map to the left shows the population coverage within this paper of African groups. The pie graphs simply show ancestral quanta as inferred by STRUCTURE. You can read the paper for the blow-by-blow. But ultimately it seems there will be need for a finer-grained coverage to the south of the equator. If the Bantu expansion is as recent as archaeologists and linguists assume, on the order of ~2,000 years ago, then the gradients of genetic signals should persist. From what I can tell it is assumed on both genetic and phenotypic grounds that the Xhosa have a higher load of Khoisan ancestry than the Zulu or Tswana. The Bantu Expansion is recent enough that the semi-legendary Phoenician circumnavigation of Africa would have encountered many Khoisan peoples along the eastern coast.

Below are a selection of figures from the above paper. After selecting an image it is probably best to hit F11 for “Full Screen” if you aren’t a on a very big monitor (you can copy image location and view it in a separate window as well).

[Show as slideshow]
[View with PicLens]
Genetic Variation by Distance Genetic Variation by Distance
Unrooted Tree Africa Unrooted Tree Africa
Unrooted Tree Unrooted Tree
Genetic Distance Tree 1 Genetic Distance Tree 1
Genetic Distance Tree 2 Genetic Distance Tree 2
Genetic Distance Tree 3 Genetic Distance Tree 3
Share

August 22nd, 2010 Tags: African Genetics, African Genomics, Genetics, Variation
by Razib Khan in Genetics, Genomics | 5 comments | RSS feed | Trackback >

5 Responses to “Genetic variation within Africa (and the world)”

  1. 1.   Tweets that mention Genetic variation within Africa (and the world) | Gene Expression | Discover Magazine -- Topsy.com Says:
    August 22nd, 2010 at 2:01 pm

    [...] This post was mentioned on Twitter by razib khan, World Amazing Things. World Amazing Things said: Genetic variation within Africa (and the world) | Gene Expression: Last year a paper came out in Science which mad… http://bit.ly/9aKIiC [...]

  2. 2.   Mike the Mad Biologist Says:
    August 22nd, 2010 at 5:49 pm

    I don’t mean to bring up a tangential point to the post, but why does the field of human genetics use PCA to visualize relationships? When I see plots like those shown here that have a ‘geometric pattern’ to them (the sharp right angles; another common pattern is a Y-shape), that tells me that there are lots of samples with zeros for many of the Y-variables (i.e., alleles that are unique to certain populations). Thus, the spatial arrangement of the points is largely an artifact of an inappropriate method: how does one calculate a correlation matrix when many of things one is correlating have values of zero?

    If one really was keen on using PCA, one could calculate a pairwise distance matrix and then use that instead of the correlation matrix (Principal Coordinates Analysis).

    Just curious (Really. I’m not implying that TEH DARWINISMZ ARE FALSIFIED! or anything ludicrous like that).

  3. 3.   German Dziebel Says:
    August 23rd, 2010 at 6:19 am

    “The big division highlighted in this paper is that between the “indigenous” relict populations, the Hazda, Sandawe, Bushmen and Pygmies .” There’s no doubt that Hadza, Sandawe and San are distinct from other African populations. The authors do include Pygmies into the mix, but this inclusion looks forced and derived from a long-standing belief that Pygmies, because they are short, must have lost their original languages. At some point the authors reveal the following: “Both language and geography explained a significant proportion of the genetic variance, but differences exist between and within the language families (table S5 and fig. S33, A to C) (4). For example, among the Niger-Kordofanian speakers, with or without the Pygmies, more of the genetic variation is explained by linguistic variation (r2 = 0.16 versus 0.11, respectively; P < 0.0001 for both) than by geographic variation (r2 = 0.02 for both; P < 0.0001 for both).” It looks like Pygmy genetic and linguistic affiliations are in sync, which means that either the Niger-Congo family is very old (and the Bantu expansion took place much earlier than we think), or, more likely, that Pygmies have always spoken Niger-Congo (and Bantu) languages and are not a relic population but a foraging “arm” of the Niger-Congo expansion. In mtDNA and Y-DNA terms, Pygmies and Bantu belong to different closely related subclades of the same clades, which is in perfect alignment with their linguistic kinship. In Razib’s “Genetic Distance Tree 1″ Pygmies and Bantu again cluster next to each other, with San populations being an outgroup for both.

    Tishkoff et al. observe close correspondence between genes and languages in Africa, and this is one good case in which languages and genes tell the same story. This story is different, however, from the common belief that Pygmies are relic African populations with roots Upper and Middle Pleistocene.

    Another quote of note: “Thus, modern humans have existed continuously in Africa longer than
    in any other geographic region and have maintained relatively large effective population sizes,
    resulting in high levels of within-population genetic diversity (1, 2). Africa contains more than
    2000 distinct ethnolinguistic groups representing nearly one-third of the world’s languages (3).
    Except for a few isolates that show no clear relationship with other languages, these languages
    have been classified into four major macrofamilies…” The fact languages and genes in Africa seem to be closely aligned with each other and that African linguistic diversity falls into a limited number of families again suggests that African genetic diversity may not be as old as it’s usually assumed. In contrast, in America between-population diversity is high (see Razib’s recent posts on Fst), languages fall into 140 language families, which may suggest high antiquity.

  4. 4.   Just pushing buttons | Gene Expression | Discover Magazine Says:
    August 24th, 2010 at 12:05 am

    [...] the Mad Biologist, whose bailiwick is the domain of the small, asks in the comments: I don’t mean to bring up a tangential point to the post, but why does the field of human [...]

  5. 5.   People, not pots, in Africa | Gene Expression | Discover Magazine Says:
    August 29th, 2010 at 1:40 am

    [...] Last weekend I mentioned a paper, The Genetic Structure and History of Africans and African Americans, which had the best coverage of disparate African populations we’ve seen so far. The map to the left shows the various ancestral population clusters inferred from the samples they had. Really the only failing is that they didn’t have samples from Angola, Zambia, Zimbabwe and Mozambique. Unfortunately, that’s not totally trivial. These are regions which were effected by the Bantu Expansion, with southern Angola in particular still having remnants of Khoisan language speakers which likely attest to the pre-Bantu populations. Luckily for us innovation and scientific ingenuity are such that minor questions can quickly be answered because of how cheap the basic methods have become. A new paper in The European Journal of Human Genetics tackles Mozambique in particular, and discerns a heretofore unknown possible population cluster. A genomic analysis identifies a novel component in the genetic structure of sub-Saharan African populations: Studies of large sets of single nucleotide polymorphism (SNP) data have proven to be a powerful tool in the analysis of the genetic structure of human populations. In this work, we analyze genotyping data for 2841 SNPs in 12 sub-Saharan African populations, including a previously unsampled region of southeastern Africa (Mozambique). We show that robust results in a world-wide perspective can be obtained when analyzing only 1000 SNPs. Our main results both confirm the results of previous studies, and show new and interesting features in sub-Saharan African genetic complexity. There is a strong differentiation of Nilo-Saharans, much beyond what would be expected by geography. Hunter-gatherer populations (Khoisan and Pygmies) show a clear distinctiveness with very intrinsic Pygmy (and not only Khoisan) genetic features. Populations of the West Africa present an unexpected similarity among them, possibly the result of a population expansion. Finally, we find a strong differentiation of the southeastern Bantu population from Mozambique, which suggests an assimilation of a pre-Bantu substrate by Bantu speakers in the region. [...]





    • About Gene Expression

      Razib Khan’s degrees are in biochemistry and biology. He has blogged about genetics since 2002, previously worked in software development, is an Unz Foundation Junior Fellow and lives in the western US. He loves habaneros.

    • Search

    • Recent Comments

      • Donn on The Iranian Genome Project
      • Razib Khan on The Iranian Genome Project
      • Donn on The Iranian Genome Project
      • Razib Khan on The Iranian Genome Project
      • Hephaestus on The Iranian Genome Project
    • Must Read List

      • Principles of Population Genetics
      • Quantitative Genetics
      • The Horse, the Wheel, and Language
      • Albion's Seed
      • The Blank Slate
    • Links

      Blogroll

      Blogroll

      • A Replicated Typo
      • Archives at unz.org
      • Brown Pundits
      • Deep Sea News
      • Dienekes
      • Gene Expression Classic
      • Harappa Ancestry Project
      • John Hawks
      • Less Wrong
      • Randall Parker
      • Razib on Books
      • Razib's Aggregator Blog
      • Secular Right
      • Sepia Mutiny
      • Steve Sailer
      • West Hunter
      Q & A

      Q & A

      • A. W. F. Edwards
      • Adam K. Webb
      • Armand Leroi
      • Bruce Lahn
      • Charles C. Mann
      • Charles Murray
      • Dan Sperber
      • David Haig
      • Heather Mac Donald
      • Hugh Pope
      • James F. Crow
      • John Derbyshire
      • Jon Entine
      • Judith Rich Harris
      • Justin L. Barrett
      • Ken Miller
      • Matthew Stewart
      • Parag Khanna
      • Peter Turchin
      • Warren Treadgold
      Books

      Books

      • 1491
      • 1848
      • A Beautiful Math
      • A Concise Economic History of the World
      • A Farewell to Alms
      • A History of Christianity
      • A History of Iran
      • A History of the Byzantine State and Society
      • A Reason for Everything
      • A Separate Creation
      • A Splendid Exchange
      • A Theory of Religion
      • A World History
      • Aboriginal Australians
      • Adaptation and Natural Selection
      • After Tamerlane
      • After the Ice
      • Age of Abundance
      • Albion's Seed
      • American Judaism
      • Banana
      • Before the Dawn
      • Behavioral Genetics in the Postgenomic Era
      • Biometry
      • Blood of the Isles
      • Bones, Stones and Molecules
      • Born That Way
      • Calculus Made Easy
      • Castes of Mind
      • Catholicism and Freedom
      • Causes of Evolution
      • Children of the Revolution
      • China in World History
      • China's Cosmopolitan Empire
      • China: A New History
      • Clash of Extremes
      • Contours of the World Economy 1-2030 AD
      • Darwin's Cathedral
      • Dawn of Human Culture
      • Deep Ancestry
      • Defenders of the Truth
      • Descartes' Baby
      • Divided by the Faith
      • Dragon Bone Hill
      • Empires and Barbarians
      • Empires of the Silk Road
      • Empires of the Word
      • End of the Bronze Age
      • Endless Forms Most Beautiful
      • Epistasis and Evolutionary Process
      • Europe
      • Europe After Rome
      • Europe Between the Oceans
      • Evolution
      • Evolution and the Genetics of Populations
      • Evolution for Everyone
      • Evolutionary Dynamics
      • Evolutionary Genetics
      • Evolutionary Human Genetics
      • Evolutionary Quantitative Genetics
      • Explaining Culture
      • Fooled By Randomness
      • Fourth Crusade & the Sack of Constantinople
      • Freedom Just Around the Corner
      • From Plato to Nato
      • Genetical Theory of Natural Selection
      • Genetics and Analysis of Quantitative Traits
      • Genetics and Origins of Species
      • Genetics of Populations
      • Genghis Khan & the Making of the Modern World
      • Genome
      • Geography of Thought
      • Global Capitalism
      • God's War
      • Grand New Party
      • Grooming, Gossip, and the Evolution of Language
      • Guns, Germs, and Steel
      • Historical Dynamics
      • History of Rome
      • How Pleasure Works
      • How Rome Fell
      • How We Decide
      • In Gods We Trust
      • In Search of the Trojan War
      • India: A New History
      • Infidels
      • Journey of Man
      • Keepers of the Keys of Heaven
      • Knowledge and the Wealth of Nations
      • Mapping Human History
      • Marketplace of the Gods
      • Mathematical Models in Biology
      • Molecular Evolution
      • Molecular Markers, Natural History, and Evolution
      • Mother Nature
      • Mutants
      • Narrow Roads of Gene Land 1
      • Narrow Roads of Gene Land 2
      • Narrow Roads of Gene Land 3
      • Natural Selection and Social Theory
      • Nature via Nurture
      • No Two Alike
      • Of Moths and Men
      • Origin and Evolution of Cultures
      • Origins of Theoretical Population Genetics
      • Out of Thin Air
      • Pandora's Seed
      • Plagues and Peoples
      • Population Genetics and Microevolutionary Theory
      • Population Genetics, Molecular Evolution, and the Neutral Theory
      • Postwar
      • Power and Plenty
      • Predictably Irrational
      • Prehistory of the Mind
      • Principles of Population Genetics
      • Pursuit of Glory
      • Quantitative Genetics
      • R.A. Fisher, the Life of a Scientist
      • Reading in the Brain
      • Religion Explained
      • Rome and Jersalem
      • Sailing to Byzantium
      • Sewall Wright and Evolutionary Biology
      • Sociobiology
      • Speciation
      • Statistical Methods in Molecular Evolution
      • Supernatural Selection
      • Survival of the Prettiest
      • Synaptic Self
      • Tempo and Mode in Evolution
      • The 10,000 Year Explosion
      • The Age of Confucian Rule
      • The Age of Lincoln
      • The Altruism Equation
      • The Ancestor's Tale
      • The Ascent of Money
      • The Barbarian Conversion
      • The Black Swan
      • The Blank Slate
      • The Classical World
      • The Creationists
      • The Cultural Origins of Human Cognition
      • The Darwin Wars
      • The Descent of Man
      • The Early Chinese Empires
      • The Essential Difference
      • The Evolutionists
      • The Faith Instinct
      • The Fall of Rome
      • The Fall of the Roman Empire
      • The g Factor
      • The Genetics of Human Populations
      • The Germanization of Early Medieval Christianity
      • The Great Arab Conquests
      • The Great Divergence
      • The Great Human Diasporas
      • The Great Upheaval
      • The History and Geography of Human Genes
      • The Horse, the Wheel, and Language
      • The Human Web
      • The Imitation Factor
      • The Invisible Gorilla
      • The Language Instinct
      • The Making of a Christian Aristoracy
      • The Math Gene
      • The Mating Mind
      • The Meme Machine
      • The Moral Animal
      • The Number Sense
      • The Nurture Assumption
      • The Origin of Species
      • The Origin Of The Mind
      • The Origins of Virtue
      • The Power of Babel
      • The Price of Altruism
      • The Red Queen
      • The Reformation
      • The Rise of Western Christendom
      • The Sacred Chain
      • The Selfish Gene
      • The Seven Daughters of Eve
      • The Stuff of Thought
      • The Symbolic Species
      • The Tenth Parallel
      • The Troubled Empire
      • The Vertigo Years
      • The Vikings
      • Throes of Democracy
      • Unknown Quantity
      • Unto Others
      • War and Peace and War
      • War, Wine, and Taxes
      • We Are Doomed
      • Wealth and Poverty of Nations
      • What Hath God Wrought
      • When Baghdad Ruled the Muslim World
      • When Genius Failed
      • Why Sex Matters
      • Why Some Like It Hot
    • Elsewhere on DISCOVER

      RSS Genetics in DISCOVER mag

      Genetics in DISCOVER

      • The Spider Assassin That Acts Like Prey and Cloaks Itself With Wind
      • How Did LEGO Become More About Limits Than Possibilities?
      • Top 100 Stories of 2011: #48: Strongest Repellent Found

      • Top 100 Stories of 2011: #35: Fossil Stirs Debate Over 
Dinosaurs’ Last Days
      • Top 100 Stories of 2011: #30: New Fossil Casts Doubt on Oldest Bird

      • Top 100 Stories of 2011: #63: How Many Species Inhabit the Earth?

      • Top 100 Stories of 2011: #74: Meet the Megavirus

      • Top 100 Stories of 2011: #61: Aging Effects 
Reversed in Mice

    • Gene Expression content

      RSS Recent Posts

      Recent Posts

      • The Iranian Genome Project
      • Socialized personal genomics?
      • A personal note
      • Everlasting permanence
      • ChromoPainter & fineSTRUCTURE on a South Asian data set
      • Secular liberals the tip of the Islamist spear
      • Out of who knows where
      • Monogamous societies superior to polygamous societies
      Categories

      Categories

      • Administration
      • Agriculture
      • Anthroplogy
      • Ask a ScienceBlogger
      • Barbarism
      • Behavior Genetics
      • Bioethics
      • Biology
      • Biotech
      • Blog
      • Books
      • Cognitive Science
      • Creationism
      • Culture
      • Data Analysis
      • Demographics
      • Development
      • Ecology
      • Economics
      • Education
      • Environment
      • Evolution
      • Evolutionary Genetics
      • Evolutionary Psychology
      • Fantasy
      • Food
      • Futurism
      • Genetics
      • Genomics
      • Geography
      • GSS
      • Health
      • History
      • Human Evolution
      • Human Evolutionary Genetics
      • Human Evolutionary Genomics
      • Human Genetics
      • Human Genomics
      • International Affairs
      • Linguistics
      • Medicine
      • Paleontology
      • Personal Genomics
      • philosophy
      • Politics
      • Population Genetics
      • Psychology
      • Quantitative Genetics
      • Religion
      • Science
      • Science Fiction
      • Select
      • Social Science
      • Space
      • Sports
      • Statistics
      • Technology
      • Transhumanism
      • Uncategorized
      Archives

      Archives

      • February 2012
      • January 2012
      • December 2011
      • November 2011
      • October 2011
      • September 2011
      • August 2011
      • July 2011
      • June 2011
      • May 2011
      • April 2011
      • March 2011
      • February 2011
      • January 2011
      • December 2010
      • November 2010
      • October 2010
      • September 2010
      • August 2010
      • July 2010
      • June 2010
      • May 2010
      • April 2010
      • March 2010
      • February 2010
      • January 2010
      • December 2009
      • November 2009
      • October 2009
      • September 2009
      • August 2009
      • July 2009
      • June 2009
      • May 2009
      • April 2009
      • March 2009
      • February 2009
      • January 2009
      • December 2008
      • November 2008
      • October 2008
      • September 2008
      • August 2008
      • July 2008
      • June 2008
      • May 2008
      • April 2008
      • March 2008
      • February 2008
      • January 2008
      • December 2007
      • November 2007
      • October 2007
      • September 2007
      • August 2007
      • July 2007
      • June 2007
      • May 2007
      • April 2007
      • March 2007
      • February 2007
      • January 2007
      • December 2006
      • November 2006
      • October 2006
      • September 2006
      • August 2006
      • July 2006
      • June 2006
      • May 2006
      • April 2006
      • March 2006
      • February 2006
      • January 2006
    • Meta

      • Log in
      • Entries RSS
      • Comments RSS
      • WordPress.org
    • RSS Razib’s Pinboard Feed

      • If you’ve seen one elephant, have you seen them all? | Uda Walawe Elephants
      • Functional genomics: The changes that count : Nature : Nature Publishing Group
      • College Rankings :: Political Affiliation of the Students
      • Economics of Family Life, as Taught by a Power Couple - NYTimes.com
      • Steve Sailer's iSteve Blog: Why does Britain have so many yobs these days?
      • Which population in the 1000 Genomes Project samples has the most Neandertal similarity? | john hawks weblog
      • Neanderthal demise due to many influences, including cultural changes
      • Atheism in America: Why won’t the U.S. accept its atheists? - Slate Magazine
      • For Ron Paul, a Distinctive Worldview of Long Standing - NYTimes.com
      • Killers’ Families Left to Confront Fear and Shame - NYTimes.com
      • 911 IS A JOKE - WWW.THEDAILY.COM
      • When Counseling and Conviction Collide — Beliefs - NYTimes.com
      • Rhodes Trust Gives Account of Quarterback’s Candidacy - NYTimes.com
      • The Powerful Resist Change to Greek Tax System - NYTimes.com
      • Effort to Rebrand Arab Spring Backfires in Iran - NYTimes.com
      • For Founders to Decorators, Facebook Riches - NYTimes.com
      • To Combat Modern Ills, Korea Looks to the Past - NYTimes.com
      • Shit Programmers Say - YouTube
      • Muslim Brotherhood Blocks Protest in Egypt - NYTimes.com
      • Bread and Circuses: Aishwarya Rai, Valerio Massimo Manfredi and The Last Legion


  • Kalmbach Publishing Co.

    Copyright © 2012, Kalmbach Publishing Co.

    Privacy - Terms - Reader Services - Subscribe Today - Advertise - About Us