DISCOVER Magazine. Science, Technology and The Future
Current Issue
Subscribe Today »
  • Renew
  • Give a Gift
  • Archives
  • Customer Service
  • Facebook
  • Twitter
  • Newsletter
  • Health & Medicine
  • Mind & Brain
  • Technology
  • Space
  • Human Origins
  • Living World
  • Environment
  • Physics & Math
  • Video
  • Photos
  • Podcast
  • RSS
Gene Expression
« The study of humankind: questions, answers, and good faith
Around the Web – December 13th, 2010 »

Live not by visualization alone

pc1
Synthetic map

In the age of 500,000 SNP studies of genetic variation across dozens of populations obviously we’re a bit beyond lists of ABO blood frequencies. There’s no real way that a conventional human is going to be able to discern patterns of correlated allele frequency variations which point to between population genetic differences on this scale of marker density. So you rely on techniques which extract the general patterns out of the data, and present them to you in a human-comprehensible format. But, there’s an unfortunate tendency for humans to imbue the products of technique with a particular authority which they always should not have.

ResearchBlogging.orgThe History and Geography of Human Genes is arguably the most important historical genetics work of the past generation. It has surely influenced many within the field of genetics, and because of its voluminous elegant visual displays of genetic data it is also a primary source for those outside of genetics to make sense of phylogenetic relations between human populations. And yet one aspect of this great work which never caught on was the utilization of “synthetic maps” to visualize components of genetic variation between populations. This may have been fortuitous, a few years ago a paper was published, Interpreting principal components analyses of spatial population genetic variation, which suggested that the gradients you see on the map above may be artifacts:

Nearly 30 years ago, Cavalli-Sforza et al. pioneered the use of principal component analysis (PCA) in population genetics and used PCA to produce maps summarizing human genetic variation across continental regions. They interpreted gradient and wave patterns in these maps as signatures of specific migration events. These interpretations have been controversial, but influential, and the use of PCA has become widespread in analysis of population genetics data. However, the behavior of PCA for genetic data showing continuous spatial variation, such as might exist within human continental groups, has been less well characterized. Here, we find that gradients and waves observed in Cavalli-Sforza et al.’s maps resemble sinusoidal mathematical artifacts that arise generally when PCA is applied to spatial data, implying that the patterns do not necessarily reflect specific migration events. Our findings aid interpretation of PCA results and suggest how PCA can help correct for continuous population structure in association studies.

A paper earlier this year took the earlier work further and used a series of simulations to show how the nature of the gradients varied. In light of recent preoccupations the results are of interest. Principal Component Analysis under Population Genetic Models of Range Expansion and Admixture:

In a series of highly influential publications, Cavalli-Sforza and colleagues used principal component (PC) analysis to produce maps depicting how human genetic diversity varies across geographic space. Within Europe, the first axis of variation (PC1) was interpreted as evidence for the demic diffusion model of agriculture, in which farmers expanded from the Near East ∼10,000 years ago and replaced the resident hunter-gatherer populations with little or no interbreeding. These interpretations of the PC maps have been recently questioned as the original results can be reproduced under models of spatially covarying allele frequencies without any expansion. Here, we study PC maps for data simulated under models of range expansion and admixture. Our simulations include a spatially realistic model of Neolithic farmer expansion and assume various levels of interbreeding between farmer and resident hunter-gatherer populations. An important result is that under a broad range of conditions, the gradients in PC1 maps are oriented along a direction perpendicular to the axis of the expansion, rather than along the same axis as the expansion. We propose that this surprising pattern is an outcome of the “allele surfing” phenomenon, which creates sectors of high allele-frequency differentiation that align perpendicular to the direction of the expansion.

The first figure shows the general framework with which they performed the simulations:

pcab1

You have a lattice which consists of demes, population units, all across Europe. They modulated parameters such as population growth (r), carrying capacity (C), and migration (m). Additionally, they had various scenarios of expansion from the southwest or southeast, as well as two expansions one after another to mimic the re-population of Europe after the Ice Age by Paleolithic groups, and their later replacement by Neolithic groups. They modulated admixture and introgression of genes from the Paleolithic group to the Neolithics so that you had the full range where the final European were mostly Neolithic or mostly Paleolithic.

Below are some of the figures which show the results:

[Show as slideshow]
[View with PicLens]
Figure 4 & 5 Figure 4 & 5
Figure 2 Figure 2

allesurAs you can see the strange thing is that in some models the synthetic map gradient is rotated 90 degrees from the axis of demographic expansion! In this telling the famous synthetic map showing Neolithic expansion might be showing expansion from Iberia. Perhaps a radiation from a post-Ice Age southern refuge?

One explanation might be “allele surfing” on the demographic “wave of advance.” Basically as a population expands very rapidly stochastic forces such as random genetic drift and bottlenecks could produce diversification along the edge of the population wave front. The reason for this is that these rapidly expanding populations explode out of serial bottlenecks and demographic expansions, which will produce genetic distinctiveness among the many differentiated demes bubbling along the edge of expansion. Alleles which may have been at low frequency in the ancestral population can “fix” in descendant populations on the edge of the demographic wave of advance. This is the explanation, more or less, that one group gave last year for the very high frequencies of R1b1b2 in Western Europe. With this, they overturned the classic assumption that R1b1b2 was a Paleolithic marker, and suggested it was a Neolithic one.

Here’s their conclusion from the paper:

A previous study showed that the original patterns observed in PCA might not reflect any expansion events (Novembre and Stephens 2008). Here, we find that under very general conditions, the pattern of molecular diversity produced by an expansion may be different than what was expected in the literature. In particular, we find conditions where an expansion of Neolithic farmers from the southeast produces a greatest axis of differentiation running from the southwest to the northeast. This surprising result is seemingly due to allele surfing leading to sectors that create differentiation perpendicular to the expansion axis. Although a lot of our results can be explained by the surfing phenomenon, some interesting questions remain open. For example, the phase transition observed for relatively small admixture rates between Paleolithic resident and Neolithic migrant populations occurs at a value that is dependent on our simulation settings, and further investigations would be needed to better characterize this critical value as a function of all the model parameters. Another unsolved question is to know why the patterns generally observed in PC2 maps for our simulation settings sometimes arise in PC1 maps instead. These unexplained examples remind us that PCA is summarizing patterns of variation in the sample due to multiple factors (ancestral expansions and admixture, ongoing limited migration, habitat boundary effects, and the spatial distribution of samples). In complex models such as our expansion models with admixture in Europe, it may be difficult to tease apart what processes give rise to any particular PCA pattern. Our study emphasizes that PC (and AM) should be viewed as tools for exploring the data but that the reverse process of interpreting PC and AM maps in terms of past routes of migration remains a complicated exercise. Additional analyses—with more explicit demographic models—are more than ever essential to discriminate between multiple explanations available for the patterns observed in PC and AM maps. We speculate that methods exploiting the signature of alleles that have undergone surfing may be a powerful approach to study range expansions.

What’s the big picture here? In the textbook Human Evolutionary Genetics it is asserted that synthetic maps never became very popular compared to PCA itself. I think this is correct. But, the original synthetic maps have become prominent for many outside of genetics. They figure in Peter Bellwood’s First Farmers, and are taken as a given by many pre-historians, such as Colin Renfrew. And yet a reliance on these sorts of tools must not be blind to the reality that the more layers of abstraction you put between your perception and comprehension of concrete reality, the more likely you are to be led astray by quirks and biases of method.

In this case I do think first-order intuition would tell us that synthetic maps which display PCs would be showing gradients as a function of demographic pulses. And yet the intuition may not be right, and with the overturning of old orthodoxies in the past generation of inferences from the variation patterns in modern populations, we should be very cautious.

Citation: Olivier François, Mathias Currat, Nicolas Ray, Eunjung Han, Laurent Excoffier, & John Novembre (2010). Principal Component Analysis under Population Genetic
Models of Range Expansion and Admixture Mol Biol Evol

Share

December 13th, 2010 Tags: Anthropology, Archaeogenetics, Archaeology, European Genetics, Neolithic, Paleolithic, Prehistory
by Razib Khan in Genetics, Genomics, History | 3 comments | RSS feed | Trackback >

3 Responses to “Live not by visualization alone”

  1. 1.   Tweets that mention Live not by visualization alone | Gene Expression | Discover Magazine -- Topsy.com Says:
    December 13th, 2010 at 2:30 am

    [...] This post was mentioned on Twitter by Al Poe, Maggie, World Amazing Things, 0001_xml, m and others. m said: Live not by visualization alone | Gene Expression: Synthetic map In the age of 500,000 SNP studies of genetic v… http://bit.ly/gxmGu8 [...]

  2. 2.   bob sykes Says:
    December 13th, 2010 at 4:46 am

    Perhaps we should be looking for Noah’s log book.

  3. 3.   dearieme Says:
    December 13th, 2010 at 5:05 am

    It was fun in the old days, when our ancestors wandered, and REFLUXED, like billy-oh.





    • About Gene Expression

      Razib Khan’s degrees are in biochemistry and biology. He has blogged about genetics since 2002, previously worked in software development, is an Unz Foundation Junior Fellow and lives in the western US. He loves habaneros.

    • Search

    • Recent Comments

      • Razib Khan on An Orientalist fantasy
      • Wulf Kurtoglu on An Orientalist fantasy
      • Larry, San Francisco on Vaccination as heterodoxy
      • Onur on The utility and reality of species
      • DK on The utility and reality of species
    • Must Read List

      • Principles of Population Genetics
      • Quantitative Genetics
      • The Horse, the Wheel, and Language
      • Albion's Seed
      • The Blank Slate
    • Links

      Blogroll

      Blogroll

      • A Replicated Typo
      • Archives at unz.org
      • Brown Pundits
      • Deep Sea News
      • Dienekes
      • Gene Expression Classic
      • Harappa Ancestry Project
      • John Hawks
      • Less Wrong
      • Randall Parker
      • Razib on Books
      • Razib's Aggregator Blog
      • Secular Right
      • Sepia Mutiny
      • Steve Sailer
      • West Hunter
      Q & A

      Q & A

      • A. W. F. Edwards
      • Adam K. Webb
      • Armand Leroi
      • Bruce Lahn
      • Charles C. Mann
      • Charles Murray
      • Dan Sperber
      • David Haig
      • Heather Mac Donald
      • Hugh Pope
      • James F. Crow
      • John Derbyshire
      • Jon Entine
      • Judith Rich Harris
      • Justin L. Barrett
      • Ken Miller
      • Matthew Stewart
      • Parag Khanna
      • Peter Turchin
      • Warren Treadgold
      Books

      Books

      • 1491
      • 1848
      • A Beautiful Math
      • A Concise Economic History of the World
      • A Farewell to Alms
      • A History of Christianity
      • A History of Iran
      • A History of the Byzantine State and Society
      • A Reason for Everything
      • A Separate Creation
      • A Splendid Exchange
      • A Theory of Religion
      • A World History
      • Aboriginal Australians
      • Adaptation and Natural Selection
      • After Tamerlane
      • After the Ice
      • Age of Abundance
      • Albion's Seed
      • American Judaism
      • Banana
      • Before the Dawn
      • Behavioral Genetics in the Postgenomic Era
      • Biometry
      • Blood of the Isles
      • Bones, Stones and Molecules
      • Born That Way
      • Calculus Made Easy
      • Castes of Mind
      • Catholicism and Freedom
      • Causes of Evolution
      • Children of the Revolution
      • China in World History
      • China's Cosmopolitan Empire
      • China: A New History
      • Clash of Extremes
      • Contours of the World Economy 1-2030 AD
      • Darwin's Cathedral
      • Dawn of Human Culture
      • Deep Ancestry
      • Defenders of the Truth
      • Descartes' Baby
      • Divided by the Faith
      • Dragon Bone Hill
      • Empires and Barbarians
      • Empires of the Silk Road
      • Empires of the Word
      • End of the Bronze Age
      • Endless Forms Most Beautiful
      • Epistasis and Evolutionary Process
      • Europe
      • Europe After Rome
      • Europe Between the Oceans
      • Evolution
      • Evolution and the Genetics of Populations
      • Evolution for Everyone
      • Evolutionary Dynamics
      • Evolutionary Genetics
      • Evolutionary Human Genetics
      • Evolutionary Quantitative Genetics
      • Explaining Culture
      • Fooled By Randomness
      • Fourth Crusade & the Sack of Constantinople
      • Freedom Just Around the Corner
      • From Plato to Nato
      • Genetical Theory of Natural Selection
      • Genetics and Analysis of Quantitative Traits
      • Genetics and Origins of Species
      • Genetics of Populations
      • Genghis Khan & the Making of the Modern World
      • Genome
      • Geography of Thought
      • Global Capitalism
      • God's War
      • Grand New Party
      • Grooming, Gossip, and the Evolution of Language
      • Guns, Germs, and Steel
      • Historical Dynamics
      • History of Rome
      • How Pleasure Works
      • How Rome Fell
      • How We Decide
      • In Gods We Trust
      • In Search of the Trojan War
      • India: A New History
      • Infidels
      • Journey of Man
      • Keepers of the Keys of Heaven
      • Knowledge and the Wealth of Nations
      • Mapping Human History
      • Marketplace of the Gods
      • Mathematical Models in Biology
      • Molecular Evolution
      • Molecular Markers, Natural History, and Evolution
      • Mother Nature
      • Mutants
      • Narrow Roads of Gene Land 1
      • Narrow Roads of Gene Land 2
      • Narrow Roads of Gene Land 3
      • Natural Selection and Social Theory
      • Nature via Nurture
      • No Two Alike
      • Of Moths and Men
      • Origin and Evolution of Cultures
      • Origins of Theoretical Population Genetics
      • Out of Thin Air
      • Pandora's Seed
      • Plagues and Peoples
      • Population Genetics and Microevolutionary Theory
      • Population Genetics, Molecular Evolution, and the Neutral Theory
      • Postwar
      • Power and Plenty
      • Predictably Irrational
      • Prehistory of the Mind
      • Principles of Population Genetics
      • Pursuit of Glory
      • Quantitative Genetics
      • R.A. Fisher, the Life of a Scientist
      • Reading in the Brain
      • Religion Explained
      • Rome and Jersalem
      • Sailing to Byzantium
      • Sewall Wright and Evolutionary Biology
      • Sociobiology
      • Speciation
      • Statistical Methods in Molecular Evolution
      • Supernatural Selection
      • Survival of the Prettiest
      • Synaptic Self
      • Tempo and Mode in Evolution
      • The 10,000 Year Explosion
      • The Age of Confucian Rule
      • The Age of Lincoln
      • The Altruism Equation
      • The Ancestor's Tale
      • The Ascent of Money
      • The Barbarian Conversion
      • The Black Swan
      • The Blank Slate
      • The Classical World
      • The Creationists
      • The Cultural Origins of Human Cognition
      • The Darwin Wars
      • The Descent of Man
      • The Early Chinese Empires
      • The Essential Difference
      • The Evolutionists
      • The Faith Instinct
      • The Fall of Rome
      • The Fall of the Roman Empire
      • The g Factor
      • The Genetics of Human Populations
      • The Germanization of Early Medieval Christianity
      • The Great Arab Conquests
      • The Great Divergence
      • The Great Human Diasporas
      • The Great Upheaval
      • The History and Geography of Human Genes
      • The Horse, the Wheel, and Language
      • The Human Web
      • The Imitation Factor
      • The Invisible Gorilla
      • The Language Instinct
      • The Making of a Christian Aristoracy
      • The Math Gene
      • The Mating Mind
      • The Meme Machine
      • The Moral Animal
      • The Number Sense
      • The Nurture Assumption
      • The Origin of Species
      • The Origin Of The Mind
      • The Origins of Virtue
      • The Power of Babel
      • The Price of Altruism
      • The Red Queen
      • The Reformation
      • The Rise of Western Christendom
      • The Sacred Chain
      • The Selfish Gene
      • The Seven Daughters of Eve
      • The Stuff of Thought
      • The Symbolic Species
      • The Tenth Parallel
      • The Troubled Empire
      • The Vertigo Years
      • The Vikings
      • Throes of Democracy
      • Unknown Quantity
      • Unto Others
      • War and Peace and War
      • War, Wine, and Taxes
      • We Are Doomed
      • Wealth and Poverty of Nations
      • What Hath God Wrought
      • When Baghdad Ruled the Muslim World
      • When Genius Failed
      • Why Sex Matters
      • Why Some Like It Hot
    • Elsewhere on DISCOVER

      RSS Genetics in DISCOVER mag

      Genetics in DISCOVER

      • Can Stuffing Germs up Ferrets Unleash a Human Pandemic?
      • 20 Things You Didn't Know About... Allergies
      • The Brain: Hidden Epidemic: 
Tapeworms Living Inside People's Brains
      • The Hagfish's Special Trick for Warding Off Predators: Thick, Sticky Mucus
      • The Big, Overlooked Factor in the Rise of Pandemics: The Human Vector
      • Does Rain Come From Life in the Clouds?
      • Gallery | 6 Creepy-Crawlies We Hate But Couldn't Do Without
      • Plants Repel Bacteria's Assaults by Spying on Their Chatter
    • Gene Expression content

      RSS Recent Posts

      Recent Posts

      • A quick note on comments policy
      • An Orientalist fantasy
      • Vaccination as heterodoxy
      • Hispanos and Sephardic ancestry
      • Are Hispanics that socially conservative?
      • The utility and reality of species
      • The American Community Survey: mend it, don’t end it!
      • GEDmatch
      Categories

      Categories

      • Administration
      • Agriculture
      • Anthroplogy
      • Ask a ScienceBlogger
      • Barbarism
      • Behavior Genetics
      • Bioethics
      • Biology
      • Biotech
      • Blog
      • Books
      • Cognitive Science
      • Creationism
      • Culture
      • Data Analysis
      • Demographics
      • Development
      • Ecology
      • Economics
      • Education
      • Environment
      • Evolution
      • Evolutionary Genetics
      • Evolutionary Psychology
      • Fantasy
      • Food
      • Futurism
      • Genetics
      • Genomics
      • Geography
      • GSS
      • Health
      • History
      • Human Evolution
      • Human Evolutionary Genetics
      • Human Evolutionary Genomics
      • Human Genetics
      • Human Genomics
      • International Affairs
      • Linguistics
      • Medicine
      • Paleontology
      • Personal Genomics
      • philosophy
      • Politics
      • Population Genetics
      • Psychology
      • Quantitative Genetics
      • Race
      • Religion
      • Science
      • Science Fiction
      • Select
      • Social Science
      • Space
      • Sports
      • Statistics
      • Technology
      • Transhumanism
      • Uncategorized
      Archives

      Archives

      • May 2012
      • April 2012
      • March 2012
      • February 2012
      • January 2012
      • December 2011
      • November 2011
      • October 2011
      • September 2011
      • August 2011
      • July 2011
      • June 2011
      • May 2011
      • April 2011
      • March 2011
      • February 2011
      • January 2011
      • December 2010
      • November 2010
      • October 2010
      • September 2010
      • August 2010
      • July 2010
      • June 2010
      • May 2010
      • April 2010
      • March 2010
      • February 2010
      • January 2010
      • December 2009
      • November 2009
      • October 2009
      • September 2009
      • August 2009
      • July 2009
      • June 2009
      • May 2009
      • April 2009
      • March 2009
      • February 2009
      • January 2009
      • December 2008
      • November 2008
      • October 2008
      • September 2008
      • August 2008
      • July 2008
      • June 2008
      • May 2008
      • April 2008
      • March 2008
      • February 2008
      • January 2008
      • December 2007
      • November 2007
      • October 2007
      • September 2007
      • August 2007
      • July 2007
      • June 2007
      • May 2007
      • April 2007
      • March 2007
      • February 2007
      • January 2007
      • December 2006
      • November 2006
      • October 2006
      • September 2006
      • August 2006
      • July 2006
      • June 2006
      • May 2006
      • April 2006
      • March 2006
      • February 2006
      • January 2006
    • Meta

      • Log in
      • Entries RSS
      • Comments RSS
      • WordPress.org
    • RSS Razib’s Pinboard Feed

      • Abortion polls, gay marriage polls: Why are we becoming liberal on some issues but not others? - Slate Magazine
      • At CUNY’s Top Colleges, Black and Hispanic Freshmen Enrollments Drop - NYTimes.com
      • Megafaunal Extinctions
      • New Details Are Released in Shooting of Trayvon Martin - NYTimes.com
      • White American babies are now in the minority. Why does the census divide people by race, anyway? - Slate Magazine
      • When you eat matters, not just what you eat
      • Can You Call a 9-Year-Old a Psychopath? - NYTimes.com
      • A Circle of Tech in Silicon Valley - Collect Payout, Do a Start-Up - NYTimes.com
      • Archaeologists Unearth Ancient Maya Calendar Writing - NYTimes.com
      • Repeat act: Parallel selection tweaks many of the same genes to make big and heavy mice
      • Blond as a window to ancient pigmentation variation
      • Eugenics, Malthusianism, and Trepidation, Bryan Caplan | EconLog | Library of Economics and Liberty
      • Textuality: The Jews Are a Race, Geneticist Says
      • The designer baby factory: Eggs from beautiful Eastern Europeans. Sperm from wealthy Westerners. And embryos implanted in desperate women. | Mail Online
      • Arab Spring Stirs Palestinian Journalists to Test Free Speech Limits - NYTimes.com
      • Barack Obama | Racial Diversity | Civil Rights | 2012 Election | The Daily Caller
      • Could These Start-Ups Become the Next Big Thing? - NYTimes.com
      • Steve Sailer's iSteve Blog: Pym Fortuyn, RIP
      • Never mind Europe; worry about India's economic growth - The Economic Times
      • 9 Swing States, Critical to Presidential Race, Are Mixed Lot - NYTimes.com


  • Kalmbach Publishing Co.

    Copyright © 2012, Kalmbach Publishing Co.

    Privacy - Terms - Reader Services - Subscribe Today - Advertise - About Us