DISCOVER Magazine. Science, Technology and The Future
Current Issue
Subscribe Today »
  • Renew
  • Give a Gift
  • Archives
  • Customer Service
  • Facebook
  • Twitter
  • Newsletter
  • Health & Medicine
  • Mind & Brain
  • Technology
  • Space
  • Human Origins
  • Living World
  • Environment
  • Physics & Math
  • Video
  • Photos
  • Podcast
  • RSS
Gene Expression
« American medicine & American red-tape
Non-overlapping magisteria for the social and biological? »

Clusters where they “shouldn’t be”….


Uyghur girls

A few people have pointed me to the paper, Implications for health and disease in the genetic signature of the Ashkenazi Jewish population. You should check it out if you don’t have academic access to papers, it’s not gated. Rather, I want to focus on a methodological issue.

In the genetics reader survey only 20 percent of you agreed that you understood how to read an ADMIXTURE plot. After looking at some of the results in this paper I have a lot of sympathy. Understanding what’s going on requires more prior information than is often present in the legends of the figures.

It is known that to a first approximation Ashkenazi Jews, that is, the Jews of Europe, can be understood as an admixture between a European population and a Middle Eastern one. But Ashkenazi Jews also exhibit their own genetic distinctiveness, probably due to long term endogamy. This shows up in various genetic statistics. In this paper the authors show that Ashkenazi form their own cluster in both PCA and ADMIXTURE, two ways in which to ascertain population structure. Below I’ve reedited and highlighted some populations of note in one of their ADMIXTURE plots. It’s rather informative of the bigger problem with interpreting these sorts of results in the absence of context.


As you can see there is an ancestral element which is predominant in the Ashkenazi. A individual analysis also implies that most of those with a lower fraction of this element who identify as Ashkenazi probably have recent admixture (e.g., only three out of four grandparents were Jewish). What I found striking is that the Uyghur and Hazara both also shook out as of a particular ancestral element. The reality is that we know this is a total artifact of the ADMIXTURE software; the Hazara have a historical narrative of being the product of intermarriage between Mongols and Persians. The historical evidence for the origin of the Uyghur is sketchier and more confused, but it can be reconstructed. And the genetics make it likely that both these groups emerged over the past 2,000 years, as an admixture between a Western and Eastern Eurasian set of populations.

What does this have to do with Ashkenazi Jews? I think one should be skeptical of an “Ashkenazi Jewish” modal element when we already know that this plot has useless clusters. It does not seem like they included any “real” East Asian reference populations, so the Hazara and Uyghur stepped up and took that position, despite both populations having ~50 percent West Eurasian admixture. The ADMIXTURE software transformed a clearly hybridized population into its own ‘ancestral’ population. Something similar might be happening with the Jews, especially in light of the fact that the authors had a relatively large Jewish population their data, geared to exploring the nature of Jewish genetic relationships. This is a case where we know that we probably don’t know that much.

The moral: don’t think you can read a scientific figure plainly without any context.

Image credit: Wikipedia

Share

February 13th, 2012 Tags: Jews
by Razib Khan in Human Genetics, Human Genomics | 13 comments | RSS feed | Trackback >

13 Responses to “Clusters where they “shouldn’t be”….”

  1. 1.   Nick Patterson Says:
    February 14th, 2012 at 7:40 am

    This is a shrewd and important note illuminating a key issue.
    I work a lot on recovering phylogeny as some readers of this blog will know.

    I use PCA (and ADMIXTURE) output mostly for data exploration rather than formal modeling;
    PCA in some sense has no underlying model at all, and ADMIXTURE’s model is
    a “star shaped phylogeny” where populations all split from a root population at the same time
    (and then remix). Sometimes this model is unrealistic and the results can be misleading,
    as Razib points out.

    Nick Patterson (Broad Institute)

  2. 2.   Charles Nydorf Says:
    February 14th, 2012 at 7:51 am

    I am looking at the provisional version of the article and finding it frustrating. The authors found a small amount of geographical intra-population variation within the Ashkenazim but ‘Addistional File 5′ that shows this does not have a legend explaining the color coding.

  3. 3.   marcel Says:
    February 14th, 2012 at 7:52 am

    A minor niggle, and more history of European Jews than you may be interested in.

    It is known that to a first approximation Ashkenazi Jews, that is, the Jews of Europe

    I do not believe that this is quite right. While the Ashkenazim seem to have come originally from groups up and down the Rhine river and neighboring regions, they moved east during and after the Crusades, especially the first Crusade. Until the 19thC, the small Jewish populations in France, the Lowlands, and England (I don’t think there were any populations of note elsewhere in the UK) were almost entirely descended from people exiled from Spain a couple of hundred years earlier. The same is true of Greek Jews, and a bit less firmly, of Italian Jews, though here I think as you move further south, especially away from Austria, the Ashkenazi influence becomes negligible.

    It is certainly true that at the end of the 18th C, there were few Jews in these parts of Europe, that the Ashkenazim were the lion’s share of European Jews, so to that extent, the quoted assertion is not incorrect, but there remained an identifiable minority of Jews in Europe who were not Ashkenazi. Over the course of the 19th and early 20thC, Ashkenazim immigrated in increasing numbers in areas where the Jewish populations had been predominantly Sephardic (i.e., from Iberia)

  4. 4.   Razib Khan Says:
    February 14th, 2012 at 8:32 am

    It is certainly true that at the end of the 18th C, there were few Jews in these parts of Europe, that the Ashkenazim were the lion’s share of European Jews, so to that extent, the quoted assertion is not incorrect

    yes. i am aware of all the rest (e.g., romaniots, etc.).

  5. 5.   Onur Says:
    February 14th, 2012 at 9:13 am

    Technically speaking, Ashkenazi Jews are eastern European (not to be confused with East Europe) Jews and Sephardi Jews are western European (not to be confused with West Europe) Jews. The former community originated along the Rhine area and the latter community originated in Iberia. Both communities seem to be a mixture of ancient West Asian Jews (whose places of origin are not clear) with European indigenous populations.

  6. 6.   Charles Nydorf Says:
    February 14th, 2012 at 11:16 am

    A few words on the geographical extent of Ashkenazic Jewry. Ashkenazic Jews are traditionally defined by following some religious practices that are slightly different from those of other religious groups. The area within which these practices were followed coincided with the territory in which Jews spoke Yiddish up till the early 19th century. This area includes Germany along with Alsace-Lorraine, a small part of Switzerland, Austria and Holland. To the southeast it extended into the Czech regions and an area south of the Carpathians taking in Slovakia, Hungary, and subcarpathian Ukraine and Transylvanian Romania. To the north and east of the Carpathians Bukovina and Moldavia were traditional Ashkenazic area. The demographic heartland of the Ashkenaz was the medieval kingdom of Poland-Lithuania. The part belonging to the old Lithuanian Duchy embraced Lithuania, eastern Latvia, Belarus and parts of Ukraine and Poland. The part belonging the Kingdom of Poland included Poland and most of Ukraine.

  7. 7.   ohwilleke Says:
    February 14th, 2012 at 2:25 pm

    I wonder how “well mixed” a hybrid population has to be to appear to be an ancestral rather than hybrid component in an ADMIXTURE analysis. The Ashkenazi example suggests that in the right circumstances that it takes less than about sixty generations. The Uygur and Hazara examples trim that a bit and suggest that the circumstances necessary don’t have to be that exceptional.

    Would five hundred years with less endogamy than the Ashkenazi suffice? For example, would a Mayflower Puritan cluster be treated by ADMIXTURE as an ancestral population (presumably very similar to CEU) distinct from the source populations for them in the United Kingdom?

  8. 8.   Eurologist Says:
    February 14th, 2012 at 5:48 pm

    The solution to the “problem” of getting artificial ancestral populations out of ADMIXTURE or PCA runs seems to be trivial: use proper reference populations, and project the population you want to study onto that (i.e., exclude it from the initial analysis).

  9. 9.   Razib Khan Says:
    February 14th, 2012 at 7:20 pm

    #8, oh wow, you should go school the guys at the broad institute! everything is so trivial for you!

    more seriously, this paper illustrates a ‘trivial’ problem with getting ‘proper’ reference populations:

    http://www.nature.com/nature/journal/v461/n7263/abs/nature08365.html

  10. 10.   Razib Khan Says:
    February 14th, 2012 at 7:23 pm

    #7, reductio ad absurdum: a mixed-race family would be an ‘ancestral population.’ that’s why you remove relatives out of these analyses. but it shows that with enough IBD in an admixed group you can create a new group. one can actually argue that that is what the ashkenazi jews are.

  11. 11.   Eurologist Says:
    February 15th, 2012 at 2:16 am

    Razib,

    I only addressed one of several issues, and that one indeed seems trivial compared to the others.

    Given that I never shy away from admitting that I am no expert, I certainly didn’t mean to sound condescending. As I mentioned, it seems to me if one wants to find out ancestral admixtures to a specific population, then it’s best to exclude that one and do a projection, instead. That way, the studied population doesn’t pretend to be it’s own source and can never form its own cluster (so as to render the analysis useless). So, that’s the trivial part (conceptually; and practically, in PCA – not so sure about ADMIXTURE).

    What are “proper” reference populations? Looks like for this to be meaningful, you want a lot more populations than K-levels, widely spread, and yes, at one point it becomes model- or prior-knowledge dependent (when selecting as clean or cleanly admixed reference populations as possible, and not overloading a particular region or irrelevant outlying groups). For minor contribution, it probably doesn’t matter much, and it’s probably OK to use as few “stand-in” populations as possible. But sooner or later, interpretation also depends on when admixture took place, and whether you are open-minded enough to allow this to have happened any time in the past 50,000 years or so, in Eurasia — rather than starting from the preconceived notion that, in order of descending importance, most was done in the iron age, the bronze age, or towards the final stages of the dominance of agriculture.

    The Reich paper has some of those latter elements of “interpretation difficulty.” For example, as I have mentioned many times, large parts of West Asia and Europe clearly descend from populations originally residing in the subcontinent – ~45,000ya. Add some continuous gene flow and the well-known climatic barrier within India, just before LGM parts of Pakistan and NW India likely were genetically closer to people in the Caucasus/Anatolia/Eastern Europe than to people in S and SE India and SE Asia. So, IMO, what today may look like recent / IE admixture may have a base-level cause that dates back 25,000ya (the large diversity of y-DNA haplogroups R, R1, and R1a/ R1b in the subcontinent is also of interest in this context).

    Conversely, if the goal is to identify a person as belonging to a particular group, one could do the opposite and climb up the k-ladder until (almost) every group has become distinct.

    Finally, if the algorithm has no way to put proper weights on the groups, then it makes most sense to me to pare down to equally-sized groups.

  12. 12.   Interesting Says:
    February 16th, 2012 at 8:02 pm

    This maybe khazar ancestry showing up.

    As a Jew I have no problem with this. However, it is kind of a hot button issue among some. As it has been a claim that anti-Semites have made in the past.

  13. 13.   Onur Says:
    February 17th, 2012 at 8:08 am

    This maybe khazar ancestry showing up.

    Where do you see that?





    • About Gene Expression

      Razib Khan’s degrees are in biochemistry and biology. He has blogged about genetics since 2002, previously worked in software development, is an Unz Foundation Junior Fellow and lives in the western US. He loves habaneros.

    • Search

    • Recent Comments

      • DK on The utility and reality of species
      • Razib Khan on An Orientalist fantasy
      • Wulf Kurtoglu on An Orientalist fantasy
      • Larry, San Francisco on Vaccination as heterodoxy
      • Onur on The utility and reality of species
    • Must Read List

      • Principles of Population Genetics
      • Quantitative Genetics
      • The Horse, the Wheel, and Language
      • Albion's Seed
      • The Blank Slate
    • Links

      Blogroll

      Blogroll

      • A Replicated Typo
      • Archives at unz.org
      • Brown Pundits
      • Deep Sea News
      • Dienekes
      • Gene Expression Classic
      • Harappa Ancestry Project
      • John Hawks
      • Less Wrong
      • Randall Parker
      • Razib on Books
      • Razib's Aggregator Blog
      • Secular Right
      • Sepia Mutiny
      • Steve Sailer
      • West Hunter
      Q & A

      Q & A

      • A. W. F. Edwards
      • Adam K. Webb
      • Armand Leroi
      • Bruce Lahn
      • Charles C. Mann
      • Charles Murray
      • Dan Sperber
      • David Haig
      • Heather Mac Donald
      • Hugh Pope
      • James F. Crow
      • John Derbyshire
      • Jon Entine
      • Judith Rich Harris
      • Justin L. Barrett
      • Ken Miller
      • Matthew Stewart
      • Parag Khanna
      • Peter Turchin
      • Warren Treadgold
      Books

      Books

      • 1491
      • 1848
      • A Beautiful Math
      • A Concise Economic History of the World
      • A Farewell to Alms
      • A History of Christianity
      • A History of Iran
      • A History of the Byzantine State and Society
      • A Reason for Everything
      • A Separate Creation
      • A Splendid Exchange
      • A Theory of Religion
      • A World History
      • Aboriginal Australians
      • Adaptation and Natural Selection
      • After Tamerlane
      • After the Ice
      • Age of Abundance
      • Albion's Seed
      • American Judaism
      • Banana
      • Before the Dawn
      • Behavioral Genetics in the Postgenomic Era
      • Biometry
      • Blood of the Isles
      • Bones, Stones and Molecules
      • Born That Way
      • Calculus Made Easy
      • Castes of Mind
      • Catholicism and Freedom
      • Causes of Evolution
      • Children of the Revolution
      • China in World History
      • China's Cosmopolitan Empire
      • China: A New History
      • Clash of Extremes
      • Contours of the World Economy 1-2030 AD
      • Darwin's Cathedral
      • Dawn of Human Culture
      • Deep Ancestry
      • Defenders of the Truth
      • Descartes' Baby
      • Divided by the Faith
      • Dragon Bone Hill
      • Empires and Barbarians
      • Empires of the Silk Road
      • Empires of the Word
      • End of the Bronze Age
      • Endless Forms Most Beautiful
      • Epistasis and Evolutionary Process
      • Europe
      • Europe After Rome
      • Europe Between the Oceans
      • Evolution
      • Evolution and the Genetics of Populations
      • Evolution for Everyone
      • Evolutionary Dynamics
      • Evolutionary Genetics
      • Evolutionary Human Genetics
      • Evolutionary Quantitative Genetics
      • Explaining Culture
      • Fooled By Randomness
      • Fourth Crusade & the Sack of Constantinople
      • Freedom Just Around the Corner
      • From Plato to Nato
      • Genetical Theory of Natural Selection
      • Genetics and Analysis of Quantitative Traits
      • Genetics and Origins of Species
      • Genetics of Populations
      • Genghis Khan & the Making of the Modern World
      • Genome
      • Geography of Thought
      • Global Capitalism
      • God's War
      • Grand New Party
      • Grooming, Gossip, and the Evolution of Language
      • Guns, Germs, and Steel
      • Historical Dynamics
      • History of Rome
      • How Pleasure Works
      • How Rome Fell
      • How We Decide
      • In Gods We Trust
      • In Search of the Trojan War
      • India: A New History
      • Infidels
      • Journey of Man
      • Keepers of the Keys of Heaven
      • Knowledge and the Wealth of Nations
      • Mapping Human History
      • Marketplace of the Gods
      • Mathematical Models in Biology
      • Molecular Evolution
      • Molecular Markers, Natural History, and Evolution
      • Mother Nature
      • Mutants
      • Narrow Roads of Gene Land 1
      • Narrow Roads of Gene Land 2
      • Narrow Roads of Gene Land 3
      • Natural Selection and Social Theory
      • Nature via Nurture
      • No Two Alike
      • Of Moths and Men
      • Origin and Evolution of Cultures
      • Origins of Theoretical Population Genetics
      • Out of Thin Air
      • Pandora's Seed
      • Plagues and Peoples
      • Population Genetics and Microevolutionary Theory
      • Population Genetics, Molecular Evolution, and the Neutral Theory
      • Postwar
      • Power and Plenty
      • Predictably Irrational
      • Prehistory of the Mind
      • Principles of Population Genetics
      • Pursuit of Glory
      • Quantitative Genetics
      • R.A. Fisher, the Life of a Scientist
      • Reading in the Brain
      • Religion Explained
      • Rome and Jersalem
      • Sailing to Byzantium
      • Sewall Wright and Evolutionary Biology
      • Sociobiology
      • Speciation
      • Statistical Methods in Molecular Evolution
      • Supernatural Selection
      • Survival of the Prettiest
      • Synaptic Self
      • Tempo and Mode in Evolution
      • The 10,000 Year Explosion
      • The Age of Confucian Rule
      • The Age of Lincoln
      • The Altruism Equation
      • The Ancestor's Tale
      • The Ascent of Money
      • The Barbarian Conversion
      • The Black Swan
      • The Blank Slate
      • The Classical World
      • The Creationists
      • The Cultural Origins of Human Cognition
      • The Darwin Wars
      • The Descent of Man
      • The Early Chinese Empires
      • The Essential Difference
      • The Evolutionists
      • The Faith Instinct
      • The Fall of Rome
      • The Fall of the Roman Empire
      • The g Factor
      • The Genetics of Human Populations
      • The Germanization of Early Medieval Christianity
      • The Great Arab Conquests
      • The Great Divergence
      • The Great Human Diasporas
      • The Great Upheaval
      • The History and Geography of Human Genes
      • The Horse, the Wheel, and Language
      • The Human Web
      • The Imitation Factor
      • The Invisible Gorilla
      • The Language Instinct
      • The Making of a Christian Aristoracy
      • The Math Gene
      • The Mating Mind
      • The Meme Machine
      • The Moral Animal
      • The Number Sense
      • The Nurture Assumption
      • The Origin of Species
      • The Origin Of The Mind
      • The Origins of Virtue
      • The Power of Babel
      • The Price of Altruism
      • The Red Queen
      • The Reformation
      • The Rise of Western Christendom
      • The Sacred Chain
      • The Selfish Gene
      • The Seven Daughters of Eve
      • The Stuff of Thought
      • The Symbolic Species
      • The Tenth Parallel
      • The Troubled Empire
      • The Vertigo Years
      • The Vikings
      • Throes of Democracy
      • Unknown Quantity
      • Unto Others
      • War and Peace and War
      • War, Wine, and Taxes
      • We Are Doomed
      • Wealth and Poverty of Nations
      • What Hath God Wrought
      • When Baghdad Ruled the Muslim World
      • When Genius Failed
      • Why Sex Matters
      • Why Some Like It Hot
    • Elsewhere on DISCOVER

      RSS Genetics in DISCOVER mag

      Genetics in DISCOVER

      • Can Stuffing Germs up Ferrets Unleash a Human Pandemic?
      • 20 Things You Didn't Know About... Allergies
      • The Brain: Hidden Epidemic: 
Tapeworms Living Inside People's Brains
      • The Hagfish's Special Trick for Warding Off Predators: Thick, Sticky Mucus
      • The Big, Overlooked Factor in the Rise of Pandemics: The Human Vector
      • Does Rain Come From Life in the Clouds?
      • Gallery | 6 Creepy-Crawlies We Hate But Couldn't Do Without
      • Plants Repel Bacteria's Assaults by Spying on Their Chatter
    • Gene Expression content

      RSS Recent Posts

      Recent Posts

      • A quick note on comments policy
      • An Orientalist fantasy
      • Vaccination as heterodoxy
      • Hispanos and Sephardic ancestry
      • Are Hispanics that socially conservative?
      • The utility and reality of species
      • The American Community Survey: mend it, don’t end it!
      • GEDmatch
      Categories

      Categories

      • Administration
      • Agriculture
      • Anthroplogy
      • Ask a ScienceBlogger
      • Barbarism
      • Behavior Genetics
      • Bioethics
      • Biology
      • Biotech
      • Blog
      • Books
      • Cognitive Science
      • Creationism
      • Culture
      • Data Analysis
      • Demographics
      • Development
      • Ecology
      • Economics
      • Education
      • Environment
      • Evolution
      • Evolutionary Genetics
      • Evolutionary Psychology
      • Fantasy
      • Food
      • Futurism
      • Genetics
      • Genomics
      • Geography
      • GSS
      • Health
      • History
      • Human Evolution
      • Human Evolutionary Genetics
      • Human Evolutionary Genomics
      • Human Genetics
      • Human Genomics
      • International Affairs
      • Linguistics
      • Medicine
      • Paleontology
      • Personal Genomics
      • philosophy
      • Politics
      • Population Genetics
      • Psychology
      • Quantitative Genetics
      • Race
      • Religion
      • Science
      • Science Fiction
      • Select
      • Social Science
      • Space
      • Sports
      • Statistics
      • Technology
      • Transhumanism
      • Uncategorized
      Archives

      Archives

      • May 2012
      • April 2012
      • March 2012
      • February 2012
      • January 2012
      • December 2011
      • November 2011
      • October 2011
      • September 2011
      • August 2011
      • July 2011
      • June 2011
      • May 2011
      • April 2011
      • March 2011
      • February 2011
      • January 2011
      • December 2010
      • November 2010
      • October 2010
      • September 2010
      • August 2010
      • July 2010
      • June 2010
      • May 2010
      • April 2010
      • March 2010
      • February 2010
      • January 2010
      • December 2009
      • November 2009
      • October 2009
      • September 2009
      • August 2009
      • July 2009
      • June 2009
      • May 2009
      • April 2009
      • March 2009
      • February 2009
      • January 2009
      • December 2008
      • November 2008
      • October 2008
      • September 2008
      • August 2008
      • July 2008
      • June 2008
      • May 2008
      • April 2008
      • March 2008
      • February 2008
      • January 2008
      • December 2007
      • November 2007
      • October 2007
      • September 2007
      • August 2007
      • July 2007
      • June 2007
      • May 2007
      • April 2007
      • March 2007
      • February 2007
      • January 2007
      • December 2006
      • November 2006
      • October 2006
      • September 2006
      • August 2006
      • July 2006
      • June 2006
      • May 2006
      • April 2006
      • March 2006
      • February 2006
      • January 2006
    • Meta

      • Log in
      • Entries RSS
      • Comments RSS
      • WordPress.org
    • RSS Razib’s Pinboard Feed

      • Abortion polls, gay marriage polls: Why are we becoming liberal on some issues but not others? - Slate Magazine
      • At CUNY’s Top Colleges, Black and Hispanic Freshmen Enrollments Drop - NYTimes.com
      • Megafaunal Extinctions
      • New Details Are Released in Shooting of Trayvon Martin - NYTimes.com
      • White American babies are now in the minority. Why does the census divide people by race, anyway? - Slate Magazine
      • When you eat matters, not just what you eat
      • Can You Call a 9-Year-Old a Psychopath? - NYTimes.com
      • A Circle of Tech in Silicon Valley - Collect Payout, Do a Start-Up - NYTimes.com
      • Archaeologists Unearth Ancient Maya Calendar Writing - NYTimes.com
      • Repeat act: Parallel selection tweaks many of the same genes to make big and heavy mice
      • Blond as a window to ancient pigmentation variation
      • Eugenics, Malthusianism, and Trepidation, Bryan Caplan | EconLog | Library of Economics and Liberty
      • Textuality: The Jews Are a Race, Geneticist Says
      • The designer baby factory: Eggs from beautiful Eastern Europeans. Sperm from wealthy Westerners. And embryos implanted in desperate women. | Mail Online
      • Arab Spring Stirs Palestinian Journalists to Test Free Speech Limits - NYTimes.com
      • Barack Obama | Racial Diversity | Civil Rights | 2012 Election | The Daily Caller
      • Could These Start-Ups Become the Next Big Thing? - NYTimes.com
      • Steve Sailer's iSteve Blog: Pym Fortuyn, RIP
      • Never mind Europe; worry about India's economic growth - The Economic Times
      • 9 Swing States, Critical to Presidential Race, Are Mixed Lot - NYTimes.com


  • Kalmbach Publishing Co.

    Copyright © 2012, Kalmbach Publishing Co.

    Privacy - Terms - Reader Services - Subscribe Today - Advertise - About Us