<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Analyzing ancestry with ADMIXTURE, step by step</title>
	<atom:link href="http://blogs.discovermagazine.com/gnxp/2011/03/analyzing-ancestry-with-admixture-step-by-step/feed/" rel="self" type="application/rss+xml" />
	<link>http://blogs.discovermagazine.com/gnxp/2011/03/analyzing-ancestry-with-admixture-step-by-step/</link>
	<description></description>
	<lastBuildDate>Sat, 18 May 2013 12:05:00 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.4.2</generator>
	<item>
		<title>By: Input determining output in ADMIXTURE &#124; Gene Expression &#124; Discover Magazine</title>
		<link>http://blogs.discovermagazine.com/gnxp/2011/03/analyzing-ancestry-with-admixture-step-by-step/#comment-31307</link>
		<dc:creator>Input determining output in ADMIXTURE &#124; Gene Expression &#124; Discover Magazine</dc:creator>
		<pubDate>Mon, 21 Mar 2011 22:39:11 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/gnxp/?p=10311#comment-31307</guid>
		<description>[...] reason I posted about how to run ADMIXTURE was that the more readers themselves were familiar with the biases of the program the more they [...] </description>
		<content:encoded><![CDATA[<p>[...] reason I posted about how to run ADMIXTURE was that the more readers themselves were familiar with the biases of the program the more they [...] </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Antonio</title>
		<link>http://blogs.discovermagazine.com/gnxp/2011/03/analyzing-ancestry-with-admixture-step-by-step/#comment-31306</link>
		<dc:creator>Antonio</dc:creator>
		<pubDate>Mon, 21 Mar 2011 18:28:04 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/gnxp/?p=10311#comment-31306</guid>
		<description>&quot;Do people ever make pie charts for these things?&quot; what else could be worse than that?</description>
		<content:encoded><![CDATA[<p>&#8220;Do people ever make pie charts for these things?&#8221; what else could be worse than that?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jean</title>
		<link>http://blogs.discovermagazine.com/gnxp/2011/03/analyzing-ancestry-with-admixture-step-by-step/#comment-31305</link>
		<dc:creator>Jean</dc:creator>
		<pubDate>Mon, 21 Mar 2011 01:17:50 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/gnxp/?p=10311#comment-31305</guid>
		<description>Is the dataset listed above complete or might there be other popuations that were not included?

Thanks</description>
		<content:encoded><![CDATA[<p>Is the dataset listed above complete or might there be other popuations that were not included?</p>
<p>Thanks</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: biologist</title>
		<link>http://blogs.discovermagazine.com/gnxp/2011/03/analyzing-ancestry-with-admixture-step-by-step/#comment-31304</link>
		<dc:creator>biologist</dc:creator>
		<pubDate>Fri, 18 Mar 2011 08:12:17 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/gnxp/?p=10311#comment-31304</guid>
		<description>&lt;i&gt;2) What information is actually contained within these files? ADMIXTURE’s page says it’s “multilocus SNP genotype datasets.” What the heck is that? What part of the genome are they examining for single nucleotide polymorphisms? Are these standard points for examination? How many points per individual?&lt;/i&gt;

Each SNP is a single base-pair in the genome at which two alternative versions are measured. Given two possible alleles (say A and B), there are three possible genotypes  (AA, AB, BB). The data are genotypes.

The chromosome and base-pair coordinate is given in the map file. These coordinates are with respect to a genome build, such as hg18. http://genome.ucsc.edu/cgi-bin/hgGateway?db=hg18

These SNPs were likely selected because they are otherwise well characterized, so in essence they are standardized.</description>
		<content:encoded><![CDATA[<p><i>2) What information is actually contained within these files? ADMIXTURE’s page says it’s “multilocus SNP genotype datasets.” What the heck is that? What part of the genome are they examining for single nucleotide polymorphisms? Are these standard points for examination? How many points per individual?</i></p>
<p>Each SNP is a single base-pair in the genome at which two alternative versions are measured. Given two possible alleles (say A and B), there are three possible genotypes  (AA, AB, BB). The data are genotypes.</p>
<p>The chromosome and base-pair coordinate is given in the map file. These coordinates are with respect to a genome build, such as hg18. <a href="http://genome.ucsc.edu/cgi-bin/hgGateway?db=hg18" rel="nofollow">http://genome.ucsc.edu/cgi-bin/hgGateway?db=hg18</a></p>
<p>These SNPs were likely selected because they are otherwise well characterized, so in essence they are standardized.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: biologist</title>
		<link>http://blogs.discovermagazine.com/gnxp/2011/03/analyzing-ancestry-with-admixture-step-by-step/#comment-31303</link>
		<dc:creator>biologist</dc:creator>
		<pubDate>Fri, 18 Mar 2011 08:06:50 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/gnxp/?p=10311#comment-31303</guid>
		<description>&lt;i&gt;1) What does “pedigree formatted” mean?&lt;/i&gt;

http://pngu.mgh.harvard.edu/~purcell/plink/data.shtml#ped

&lt;i&gt;Are these beds, bims, and fams in a binary or ascii format? I assume binary because I wasn’t able to open them in a text editor.&lt;/i&gt;

bed and bim are binary

http://pngu.mgh.harvard.edu/~purcell/plink/data.shtml#bed

&lt;i&gt;Besides ADMIXTURE (which I can’t use because like most people in the world, I use Windows :) ), is there any way to read these files? Could I write a custom binary processor? Is there any ascii version of the data? Is that what the csv is?&lt;/i&gt;

you can convert bed/bim to ped/map using plink (see links above)</description>
		<content:encoded><![CDATA[<p><i>1) What does “pedigree formatted” mean?</i></p>
<p><a href="http://pngu.mgh.harvard.edu/~purcell/plink/data.shtml#ped" rel="nofollow">http://pngu.mgh.harvard.edu/~purcell/plink/data.shtml#ped</a></p>
<p><i>Are these beds, bims, and fams in a binary or ascii format? I assume binary because I wasn’t able to open them in a text editor.</i></p>
<p>bed and bim are binary</p>
<p><a href="http://pngu.mgh.harvard.edu/~purcell/plink/data.shtml#bed" rel="nofollow">http://pngu.mgh.harvard.edu/~purcell/plink/data.shtml#bed</a></p>
<p><i>Besides ADMIXTURE (which I can’t use because like most people in the world, I use Windows <img src='http://blogs.discovermagazine.com/gnxp/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  ), is there any way to read these files? Could I write a custom binary processor? Is there any ascii version of the data? Is that what the csv is?</i></p>
<p>you can convert bed/bim to ped/map using plink (see links above)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: iron0037</title>
		<link>http://blogs.discovermagazine.com/gnxp/2011/03/analyzing-ancestry-with-admixture-step-by-step/#comment-31302</link>
		<dc:creator>iron0037</dc:creator>
		<pubDate>Wed, 16 Mar 2011 20:22:16 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/gnxp/?p=10311#comment-31302</guid>
		<description>Hi Razib,
Thank you for the quick answers. I hadn&#039;t made it to your next post at the time of my comment! 27000 SNPs huh?  That&#039;s quite a bit of data. I see that I can open the bim and fam files in a text editor, so there&#039;s hope I can import all of it into MATLAB and play with it. I&#039;ll have to read your posts more carefully and do some investigation as to the meaning of each row of the file. We&#039;ll see if anything comes of it.

Cheers</description>
		<content:encoded><![CDATA[<p>Hi Razib,<br />
Thank you for the quick answers. I hadn&#8217;t made it to your next post at the time of my comment! 27000 SNPs huh?  That&#8217;s quite a bit of data. I see that I can open the bim and fam files in a text editor, so there&#8217;s hope I can import all of it into MATLAB and play with it. I&#8217;ll have to read your posts more carefully and do some investigation as to the meaning of each row of the file. We&#8217;ll see if anything comes of it.</p>
<p>Cheers</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Razib Khan</title>
		<link>http://blogs.discovermagazine.com/gnxp/2011/03/analyzing-ancestry-with-admixture-step-by-step/#comment-31301</link>
		<dc:creator>Razib Khan</dc:creator>
		<pubDate>Wed, 16 Mar 2011 18:08:41 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/gnxp/?p=10311#comment-31301</guid>
		<description>&lt;i&gt;Besides ADMIXTURE (which I can’t use because like most people in the world, I use Windows  ), is there any way to read these files? Could I write a custom binary processor? Is there any ascii version of the data? Is that what the csv is?&lt;/i&gt;

no idea about windows. .bed is binary. the text version is .ped.

&lt;i&gt;What information is actually contained within these files? ADMIXTURE’s page says it’s “multilocus SNP genotype datasets.” What the heck is that? What part of the genome are they examining for single nucleotide polymorphisms? Are these standard points for examination? How many points per individual?&lt;/i&gt;

the data have snps and individuals. also stuff like sex, phenotype, etc., but that&#039;s extraneous for our purpose. in these data 27,000 snps per individual.

&lt;i&gt; Or to put it another way, it’s creating K fictitious populations that are as genetically distinct as possible, and showing the proportion each modern population has of these fictitious populations?&lt;/i&gt;

yes

&lt;i&gt;4) Is it possible to specify that one of your K populations is completely one of the modern populations? Could you say, as an example, that you want the French and the Moroccans to show up as 100% one of the resultant clusters?&lt;/i&gt;

yes. see follow up post.

&lt;i&gt;These bar charts that you end up with are rather hard to follow. Do people ever make pie charts for these things? How about a map of the Earth with the ranges of each of the K populations? You can clearly see in your K=12 map, for example, that the pink population is most prevalent in northeastern Europe and dissipates moving outward from there.&lt;/i&gt;

yes. i have thought of that. need to get fluid with R&#039;s map functions. i don&#039;t want to learn GIS. the thematic heat maps are easy with paid software, but i&#039;m not going to shell out $400 for that :)</description>
		<content:encoded><![CDATA[<p><i>Besides ADMIXTURE (which I can’t use because like most people in the world, I use Windows  ), is there any way to read these files? Could I write a custom binary processor? Is there any ascii version of the data? Is that what the csv is?</i></p>
<p>no idea about windows. .bed is binary. the text version is .ped.</p>
<p><i>What information is actually contained within these files? ADMIXTURE’s page says it’s “multilocus SNP genotype datasets.” What the heck is that? What part of the genome are they examining for single nucleotide polymorphisms? Are these standard points for examination? How many points per individual?</i></p>
<p>the data have snps and individuals. also stuff like sex, phenotype, etc., but that&#8217;s extraneous for our purpose. in these data 27,000 snps per individual.</p>
<p><i> Or to put it another way, it’s creating K fictitious populations that are as genetically distinct as possible, and showing the proportion each modern population has of these fictitious populations?</i></p>
<p>yes</p>
<p><i>4) Is it possible to specify that one of your K populations is completely one of the modern populations? Could you say, as an example, that you want the French and the Moroccans to show up as 100% one of the resultant clusters?</i></p>
<p>yes. see follow up post.</p>
<p><i>These bar charts that you end up with are rather hard to follow. Do people ever make pie charts for these things? How about a map of the Earth with the ranges of each of the K populations? You can clearly see in your K=12 map, for example, that the pink population is most prevalent in northeastern Europe and dissipates moving outward from there.</i></p>
<p>yes. i have thought of that. need to get fluid with R&#8217;s map functions. i don&#8217;t want to learn GIS. the thematic heat maps are easy with paid software, but i&#8217;m not going to shell out $400 for that <img src='http://blogs.discovermagazine.com/gnxp/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: iron0037</title>
		<link>http://blogs.discovermagazine.com/gnxp/2011/03/analyzing-ancestry-with-admixture-step-by-step/#comment-31300</link>
		<dc:creator>iron0037</dc:creator>
		<pubDate>Wed, 16 Mar 2011 15:41:01 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/gnxp/?p=10311#comment-31300</guid>
		<description>Thanks for this useful post. I am an engineer with a casual interest in genetics. So I have a decent math background but understand little about the domain.  Up til now, all these admixture analyses have been like black magic to me. Now I at least have a bit better of an understanding.  And yet, your post begs more questions...

1) What does &quot;pedigree formatted&quot; mean? Are these beds, bims, and fams in a binary or ascii format? I assume binary because I wasn&#039;t able to open them in a text editor. Besides ADMIXTURE (which I can&#039;t use because like most people in the world, I use Windows :)), is there any way to read these files? Could I write a custom binary processor? Is there any ascii version of the data? Is that what the csv is?

2) What information is actually contained within these files? ADMIXTURE&#039;s page says it&#039;s &quot;multilocus SNP genotype datasets.&quot; What the heck is that? What part of the genome are they examining for single nucleotide polymorphisms?  Are these standard points for examination?  How many points per individual?

3) If I understand correctly, ADMIXTURE is doing is a cluster analysis? It&#039;s crunching the data, finding the eigenvectors for the given K, and displaying the vector components of each population mapped onto the eigenvectors? Or to put it another way, it&#039;s creating K fictitious populations that are as genetically distinct as possible, and showing the proportion each modern population has of these fictitious populations?

4) Is it possible to specify that one of your K populations is completely one of the modern populations? Could you say, as an example, that you want the French and the Moroccans to show up as 100% one of the resultant clusters?

5) These bar charts that you end up with are rather hard to follow. Do people ever make pie charts for these things? How about a map of the Earth with the ranges of each of the K populations? You can clearly see in your K=12 map, for example, that the pink population is most prevalent in northeastern Europe and dissipates moving outward from there.</description>
		<content:encoded><![CDATA[<p>Thanks for this useful post. I am an engineer with a casual interest in genetics. So I have a decent math background but understand little about the domain.  Up til now, all these admixture analyses have been like black magic to me. Now I at least have a bit better of an understanding.  And yet, your post begs more questions&#8230;</p>
<p>1) What does &#8220;pedigree formatted&#8221; mean? Are these beds, bims, and fams in a binary or ascii format? I assume binary because I wasn&#8217;t able to open them in a text editor. Besides ADMIXTURE (which I can&#8217;t use because like most people in the world, I use Windows <img src='http://blogs.discovermagazine.com/gnxp/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> ), is there any way to read these files? Could I write a custom binary processor? Is there any ascii version of the data? Is that what the csv is?</p>
<p>2) What information is actually contained within these files? ADMIXTURE&#8217;s page says it&#8217;s &#8220;multilocus SNP genotype datasets.&#8221; What the heck is that? What part of the genome are they examining for single nucleotide polymorphisms?  Are these standard points for examination?  How many points per individual?</p>
<p>3) If I understand correctly, ADMIXTURE is doing is a cluster analysis? It&#8217;s crunching the data, finding the eigenvectors for the given K, and displaying the vector components of each population mapped onto the eigenvectors? Or to put it another way, it&#8217;s creating K fictitious populations that are as genetically distinct as possible, and showing the proportion each modern population has of these fictitious populations?</p>
<p>4) Is it possible to specify that one of your K populations is completely one of the modern populations? Could you say, as an example, that you want the French and the Moroccans to show up as 100% one of the resultant clusters?</p>
<p>5) These bar charts that you end up with are rather hard to follow. Do people ever make pie charts for these things? How about a map of the Earth with the ranges of each of the K populations? You can clearly see in your K=12 map, for example, that the pink population is most prevalent in northeastern Europe and dissipates moving outward from there.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Eurasia, ADMIXTURE supervised &#38; unpservised &#124; Gene Expression &#124; Discover Magazine</title>
		<link>http://blogs.discovermagazine.com/gnxp/2011/03/analyzing-ancestry-with-admixture-step-by-step/#comment-31299</link>
		<dc:creator>Eurasia, ADMIXTURE supervised &#38; unpservised &#124; Gene Expression &#124; Discover Magazine</dc:creator>
		<pubDate>Wed, 16 Mar 2011 07:45:16 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/gnxp/?p=10311#comment-31299</guid>
		<description>[...] RSS                   &#171; Analyzing ancestry with ADMIXTURE, step by step [...] </description>
		<content:encoded><![CDATA[<p>[...] RSS                   &laquo; Analyzing ancestry with ADMIXTURE, step by step [...] </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Garvan</title>
		<link>http://blogs.discovermagazine.com/gnxp/2011/03/analyzing-ancestry-with-admixture-step-by-step/#comment-31298</link>
		<dc:creator>Garvan</dc:creator>
		<pubDate>Wed, 16 Mar 2011 05:34:48 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/gnxp/?p=10311#comment-31298</guid>
		<description>Thanks, I downloaded the script using a vista computer on another network. I still have no idea why I can not access it here. But I have it now. Garvan</description>
		<content:encoded><![CDATA[<p>Thanks, I downloaded the script using a vista computer on another network. I still have no idea why I can not access it here. But I have it now. Garvan</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: James</title>
		<link>http://blogs.discovermagazine.com/gnxp/2011/03/analyzing-ancestry-with-admixture-step-by-step/#comment-31297</link>
		<dc:creator>James</dc:creator>
		<pubDate>Wed, 16 Mar 2011 05:23:16 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/gnxp/?p=10311#comment-31297</guid>
		<description>Thanks a bunch for this....can&#039;t wait to graduate college so I can mess around with this...</description>
		<content:encoded><![CDATA[<p>Thanks a bunch for this&#8230;.can&#8217;t wait to graduate college so I can mess around with this&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Razib Khan</title>
		<link>http://blogs.discovermagazine.com/gnxp/2011/03/analyzing-ancestry-with-admixture-step-by-step/#comment-31296</link>
		<dc:creator>Razib Khan</dc:creator>
		<pubDate>Tue, 15 Mar 2011 18:25:23 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/gnxp/?p=10311#comment-31296</guid>
		<description>try &lt;a href=&quot;http://gnxp.com/Rstuff.txt&quot; rel=&quot;nofollow&quot;&gt;Rstuff.txt&lt;/a&gt;.</description>
		<content:encoded><![CDATA[<p>try <a href="http://gnxp.com/Rstuff.txt" rel="nofollow">Rstuff.txt</a>.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Garvan</title>
		<link>http://blogs.discovermagazine.com/gnxp/2011/03/analyzing-ancestry-with-admixture-step-by-step/#comment-31295</link>
		<dc:creator>Garvan</dc:creator>
		<pubDate>Tue, 15 Mar 2011 12:49:22 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/gnxp/?p=10311#comment-31295</guid>
		<description>The Rstuff.R link gives me a 403 Forbidden error message. Could you check the link please?

Thanks,
Garvan</description>
		<content:encoded><![CDATA[<p>The Rstuff.R link gives me a 403 Forbidden error message. Could you check the link please?</p>
<p>Thanks,<br />
Garvan</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Moreno</title>
		<link>http://blogs.discovermagazine.com/gnxp/2011/03/analyzing-ancestry-with-admixture-step-by-step/#comment-31294</link>
		<dc:creator>Moreno</dc:creator>
		<pubDate>Tue, 15 Mar 2011 07:41:00 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/gnxp/?p=10311#comment-31294</guid>
		<description>Thanks Razib for this great tutorial! I&#039;ve always wanted to play with this kind of things, and this post will save me a lot of time.</description>
		<content:encoded><![CDATA[<p>Thanks Razib for this great tutorial! I&#8217;ve always wanted to play with this kind of things, and this post will save me a lot of time.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: RK</title>
		<link>http://blogs.discovermagazine.com/gnxp/2011/03/analyzing-ancestry-with-admixture-step-by-step/#comment-31293</link>
		<dc:creator>RK</dc:creator>
		<pubDate>Tue, 15 Mar 2011 05:24:17 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/gnxp/?p=10311#comment-31293</guid>
		<description>Zack, yeah, the latest version of gtool, which shellfish depends on, seems to be broken. I downgraded to version 0.6.1 and it works fine: http://www.well.ox.ac.uk/~cfreeman/software/gwas/gtool_v0.6.1.html

I&#039;m using Python 2.6.6 as well, on Debian unstable.</description>
		<content:encoded><![CDATA[<p>Zack, yeah, the latest version of gtool, which shellfish depends on, seems to be broken. I downgraded to version 0.6.1 and it works fine: <a href="http://www.well.ox.ac.uk/~cfreeman/software/gwas/gtool_v0.6.1.html" rel="nofollow">http://www.well.ox.ac.uk/~cfreeman/software/gwas/gtool_v0.6.1.html</a></p>
<p>I&#8217;m using Python 2.6.6 as well, on Debian unstable.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Zack</title>
		<link>http://blogs.discovermagazine.com/gnxp/2011/03/analyzing-ancestry-with-admixture-step-by-step/#comment-31292</link>
		<dc:creator>Zack</dc:creator>
		<pubDate>Tue, 15 Mar 2011 02:31:01 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/gnxp/?p=10311#comment-31292</guid>
		<description>RK: I downloaded shellfish just for that reason, but it gives me (Python 2.6.6 on Ubuntu 10.10) errors in the initial conversion step. Have you been able to get it to work? I haven&#039;t had time to look into it.</description>
		<content:encoded><![CDATA[<p>RK: I downloaded shellfish just for that reason, but it gives me (Python 2.6.6 on Ubuntu 10.10) errors in the initial conversion step. Have you been able to get it to work? I haven&#8217;t had time to look into it.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Razib Khan</title>
		<link>http://blogs.discovermagazine.com/gnxp/2011/03/analyzing-ancestry-with-admixture-step-by-step/#comment-31291</link>
		<dc:creator>Razib Khan</dc:creator>
		<pubDate>Tue, 15 Mar 2011 01:34:29 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/gnxp/?p=10311#comment-31291</guid>
		<description>hey, anyone can leave a question here. didn&#039;t want 2 overload the post.</description>
		<content:encoded><![CDATA[<p>hey, anyone can leave a question here. didn&#8217;t want 2 overload the post.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Razib Khan</title>
		<link>http://blogs.discovermagazine.com/gnxp/2011/03/analyzing-ancestry-with-admixture-step-by-step/#comment-31290</link>
		<dc:creator>Razib Khan</dc:creator>
		<pubDate>Tue, 15 Mar 2011 01:24:44 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/gnxp/?p=10311#comment-31290</guid>
		<description>yeah, i put the --extract stuff there. also, no issue with flipping when i did a test run. it seemed oriented 23andMe direction. haven&#039;t used shellfish, but i&#039;ll check it out.</description>
		<content:encoded><![CDATA[<p>yeah, i put the &#8211;extract stuff there. also, no issue with flipping when i did a test run. it seemed oriented 23andMe direction. haven&#8217;t used shellfish, but i&#8217;ll check it out.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: RK</title>
		<link>http://blogs.discovermagazine.com/gnxp/2011/03/analyzing-ancestry-with-admixture-step-by-step/#comment-31289</link>
		<dc:creator>RK</dc:creator>
		<pubDate>Tue, 15 Mar 2011 00:57:32 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/gnxp/?p=10311#comment-31289</guid>
		<description>Wow, this is great -- especially the merged dataset. Thanks for putting it together. I haven&#039;t used the dataset, but shouldn&#039;t you be filtering your 23andMe file for the SNPs found in the merged dataset? After all, the latter has many fewer SNPs than the former. Also, I&#039;m not sure if it&#039;s needed for this dataset, but people should familiarize themselves with how to &lt;a href=&quot;http://pngu.mgh.harvard.edu/~purcell/plink/dataman.shtml&quot; rel=&quot;nofollow&quot;&gt;flip strands&lt;/a&gt; when merging datasets.

Have you ever used &lt;a href=&quot;http://www.stats.ox.ac.uk/~davison/software/shellfish/shellfish.php&quot; rel=&quot;nofollow&quot;&gt;Shellfish&lt;/a&gt; to do PCA? It lets you create &quot;loadings&quot; for reference populations, so that you can see where samples fall on the dimensions generated by the reference data without changing the dimensions themselves. Basically, it&#039;s like the advanced genetic similarity plot on 23andMe. I&#039;m trying to create a loading for the HGDP populations, but now I may as well add these in.</description>
		<content:encoded><![CDATA[<p>Wow, this is great &#8212; especially the merged dataset. Thanks for putting it together. I haven&#8217;t used the dataset, but shouldn&#8217;t you be filtering your 23andMe file for the SNPs found in the merged dataset? After all, the latter has many fewer SNPs than the former. Also, I&#8217;m not sure if it&#8217;s needed for this dataset, but people should familiarize themselves with how to <a href="http://pngu.mgh.harvard.edu/~purcell/plink/dataman.shtml" rel="nofollow">flip strands</a> when merging datasets.</p>
<p>Have you ever used <a href="http://www.stats.ox.ac.uk/~davison/software/shellfish/shellfish.php" rel="nofollow">Shellfish</a> to do PCA? It lets you create &#8220;loadings&#8221; for reference populations, so that you can see where samples fall on the dimensions generated by the reference data without changing the dimensions themselves. Basically, it&#8217;s like the advanced genetic similarity plot on 23andMe. I&#8217;m trying to create a loading for the HGDP populations, but now I may as well add these in.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
