Friday, April 29, 2011

K=12 ADMIXTURE results for selected participants

In comparison to the previous K=11 run:
  • I have included some more general populations, e.g., Italian_D and Scandinavian_D, so that some people who did not fit in the previous populations, e.g., Danes or North Italians could be included
  • The number of individuals is 1,010 now, the number of populations 65
  • I have upped the number of markers to ~173k after linkage-disequilibrium pruning
  • I am continuing to play around with ways to frame West Eurasia, so now I have included Pakistan_H, North Kannadi and Sakilli for South Asia, She, Miaozu, Chinese, Yakut, and Selkup for North/Eastern Eurasia, and Yoruba, Maasai, Bantu, Ethiopians, and Ethiopian Jews for Africa
The previous analysis had stopped at K=11 because a couple of Iraqi Jews formed a spurious "family" cluster. On inspection, these were a couple of likely first-degree relatives, so I removed one of them to proceed.

The clusters of the previous run were more or less recreated, but please check the table of Fst distances to see how the different names are related to each other. The new addition at K=12 is the split of East Eurasians into East Asian and North Eurasian. The latter is centered on the Uralic Selkup and the Altaic Yakut.

I would say that this is a substantial improvement over the standard K=10 analysis of the Project, as:
  • Two main components (North and South European) have been replaced by four new ones (NW/NE European, Sardinian, Basque) that have interesting distributions.
  • The five "framing" components (Sub-Saharan, E African, S Asian, N Eurasian, E Asian) correspond largely to the pre-existing ones, but with more diverse framing populations to make them a little better defined.
The search continues...

Admixture proportion and individual results can be found here. Population portraits from here or here.

PS: As I've noticed before, at this level of resolution "noise" becomes a real problem, as evidenced by the emergence of a few tenths of a percent of components where one might not expect them.


  1. I think the Sardinian to Basque ratios are really significant. I expect that like Sephardim, Ashkenazim will lean toward the Sardinian pole.

  2. Charles,

    I would be pretty sure of that too!

    My Sicilian Grandmother-in-law DOD097 has the following:
    Sardinian 30.58%
    Basque 0.85%

    She seems to have some Sephardic ancestry too.

  3. How do you discern which bar you are in your respective population portrait? The Indian_D one for example -

    Is there a spreadsheet that lists the numbers against the Dodecad ID, perhaps?

  4. Individual results are in the second tab of the spreadsheet.

  5. hi and thanks for the study I am user ashraf
    The west asian/southwest asian ration in eastern and southern europeans seems like showing an armenian origin for that wave.
    The north africans are not homegenous and should be split-I think-into egyptians+moroccans (already done) and also libyans and algerotunisians
    Btw you can include libyans from The Libyans from the Henn et all public dataset, there was much greek colonization in eastern libya (pentapolis) and also western libya (tripolis)

  6. Yes, I am aware of that. It would be nice to have an image of the individual admixture bar, too so I was wondering which bar represented which DOD. Thanks either way.

  7. @dienekes

    I wanted to see dod307 in the individual results for this but do you need the person's, swissgirl, permission? Also is that still the only Swiss person you have?

  8. Only populations with 5+ individuals are included in this.

  9. So Dienekes, which bar is which DOD-ID?

  10. As I said countless times, I never identify DODs with ethnic groups as per the Project's policy.