- I have included some more general populations, e.g., Italian_D and Scandinavian_D, so that some people who did not fit in the previous populations, e.g., Danes or North Italians could be included
- The number of individuals is 1,010 now, the number of populations 65
- I have upped the number of markers to ~173k after linkage-disequilibrium pruning
- I am continuing to play around with ways to frame West Eurasia, so now I have included Pakistan_H, North Kannadi and Sakilli for South Asia, She, Miaozu, Chinese, Yakut, and Selkup for North/Eastern Eurasia, and Yoruba, Maasai, Bantu, Ethiopians, and Ethiopian Jews for Africa
The clusters of the previous run were more or less recreated, but please check the table of Fst distances to see how the different names are related to each other. The new addition at K=12 is the split of East Eurasians into East Asian and North Eurasian. The latter is centered on the Uralic Selkup and the Altaic Yakut.
I would say that this is a substantial improvement over the standard K=10 analysis of the Project, as:
- Two main components (North and South European) have been replaced by four new ones (NW/NE European, Sardinian, Basque) that have interesting distributions.
- The five "framing" components (Sub-Saharan, E African, S Asian, N Eurasian, E Asian) correspond largely to the pre-existing ones, but with more diverse framing populations to make them a little better defined.
Admixture proportion and individual results can be found here. Population portraits from here or here.
PS: As I've noticed before, at this level of resolution "noise" becomes a real problem, as evidenced by the emergence of a few tenths of a percent of components where one might not expect them.