With 14 MDS dimensions retained, there were 50 clusters inferred in the optimal solution by MCLUST.
The results spreadsheet has rows for the 54 project participants in the first rows: each row is the probability that you belong to a particular cluster. This is followed by the reference populations where each row has the number of individuals (for that populations) that is assigned to a particular cluster.
There are also some outliers in this analysis:
FFD002 FFD004 FFD007 FFD012 FFD015 FFD016 FFD021 FFD022 FFD023 FFD038 FFD046
Check what an outlier is in the context of this analysis, and what it means.
Interestingly, because of the smaller number of Family Finder participants some previously defined clusters (for 23andMe data) such as the "Finnish" cluster do not appear here. This is not surprising at all, because for a cluster to be defined several individuals from that population must be present in the data.
Many continental Europeans of this type ended up in cluster #2. Some others, like FFD048 who is Lithuanian were assigned to the proper cluster #9, centered on Lithuanians.
This underscores the importance of having more people join the Project at the next available opportunity. This will not only create new clusters for individuals who are currently the only representatives of their populations, but it may also split already existing clusters if regional sub-populations are detected.
It is also important for project participants to drop a note at the ancestry thread, to help others make better sense of their results.