Clustering tools are a great way to quickly identify groups of matches with a relationship to each other.
As we know shared matches are 'clues', whilst shared segments are 'evidence' of a shared ancestor. All cluster tools identify matches that are likely to be clues of common relationships. The best types of clusters in my opinion, are shared segment clusters, however every cluster analysis you do is likely to give you new ideas about how your matches might relate to you.
This post is aimed at identifying where clustering can be undertaken and whether they provide shared match or shared segment clusters. Click on the heading hyperlinks for more information, plus please refer to the blog posts at the end of this page for more information about using each of the different tools.
Using the cluster tool at DNAGedcom requires a subscription which can be taken out for a minimum of one month. The DNAGedcom client provides clusters based on the Collins Leeds Method for AncestryDNA and FamilyTreeDNA.
As AncestryDNA does not provide segment data, the clusters generated are 'shared match' clusters. Generating clusters with matches >30cMs will generally given an indication of shared ancestry, however clusters generated with less than that amount may give false leads with matches potentially sharing different more distant ancestors.
There are now 2 cluster tools at GEDmatch, they require a Tier 1 subscription which can be taken out for a minimum of one month.
Genetic Affairs provides many other useful tools on its site, including AutoKinship, AutoTree and AutoPedigree.
Connected DNA used to provide fabulous network maps of shared match clusters for AncestryDNA, FamilyTreeDNA, and 23andMe. However, as at October 2021 Shelley is taking a break and no orders are being taken at the moment. We hope she will be back soon.
DNA Painter Cluster Auto Painter
You can upload clusters created from DNAGedcom (FTDNA only), My Heritage, GEDmatch and Genetic Affairs to the DNA Painter Cluster Auto Painter to visualise the segments and make notes about the results of your analysis. This allows for a more visual approach to analysing your segment data. The image below has been generated from the output of the AutoSegment at GEDmatch.
RootsFinder is a family tree building and DNA analysis website. The premium level has DNA features for a subscription fee. The triangulation (cluster) view allows you to view your matches in clusters – otherwise known as a network graph.
Ideas for exploring your cluster
* Explore the matches in the cluster and check if the shared matches are also sharing the same segments;
* Are there any triangulated groups within the cluster? Explore these matches first. Remember you can expand each triangulated group by checking your segment data for others not appearing in the cluster report, who may not have met the cluster criteria.
* Are there any 'bridge' matches in the cluster? Use these to help you to find others who match in the same segment area at other sites.
* If the cluster includes matches not triangulating with the core group, explore those segment areas as these may provide additional clues to the possible relationship of those in the cluster.
* Do the genealogy! Build research trees for your matches and revisit your own pedigree to search for the what is in common between the cluster group - surnames, locations, ethnicities?
* Once you have identified a 'side' or MRCA make notes on your master list and at the DNA site for all the matches in the triangulated group.
* You may also wish to allocate a reference number to your Cluster for future reference. Remember all the tools have their own numbering systems which constantly change with each report.
* Make sure you are systematic with your notes so that next time you generate a cluster report you can easily see matches that have been worked on before.
* Don't forget to consider the other side of the chromosome - use what you now know to mark the segments on the opposing side. Explore the other side of the 'specific segment area' for more triangulated groups. By looking at both sides concurrently you can then mark others not triangulating as potentially false positive matches, saving analysis time in the future.
My group numbering system has changed many times over the years, it has been adapted from the Jim Bartlett method. It doesn't need to be as complex as this, but it needs to be meaningful for you.
More information:
Check out these links for useful blogs that may help you interpret how the different cluster sites work:
* The Leeds Method, Dana Leeds, 2018
* What to do after Clustering (You Tube), Dana Leeds, 2024
* DNA Sleuth, 2019
* Walking the Clusters Back, Jim Bartlett 2019
* Cluster Auto Painter, Jonny Perl 2019
* Shared Clustering - A great tool! Jim Bartlett, 2019
* Connecting the Dots, My Heritage 2020
* Fast Ways to Cluster your DNA Matches, Family Locket 2020
* Genetic Affairs Tools and how to use them, Roberta Estes 2020
* Walking Back the Clusters, Veronica Williams 2020. Demonstration of applying Jim's method.
* Using DNA tools to solve a family mystery, Vicki Hails 2020
* Annotating a Cluster Auto Painter Map, Jonny Pearl 2020
* Auto Segment Triangulation Tool at GEDmatch, Roberta Estes 18 Oct 2021
* RootsFinder Network Graphs, Family Locket Oct 2021
* RootsFinder - Making Triangulated Network Graphs, Family Locket Dec 2021
AncestryDNA
Whilst AncestryDNA does not provide chromosome information it is always best to start your analysis there, due to the large numbers in the database and its many pedigrees. AncestryDNA Clustering can be done using the DNAGedcom CLM tool, Shared Clustering Tool and the DNA2 Tree app. You will often find Ancestry testers at other sites, particularly GEDmatch and My Heritage (a growing database with lots of family trees). Always look for 'bridge matches' between the sites as they can help to expand your research pool and help you tie your triangulated groups together with broader shared match clusters.
Veronica Williams
First published: 24 Oct 2021
Last updated: September 2024
2024 NOTE: Since September 2023 many downloads have been unavailable due to restrictions at the DNA companies, this includes the very handy DNA2Tree Tool. GEDmatch has remained available throughout this time and finally FTDNA returned their reports in late August 2024. We hope things will change at My Heritage and 23andMe soon.