{"id":3077,"date":"2013-10-21T14:00:47","date_gmt":"2013-10-21T14:00:47","guid":{"rendered":"http:\/\/dna-explained.com\/?p=3077"},"modified":"2019-07-29T18:21:43","modified_gmt":"2019-07-29T18:21:43","slug":"why-are-my-predicted-cousin-relationships-wrong","status":"publish","type":"post","link":"https:\/\/dna-explained.com\/2013\/10\/21\/why-are-my-predicted-cousin-relationships-wrong\/","title":{"rendered":"Why Are My Predicted Cousin Relationships Wrong?"},"content":{"rendered":"<p>The answer is, because inherited DNA segments do not always follow the 50% rule.\u00a0 I guess maybe no one told them???<\/p>\n<p>Many times, when we receive our autosomal DNA results, we wonder why predicted relationships, particularly distant ones, aren\u2019t accurate.\u00a0 Sometimes people estimated to be 3<sup>rd<\/sup> cousins, or maybe 2<sup>nd<\/sup> to 4<sup>th<\/sup> cousins, turn out to be 6<sup>th<\/sup> cousins, for example.\u00a0 This happens because <a href=\"http:\/\/www.isogg.org\/wiki\/Autosomal_DNA_statistics\">genetic predictions must use math models and averages<\/a>, but our actual DNA doesn\u2019t follow those rules.<\/p>\n<p>Dr. <a href=\"http:\/\/www.clfs.umd.edu\/labs\/mount\/\">Steve Mount<\/a> is an Associate Professor of Cell Biology and Molecular Genetics at the University of Maryland.\u00a0 In February 2011, he wrote an article about his experience submitting his DNA to 23andMe and his experiences matching his cousins.\u00a0 More specifically, he became interested in one particular segment of DNA trackable to a specific ancestor.<\/p>\n<p>He shares these insights.<\/p>\n<ul>\n<li>Distant relatives (4<sup>th<\/sup>\u00a0cousins and beyond) often share no genetic material at all.<\/li>\n<li>It is possible to share a segment with\u00a0very distant relatives.<\/li>\n<li>Sometimes, more distant relationships\u00a0are more likely.<\/li>\n<li>Most of your relatives may be descended\u00a0from a small fraction of your ancestors.<\/li>\n<\/ul>\n<p>In genetic genealogy, people who deal with autosomal DNA spend a lot of time trying to figure out which segments are IBD vs IBS \u2013 Identical by Descent versus Identical by State.\u00a0 In laymen\u2019s terms, identical by descent means that you do in fact share a common ancestor in a timeframe in which you might be able to identify them.\u00a0 Identical by state really implies, technically, that you just happen to have the same DNA due to spontaneous mutations, not because you share a common ancestor.\u00a0 In reality, it\u2019s taken to mean that you descend from a common population &#8211;\u00a0 in other words, you do share a common ancestor but the segment is so small that it implies that the ancestor is so far back in time that you can\u2019t possibly identify them. \u00a0Some people call these matches \u201cfalse positives\u201d which really isn\u2019t accurate.<\/p>\n<p>Far from being useless, these small segments are very useful in <a href=\"http:\/\/dna-explained.com\/2013\/06\/02\/the-autosomal-me-summary-and-pdf-file\/\">identifying different ethnic populations<\/a> found in your ancestral tree and can, often in conjunction with larger segments also be useful in identifying ancestral lines.\u00a0 Discounting small segments, especially if you share a common ancestor, is akin to throwing away pennies because they aren\u2019t as useful and are more difficult to manage than quarters or dollars.\u00a0 Furthermore, small segments may be our only way of identifying ancestors that are many generations back in our tree.\u00a0 After all, we inherited all of our DNA from some ancestor, no matter how small the segments are today.<\/p>\n<p>Because we have no better rule of thumb (or statistical model), we utilize the theory that one inherits about 50% of the DNA of each ancestor in each generation.\u00a0 We know this is absolutely true between Mom and Dad, but you don\u2019t receive exactly 25% of each of your grandparents\u2019 DNA.\u00a0 However, the mixture of what and how much of your grandparents\u2019 DNA you do inherit is approximately 25% and appears to be random, like a card shuffle.\u00a0 If it\u2019s not random, we don\u2019t know what the rules of inheritance are.<\/p>\n<p>In the past few years, as we\u2019ve come to work more closely with autosomal results, we have learned that while the rules of thumb about how much DNA you inherit from specific ancestors are useful, they are not absolute.\u00a0 In other words, it\u2019s certainly possible to inherit a very large chunk of DNA from a very specific distant ancestor when the rules of probability and the rule of thumb of 50% would indicate that you should not.<\/p>\n<p>This is shown clearly in the Vannoy project where 5 cousins who descend from Elijah Vannoy born in 1786 (5 generations removed) share a very\u00a0significant portion of chromosome 15.\u00a0 These people are all 5 generations or more distantly related from the common ancestor, (approximate 4<sup>th<\/sup> cousins) and should share less than 1% of their DNA in total, and certainly no large, unbroken\u00a0segments.\u00a0 \u00a0As you can see, below, that\u2019s not the case.\u00a0 We don\u2019t know why or how some DNA clumps together like this and is transmitted in complete (or nearly complete) segments, but they obviously are.\u00a0 We often call these \u201csticky segments\u201d for lack of a better term.<\/p>\n<p><a href=\"https:\/\/i0.wp.com\/dna-explained.com\/wp-content\/uploads\/2013\/10\/cousin-1.png\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/dna-explained.com\/wp-content\/uploads\/2013\/10\/cousin-1.png?resize=584%2C820\" alt=\"cousin 1\" width=\"584\" height=\"820\" \/><\/a><\/p>\n<p>I downloaded this information into a spreadsheet where I can sort it by chromosome.\u00a0 Below you can see the segments on chromosome 15 where these cousins match me.\u00a0 Note that Buster is also a cousin from a second ancestor.<\/p>\n<p><a href=\"https:\/\/i0.wp.com\/dna-explained.com\/wp-content\/uploads\/2013\/10\/cousin-2.png\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" class=\"aligncenter\" src=\"https:\/\/i0.wp.com\/dna-explained.com\/wp-content\/uploads\/2013\/10\/cousin-2.png?resize=444%2C290\" alt=\"cousin 2\" width=\"444\" height=\"290\" \/><\/a><\/p>\n<p>Given these incidental discoveries and the very large amount of DNA I share with these cousins on chromosome 15, I was quite interested in Dr. Mount\u2019s following commentary:<\/p>\n<p>\u201cThe probability that fourth cousins share at least one IBD [identical by descent] segment is 77%, and the expected length of this segment is 10 cM.&#8221; Now consider the next step. There is a 50% chance that that one shared segment will not be transmitted at all, but a 90% chance that if it is transmitted it will be <strong>just as big as it was<\/strong> (the same 10 cM.). What this means for genealogy on 23andMe is that for two people sharing one segment identical by descent there is no way to reliably estimate how far back the common ancestor was. Furthermore, no improvement in software can possibly change that, because the limitation is imposed by the genetics itself.\u201d<\/p>\n<p>Well, there goes the 50% rule &#8211; flying right out the window.\u00a0 The 50% rule of thumb says that in any given transmission, there is a 50% chance that it will be transmitted (so good so far) and that if it is transmitted, roughly half of it would be transmitted, or approximately 5 cM..\u00a0 That&#8217;s obviously not what is happening.<\/p>\n<p>Dr. Mount goes on to say that, \u201cNo matter how far back you go, every nucleotide of one&#8217;s genome is derived from some ancestor, and even going back 20 generations, the chance that the bit which has been inherited is part of a block 5 cM. or greater is still appreciable. In fact, even for 19th cousins, there is a real chance (13%) that any segment of DNA they have inherited in common will be 5 cM. or greater. Of course, as mentioned above, there is very little chance that two 19th cousins will share any IBD segments at all, but this is offset if one has many 19th cousins, which is often the case.\u201d<\/p>\n<p>5cM is the line-in-the-sand cutoff number many genetic genealogists use to determine whether DNA segments are IBD or IBS.<\/p>\n<p>What this really means is that the more distant, or 19<sup>th<\/sup>, cousins that you have, the greater the chance that one or more of them will test and\u00a0will indeed share a piece of DNA large enough to be identified by the testing companies as relevant.\u00a0 The software companies\u00a0will then apply their relationship estimating software to the size of the match and number of SNPs.\u00a0 The results are often inaccurate, as Dr. Mount says.\u00a0 Not inaccurate in that the match is incorrect, but the estimated relationship\u00a0is incorrect because the DNA did not divide in half as the mathematical model says it should.\u00a0 The &#8220;problem&#8221; is not in the software, but in the DNA itself.<\/p>\n<p>\u201c23andMe reports a &#8220;predicted relationship&#8221; (e.g. &#8220;4th cousin&#8221;) and a &#8220;relationship range&#8221; (e.g. &#8220;3rd to 7th cousin&#8221;). However, these ranges are likely to be wildly inaccurate, because the likely distance to a common ancestor, given only the information that two people share a single IBD segment, can vary enormously, based largely on how many relatives one has.\u201d<\/p>\n<p>And I will add, it will also vary by how and how much the DNA has or has not divided in every generation.<\/p>\n<p>Dr. Mount goes on to provide the math and probability formulas for these various calculations, and explains what they mean, in English, then he summarizes by saying, \u201c<\/p>\n<p>\u201cThus, if you have many more distant cousins, as would be expected if your ancestors had large families, then someone who shares a single IBD segment is more likely to be a distant cousin, because you have so many more distant cousins. The point where the increase in the number of cousins outweighs the loss of shared segments is five children per family. This is not extremely uncommon.\u201d<\/p>\n<p>This actually makes a lot of sense when I look at my results.\u00a0 One of my ancestors, Abraham Estes (1647-1720) had at least 12 children of which 11 reproduced and had very large families.\u00a0 This line was extremely prolific.\u00a0 Many of my autosomal matches include Estes descendants.\u00a0 Some of my other lines where my ancestor was one of just a few children have far fewer matches, likely because there are far fewer people out there descended from them.<\/p>\n<p>Dr. Mount confirms this by saying that, \u201cIf one family among [your] 32 [great-great-great-grandparents] had five children and their descendants did as well, while others in the family reproduced at replacement rates (two children per family), then your more prolific ancestors (the parents of just one of your 31 great-great-grandparents) would account for over 3\/4 of your fourth cousins.\u201d<\/p>\n<p>So what is the take away message to us from all of this?<\/p>\n<ul>\n<li>The autosomal testing companies are doing the best they can predicting your cousin-level relationships with what they have to work with.<\/li>\n<li>Real life genetic transmission does not follow the 50% rule of thumb beyond the first generation (parent-child).<\/li>\n<li>The predictions get more uncertain and therefore unreliable the more distant they are.<\/li>\n<li>Based on the unmeasureable randomness of the genetic transmission involved, there is no way for the testing companies to improve their predictions.<\/li>\n<li>Expect more matches to your more prolific lines, and less to lines who had fewer children.<\/li>\n<li>Beyond about the first or second cousin level, understand that predictions are only suggestions based on math.\u00a0 Given that you understand why and how reality can vary, you can then utilize this information when analyzing your matches.<\/li>\n<li>Drawing an arbitrary cM line for IBS vs IBD and utilizing only the segments above that threshold may eliminate the small segments you need to identify ancestors many generations removed.<\/li>\n<li>Endogamous populations throw a monkey wrench into estimates and calculations, because population members are likely related many times over in unknown ways.\u00a0 This makes the estimate of relatedness of two people\u00a0appear closer than it is genealogically.\u00a0 At least one of the testing companies, Family Tree DNA, attempts to correct for this mathematically when they are aware of the situation, such as in Jewish families.<\/li>\n<\/ul>\n<p>You can read Dr. Mount\u2019s article including his mathematical proofs, <a href=\"http:\/\/ongenetics.blogspot.com\/2011\/02\/genetic-genealogy-and-single-segment.html?m=1\">here.<\/a><\/p>\n<p>______________________________________________________________<\/p>\n<p><strong>Disclosure<\/strong><\/p>\n<p>I receive a small contribution when you click on some of the links to vendors in my articles. This does NOT increase the price you pay but helps me to keep the lights on and this informational blog free for everyone. Please click on the links in the articles or to the vendors below if you are purchasing products or DNA testing.<\/p>\n<p>Thank you so much.<\/p>\n<p><strong>DNA Purchases and Free Transfers<\/strong><\/p>\n<ul>\n<li><a href=\"https:\/\/click.linksynergy.com\/deeplink?id=uLStJfWJvi8&#038;mid=53717&#038;murl=https%3A%2F%2Fwww.familytreedna.com%2F\">Family Tree DNA<\/a><\/li>\n<li><a href=\"https:\/\/www.myheritage.com\/dna?utm_source=partner_dnaexplained&amp;utm_medium=partner&amp;utm_campaign=dnaexplained_nov17_dnasale&amp;tr_funnel=mh+dna\">MyHeritage DNA<\/a> only<\/li>\n<li><a href=\"https:\/\/www.myheritage.com\/health?utm_source=partner_dnaexplained&amp;utm_medium=partner&amp;utm_campaign=dnaexplainedhealth_june19&amp;tr_funnel=health\">MyHeritage DNA plus Health<\/a><\/li>\n<li><a href=\"https:\/\/www.myheritage.com\/dna\/upload?utm_source=partner_dnaexplained&amp;utm_medium=partner&amp;utm_campaign=dnaexplained_mar18_dnaupload&amp;tr_funnel=dna+upload\">MyHeritage FREE DNA file upload<\/a><\/li>\n<li><a href=\"https:\/\/amzn.to\/2DjSkmQ\">AncestryDNA<\/a><\/li>\n<li><a href=\"https:\/\/amzn.to\/2DfWtIr\">23andMe Ancestry<\/a><\/li>\n<li><a href=\"https:\/\/amzn.to\/2XtkAep\">23andMe Ancestry Plus Health<\/a><\/li>\n<li><a href=\"http:\/\/www.anrdoezrs.net\/click-6754800-12748243\">LivingDNA<\/a><\/li>\n<\/ul>\n<p><strong>Genealogy Services<\/strong><\/p>\n<ul>\n<li><a href=\"https:\/\/ftbfamilytreebuilder.com\/home?utm_source=partner_dnaexplained&amp;utm_medium=partner&amp;utm_campaign=dnaexplained_jan19_ftb&amp;tr_adgroup=_ftb&amp;utm_content=funnel=ftb\">MyHeritage FREE Tree Builder<\/a><\/li>\n<li><a href=\"https:\/\/www.myheritage.com\/complete-genealogy-package?keyword=partners&amp;utm_source=partner_dnaexplained&amp;utm_medium=partner&amp;utm_campaign=dnaexplained_march19_ftfc&amp;tr_placement&amp;tr_funnel=complete\">MyHeritage Subscription with Free Trial<\/a><\/li>\n<\/ul>\n<p><strong>Genealogy Research<\/strong><\/p>\n<ul>\n<li><a href=\"https:\/\/www.legacytree.com\/dnaexplain\">Legacy Tree Genealogists<\/a> for genealogy research<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>The answer is, because inherited DNA segments do not always follow the 50% rule.\u00a0 I guess maybe no one told them??? Many times, when we receive our autosomal DNA results, we wonder why predicted relationships, particularly distant ones, aren\u2019t accurate.\u00a0 &hellip; <a href=\"https:\/\/dna-explained.com\/2013\/10\/21\/why-are-my-predicted-cousin-relationships-wrong\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_crdt_document":"","advanced_seo_description":"","jetpack_seo_html_title":"","jetpack_seo_noindex":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2},"_wpas_customize_per_network":false,"jetpack_post_was_ever_published":false},"categories":[2739125,84839],"tags":[362070305],"class_list":["post-3077","post","type-post","status-publish","format-standard","hentry","category-autosomal","category-inheritance","tag-introductory-dna"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p2AwO4-ND","jetpack_likes_enabled":true,"amp_enabled":true,"_links":{"self":[{"href":"https:\/\/dna-explained.com\/wp-json\/wp\/v2\/posts\/3077","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dna-explained.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dna-explained.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dna-explained.com\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/dna-explained.com\/wp-json\/wp\/v2\/comments?post=3077"}],"version-history":[{"count":9,"href":"https:\/\/dna-explained.com\/wp-json\/wp\/v2\/posts\/3077\/revisions"}],"predecessor-version":[{"id":26034,"href":"https:\/\/dna-explained.com\/wp-json\/wp\/v2\/posts\/3077\/revisions\/26034"}],"wp:attachment":[{"href":"https:\/\/dna-explained.com\/wp-json\/wp\/v2\/media?parent=3077"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dna-explained.com\/wp-json\/wp\/v2\/categories?post=3077"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dna-explained.com\/wp-json\/wp\/v2\/tags?post=3077"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}