Quantcast
Channel: Family Tree DNA – DNAeXplained – Genetic Genealogy
Viewing all 367 articles
Browse latest View live

Family Tree DNA Mystery Rewards – Week 5 – Merry Christmas

$
0
0

ftdna mystery reward package

This posting should be subtitled, “oh, my aching credit card.”

But this sale, and these coupons, have just been too good to pass up.  Given that Christmas is this week, and these coupons expire on 12-28-2014, it’s doubtful that there will be additional coupons issued on 12-29, but that has not been confirmed by Family Tree DNA – just my speculation.  The holiday sale ends on December 31st.

One of the things I’ve learned this past year is just how valuable it is to test cousins….and I mean autosomally testing every single cousin you can get your hands on.

Why?

Because those cousins are going to match people that you aren’t.  Each cousin may be the person to hold that gold nugget of ancestral DNA that you’re going to need to break down that brick wall.

I just had that experience myself, if you’ve been reading along.  If not, this article about my new Hickerson cousin will warm your heart.  And the really important part – Bill Hickerson did not match me at Family Tree DNA, he matched one of my cousins.  My Vannoy family has a group of about 20 people that we’ve convinced to test and that cousin cluster has proved invaluable.  We simply could not have broken this brick wall without the cousins!

We’re doing the same cousin cluster research approach with the Henry Bolton line and maybe, if we’re lucky, thanks to cousins, we may have more news on this line in 2015.  We’re within sniffing distance.

So, I can’t say this loudly enough – TEST YOUR COUSINS.

And I mean – EVERY COUSIN.

Yes, pay for it if you have to.  Merry Christmas to you (or however you celebrate whatever holiday you celebrate) – it’s the gift that keeps on giving.  Maybe a great way to spend any $$ found in your stocking!

Just think of it as a new reference book, or costing much less than a trip to a distant courthouse.

And don’t forget, if you discover that cousins have already tested at Ancestry or at 23andMe with the V3 chip, you can use one of the general $5 or $10 off coupons to transfer their results to Family Tree DNA – which drops the cost below the normal $39 (to $29 or $34) to unlock the results.

So, click here to sign on and see what mystery coupon you received this week.  If you’d like to trade, list your available coupons and their codes in the comments!

Here’s wishing you many newfound cousins and ancestors in the coming year!



Family Tree DNA – Final Mystery Rewards Coupon

$
0
0

happy holidays

Did you receive money in your Christmas stocking?  Well, now is the time to spend it during the last 3 days of Family Tree DNA’s holiday sale.

New coupons were issued today, but it seems that while Christmas is over, April Fool’s Day has arrived early.

There was a bit of an error in the Mystery Rewards notification e-mails sent today.

From Family Tree DNA:

If you received an email about a Mystery Reward that showed no discount and that the reward was expired, please disregard it.

****PLEASE LOG DIRECTLY INTO THE KIT TO GET THE VALID MYSTERY REWARD CODE.*****

We apologize for any confusion or inconvenience this may have caused.

So, never mind the e-mail and click here to sign in to your account and click on your mystery reward gift box one last time!  Post any codes you’re willing to share in the comments.  And remember, the sale ends at midnight, Houston time, on the 31st.

Happy New Year!


2014 Top Genetic Genealogy Happenings – A Baker’s Dozen +1

$
0
0

It’s that time again, to look over the year that has just passed and take stock of what has happened in the genetic genealogy world.  I wrote a review in both 2012 and 2013 as well.  Looking back, these momentous happenings seem quite “old hat” now.  For example, both www.GedMatch.com and www.DNAGedcom.com, once new, have become indispensable tools that we take for granted.  Please keep in mind that both of these tools (as well as others in the Tools section, below) depend on contributions, although GedMatch now had a tier 1 subscription offering for $10 per month as well.

So what was the big news in 2014?

Beyond the Tipping Point

Genetic genealogy has gone over the tipping point.  Genetic genealogy is now, unquestionably, mainstream and lots of people are taking part.  From the best I can figure, there are now approaching or have surpassed three million tests or test records, although certainly some of those are duplicates.

  • 500,000+ at 23andMe
  • 700,000+ at Ancestry
  • 700,000+ at Genographic

The organizations above represent “one-test” companies.  Family Tree DNA provides various kinds of genetic genealogy tests to the community and the have over 380,000 individuals with more than 700,000 test records.

In addition to the above mentioned mainstream firms, there are other companies that provide niche testing, often in addition to Family Tree DNA Y results.

In addition, there is what I would refer to as a secondary market for testing as well which certainly attracts people who are not necessarily genetic genealogists but who happen across their corporate information and decide the test looks interesting.  There is no way of knowing how many of those tests exist.

Additionally, there is still the Sorenson data base with Y and mtDNA tests which reportedly exceeded their 100,000 goal.

Spencer Wells spoke about the “viral spread threshold” in his talk in Houston at the International Genetic Genealogy Conference in October and terms 2013 as the year of infection.  I would certainly agree.

spencer near term

Autosomal Now the New Normal

Another change in the landscape is that now, autosomal DNA has become the “normal” test.  The big attraction to autosomal testing is that anyone can play and you get lots of matches.  Earlier in the year, one of my cousins was very disappointed in her brother’s Y DNA test because he only had a few matches, and couldn’t understand why anyone would test the Y instead of autosomal where you get lots and lots of matches.  Of course, she didn’t understand the difference in the tests or the goals of the tests – but I think as more and more people enter the playground – percentagewise – fewer and fewer do understand the differences.

Case in point is that someone contacted me about DNA and genealogy.  I asked them which tests they had taken and where and their answer was “the regular one.”  With a little more probing, I discovered that they took Ancestry’s autosomal test and had no clue there were any other types of tests available, what they could tell him about his ancestors or genetic history or that there were other vendors and pools to swim in as well.

A few years ago, we not only had to explain about DNA tests, but why the Y and mtDNA is important.  Today, we’ve come full circle in a sense – because now we don’t have to explain about DNA testing for genealogy in general but we still have to explain about those “unknown” tests, the Y and mtDNA.  One person recently asked me, “oh, are those new?”

Ancient DNA

This year has seen many ancient DNA specimens analyzed and sequenced at the full genomic level.

The year began with a paper titled, “When Populations Collide” which revealed that contemporary Europeans carry between 1-4% of Neanderthal DNA most often associated with hair and skin color, or keratin.  Africans, on the other hand, carry none or very little Neanderthal DNA.

http://dna-explained.com/2014/01/30/neanderthal-genome-further-defined-in-contemporary-eurasians/

A month later, a monumental paper was published that detailed the results of sequencing a 12,500 Clovis child, subsequently named Anzick or referred to as the Anzick Clovis child, in Montana.  That child is closely related to Native American people of today.

http://dna-explained.com/2014/02/13/clovis-people-are-native-americans-and-from-asia-not-europe/

In June, another paper emerged where the authors had analyzed 8000 year old bones from the Fertile Crescent that shed light on the Neolithic area before the expansion from the Fertile Crescent into Europe.  These would be the farmers that assimilated with or replaced the hunter-gatherers already living in Europe.

http://dna-explained.com/2014/06/09/dna-analysis-of-8000-year-old-bones-allows-peek-into-the-neolithic/

Svante Paabo is the scientist who first sequenced the Neanderthal genome.  Here is a neanderthal mangreat interview and speech.  This man is so interesting.  If you have not read his book, “Neanderthal Man, In Search of Lost Genomes,” I strongly recommend it.

http://dna-explained.com/2014/07/22/finding-your-inner-neanderthal-with-evolutionary-geneticist-svante-paabo/

In the fall, yet another paper was released that contained extremely interesting information about the peopling and migration of humans across Europe and Asia.  This was just before Michael Hammer’s presentation at the Family Tree DNA conference, so I covered the paper along with Michael’s information about European ancestral populations in one article.  The take away messages from this are two-fold.  First, there was a previously undefined “ghost population” called Ancient North Eurasian (ANE) that is found in the northern portion of Asia that contributed to both Asian populations, including those that would become the Native Americans and European populations as well.  Secondarily, the people we thought were in Europe early may not have been, based on the ancient DNA remains we have to date.  Of course, that may change when more ancient DNA is fully sequenced which seems to be happening at an ever-increasing rate.

http://dna-explained.com/2014/10/21/peopling-of-europe-2014-identifying-the-ghost-population/

Lazaridis tree

Ancient DNA Available for Citizen Scientists

If I were to give a Citizen Scientist of the Year award, this year’s award would go unquestionably to Felix Chandrakumar for his work with the ancient genome files and making them accessible to the genetic genealogy world.  Felix obtained the full genome files from the scientists involved in full genome analysis of ancient remains, reduced the files to the SNPs utilized by the autosomal testing companies in the genetic genealogy community, and has made them available at GedMatch.

http://dna-explained.com/2014/09/22/utilizing-ancient-dna-at-gedmatch/

If this topic is of interest to you, I encourage you to visit his blog and read his many posts over the past several months.

https://plus.google.com/+FelixChandrakumar/posts

The availability of these ancient results set off a sea of comparisons.  Many people with Native heritage matched Anzick’s file at some level, and many who are heavily Native American, particularly from Central and South America where there is less admixture match Anzick at what would statistically be considered within a genealogical timeframe.  Clearly, this isn’t possible, but it does speak to how endogamous populations affect DNA, even across thousands of years.

http://dna-explained.com/2014/09/23/analyzing-the-native-american-clovis-anzick-ancient-results/

Because Anzick is matching so heavily with the Mexican, Central and South American populations, it gives us the opportunity to extract mitochondrial DNA haplogroups from the matches that either are or may be Native, if they have not been recorded before.

http://dna-explained.com/2014/09/23/analyzing-the-native-american-clovis-anzick-ancient-results/

Needless to say, the matches of these ancient kits with contemporary people has left many people questioning how to interpret the results.  The answer is that we don’t really know yet, but there is a lot of study as well as speculation occurring.  In the citizen science community, this is how forward progress is made…eventually.

http://dna-explained.com/2014/09/25/ancient-dna-matches-what-do-they-mean/

http://dna-explained.com/2014/09/30/ancient-dna-matching-a-cautionary-tale/

More ancient DNA samples for comparison:

http://dna-explained.com/2014/10/04/more-ancient-dna-samples-for-comparison/

A Siberian sample that also matches the Malta Child whose remains were analyzed in late 2013.

http://dna-explained.com/2014/11/12/kostenki14-a-new-ancient-siberian-dna-sample/

Felix has prepared a list of kits that he has processed, along with their GedMatch numbers and other relevant information, like gender, haplogroup(s), age and location of sample.

http://www.y-str.org/p/ancient-dna.html

Furthermore, in a collaborative effort with Family Tree DNA, Felix formed an Ancient DNA project and uploaded the ancient autosomal files.  This is the first time that consumers can match with Ancient kits within the vendor’s data bases.

https://www.familytreedna.com/public/Ancient_DNA

Recently, GedMatch added a composite Archaic DNA Match comparison tool where your kit number is compared against all of the ancient DNA kits available.  The output is a heat map showing which samples you match most closely.

gedmatch ancient heat map

Indeed, it has been a banner year for ancient DNA and making additional discoveries about DNA and our ancestors.  Thank you Felix.

Haplogroup Definition

That SNP tsunami that we discussed last year…well, it made landfall this year and it has been storming all year long…in a good way.  At least, ultimately, it will be a good thing.  If you asked the haplogroup administrators today about that, they would probably be too tired to answer – as they’ve been quite overwhelmed with results.

The Big Y testing has been fantastically successful.  This is not from a Family Tree DNA perspective, but from a genetic genealogy perspective.  Branches have been being added to and sawed off of the haplotree on a daily basis.  This forced the renaming of the haplogroups from the old traditional R1b1a2 to R-M269 in 2012.  While there was some whimpering then, it would be nothing like the outright wailing now that would be occurring as haplogroup named reached 20 or so digits.

Alice Fairhurst discussed the SNP tsunami at the DNA Conference in Houston in October and I’m sure that the pace hasn’t slowed any between now and then.  According to Alice, in early 2014, there were 4115 individual SNPs on the ISOGG Tree, and as of the conference, there were 14,238 SNPs, with the 2014 addition total at that time standing at 10,213.  That is over 1000 per month or about 35 per day, every day.

Yes, indeed, that is the definition of a tsunami.  Every one of those additions requires one of a number of volunteers, generally haplogroup project administrators to evaluate the various Big Y results, the SNPs and novel variants included, where they need to be inserted in the tree and if branches need to be rearranged.  In some cases, naming request for previously unknown SNPs also need to be submitted.  This is all done behind the scenes and it’s not trivial.

The project I’m closest to is the R1b L-21 project because my Estes males fall into that group.  We’ve tested several, and I’ll be writing an article as soon as the final test is back.

The tree has grown unbelievably in this past year just within the L21 group.  This project includes over 700 individuals who have taken the Big Y test and shared their results which has defined about 440 branches of the L21 tree.  Currently there are almost 800 kits available if you count the ones on order and the 20 or so from another vendor.

Here is the L21 tree in January of 2014

L21 Jan 2014 crop

Compare this with today’s tree, below.

L21 dec 2014

Michael Walsh, Richard Stevens, David Stedman need to be commended for their incredible work in the R-L21 project.  Other administrators are doing equivalent work in other haplogroup projects as well.  I big thank you to everyone.  We’d be lost without you!

One of the results of this onslaught of information is that there have been fewer and fewer academic papers about haplogroups in the past few years.  In essence, by the time a paper can make it through the peer review cycle and into publication, the data in the paper is often already outdated relative to the Y chromosome.  Recently a new paper was released about haplogroup C3*.  While the data is quite valid, the authors didn’t utilize the new SNP naming nomenclature.  Before writing about the topic, I had to translate into SNPese.  Fortunately, C3* has been relatively stable.

http://dna-explained.com/2014/12/23/haplogroup-c3-previously-believed-east-asian-haplogroup-is-proven-native-american/

10th Annual International Conference on Genetic Genealogy

The Family Tree DNA International Conference on Genetic Genealogy for project administrators is always wonderful, but this year was special because it was the 10th annual.  And yes, it was my 10th year attending as well.  In all these years, I had never had a photo with both Max and Bennett.  Everyone is always so busy at the conferences.  Getting any 3 people, especially those two, in the same place at the same time takes something just short of a miracle.

roberta, max and bennett

Ten years ago, it was the first genetic genealogy conference ever held, and was the only place to obtain genetic genealogy education outside of the rootsweb genealogy DNA list, which is still in existence today.  Family Tree DNA always has a nice blend of sessions.  I always particularly appreciate the scientific sessions because those topics generally aren’t covered elsewhere.

http://dna-explained.com/2014/10/11/tenth-annual-family-tree-dna-conference-opening-reception/

http://dna-explained.com/2014/10/12/tenth-annual-family-tree-dna-conference-day-2/

http://dna-explained.com/2014/10/13/tenth-annual-family-tree-dna-conference-day-3/

http://dna-explained.com/2014/10/15/tenth-annual-family-tree-dna-conference-wrapup/

Jennifer Zinck wrote great recaps of each session and the ISOGG meeting.

http://www.ancestorcentral.com/decennial-conference-on-genetic-genealogy/

http://www.ancestorcentral.com/decennial-conference-on-genetic-genealogy-isogg-meeting/

http://www.ancestorcentral.com/decennial-conference-on-genetic-genealogy-sunday/

I thank Family Tree DNA for sponsoring all 10 conferences and continuing the tradition.  It’s really an amazing feat when you consider that 15 years ago, this industry didn’t exist at all and wouldn’t exist today if not for Max and Bennett.

Education

Two educational venues offered classes for genetic genealogists and have made their presentations available either for free or very reasonably.  One of the problems with genetic genealogy is that the field is so fast moving that last year’s session, unless it’s the very basics, is probably out of date today.  That’s the good news and the bad news.

http://dna-explained.com/2014/11/12/genetic-genealogy-ireland-2014-presentations 

http://dna-explained.com/2014/09/26/educational-videos-from-international-genetic-genealogy-conference-now-available/

In addition, three books have been released in 2014.emily book

In January, Emily Aulicino released Genetic Genealogy, The Basics and Beyond.

richard hill book

In October, Richard Hill released “Guide to DNA Testing: How to Identify Ancestors, Confirm Relationships and Measure Ethnicity through DNA Testing.”

david dowell book

Most recently, David Dowell’s new book, NextGen Genealogy: The DNA Connection was released right after Thanksgiving.

 

Ancestor Reconstruction – Raising the Dead

This seems to be the year that genetic genealogists are beginning to reconstruct their ancestors (on paper, not in the flesh) based on the DNA that the ancestors passed on to various descendants.  Those segments are “gathered up” and reassembled in a virtual ancestor.

I utilized Kitty Cooper’s tool to do just that.

http://dna-explained.com/2014/10/03/ancestor-reconstruction/

henry bolton probablyI know it doesn’t look like much yet but this is what I’ve been able to gather of Henry Bolton, my great-great-great-grandfather.

Kitty did it herself too.

http://blog.kittycooper.com/2014/08/mapping-an-ancestral-couple-a-backwards-use-of-my-segment-mapper/

http://blog.kittycooper.com/2014/09/segment-mapper-tool-improvements-another-wold-dna-map/

Ancestry.com wrote a paper about that fact that they have figure out how to do this as well in a research environment.

http://corporate.ancestry.com/press/press-releases/2014/12/ancestrydna-reconstructs-partial-genome-of-person-living-200-years-ago/

http://www.thegeneticgenealogist.com/2014/12/16/ancestrydna-recreates-portions-genome-david-speegle-two-wives/

GedMatch has created a tool called, appropriately, Lazarus that does the same thing, gathers up the DNA of your ancestor from their descendants and reassembles it into a DNA kit.

Blaine Bettinger has been working with and writing about his experiences with Lazarus.

http://www.thegeneticgenealogist.com/2014/10/20/finally-gedmatch-announces-monetization-strategy-way-raise-dead/

http://www.thegeneticgenealogist.com/2014/12/09/recreating-grandmothers-genome-part-1/

http://www.thegeneticgenealogist.com/2014/12/14/recreating-grandmothers-genome-part-2/

Tools

Speaking of tools, we have some new tools that have been introduced this year as well.

Genome Mate is a desktop tool used to organize data collected by researching DNA comparsions and aids in identifying common ancestors.  I have not used this tool, but there are others who are quite satisfied.  It does require Microsoft Silverlight be installed on your desktop

The Autosomal DNA Segment Analyzer is available through www.dnagedcom.com and is a tool that I have used and found very helpful.  It assists you by visually grouping your matches, by chromosome, and who you match in common with.

Charting Companion from Progeny Software, another tool I use, allows you to colorize and print or create pdf files that includes X chromosome groupings.  This greatly facilitates seeing how the X is passed through your ancestors to you and your parents.

x fan

WikiTree is a free resource for genealogists to be able to sort through relationships involving pedigree charts.  In November, they announced Relationship Finder.

Probably the best example I can show of how WikiTree has utilized DNA is using the results of King Richard III.

wiki richard

By clicking on the DNA icon, you see the following:

wiki richard 2

And then Richard’s Y, mitochondrial and X chromosome paths.

wiki richard 3

Since Richard had no descendants, to see how descendants work, click on his mother, Cecily of York’s DNA descendants and you’re shown up to 10 generations.

wiki richard 4

While this isn’t terribly useful for Cecily of York who lived and died in the 1400s, it would be incredibly useful for finding mitochondrial descendants of my ancestor born in 1802 in Virginia.  I’d love to prove she is the daughter of a specific set of parents by comparing her DNA with that of a proven daughter of those parents!  Maybe I’ll see if I can find her parents at WikiTree.

Kitty Cooper’s blog talks about additional tools.  I have used Kitty’s Chromosome mapping tools as discussed in ancestor reconstruction.

Felix Chandrakumar has created a number of fun tools as well.  Take a look.  I have not used most of these tools, but there are several I’ll be playing with shortly.

Exits and Entrances

With very little fanfare, deCODEme discontinued their consumer testing and reminded people to download their date before year end.

http://dna-explained.com/2014/09/30/decodeme-consumer-tests-discontinued/

I find this unfortunate because at one time, deCODEme seemed like a company full of promise for genetic genealogy.  They failed to take the rope and run.

On a sad note, Lucas Martin who founded DNA Tribes unexpectedly passed away in the fall.  DNA Tribes has been a long-time player in the ethnicity field of genetic genealogy.  I have often wondered if Lucas Martin was a pseudonym, as very little information about Lucas was available, even from Lucas himself.  Neither did I find an obituary.  Regardless, it’s sad to see someone with whom the community has worked for years pass away.  The website says that they expect to resume offering services in January 2015. I would be cautious about ordering until the structure of the new company is understood.

http://www.dnatribes.com/

In the last month, a new offering has become available that may be trying to piggyback on the name and feel of DNA Tribes, but I’m very hesitant to provide a link until it can be determined if this is legitimate or bogus.  If it’s legitimate, I’ll be writing about it in the future.

However, the big news exit was Ancestry’s exit from the Y and mtDNA testing arena.  We suspected this would happen when they stopped selling kits, but we NEVER expected that they would destroy the existing data bases, especially since they maintain the Sorenson data base as part of their agreement when they obtained the Sorenson data.

http://dna-explained.com/2014/10/02/ancestry-destroys-irreplaceable-dna-database/

The community is still hopeful that Ancestry may reverse that decision.

Ancestry – The Chromosome Browser War and DNA Circles

There has been an ongoing battle between Ancestry and the more seasoned or “hard-core” genetic genealogists for some time – actually for a long time.

The current and most long-standing issue is the lack of a chromosome browser, or any similar tools, that will allow genealogists to actually compare and confirm that their DNA match is genuine.  Ancestry maintains that we don’t need it, wouldn’t know how to use it, and that they have privacy concerns.

Other than their sessions and presentations, they had remained very quiet about this and not addressed it to the community as a whole, simply saying that they were building something better a better mousetrap.

In the fall, Ancestry invited a small group of bloggers and educators to visit with them in an all-day meeting, which came to be called DNA Day.

http://dna-explained.com/2014/10/08/dna-day-with-ancestry/

In retrospect, I think that Ancestry perceived that they were going to have a huge public relations issue on their hands when they introduced their new feature called DNA Circles and in the process, people would lose approximately 80% of their current matches.  I think they were hopeful that if they could educate, or convince us, of the utility of their new phasing techniques and resulting DNA Circles feature that it would ease the pain of people’s loss in matches.

I am grateful that they reached out to the community.  Some very useful dialogue did occur between all participants.  However, to date, nothing more has happened nor have we received any additional updates after the release of Circles.

Time will tell.

http://dna-explained.com/2014/11/18/in-anticipation-of-ancestrys-better-mousetrap/

http://dna-explained.com/2014/11/19/ancestrys-better-mousetrap-dna-circles/

DNA Circles 12-29-2014

DNA Circles, while interesting and somewhat useful, is certainly NOT a replacement for a chromosome browser, nor is it a better mousetrap.

http://dna-explained.com/2014/11/30/chromosome-browser-war/

In fact, the first thing you have to do when you find a DNA Circle that you have not verified utilizing raw data and/or chromosome browser tools from either 23andMe, Family Tree DNA or Gedmatch, is to talk your matches into transferring their DNA to Family Tree DNA or download to Gedmatch, or both.

http://dna-explained.com/2014/11/27/sarah-hickerson-c1752-lost-ancestor-found-52-ancestors-48/

I might add that the great irony of finding the Hickerson DNA Circle that led me to confirm that ancestry utilizing both Family Tree DNA and GedMatch is that today, when I checked at Ancestry, the Hickerson DNA Circle is no longer listed.  So, I guess I’ve been somehow pruned from the circle.  I wonder if that is the same as being voted off of the island.  So, word to the wise…check your circles often…they change and not always in the upwards direction.

The Seamy Side – Lies, Snake Oil Salesmen and Bullys

Unfortunately a seamy side, an underbelly that’s rather ugly has developed in and around the genetic genealogy industry.  I guess this was to be expected with the rapid acceptance and increasing popularity of DNA testing, but it’s still very unfortunate.

Some of this I expected, but I didn’t expect it to be so…well…blatant.

I don’t watch late night TV, but I’m sure there are now DNA diets and DNA dating and just about anything else that could be sold with the allure of DNA attached to the title.

I googled to see if this was true, and it is, although I’m not about to click on any of those links.

google dna dating

google dna diet

Unfortunately, within the ever-growing genetic genealogy community a rather large rift has developed over the past couple of years.  Obviously everyone can’t get along, but this goes beyond that.  When someone disagrees, a group actively “stalks” the person, trying to cost them their employment, saying hate filled and untrue things and even going to far as to create a Facebook page titled “Against<personname>.”  That page has now been removed, but the fact that a group in the community found it acceptable to create something like that, and their friends joined, is remarkable, to say the least.  That was accompanied by death threats.

Bullying behavior like this does not make others feel particularly safe in expressing their opinions either and is not conducive to free and open discussion. As one of the law enforcement officers said, relative to the events, “This is not about genealogy.  I don’t know what it is about, yet, probably money, but it’s not about genealogy.”

Another phenomenon is that DNA is now a hot topic and is obviously “selling.”  Just this week, this report was published, and it is, as best we can tell, entirely untrue.

http://worldnewsdailyreport.com/usa-archaeologists-discover-remains-of-first-british-settlers-in-north-america/

There were several tip offs, like the city (Lanford) and county (Laurens County) is not in the state where it is attributed (it’s in SC not NC), and the name of the institution is incorrect (Johns Hopkins, not John Hopkins).  Additionally, if you google the name of the magazine, you’ll see that they specialize in tabloid “faux reporting.”  It also reads a lot like the King Richard genuine press release.

http://urbanlegends.about.com/od/Fake-News/tp/A-Guide-to-Fake-News-Websites.01.htm

Earlier this year, there was a bogus institutional site created as well.

On one of the DNA forums that I frequent, people often post links to articles they find that are relevant to DNA.  This week, there was an interesting article, which has now been removed, correlating DNA results with latitude and altitude.  I thought to myself, I’ve never heard of that…how interesting.   Here’s part of what the article said:

Researchers at Aberdeen College’s Havering Centre for Genetic Research have discovered an important connection between our DNA and where our ancestors used to live.

Tiny sequence variations in the human genome sometimes called Single Nucleotide Polymorphisms (SNPs) occur with varying frequency in our DNA.  These have been studied for decades to understand the major migrations of large human populations.  Now Aberdeen College’s Dr. Miko Laerton and a team of scientists have developed pioneering research that shows that these differences in our DNA also reveal a detailed map of where our own ancestors lived going back thousands of years.

Dr. Laerton explains:  “Certain DNA sequence variations have always been important signposts in our understanding of human evolution because their ages can be estimated.  We’ve known for years that they occur most frequently in certain regions [of DNA], and that some alleles are more common to certain geographic or ethnic groups, but we have never fully understood the underlying reasons.  What our team found is that the variations in an individual’s DNA correlate with the latitudes and altitudes where their ancestors were living at the time that those genetic variations occurred.  We’re still working towards a complete understanding, but the knowledge that sequence variations are connected to latitude and altitude is a huge breakthrough by itself because those are enough to pinpoint where our ancestors lived at critical moments in history.”

The story goes on, but at the bottom, the traditional link to the publication journal is found.

The full study by Dr. Laerton and her team was published in the September issue of the Journal of Genetic Science.

I thought to myself, that’s odd, I’ve never heard of any of these people or this journal, and then I clicked to find this.

Aberdeen College bogus site

About that time, Debbie Kennett, DNA watchdog of the UK, posted this:

April Fools Day appears to have arrived early! There is no such institution as Aberdeen College founded in 1394. The University of Aberdeen in Scotland was founded in 1495 and is divided into three colleges: http://www.abdn.ac.uk/about/colleges-schools-institutes/colleges-53.php

The picture on the masthead of the “Aberdeen College” website looks very much like a photo of Aberdeen University. This fake news item seems to be the only live page on the Aberdeen College website. If you click on any other links, including the link to the so-called “Journal of Genetic Science”, you get a message that the website is experienced “unusually high traffic”. There appears to be no such journal anyway.

We also realized that Dr. Laerton, reversed, is “not real.”

I still have no idea why someone would invest the time and effort into the fake website emulating the University of Aberdeen, but I’m absolutely positive that their motives were not beneficial to any of us.

What is the take-away of all of this?  Be aware, very aware, skeptical and vigilant.  Stick with the mainstream vendors unless you realize you’re experimenting.

King Richard

King Richard III

The much anticipated and long-awaited DNA results on the remains of King Richard III became available with a very unexpected twist.  While the science team feels that they have positively identified the remains as those of Richard, the Y DNA of Richard and another group of men supposed to have been descended from a common ancestor with Richard carry DNA that does not match.

http://dna-explained.com/2014/12/09/henry-iii-king-of-england-fox-in-the-henhouse-52-ancestors-49/

http://dna-explained.com/2014/12/05/mitochondrial-dna-mutation-rates-and-common-ancestors/

Debbie Kennett wrote a great summary article.

http://cruwys.blogspot.com/2014/12/richard-iii-and-use-of-dna-as-evidence.html

More Alike than Different

One of the life lessons that genetic genealogy has held for me is that we are more closely related that we ever knew, to more people than we ever expected, and we are far more alike than different.  A recent paper recently published by 23andMe scientists documents that people’s ethnicity reflect the historic events that took place in the part of the country where their ancestors lived, such as slavery, the Trail of Tears and immigration from various worldwide locations.

23andMe European African map

From the 23andMe blog:

The study leverages samples of unprecedented size and precise estimates of ancestry to reveal the rate of ancestry mixing among American populations, and where it has occurred geographically:

  • All three groups – African Americans, European Americans and Latinos – have ancestry from Africa, Europe and the Americas.
  • Approximately 3.5 percent of European Americans have 1 percent or more African ancestry. Many of these European Americans who describe themselves as “white” may be unaware of their African ancestry since the African ancestor may be 5-10 generations in the past.
  • European Americans with African ancestry are found at much higher frequencies in southern states than in other parts of the US.

The ancestry proportions point to the different regional impacts of slavery, immigration, migration and colonization within the United States:

  • The highest levels of African ancestry among self-reported African Americans are found in southern states, especially South Carolina and Georgia.
  • One in every 20 African Americans carries Native American ancestry.
  • More than 14 percent of African Americans from Oklahoma carry at least 2 percent Native American ancestry, likely reflecting the Trail of Tears migration following the Indian Removal Act of 1830.
  • Among self-reported Latinos in the US, those from states in the southwest, especially from states bordering Mexico, have the highest levels of Native American ancestry.

http://news.sciencemag.org/biology/2014/12/genetic-study-reveals-surprising-ancestry-many-americans?utm_campaign=email-news-weekly&utm_source=eloqua

23andMe provides a very nice summary of the graphics in the article at this link:

http://blog.23andme.com/wp-content/uploads/2014/10/Bryc_ASHG2014_textboxes.pdf

The academic article can be found here:

http://www.cell.com/ajhg/home

2015

So what does 2015 hold? I don’t know, but I can’t wait to find out. Hopefully, it holds more ancestors, whether discovered through plain old paper research, cousin DNA testing or virtually raised from the dead!

What would my wish list look like?

  • More ancient genomes sequenced, including ones from North and South America.
  • Ancestor reconstruction on a large scale.
  • The haplotree becoming fleshed out and stable.
  • Big Y sequencing combined with STR panels for enhanced genealogical research.
  • Improved ethnicity reporting.
  • Mitochondrial DNA search by ancestor for descendants who have tested.
  • More tools, always more tools….
  • More time to use the tools!

Here’s wishing you an ancestor filled 2015!

 


Just One Cousin

$
0
0

Recently, someone wrote to me and said that they thought the autosomal DNA matching between groups of family members was wonderful, but they have “just one first cousin” and feel left out.  So, I decided to see what could be done with just two cousins.  In this case, the two cousins are full siblings and both first cousins to my mother, Barbara.  This would be the same process whether there was one or two cousins, since the two are siblings. Utilizing two cousins who are siblings just gives me the advantage of additional matching and triangulation capabilities.

This does presume that both people involved are willing to share and do a bit of comparison work on their various DNA accounts.  In other words, you can’t do this by yourself without cooperation from your cousin.

Here’s the common ancestor of our testers.

Miller Ferverda chart

Barbara, Cheryl and Don took a Family Finder autosomal DNA test at Family Tree DNA.

The DNA shared by Barbara, Cheryl and Don is from their common ancestral couple, Hiram B. Ferverda and Evaline Louise Miller.

Some of that shared DNA will be Hiram’s Ferverda DNA and some will be Evaline’s Miller DNA.  The only way to differentiate between the Ferverda and Miller DNA is to test people who are only Ferverda or only Miller, descendants of people upstream of Hiram and Evaline, and if there are any common segments between the testers and those Ferverda or Miller individuals, you can then assign that DNA segment to that side of the family – Miller or Ferverda.

I’m using Barbara’s chromosome as the “match to” background, below.  Cheryl, in orange, and Don, in blue, are shown as matches to Barbara.  You can see that these three people share a lot of their grandparents DNA.  You can also see where Don and Cheryl didn’t inherit the same DNA from their father, in some instances, like on chromosome 1 below, where Cheryl (orange) matches Barbara on a much larger part of the chromosome than Don does (blue.)  But then look at chromosome 13 where Barbara and Don match on a huge segment and Cheryl, just a small portion.  Don and Cheryl inherited different DNA from their parents at these locations.

Two cousins browser

The three testers’ common DNA segments on chromosome 1 are shown in the table below.  I’ve colored Cheryl’s pink and her brother, Don’s, blue.  You can see that Barbara matches some segments with Don that Cheryl didn’t inherit from her parents.  All of the DNA Barbara matches with Cheryl on this chromosome is also matched, at least in part, in that location, with Don.  The chart below, matches the graphic above, for chromosome 1 and is the “view data in a table” option on the chromosome browser as well as the leftmost “download to excel” option.  The download to excel option at right downloads all of the matches for the individual, not just the ones currently showing in the chromosome browser.

Two cousins combined

When at least two known relatives have tested, we have something to compare against.  In this case, we have a total of 3 people, 2 siblings and a first cousin, before we start matching outside known family.  We don’t know which of their shared DNA comes from which ancestor, but we can now look for people who match Barbara and at least Cheryl OR Don which proves a common ancestor between the three individuals.  Matching Barbara, Cheryl AND Don would be even better.

The gold standard for DNA matching, called triangulation, that proves a particular segment to a specific ancestor is as follows.

  • All (at least 2) people match you on the same segment.
  • Those people also match each other on the same segment.
  • Meaning, at least three people with a known common ancestral line must match on the same segment.

The key word here is “on the same segment”.

The next thing to do is to find out which of Barbara’s, Cheryl’s and Don’s matches are “in common with” each other.  This means Barbara, Cheryl and Don all share a matching segment with these other people, but without additional analysis, we can’t determine whether they share a match on the same segment or not.

I ran Barbara “in common with” Cheryl and you can see that the first two people returned on that match list were me and Don because matches are listed in the order of the largest cM of shared data first.  The “in common with” tool is the blue crossed arrows, below.

Two cousins ICW

Next I ran Barbara in common with Don.

There were a total of 43 people in common with Cheryl and 49 with Don.

I downloaded the matching individuals (download link at the bottom right of the match page) and sorted them in a spreadsheet to see who matches whom. Here’s what the first part of my spreadsheet looks like (sorted in chromosome and segment order.)  I colorized the rows by cousin for easier visualization.

Two cousins match example

We have 92 total matching individuals in common with Barbara and Cheryl and then Barbara and Don.  A total of 19 people are listed as matching BOTH Cheryl and Don (for a total of 38 rows in the spreadsheet), so that means that there are 54 people who are in common with either Barbara and Cheryl or Barbara and Don, but not in common with all 3, Barbara AND Cheryl AND Don.  This illustrates how differently siblings inherit DNA from their parents and how it affects matches another generation later.

In Common With Matches To both Don and Cheryl To Cheryl only or Don only, but not both
Barbara 19 (38 rows of 92) 54

Clearly, the people who match all three individuals, Barbara, Cheryl and Don are likely the closest relatives.

So let’s focus on those closest matching people.  If you were utilizing only one cousin here, you would simply utilize every “in common with” match between two individuals and move forward.  Because I have siblings here, and because I don’t want to deal with 72 different people, I’m using the fact that they are siblings to focus my efforts on the most closely related matches – people who match Barbara AND both siblings.  You could also limit your focus by something like a common ancestral surname between all match members.

The next step is for each tester, meaning Barbara, Cheryl and Don, to compare each individual on the common match list to their DNA.  This means that Barbara, Cheryl and Don all three will compare to all 18 individuals.  We now have only 18 matching people, instead of 19, because I removed my own matches, since mine are a subset of Barbara’s.  Checking to see how each of our testers matches each common matching person is the only way to determine that there is a three (or 4) way triangulation that will confirm a common ancestor.

There are two ways to do this at Family Tree DNA.

1. You can, 5 matches at a time, compare in the chromosome browser, then download only the matching segments to a spreadsheet for those 5 individuals. This means 4 sets of matches for each of three people.

Two cousins browser download

2. You can download Barbara, Cheryl and Don’s entire segment match list and then eliminate the matches that aren’t relevant to the discussion – meaning everyone except the 18 common matches between the three people.

The download option for the entire segment match list for the person whose kit you are looking at is shown at the top of the chromosome browser, to the right.  Downloading the currently showing individuals matching segments is shown at the top of the chromosome browser, to the left.

Because we can only push 5 people at a time to the chromosome browser, in this case, it will be easier to simply download all of the matches for each of the three individuals and then put them into a common spreadsheet and sort by the names we determined match in common between all three cousins.

I downloaded all of the matches for Barbara, Cheryl and Don, colorized them and then sorted them in the spreadsheet by the name of who they matched.  I then searched for the names of the 18 individuals who matched Barbara, Cheryl and Don, and copy/pasted them into a separate spreadsheet.

I could then sort the 18 matching individuals results by chromosome and start and end location.

two cousin matches

Barbara’s DNA matches are white rows, Cheryl’s are pink and Don’s are blue.

The segments where Barbara, Don and Cheryl all match more than one other person on an overlapping area of their DNA segments are colorized green.  This means that 4 or more people match on that same identical segment, the three known cousins and at least one other person.

The segments where at least Barbara and either Don or Cheryl (but not both) match at least one other person are colorized yellow. This means that least three people match on that same segment.

Since the gold standard of triangulation is 3 individuals matching on the same segment, both the yellow and green segments contain matches that fall into this category and are triangulated.  All of those segments match at least two of the cousins, who match each other, plus in some cases, additional people too.

Let’s walk through one triangulation sequence.

In the green cluster, above, you can see that Barbara, Cheryl and Don all match Arthur on overlapping portions of the same segment.  The overlapping portion between all 3 individuals and Arthur runs from 49,854,186 to 53,551,492.  In addition, both Don and Cheryl match Tiffany on part of that same segment and Barbara matches Dean on part as well.  These segments aren’t exactly the same for any of the cousins, with different amounts of matching DNA as reflected in the different cM and SNP values.

So, who is triangulated based on just this one green cluster?  Barbara, Cheryl, Don and Arthur are triangulated to a common ancestor.  We know that common ancestor is either the common ancestor of Cheryl, Don and Barbara – Hiram Ferverda and Evaline Miller – or upstream of that couple.

Tiffany is triangulated to both Cheryl and Don, but since Cheryl and Don are siblings, that’s irrelevant at this point – meaning we can’t tell if that match is IBS by chance or real because there is no additional match – at least not in this cluster.

In total, there are 19 green clusters (triangulated to at least 4 people) and 12 yellow clusters (triangulated to at least 3 people.)

In other words, the DNA that came from Hiram Ferverda and Evaline Miller is present in these matching people as well.  The million dollar question, is, of course, which upstream ancestor did it come from?  We genealogists are never satisfied, are we?  Every answer just leads to more questions.

Before we begin looking at the DNA results and discussing what they mean, I want to share with you the family tree of Hiram Ferverda and Evaline Miller, because the DNA of the people who match Don, Cheryl and Barbara had to come from these people as well.  This chart shows 7 generations back from Barbara, Cheryl and Don.  The common ancestors of the people with whom they triangulate are likely to be within this timeframe.

two cousins fan chart

The colorized ancestors above are the ancestors who contributed the X chromosome to both John Ferverda, Barbara’s father and Roscoe Ferverda, Cheryl and Don’s father.

In my working example, below, I’m utilizing the matches on chromosome 14 because chromosome 14 includes examples of a couple of interesting features.

Two cousins chr 14

Let’s look at the first green grouping.  All three cousins match to SB and then Barbara matches also to Constance and William, our Lentz cousin on part of that overlapping segment as well.  This suggests that this grouping might come from the Lentz side of the Miller tree, although we’ll see something else in a minute that might give us pause to reflect.  So just hold that thought.  Regardless, it does tell us that these individuals do share a common ancestor and it’s on the Miller side, not the Ferverda side.

The second green grouping is larger and includes larger segments as well, which are more reliably used, although the smaller green cluster clearly meets and exceeds the triangulation requirement of 3 matching individuals on the same segment.

This larger green cluster is actually quite interesting, because there are a total of 4 individuals, Ellen, Arthur, Eric and Tiffany who are all triangulated on this same segment with Don, Cheryl and Barbara.  So, not only are they triangulated to Don, Cheryl and Barbara, but also to each other.  These 7 people all share a common ancestor.

The yellow grouping shows an area where Eric matches Barbara and Don plus Arthur as well, but not Cheryl.  We don’t know anything about Arthur or Eric’s genealogy, so we don’t know if this is Miller or Ferverda DNA, at least not yet.  We’ll learn more about Arthur and Eric in a minute, even without their genealogy!

There are a couple of other areas on other chromosomes that are of interest too.

On this cluster on chromosome 12, we find a known Miller cousin, Rex, 2nd cousin to Barbara, Cheryl and Don.  Because Rex also descends from the parents of Evaline Miller, we know that this segment shared with Rex has to be Miller DNA, not Ferverda DNA.

Two cousins chr 12

On this segment of chromosome 3, below, we see that Barbara, Cheryl and Don match Herbert, another known Miller cousin, plus Dee and Constance in much smaller amounts on the same segment.  This tells us that this segment is descended from our common ancestor with Herbert.

Two cousins chr 3

Barbara, Don and Cheryl’s common ancestor with Herbert is Daniel Miller and Elizabeth Ulrich (Ullery), which makes them third cousins once removed – except – Herbert got a second dose of Miller DNA because Daniel Miller’s son, Isaac, married his first cousin who was also a Miller and shared grandparents with him.  So Herbert, genetically, is closer than he would appear since he received the double dose of Miller DNA three generations upstream.

Gotta love these close knit families.  The Millers were Brethren.  These double doses of family DNA often carry forward by matching downstream when they might otherwise not be expected do so.  That’s the upside of these endogamous groups.  Now, here’s the downside.

Two cousins chr 7

See the segments with the words problem written to the right?  Do you recognize what the problem is?  You’ll notice that in the matching group we have BOTH cousin Herbert who is a Miller (and not a Lentz) and cousin William who is a Lentz (and not a Miller.)

This is a very common situation in endogamous communities.

To make matters worse, we are dealing with very small segments here, where we often see confusion.  However, let’s look at the possibilities.

We do have triangulation, so one of three things has happened here.

First, the Brethren are an endogamous population that intermarried nearly exclusively within their faith.  The Lentz and Miller families were both Brethren.

Here are our possibilities.

  1. Our Lentz cousin has some Miller in one of his lines. This is entirely possible since he has a “short” pedigree chart and his families are living in the same Brethren communities as the other Lentz and Miller families.
  2. Our Miller cousin has some Lentz in one of his lines. That is less likely, because his genealogy is pretty well fleshed out, although certainly possible because, once again, the families were living within close proximity and attending the same churches, etc.
  3. This segment is truly a population based segment and will be found in people descending from that same base population. If this is the case, we still received it from one of our ancestors who came from that population, but since the Lentz and Miller lines may have both carried this same segment, we can’t tell who it came from. In other words, their common ancestor is further back in time than the Lentz and Miller families found in the US.

This segment cannot be IBS by chance because it does triangulate with the three cousins, Barbara, Don and Cheryl.  The definition of IBS by chance shows us that chance segments would not phase (or match with) with a parent.  If Don, Cheryl and Barbara all three carry this matching segment, it’s because their fathers both received it from their grandparents who were the common ancestor of Don, Cheryl and Barbara.

Neither Cheryl, Don nor Barbara can phase directly to their parents, who are deceased, so in this case, matching against first cousins is the best substitute we have.  We know that common DNA between the first cousins had to come from their father’s, who were brothers.  This in essence virtually phases Barbara, Don and Cheryl to their father’s on these matching segments.  Not ideal, by any means, but even partial parental phasing is better than no phasing at all.

A third match, Dean, shows Miller in his family tree, but I could not connect his Miller line to the Johann Michael Miller ancestral line, from which our Miller line descends – so Dean is not a known cousin.  Sometimes a common surname, even if found in the same geographic location, is not proof that the DNA connection is through that line.  It’s easy to make that assumption, but it’s an assumption that is just waiting to bite you.  Don’t do it!

Because of our known, proven DNA and genealogy matches to Herbert, we can attribute all of the segments where Herbert triangulates with either Barbara and Cheryl or Barbara and Don as Miller for all people involved.  This means that this common DNA descends either from Daniel Miller and Elizabeth Ulrich or Daniel’s father Philip Jacob Miller and Magdalene, surname unknown.

Why have I listed two couples?  Because, remember, Herbert has a double dose of Miller DNA from cousins and we don’t know which segment Barbara inherited, one from Daniel/Elizabeth or one from Philip Jacob/Magdalene (or some of each.)  If the segment is from Daniel/Elizabeth, it could have come from either the Ulrich or Miller side.  If it came from Daniel, then it also came from his father and mother, Philip Jacob/Magdalena and could either be Miller or Magdalena’s unknown line.

Herbert triangulate

Because of our known, proven DNA and genealogy matches to Rex, we can attribute all of the segments where Rex triangulates with either Barbara and Cheryl or Barbara and Don as Miller for all people involved.  Their common ancestor is John David Miller and Margaret Lentz, so their shared DNA could be either Lentz or Miller and is likely some of each.

Rex triangulate

For segments where there is no triangulation, but Barbara matches either Herbert or Rex, I still note that segment as Miller on my spreadsheet, since they are proven cousins, but I just omit the triangulation note.

For Barbara, that’s a total of 51 segments of her DNA that we can now assign to a Miller ancestral couple.

Furthermore, every segment that Barbara matches with either Cheryl or Don is now confirmed to be from her father’s side of the family, not her mother’s.  While we don’t have Barbara’s parents available for testing, this is a pseudo way to phase your results to determine matches from one parents’ side of the family.  For Barbara, that’s a total of 91 segments, some of them quite large.  For example, roughly half of chromosome 13 matched with Don.

Just as a matter of interest, within those 91 segments that Barbara matches with either Don or Cheryl, a total of only 7 segments matched exactly between all 3 individuals in terms of start and end location, cMs and SNPs.  While you might expect a number of small segments to match exactly, these weren’t all small.  In fact, most weren’t small and some were quite large.

Exactly matching DNA segments between Barbara and Cheryl and Barbara and Don.

Chromosome Matching cM Matching SNPs
1 8.65 1189
1 7.01 1150
8 27.79 7279
10 20.78 5141
12 27.68 6046
14 2.11 700
14 49.47 9032

This means that these segments were not divided at all in a total of 5 DNA transmission events.

  • Hiram to John
  • Hiram to Roscoe
  • John to Barbara
  • Roscoe to Cheryl
  • Roscoe to Don

Additionally, I carry two of these exact segments as well, so those two segments survived 6 transmission events.

Clearly these segments are what we would term “sticky” because they certainly are not following the statistical average of dividing the DNA in half (by 50%) in each transmission event.

There is one more thing we can tell from matching.

Both Barbara and Cheryl match with SB on the X chromosome on the same segments.

Two cousins X

This is particularly interesting because of the special inheritance path of the X chromosome.  We know that SB must be related on Evaline Miller’s side of the family, because John and Roscoe Ferverda did not receive an X chromosome from their father.  So Barbara, Cheryl and Don have to have received it from Evaline.  Unfortunately, SB listed no genealogy on Family Tree DNA, but based on the X chromosome inheritance path, I can tell you that SB is either descended from John David Miller and Margaret Lentz, or from the Schaeffer, Lentz or Moselman lines colored pink or blue, below.

Two cousins X fan

At this point, I made a chart of how the matches grouped with each other on each of the green clusters.

Just one cousin chart

I intended to create a nice chart in Excel or Word, but with all of the various colors of ink involved, I didn’t think I could find enough color differentiation so we’ll just have to suffer with my hand-made chart.  There are subtle color differences here – a different color or marker type for each of the 19 green clusters.

What I did was to look at each of the green DNA spreadsheet groupings and create a colorized chart, by group, for each grouping.  So everyone in the first cluster had their X in the boxes of who they matches in the same color, say blue pen.  The second group, orange marker, and so forth.  That way I can see who was orange or yellow or blue and if those groups tend to cluster together.

Remember Arthur and Eric from above, whose genealogy we knew nothing about.  You can see, for example, that Arthur matches in various groups with lots of people, and most often, Tiffany.  Arthur and Eric also match in multiple groups that include each other and Rex, a known Miller descendant, so we can attribute both Arthur and Eric’s DNA matches to the Miller side of the tree.  Keep in mind, all of these people also match with Barbara, Cheryl and Don.

Tiffany clusters with Arthur and Sarah and Eric in multiple groups and with Constance, David, Ellen, Leland and Rex in at least one other cluster.  So another Miller side person.

On chromosome 14, Eric, Ellen, Arthur and Tiffany were all triangulated on the same segment with Don, Cheryl and Barbara, so we know those 7 individuals unquestionably share a common ancestor.

Let’s look at SB again, our X match.  Since SB’s X connection can’t come from the Miller side, given the X inheritance path, and SB also matches with our Lentz cousin, it’s likely that SB is related through the Lentz lines.

Normally, when doing this matching relationship chart, you tend to see two distinct groupings, a mother’s side and a father’s side.  In other words, there will be some groups that absolutely don’t overlap with the others.  That’s not the case here.

So, by now you might be wondering what happened to the Ferverda side of the family?  I was secretly hoping to find a closet Ferverda relative in this exercise, and I thought we might have, actually.  Notice that Harold has no clustering at all, but he clearly matches Barbara, Cheryl and Don – but doesn’t cluster with any other Miller or Lentz cousins.  Therefore, he could be from the Ferverda side of the family, but since he provided no genealogy information or surnames at Family Tree DNA, I can’t easily tell.

However, I am not entirely without recourse.  I checked Harold “in common with” Barbara and discovered that he matches both Rex, our Miller cousin and William, our Lentz cousin, so even though Harold did not triangulate with William and/or Rex on any segments with both Barbara and/or Cheryl/Don, those Miller/Lentz matches certainly suggest descent from this line.  I’ll be sending him an e-mail!

So, there are no Ferverda cousins represented in these matches.

I decided to check one more thing, now that I know that all of these matches are on the Miller side and that we have 3 known, proven genealogical cousins, Rex, Herbert and William.  I wanted to see how many of our individuals who match Barbara, Cheryl and Don also match one of the known cousins.  I selected Barbara as the base match kit to use, since we know they all matched Barbara, Cheryl and Don, and then I ran “in common with” for each one of them with Barbara, with the following results.  A few did match one of the Miller or Lentz cousins, but fewer than I expected.miller matches chart

*However, we had a surprise.  Dean matched another Miller male individual whose line is proven to descend through two children of Philip Jacob Miller and Magdalena, surname unknown.  Another first cousin marriage.  Another cousin discovered!

Furthermore, I noticed yet another individual, Doug, in Barbara’s match list and in common with 6 of the matches as well.  Looking at Doug’s pedigree chart, not only is he a Miller descendant, he also descends from two of the Miller wives lines too.  Another cousin confirmed!

But why no Ferverda matches?

Recent immigrants.

The Ferverda side of the family immediately jumps the pond to Holland, with Hiram himself being an immigrant as a young teen in the 1860s.  There are few Ferverda (Fervida, Ferwerda) descendants here in the US to test, and many are Brethren or Mennonite.  Few people in the Netherlands have participated in DNA testing.

The converse of that, Evaline Miller’s lines have all been in the US since the early/mid-1700s, so there are lots of descendants.  Oh, the difference about a hundred years and 5 or 6 generations makes in the number of descendants who might be available to test.  This situation, unfortunately, created a very lopsided chart without the division I’m used to seeing.  On the other hand, thank goodness Evaline’s line and Hiram’s line are very distinct!

At this point, if you’re doing this “one cousin” exercise, you’ll need to do a few things.

1.  Check each of the matching individuals to see if they have uploaded or created a pedigree chart at Family Tree DNA. If they do, their pedigree icon will be green, shown below. If so, click on the icon and search for every surname (and variant) associated with your known common lines with your cousin.

2.  Check to see if these people entered a list of surnames, even if they don’t have a pedigree chart. The surnames are listed in the furthest right column. If you have entered your surnames, any that match yours will be bolded. Beware of variant spellings.

two cousins pedigree and surnames

You can see above that I am the only one of the matches shown with a pedigree chart icon, shown in green, and the common surnames are bolded at right.

3.  If your matches don’t have a pedigree chart, write to them and tell them you have a common ancestor and give them a list of your ancestors in your direct line. Please, PLEASE include the name on the kit that you match. Many people manage multiple kits and will ignore requests with only partial information.

4.  If you have additional cousins to test, do so. I’m sure you can see how valuable additional cousins DNA would be.

5.  Be sure to check your matches by “ancestral surname” to be sure that you haven’t missed any cousins who have already tested. The ancestral surname search box can be seen above the “known relationship” heading in the graphic above.

6.  If you haven’t done so, enter your surnames under the “Manage Personal Information” tab under “My Account” at Family Tree DNA. Then click on the genealogy tab, then Surnames.

Two cousins genealogy settings

7.  From your main personal page, of course, you can upload your Gedcom file by clicking on “My Family Tree.”

8.  Run “in common with” for each of the common matches of your two cousins and look for common matching names between them.  Those matching “in common with” names serve as a hint as to shared ancestry.  Your answer may be hiding in your cousins’ trees!

Utilize all of these tools to help your search.

Summary

Not bad for thinking we couldn’t do anything with our DNA matches because we had “just one cousin” to work with, even though I cheated and used siblings.

What, exactly, did we manage to do?

  • I attributed 91 segments of Barbara’s DNA to her father’s side of the tree.
  • I filled in 51 segments of Barbara’s DNA to ancestral couples.
  • I found 5 confirmed genealogy/DNA cousins.
  • I found 16 people whose genealogy is unknown, but who triangulate with Barbara, Cheryl and Don.  We know for sure which side of the tree these people match on – all Millers.
  • I can tell the X match which lines they descend from, even if they don’t know.
  • I can do one more very cool thing.  Utilizing the Lazarus utility at GedMatch, I can now recreate at least a partial autosomal DNA file for both John and Roscoe Ferverda, the fathers of our testers.  Join me in a couple days and we’ll see how that works!

This same process works between any two people who know how they are related and their common ancestor.  It’s a great way to find cousins you didn’t know you had, or you didn’t know have DNA tested, and how they are related to you and each other.

Some people get very discouraged when even thinking about working with endogamous populations, or cousin marriages.  One of the reasons I used this particular example is that I wanted to illustrate that while these situations are challenging from time to time, they are far from hopeless – so don’t let that deter you.

In fact, of the 5 confirmed cousins discovered during this process, some in unexpected ways, at least 3 and possibly 4 are through multiple lines.  Some of these matches are probably thanks to endogamy.

Happy hunting!


A Study Utilizing Small Segment Matching

$
0
0

There has been quite a bit of discussion in the last several weeks, both pro and con, about how to use small matching DNA segments in genetic genealogy.  A couple of people are even of the opinion that small segments can’t be used at all, ever.  Others are less certain and many of us are working our way through various scenarios.  Evidence certainly exists that these segments can be utilized.

I’ve been writing foundation articles, in preparation for this article, for several weeks now.  Recently, I wrote about how phasing works and determining IBD versus IBS matches and included guidelines for telling the difference between the different kinds of matches.  If you haven’t read that article, it’s essential to understanding this article, so now would be a good time to read or review that article.

I followed that with a step by step article, Demystifying Autosomal DNA Matching, on how to do phasing and matching in combination with the guidelines about how to determine IBD (identical by descent) versus IBS (identical by chance) and identical by population matches when evaluating your own matches.

Now that we understand IBS, IBD, Phasing and how matching actually works on a case by case basis, let’s look at applying those same matching and IBS vs IBD guidelines to small data segments as well.

A Little History

So those of you who haven’t been following the discussion on various blogs and social media don’t feel like you’ve been dropped into the middle of a conversation with no context, let me catch you up.

On Thanksgiving Day, I published an article about identifying one of my ancestors, after many years of trying, Sarah Hickerson.

That article spurred debate, which is just fine when the debate is about the science, but it subsequently devolved into something less pleasant.  There are some individuals with very strong opinions that utilizing small segments of DNA data can “never be done.”

I do not agree with that position.  In fact, I strongly disagree and there are multiple cases with evidence to support small segments being both accurate and useful in specific types of genealogical situations.  We’ll take a look at several.

I do agree that looking at small segment data out of context is useless.  To the best of my knowledge, no genealogist begins with their smallest segments and tries to assemble them, working from the bottom up.  We all begin with the largest segments, because they are the most useful and the closest connections in our tree, and work our way down.  Generally, we only work with small segments when we have to – and there are times that’s all we have.  So we need to establish guidelines and ways to know if those small segments are reliable or not.  In other words, how can we draw conclusions and how much confidence can we put in those conclusions?

Ultimately, whether you choose to use or work with small segment data will be your own decision, based on your own circumstances.  I simply wanted to understand what is possible and what is reasonable, both for my own genealogy and for my readers.

In my projects, I haven’t been using small segment data out of context, or randomly.  In other words, I don’t just pick any two small segment matches and infer or decide that they are valid matches.  Fortunately, by utilizing the IBD vs IBS guidelines, we have tools to differentiate IBD (Identical by Descent) segments from IBS (Identical by State) by chance segments and IBD/IBS by population for matching segments, both large and small.

Studying small segment data is the key to determining exactly how small segments can reasonably be utilized.  This topic probably isn’t black or white, but shades of gray – and assuming the position that something can’t be done simply assures that it won’t be.

I would strongly encourage those involved and interested in this type of research to retain those small segments, work with them and begin to look for patterns.  The only way we, as a community, are ever going to figure out how to work with small segments successfully and reliably is to, well, work with them.

Discussing the science and scenarios surrounding the usage of small data segments in various different situations is critical to seeing our way through the forest.  If the answers were cast in concrete about how to do this, we wouldn’t be working through this publicly today.

Negative personal comments and inferences have no place in the scientific community.  It discourages others from participating, and serves to stifle research and cooperation, not encourage it.  I hope that civil scientific discussions and comparisons involving small segment data can move forward, with decorum, because they are critically needed in order to enhance our understanding, under varying circumstances, of how to utilize small segment data.  As Judy Russell said, disagreeing doesn’t have to be disagreeable.

Two bloggers, Blaine Bettinger and CeCe Moore wrote articles following my Hickerson article.  Blaine subsequently wrote a second article here.  Felix Immanuel wrote articles here and here.

A few others have weighed in, in writing, as well although most commentary has been on Facebook.  Israel Pickholtz, a professional genealogist and genetic consultant, stated on his blog, All My Foreparents, the following:

It is my nature to distrust rules that put everything into a single category and that’s how I feel about small segments. Sometimes they are meaningful and useful, sometimes not.

When I reconstructed my father’s DNA using Lazerus (described last week in Genes From My Father), I happily accepted all small segments of whatever size because those small segments were in the DNA of at least one of his children and at least one of his brother/sister/first cousin. If I have a particular small segment, I must have received it from my parents. If my father’s brother (or sister) has it as well, then it is eminently clear to me that I got it from my father and that it came to him and his brother from my grandfather. And it is not reasonable to say that a sliver of that small segment might have come from my mother, because my father’s people share it.

After seeing Israel’s commentary about Lazarus, I reconstructed the genome of both Roscoe and John Ferverda, brothers, which includes both large and small segments.  Working with the Ferverda DNA further, I wrote an article, Just One Cousin, about matching between two siblings and a first cousin, which includes lots of small data segments, some of which were proven to triangulate, meaning they are genuine, and some which did not.  There are lots more examples in the demystifying article, as well.

What Not To Do 

Before we begin, I want to make it very clear that am not now, and never have, advocated that people utilize small data segments out of context of larger matching segments and/or at least suspected matching genealogy.  For example, I have never implied or even hinted that anyone should go to GedMatch, do a “one to many” compare at 1 cM and then contact people informing them that they are related.  Anyone who has extrapolated what I’ve written to mean that either simply did not understand or intentionally misinterpreted the articles.

Sarah Hickerson Revisited

If I thought Sarah Hickerson caused me a lot of heartburn in the decades before I found her, little did I know how much heartburn that discovery would cause.

Let’s go back to the Sarah Hickerson article that started the uproar over whether small data segments are useful at all.

In that article, I found I was a member of a new Ancestry DNA Circle for Charles Hickerson and Mary Lytle, the parents of Sarah Hickerson.

Ancestry Hickerson match

Because there are no tools at Ancestry to prove DNA connections, I hurried over to Family Tree DNA looking for any matches to Hickersons for myself and for my Vannoy cousins who also (potentially) descended from this couple.  Much to my delight, I found  several matches to Hickersons, in fact, more than 20 – a total of 614 rows of spreadsheet matches when I included all of my Vannoy cousins who potentially descend from this couple to their Hickerson matches.  There were 64 matching clusters of segments, both small and large.  Some matches were as large as 20cM with 6000 SNPs and more than 20 were over 10cM with from 1500 to 6000 SNPs.  There were also hundreds of small segments that matched (and triangulated) as well.

By the time I added in a few more Vannoy cousins that we’ve since recruited, the spreadsheet is now up to 1093 rows and we have 52 Vannoy-Hickerson TRIANGULATED CLUSTERS utilizing only Family Tree DNA tools.

Triangulated DNA, found in 3 or more people at the same location who share a common ancestor is proven to be from that ancestor (or ancestral couple.)  This is the commonly accepted gold standard of autosomal DNA triangulation within the industry.

Here’s just one example of a cluster of three people.  Charlene and Buster are known (proven, triangulated) cousins and Barbara is a descendant of Charles Hickerson and Mary Lytle.

example triang

What more could you want?

Yes, I called this a match.  As far as I’m concerned, it’s a confirmed ancestor.  How much more confirmed can you get?

Some clusters have as many as 25 confirmed triangulated members.

chr 13 group

Others took issue with this conclusion because it included small segment data.  This seems like the perfect opportunity in which to take a look at how small segments do, or don’t stand up to scrutiny.  So, let’s do just that.  I also did the same type of matching comparison in a situation with 2 siblings and a known cousin, here.

To Trash…or Not To Trash

Some genetic genealogists discard small segments entirely, generally under either 5 or 7cM, which I find unfortunate for several reasons.

  1. If a person doesn’t work with small segments, they really can’t comment on the lack of results, and they’ll never have a success because the small segments will have been discarded.
  2. If a person doesn’t work with small segments, they will never notice any trends or matches that may have implications for their ancestry.
  3. If a person doesn’t work with small segments, they can’t contribute to the body of evidence for how to reasonably utilize these segments.
  4. If a person doesn’t work with small segments, they may well be throwing the baby out with the bathwater, but they’ll never know.
  5. They encourage others to do the same.

The Sarah Hickerson article was not meant as a proof article for anything – it was meant to be an article encouraging people to utilize genetic genealogy for not only finding their ancestor and proving known connections, but breaking down brick walls.  It was pointing the way to how I found Sarah Hickerson.  It was one of my 52 Ancestors Series, documenting my ancestors, not one of the specifically educational articles.  This article is different.

If you are only interested in the low hanging fruit, meaning within the past 5 or 6 generations, and only proving your known pedigree, not finding new ancestors beyond that 5-6 generation level, then you can just stop reading now – and you can throw away your small segments.  But if you want more, then keep reading, because we as a community need to work with small segment data in order to establish guidelines that work relative to utilizing small segments and identifying the small segments that can be useful, versus the ones that aren’t.

I do not believe for one minute that small segments are universally useless.  As Israel said, if his family did not receive those segments from a common family member, then where did they all get those matching segments?

In fact, utilizing triangulated and proven DNA relationships within families is how adoptees piece together their family trees, piggybacking off of the work of people with known pedigrees that they match genetically.  My assumption had been that the adoptee community utilized only large DNA segments, because the larger the matching segments, generally the closer in time the genealogy match – and theoretically the easier to find.

However, I discovered that I was wrong, and the adoptee community does in fact utilize small segments as well.  Here’s one of the comments posted on my Chromosome Browser War blog article.

“Thanks for the well thought out article, Roberta, I have something to add from the folks at DNAadoption. Adoptees are not just interested in the large segments, the small segments also build the proof of the numerous lines involved. In addition, the accumulation of surnames from all the matches provides a way to evaluate new lines that join into the tree.”

Diane Harman-Hoog (on behalf of the 6 million adoptees in this country, many of who are looking for information on medical records and family heritage).

Diane isn’t the only person who is working with small segment data.  Tim Janzen works with small segments, in particular on his Mennonite project, and discusses small segments on the ISOGG WIKI Phasing page.  Here is what Tim has to say:

“One advantage of Family Finder is that FF has a 1 cM threshold for matching segments. If a parent and a child both have a matching segment that is in the 2 to 5 cM range and if the number of matching SNPs is 500 or more then there is a reasonably high likelihood that the matching segment is IBD (identical by descent) and not IBS (identical by state).”

The same rules for utilizing larger segment data need to be applied to small segment data to begin with.

Are more guidelines needed for small segments?  I don’t know, but we’ll never know if we don’t work with many individual situations and find the common methods for success and identify any problematic areas.

Why Do Small Segments Matter?

In some cases, especially as we work beyond the 6 generation level, small segments may be all we have left of a specific ancestor.  If we don’t learn to recognize and utilize the small segments available to us, those ancestors, genetically speaking, will be lost to us forever.

As we move back in time, the DNA from more distant ancestors will be divided into smaller and smaller segments, so if we ever want the ability to identify and track those segments back in time to a specific ancestor, we have to learn how to utilize small segment data – and if we have deleted that data, then we can’t use it.

In my case, I have identified all of my 5th generation ancestors except one, and I have a strong lead on her.  In my 6th generation, however, I have lots of walls that need to be broken through – and DNA may be the only way I’ll ever do that.

Let’s take a look at what I can expect when trying to match people who also descend from an ancestor 5 generations back in time.  If they are my same generation, they would be my fourth cousins.

Based on the autosomal statistics chart at ISOGG, 4th cousins, on the average, would expect to share about 13.28 cM of DNA from their common ancestor.  This would not be over the match threshold at FTDNA of approximately 20 cM total, and if those segments were broken into three pieces, for example, that cousin would not show as a match at either FTDNA or 23andMe, based on the vendors’ respective thresholds.

% Shared DNA Expected Shared cM Relationship
0.781% 53.13 Third cousins, common ancestor is 4 generations back in time
0.391% 26.56 Third cousins once removed
20 cm Family Tree DNA total cM Threshold
0.195% 13.28 Fourth cousins, common ancestor is 5 generations back in time
7 cM 23andMe individual segment cM match threshold
0.0977% 6.64 Fourth cousins once removed
0.0488% 3.32 Fifth cousins, common ancestor is 6 generations back in time
0.0244 1.66 Fifth cousins once removed

If you’re lucky, as I was with Hickerson, you’ll match at least some relative who carries that ancestral DNA line above the threshold, and then they’ll match other cousins above the threshold, and you can build a comparison network, linking people together, in that fashion.  And yes you may well have to utilize GedMatch for people testing at various different vendors and for those smaller segment comparisons.

For clarification, I have never “called” a genealogy match without supporting large segment data.  At the vendors, you can’t even see matches if they don’t have larger segments – so there is no way to even know you would match below the threshold.

I do think that we may be able to make calls based on small segments, at least in some instances, in the future.  In fact, we have to figure out how to do this or we will rarely be able to move past the 5th or 6th generation utilizing genetics.

At the 5th generation, or third cousins, one expects to see approximately 26 cM of matching DNA, still over the threshold (if divided correctly), but from that point further back in time, the expected shared amount of DNA is under the current day threshold.  For those who wonder why the vendors state that autosomal matches are reliable to about the 5th or 6th generation, this is the answer.

I do not discount small segments without cause.  In other words, I don’t discount small segments unless there is a reason.  Unless they are positively IBS by chance, meaning false, and I can prove it, I don’t disregard them.  I do label them and make appropriate notes.  You can’t learn from what’s not there.

Let me give you an example.  I have one area of my spreadsheet where I have a whole lot of segments, large and small, labeled Acadian.  Why?  Because the Acadians are so intermarried that I can’t begin to sort out the actual ancestor that DNA came from, at least not yet…so today, I just label them “Acadian.”

This example row is from my master spreadsheet.  I have my Mom’s results in my spreadsheet, so I can see easily if someone matches me and Mom both. My rows are pink.  The match is on Mom’s side, which I’ve color coded purple.  I don’t know which ancestor is the most recent common ancestor, but based on the surnames involved, I know they are Acadian.  In some cases, on Acadian matches, I can tell the MRCA and if so, that field is completed as well.

Me Mom acadian

As a note of interest, I inherited my mother’s segment intact, so there was no 50% division in this generation.

I also have segments labeled Mennonite and Brethren.  Perhaps in the future I’ll sort through these matches and actually be able to assign DNA segments to specific ancestors.  Those segments aren’t useless, they just aren’t yet fully analyzed.  As more people test, hopefully, patterns will emerge in many of these DNA groupings, both small and large.

In fact, I talked about DNA patterns and endogamous populations in my recent article, Just One Cousin.

For me, today, some small segment matches appear to be central European matches.  I say “appear to be,” because they are not triangulated.  For me this is rather boring and nondescript – but if this were my African American client who is trying to figure out which line her European ancestry came from, this could be very important.  Maybe she can map these segments to at least a specific ancestral line, which she would find very exciting.

Learning to use small segments effectively has the potential to benefit the following groups of people:

  • People with colonial ancestry, because all that may be left today of colonial ancestors is small segments.
  • People looking to break down brick walls, not just confirm currently known ancestors.
  • People looking for minority ancestors more than 5 or 6 generations back in their trees.
  • Adoptees – although very clearly, they want to work with the largest matches first.
  • People working with ethnic identification of ancestors, because you will eventually be able to track ethnicity identifying segments back in time to the originating ancestor(s).

Conversely, people from highly endogamous groups may not be helped much, if at all, by small segments because they are so likely to be widely shared within that population as a group from a common ancestor much further back in time.  In fact, the definition of a “small segment” for people with fully endogamous families might be much larger than for someone with no known endogamy.

However, if we can identify segments to specific populations, that may help the future accuracy of ethnicity testing.

Let’s go back and take a look at the Hickerson data using the same format we have been using for the comparisons so far.

Small Segment Examples

These Hickerson/Vannoy examples do not utilize random small segment matches, but are utilizing the same matching rules used for larger matches in conjunction with known, triangulated cousin groups from a known ancestor.  Many cousins, including 2 brothers and their uncle all carry this same DNA.  Like in Israel’s case, where did they get that same DNA if not from a common ancestor?

In the following examples, I want to stress that all of the people involved DO HAVE LARGER SEGMENT MATCHES on other chromosomes, which is how we knew they matched in the first place, so we aren’t trying to prove they are a match.  We know they are.  Our goal is to determine if small segments are useful in the same situation, proving matches, as with larger segments.  In other words, do the rules hold true?  And how do we work with the data?  Could we utilize these small segment matches if we didn’t have larger matching segments, and if so, how reliable would they be?

There is a difference between a single match and a triangulated group:

  • Matches between two people are suggestive of a common ancestor but could be IBS by chance or population..
  • Multiple matches, such as with the 6 different Hickersons who descend from Charles Hickerson and Mary Lytle, both in the Ancestry DNA Circle and at Family Tree DNA, are extremely suggestive of a specific common ancestor.
  • Only triangulated groups are proof of a common ancestor, unless the people are  closely related known relatives.

In our Hickerson/Vannoy study, all participants match at least to one other (but not to all other) group members at Family Tree DNA which means they match over the FTDNA threshold of approximately 20 cM total and at least one segment over 7.7cM and 500 SNPs or more.

In the example below, from the Hickerson article, the known Vannoy cousins are on the left side and the Hickerson matches to the Vannoy cousins are across the top.  We have several more now, but this gives you an idea of how the matching stacked up initially.  The two green individuals were proven descendants from Charles Hickerson and Mary Lytle.

vannoy hickerson higginson matrix

The goal here is to see how small data segments stack up in a situation where the relationship is distant.  Can small segments be utilized to prove triangulation?  This is slightly different than in the Just One Cousin article, where the relationship between the individuals was close and previously known.  We can contrast the results of that close relationship and small segments with this more distant connection and small segments.

Sarah Hickerson and Daniel Vannoy

The Vannoy project has a group of about a dozen cousins who descend from Elijah Vannoy who have worked together to discover the identify of Elijah’s parents.  Elijah’s father is one of 4 Vannoy men, all sons of the same man, found in Wilkes County, NC. in the late 1700s.  Elijah Vannoy is 5 generations upstream from me.

What kind of evidence do we have?  In the paper genealogy world, I have ruled out one candidate via a Bible record, and probably a second via census and tax records, but we have little information about the third and fourth candidates – in spite of thoroughly perusing all existent records.  So, if we’re ever going to solve the mystery, short of that much-wished-for Vannoy Bible showing up on e-Bay, it’s going to have to be via genetic genealogy.

In addition to the dozen or so Vannoy cousins who have DNA tested, we found 6 individuals who descend from Sarah Hickerson’s parents, Charles Hickerson and Mary Lytle who match various Vannoy cousins.  Additionally, those cousins match another 21 individuals who carry the Hickerson or derivative surnames, but since we have not proven their Hickerson lineage on paper, I have not utilized any of those additional matches in this analysis.  Of those 26 total matches, at Family Tree DNA, one Hickerson individual matches 3 Vannoy cousins, nine Hickerson descendants match 2 Vannoy cousins and sixteen Hickerson descendants match 1 Vannoy cousin.

Our group of Vannoy cousins matching to the 6 Charles Hickerson/Mary Lytle descendants contains over 60 different clusters of matching DNA data across the 22 chromosomes.  Those 6 individuals are included in 43 different triangulated groups, proving the entire triangulation group shares a common ancestor.  And that is BEFORE we add any GedMatch information.

If that sounds like a lot, it’s not.  Another recent article found 31 clusters among siblings and their first cousin, so 60 clusters among a dozen known Vannoy cousins and half a dozen potential Hickerson cousins isn’t unusual at all.

To be very clear, Sarah Hickerson and Daniel Vannoy were not “declared” to be the parents of Elijah Vannoy, born in 1784, based on small segment matches alone.  Larger segment matches were involved, which is how we saw the matches in the first place.  Furthermore, the matches triangulated.  However, small segments certainly are involved and are more prevalent, of course, than large segments.  Some cousins are only connected by small segments.  Are they valid, and how do we tell?  Sometimes it’s all we have.

Let me give you the classic example of when small segments are needed.

We have four people.  Person A and B are known Vannoy cousins and person C and D are potential Hickerson cousins.  Potential means, in this case, potential cousins to the Vannoys.  The Hickersons already know they both descend from Charles Hickerson and Mary Lytle.

  • Person A matches person C on chromosome 1 over the matching threshold.
  • Person B matches person D on chromosome 2 over the matching threshold.

Both Vannoy cousins match Hickerson cousins, but not the same cousin and not on the same segments at the vendor.  If these were same segment matches, there would be no question because they would be triangulated, but they aren’t.

So, what do we do?  We don’t have access to see if person C and D match each other, and even if we did, they don’t match on the same segments where they match persons A and B, because if they did we’d see them as a match too when we view A and B.

If person A and B don’t match each other at the vendor, we’re flat out of luck and have to move this entire operation to GedMatch, assuming all 4 people have or are willing to download their data.

a and b nomatch

If person A and B match each other at the vendor, we can see their small segment data as compared to each other and to persons C and D, respectively which then gives us the ability to see if A matches C on the same small segment as B matches D.

a and b match

If we are lucky, they will all show a common match on a small segment – meaning that A will match B on a small segment of chromosome 3, for example, and A will match C on that same segment.  In a perfect world, B will also match D on that same segment, and you will have 4 way triangulation – but I’m happy with the required 3 way match to triangulate.

This is exactly what happened in the article, Be Still My H(e)art.  As you can see, three people match on chromosomes 1 and 8, below – two of whom are proven cousins and the third was the wife surname candidate line.

Younger Hart 1-8

The example I showed of chromosome 2 in the Hickerson article was where all participants of the 5 individuals shown on the chromosome browser were matching to the Vannoy participant.  I thought it was a good visual example.  It was just one example of the 60+ clusters of cousin matches between the dozen Vannoy cousins and 6 Hickerson descendants.

This example was criticized by some because it was a small segment match.  I should probably have utilized chromosome 15 or searched for a better long segment example, but the point in my article was only to show how people that match stack up together on the chromosome browser – nothing more.   Here’s the entire chromosome, for clarity.

hickerson vannoy chr 2

Certainly, I don’t want to mislead anyone, including myself.  Furthermore, I dislike being publicly characterized as “wrong” and worse yet, labeled “irresponsible,” so I decided to delve into the depths of the data and work through several different examples to see if small segment data matching holds in various situations.  Let’s see what we found.

Chromosome 15

I selected chromosome 15 to work with because it is a region where a lot of Vannoy descendants match – and because it is a relatively large segment.  If the Hickersons do match the Vannoys, there’s a fairly good change they might match on at least part of that segment.  In other words, it appears to be my best bet due to sheer size and the number of Elijah Vannoy’s descendants who carry this segment.  In addition to the 6 individuals above who matched on chromosome 15, here are an additional 4.  As you can see, chromosome 15 has a lot of potential.

Chrom 15 Vannoy

The spreadsheet below shows the sections of chromosome 15 where cousins match.  Green individuals in the Match column are descendants of Charles Hickerson and Mary Lytle, the parents of Sarah Hickerson.  The balance are Vannoys who match on chromosome 15.

chr 15 matches ftdna v4

As you can see, there are several segments that are quite large, shown in yellow, but there are also many that are under the threshold of 7cM, which are all  segments that would be deleted if you are deleting small segments.  Please also note that if you were deleting small segments, all of the Hickerson matches would be gone from chromosome 15.

Those of you with an eagle eye will already notice that we have two separate segments that have triangulated between the Vannoy cousins and the Hickerson descendants, noted in the left column by yellow and beige.  So really, we could stop right here, because we’ve proven the relationship, but there’s a lot more to learn, so let’s go on.

You Can’t Use What You Can’t See

I need to point something out at this point that is extremely important.

The only reason we see any segment data below the match threshold is because once you match someone on a larger segment at Family Tree DNA, over the threshold, you also get to view the small segment data down to 1cM for your match with that person. 

What this means is that if one person or two people match a Hickerson descendant, for example you will see the small segment data for their individual matches, but not for anyone that doesn’t match the participant over the matching threshold.

What that means in the spreadsheet above, is that the only Hickerson that matches more than one Vannoy (on this segment) is Barbara – so we can see her segment data (down to 1cM ) as compared to Polly and Buster, but not to anyone else.

If we could see the smaller segment data of the other participants as compared to the Hickerson participants, even though they don’t match on a larger segment over the matching threshold, there could potentially be a lot of small segment data that would match – and therefore triangulate on this segment.

This is the perfect example of why I’ve suggested to Family Tree DNA that within projects or in individuals situations, that we be allowed to reduce the match threshold – especially when a specific family line match is suspected.

This is also one of the reasons why people turn to GedMatch, and we’ll do that as well.

What this means, relative to the spreadsheet is that it is, unfortunately, woefully incomplete – and it’s not apples to apples because in some cases we have data under the match threshold, and in some, we don’t.  So, matches DO count, but nonmatches where small segment data is not available do NOT count as a non-match, or as disproof.  It’s only negative proof IF you have the data AND it doesn’t match.

The Vannoys match and triangulate on many segments, so those are irrelevant to this discussion other than when they match to Hickerson DNA.  William (H), descends from two sons of Charles Hickerson and Mary Lytle.  Unfortunately, he only matches one Vannoy, so we can only see his small segments for that one Vannoy individual, William (V).  We don’t know what we are missing as compared to the rest of the Vannoy cousins.

To see William (H)’s and William (V)’s DNA as compared to the rest of the Vannoy cousins, we had to move to GedMatch.

Matching Options

Since we are working with segments that are proven to be Vannoy, and we are trying to prove/disprove if Daniel Vannoy and Sarah Hickerson are the parents of Elijah through multiple Hickerson matches, there are only a few matching options, which are:

  1. The Hickerson individuals will not triangulate with any of the Vannoy DNA, on chromosome 15 or on other chromosomes, meaning that Sarah Hickerson is probably not the mother of Elijah Vannoy, or the common ancestor is too far back in time to discern that match at vendor thresholds.
  2. The Hickerson individuals will not triangulate on this segment, but do triangulate on other segments, meaning that this segment came entirely from the Vannoy side of the family and not the Hickerson side of the family. Therefore, if chromosome 15 does not triangulate, we need to look at other chromosomes.
  3. The Hickerson individuals triangulate with the Vannoy individuals, confirming that Sarah Hickerson is the mother of Elijah Vannoy, or that there is a different common unknown ancestor someplace upstream of several Hickersons and Vannoys.

All of the Vannoy cousins descend from Elijah Vannoy and Lois McNiel, except one, William (V), who descends from the proven son of Sarah Hickerson and Daniel Vannoy, so he would be expected to match at least some Hickerson descendants.  The 6 Hickerson cousins descend from Charles Hickerson and Mary Lytle, Sarah’s parents.

hickerson vannoy pedigree

William (H), the Hickerson cousin who descends from David, brother to Sarah Hickerson, is descended through two of David Hickerson’s sons.

I decided to utilize the same segment “mapping comparison” technique with a spreadsheet that I utilized in the phasing article, because it’s easy to see and visualize.

I have created a matching spreadsheet and labeled the locations on the spreadsheet from 25-100 based on the beginning of the start location of the cluster of matches and the end location of the cluster.

Each individual being compared on the spreadsheet below has a column across the top.  On the chart below, all Hickerson individuals are to the right and are shown with their cells highlighted yellow in the top row.

Below, the entire colorized chart of chromosome 15 is shown, beginning with location 25 and ending with 100, in the left hand column, the area of the Vannoy overlap.  Remember, you can double click on the graphics to enlarge.  The columns in this spreadsheet are not fully expanded below, but they are in the individual examples.

entire chr 15 match ss v4

I am going to step through this spreadsheet, and point out several aspects.

First, I selected Buster, the individual in the group to begin the comparison, because he was one of the closest to the common ancestor, Elijah Vannoy, genealogically, at 4 generations.  So he is the person at Family Tree DNA that everyone is initially compared against.

Everyone who matches Buster has their matching segments shown in blue.  Buster is shown furthest left.

When participants match someone other than Buster, who they match on that segment is typed into their column.  You can tell who Buster matches because their columns are blue on matching locations.  Here’s an example.

Me Buster match

You can see that in my column, it’s blue on all segments which means I match Buster on this entire region.  In addition, there are names of Carl, Dean, William Gedmatch and Billie Gedmatch typed into the cell in the first row which means at that location, in addition to Buster, I also match Carl and Dean at Family Tree DNA and William (descended from the son of Daniel Vannoy and Sarah Hickerson) at Gedmatch and Billie (a Hickerson) at Gedmatch.  Their name is typed into my column, and mine into theirs.  Please note that I did not run everyone against everyone at GedMatch.  I only needed enough data to prove the point and running many comparisons is a long, arduous process even when GedMatch isn’t experiencing problems.

On cells that aren’t colorized blue, the person doesn’t match Buster, but may still match other Vannoy cousin segments.  For example, Dean, below, matches Buster on location 25-29, along with some other cousins.  However, he does not match Buster on location 30 where he instead matches Harold and Carl who also don’t match Buster at that location. Harold, Carl and Dean do, however, all descend from the same son of Elijah so they may well be sharing DNA from a Vannoy wife at this location, especially since no one who doesn’t share that specific wife’s line matches those three at this location.

Me Buster Dean match

Remember, we are not working with random small data segments, but with a proven matching segment to a common Vannoy ancestor, with a group of descendants from a possible/probable Hickerson ancestor that we are trying to prove/disprove.  In other words, you would expect either a lot of Hickerson matches on the same segments, if Hickerson is indeed a Vannoy ancestral family, or virtually none of them to match, if not.

The next thing I’d like to point out is that these are small segments of people who also have larger matching segments, many of whom do triangulate on larger segments on other chromosomes.  What we are trying to discern is whether small segment matches can be utilized by employing the same matching criteria as large segment matching.  In other words, is small segment data valid and useful if it meets the criteria for an IBD match?

For example, let’s look at Daniel.  Daniel’s segments on chromosome 15, were it not for the fact that he matches on larger segments on other chromosomes, would not be shown as matches, because they are not individually over the match threshold.

Look at Daniel’s column for Polly and Warren.

Daniel matches 2

The segments in red show a triangulated group where Daniel and Warren, or Daniel, Warren and Polly match.  The segments where all 3 match are triangulated.

This proves, unquestionably, that small segments DO match utilizing the normal prescribed IBD matching criteria.  This spreadsheet, just for chromosome 15, is full of these examples.

Is there any reason to think that these triangulated matches are not identical by descent?  If they are not IBD, how do all of these people match the same DNA? Chance alone?  How would that be possible?  Two people, yes, maybe, but 3 or more?  In some cases, 5 or 6 on the same segment?  That is simply not possible, or we have disproven the entire foundation that autosomal DNA matching is based upon.

The question will soon be asked if small segments that triangulate can be useful when there are no larger matching segments to put the match over the initial vendor threshold.

Triangulated Groups

As you can see, most of the people and segments on the spreadsheet, certainly the Elijah descendants, are heavily triangulated, meaning that three or more people match each other on the same locations.  Most of this matching is over the vendor threshold at Family Tree DNA.

You can see that Buster, Me, Dean, Carl and Harold all match each other on the same segments, on the left half of the spreadsheet where our names are in each other’s columns.

triangulated groups

Remember when I said that the spreadsheet was incomplete?  This is an example.  David and Warren don’t match each other at a high enough total of segments to get them over the matching threshold when compared to each other, so we can’t see their small segment data as compared to each other.  David matches Buster, but Warren doesn’t, so I can’t even see them both in relationship to a common match.  There are several people who fall into this category.

Let’s select one individual to use as an example.

I’ve chosen the Vannoy cousin, William(V), because his kit has been uploaded to Gedmatch, he has Vannoy matches and because William is proven to descend from Sarah Hickerson and Daniel Vannoy through their son Joel – so we expect some Hickerson DNA to match William(V).

If William (V) matches the Hickersons on the same DNA locations as he matches to Elijah’s descendants, then that proves that Elijah’s descendant’s DNA in that location is Hickerson DNA.

At GedMatch, I compared William(V) with me and then with Dean using a “one to one” comparison at a low threshold, simply because I wanted as much data as I could get.  Family Tree DNA allows for 1 cM and I did the same, allowing 100 SNPs at GedMatch.  Family Tree DNA’s lowest SNP threshold is 500.

In case you were wondering, even though I did lower the GedMatch threshold below the FTDNA minimum, there were 45 segments that were above 1cM and above 500 SNPs when matching me to William(V), which would have been above the lowest match threshold at FTDNA (assuming we were over the initial match threshold.)  In other words, had we not been below the original match threshold (20cM total, one segment over 7.7cM), these segments would have been included at FTDNA as small segments.  As you can see in the chart below, many triangulated.

I colorized the GedMatch matches, where there were no FTDNA matches, in dark red text.  This illustrates graphically just how much is missed when the small segments are ignored in cases with known or probable cousins.  In the green area, the entry that says “Me GedMatch” could not be colorized red (because you can’t colorize only part of the text of a cell) so I added the Gedmatch designation to differentiate between a match through FTDNA and one from GedMatch.  I did the same with all Gedmatch matches, whether colorized or not.

Let’s take a look and see how small segments from GedMatch affect our Hickerson matching.  Note that in the green area, William (V) matches William (H), the Hickerson descendant, and William (V) matches to me and Dean as well.  This triangulates William (V)’s Hickerson DNA and proves that Elijah’s descendants DNA includes proven Hickerson segments.

William (V) gedmatch matches v2

In this next example, I matched William (H), the Hickerson cousin (with no Vannoy heritage) against both Buster and me.

William (H) gedmatch me buster

Without Gedmatch data, only two segments of chromosome 15 are triangulated between Vannoy and Hickerson cousins, because we can’t see the small data segments of the rest of the cousins who don’t match over the threshold.

You can see here that nearly the entire chromosome is triangulated using small segments.  In the chart below, you can see both William(V) and William (H) as they match various Vannoy cousins.  Both triangulate with me.

William V and William H

I did the same thing with the Hickerson descendant, Billie, as compared to both me and Dean, with the same type of results.

The next question would be if chromosome 15 is a pileup area where I have a lot of IBS matches that are really population based matches.  It does not appear to be.  I have identified an area of my chromosomes that may be a pileup area, but chromosome 15 does not carry any of those characteristics.

So by utilizing the small segments at GedMatch for chromosome 15 that we can’t otherwise see, we can triangulate at least some of the Hickerson matches.  I can’t complete this chart, because several individuals have not uploaded to GedMatch.

Why would the Hickerson descendant match so many of the Vannoy segments on chromosome 15?  Because this is not a random sample.  This is a proven Vannoy segment and we are trying to see which parts of this segment are from a potential Hickerson mother or the Vannoy father.  If from the Hickerson mother, then this level of matching is not unexpected.  In fact, it would be expected.  Since we cheated and saw that chromosome 15 was already triangulated at Family Tree DNA, we already knew what to expect.

In the spreadsheet below, I’ve added the 2 GedMatch comparisons, William (V) to me and Dean, and William (H) to me and Buster.  You can see the segments that triangulate, on the left.  We could also build “triangulated groups,” like GedMatch does.  I started to do this, but then stopped because I realized most cells would be colored and you’d have a hard time seeing the individual triangulated segments.  I shifted to triangulating only the individuals who triangulate directly with the Hickerson descendant, William(H), shown in green.  GedMatch data is shown in red.

chr 15 with gedmatch

I would like to make three points.

1.  This still is not a complete spreadsheet where everyone is compared to everyone.  This was selectively compared for two known Hickerson cousins, William (V) who descends from both Vannoys and Hickersos and William (H) who descends only from Hickersons.

2. There are 25 individually triangulated segments to the Hickerson descendant on just this chromosome to the various Vannoy cousins.  That’s proof times 25 to just one Hickerson cousin.

2.  I would NEVER suggest that you select one set of small segments and base a decision on that alone.  This entire exercise has assembled cumulative evidence.  By the same token, if the rules for segment matching hold up under the worst circumstances, where we have an unknown but suspected relationship and the small segments appear to continue to follow the triangulation rules, they could be expected to remain true in much more favorable circumstances.

Might any of these people have random DNA matches that are truly IBS by chance on chromosome 15?  Of course, but the matching rules, just like for larger segments, eliminates them.  According to triangulation rules, if they are IBS by chance, they won’t triangulate.  If they do triangulate, that would confirm that they received the same DNA from a common ancestor.

If this is not true, and they did not receive their common DNA from a common ancestor, then it disproves the fundamental matching rule upon which all autosomal DNA genetic genealogy is based and we all need to throw in the towel and just go and do something else.

Is there some grey area someplace?  I would presume so,  but at this point, I don’t know how to discern or define it, if there is.  I’ve done three in-depth studies on three different families over the past 6 weeks or so, and I’ve yet to find an area (except for endogamous populations that have matches by population) where the guidelines are problematic.  Other researchers may certainly make different discoveries as they do the same kind of studies.  There is always more to be discovered, so we need to keep an open mind.

In this situation, it helps a lot that the Hickerson/Vannoy descendants match and triangulate on larger segments on other chromosomes.  This study was specifically to see if smaller segments would triangulate and obey the rules. We were fortunate to have such a large, apparently “sticky” segment of Vannoy DNA on chromosome 15 to work with.

Does small segment matching matter in most cases, especially when you have larger segments to utilize?  Probably not. Use the largest segments first.  But in some cases, like where you are trying to prove an ancestor who was born in the 1700s, you may desperately need that small segment data in order to triangulate between three people.

Why is this important – critically important?  Because if small segments obey all of the triangulation rules when larger segments are available to “prove” the match, then there is no reason that they couldn’t be utilized, using the same rules of IBD/IBS, when larger segments are not available.  We saw this in Just One Cousin as well.

However, in terms of proof of concept, I don’t know what better proof could possibly be offered, within the standard genetic genealogy proofs where IBD/IBS guidelines are utilized as described in the Phasing article.  Additional examples of small segment proof by triangulation are offered in Just One Cousin, Lazarus – Putting Humpty Dumpty Together Again, and in Demystifying Autosomal DNA Matching.

Raising Elijah Vannoy and Sarah Hickerson from the Dead

As I thought more about this situation, I realized that I was doing an awful lot of spreadsheet heavy lifting when a tool might already be available.  In fact, Israel’s mention of Lazarus made me wonder if there was a way to apply this tool to the situation at hand.

I decided to take a look at the Lazarus tool and here is what the intro said:

Generate ‘pseudo-DNA kits’ based on segments in common with your matches. These ‘pseudo-DNA kits’ can then be used as a surrogate for a common ancestor in other tests on this site. Segments are included for every combination where a match occurs between a kit in group1 and group2.

It’s obvious from further instructions that this is really meant for a parent or grandparent, but the technique should work just the same for more distant relatives.

I decided to try it first just with the descendants of Elijah Vannoy.  At first, I thought that recreated Elijah would include the following DNA:

  • DNA segments from Elijah Vannoy
  • DNA segments from Elijah Vannoy’s wife, Lois McNiel
  • DNA segments that match from Elijah’s descendants spouse’s lines when individuals come from the same descendant line. This means that if three people descend from Joel Vannoy and Phoebe Crumley, Elijah’s son and his wife, that they would match on some DNA from Phoebe, and that there was no way to subtract Phoebe’s DNA.

After working with the Lazarus tool, I realized this is not the case because Lazarus is designed to utilize a group of direct descendants and then compare the DNA of that group to a second group of know relatives, but not descendants.

In other words, if you have a grandson of a man, and his brother.  The DNA shared by the brother and the grandson HAS to be the DNA contributed to that grandson by his grandfather, from their common ancestor, the great grandfather.  So, in our situation above, Phoebe’s DNA is excluded.

The chart below shows the inheritance path for Lazarus matching.

Lazarus inheritance

Because Lazarus is comparing the DNA of Son Doe with Brother Doe – that eliminates any DNA from the brother’s wives, Sarah Spoon or Mary – because those lines are not shared between Brother Doe and Son Doe.  The only shared ancestors that can contribute DNA to both are Father Doe and Methusaleh Fisher.

The Lazarus instructions allow you to enter the direct descendants of the person/couple that you are reconstructing, then a second set of instructions asks for remaining relatives not directly descended, like siblings, parents, cousins, etc. In other words, those that should share DNA through the common ancestor of the person you are recreating.

To recreate Elijah, I entered all of the Vannoy cousins and then entered William (V) as a sibling since he is the proven son of Daniel Vannoy and Sarah Hickerson.

Here is what Lazarus produced.

lazarus elijah 1

Lazarus includes segments of 4cM and 500 SNPs.

The first thing I thought was, “Holy Moly, what happened to chromosome 15?”  I went back and looked, and sure enough, while almost all of the Elijah descendants do match on chromosome 15, William (V), kit 156020, does not match above the Lazarus threshold I selected.  So chromosome 15 is not included.  Finding additional people who are known to be from this Vannoy line and adding them to the “nondescendant” group would probably result in a more complete Elijah.

lazarus elijah 2

Next, to recreate Sarah Hickerson, I added all of the Vannoy cousins plus William (V) as descendants of Sarah Hickerson and then I added just the one Hickerson descendant, William, as a sibling.  William’s ancestor is proven to be the sibling of Sarah.

I didn’t know quite what to expect.

Clearly if the DNA from the Hickerson descendant didn’t match or triangulate with DNA from any of the Vannoy cousins at this higher level, then Sarah Hickerson wasn’t likely Elijah’s mother.  I wanted to see matching, but more, I wanted to see triangulation.

lazarus elijah 3

I was stunned.  Every kit except two had matches, some of significant size.

lazarus elijah 4

lazarus elijah 5 v2

Please note that locations on chromosomes 3, 4 and 13, above, are triangulated in addition to matching between two individuals, which constitutes proof of a common ancestor.  Please also note that if you were throwing away segments below 7cM, you would lose all of the triangulated matches and all but two matches altogether.

Clearly, comparing the Vannoy DNA with the Hickerson DNA produced a significant number of matches including three triangulated segments.

lazarus elijah 6

Where Are We?

I never have, and I never would recommend attempting to utilize random small match segments out of context.  By out of context, I mean simply looking at all of your 1cM segments and suggesting that they are all relevant to your genealogy.  Nope, never have.  Never would.

There is no question that many small segments are IBS by chance or identical by population.  Furthermore, working with small segments in endogamous populations may not be fruitful.

Those are the caveats.  Small segments in the right circumstances are useful.  And we’ve seen several examples of the right circumstances.

Over the past few weeks, we have identified guidelines and tools to work with small segments, and they are the same tools and guidelines we utilize to work with larger segments as well.  The difference is size.  When working with large segments, the fact that they are large serves an a filter for us and we don’t question their authenticity.  With all small segments, we must do the matching and analysis work to prove validity.  Probably not worthwhile if you have larger segments for the same group of people.

Working with the Vannoy data on chromosome 15 is not random, nor is the family from an endogamous population.  That segment was proven to be Vannoy prior to attempts to confirm or disprove the Hickerson connection.  And we’ve gone beyond just matching, we’ve proven the ancestral link by triangulation, including small segments.  We’ve now proven the Hickerson connection about 7 ways to Sunday.  Ok, maybe 7 is an exaggeration, but here is the evidence summed up for the Vannoy/Hickerson study from multiple vendors and tools:

  • Ancestry DNA Circle indicating that multiple Hickerson descendants match me and some that don’t match me, match each other. Not proof, but certainly suggestive of a common ancestor.
  • A total of 26 Hickerson or derivative family name matches to Vannoy cousins at Family Tree DNA. Not proof, but again, very suggestive.
  • 6 Charles Hickerson/Mary Lytle descendants match to Vannoy cousins at Family Tree DNA. Extremely suggestive, needs triangulation.
  • Triangulation of segments between Vannoy and Hickerson cousins at Family Tree DNA. Proof, but in this study we were only looking to determine whether small segment matches constituted proof.
  • Triangulation of multiple Hickerson/Vannoy cousins on chromosome 15 at GedMatch utilizing small segments and one to one matching. More proof.
  • Lazarus, at higher thresholds than the triangulation matching, when creating Sarah Hickerson, still matched 19 segments and triangulated three for a total of 73.2cM when comparing the Hickerson descendant against the Vannoy cousins. Further proof.

So, can small segment matching data be useful? Is there any reason NOT to accept this evidence as valid?

With proper usage, small segment data certainly looks to provide value by judiciously applying exactly the same rules that apply to all DNA matching.  The difference of course being that you don’t really have to think about utilizing those tools with large segment matches.  It’s pretty well a given that a 20cM match is valid, but you can never assume anything about those small segment matches without supporting evidence. So are larger segments easier to use?  Absolutely.

Does that automatically make small segments invalid?  Absolutely not.

In some cases, especially when attempting to break down brick walls more than 5 or 6 generations in the past, small segment data may be all we have available.  We must use it effectively.  How small is too small?  I don’t know.  It appears that size is really not a factor if you strictly adhere to the IBD/IBS guidelines, but at some point, I would think the segments would be so small that just about everyone would match everyone because we are all humans – so the ultimate identical by population scenario.

Segments that don’t match an individual and either or both parents, assuming you have both parents to test, can safely be disregarded unless they are large and then a look at the raw data is in order to see if there is a problem in that area.  These are IBS by chance.  IBS segments by chance also won’t triangulate further up the tree.  They can’t, because they don’t match your parents so they cannot come from an ancestor.  If they don’t come from an ancestor, they can’t possibly match two other people whose DNA comes from that ancestor on that segment.

If both parents aren’t available, or your small segments do match with your parents, I would suggest that you retain your small segments and map them.

You can’t recognize patterns if the data isn’t present and you won’t be able to find that proverbial needle in the haystack that we are all looking for.

Based on what we’ve seen in multiple case studies, I would conclude that small segment data is certainly valid and can play a valid role in a situation where there is a known or suspected relationship.

I would agree that attempting to utilize small segment data outside the context of a larger data match is not optimal, at least not today, although I wish the vendors would provide a way for us to selectively lower our thresholds.  A larger segment match can point the way to smaller segment matches between multiple people that can be triangulated.  In some situations, like the person A, B, C, D Hickerson-Vannoy situation I described earlier in this article, I would like to be able to drop the match threshold to reveal the small segment data when other matches are suggestive of a family relationship.

In the Hickerson situation, having the ability to drop the matching thresholds would have been the key to positively confirming this relationship within the vendor’s data base and not having to utilize third party tools like GedMatch – which require the cooperation of all parties involved to download their raw data files.  Not everyone transferred their data to Gedmatch in my Vannoy group, but enough did that we were able to do what we needed to do.  That isn’t always the case.  In fact, I have an nearly identical situation in another line but my two matches at Ancestry have declined to download their data to Gedmatch.

This not the first time that small segment data has played a successful role in finding genealogy solutions, or confirming what we thought we knew – although in all cases to date, larger segments matched as well – and those larger segment matches were key and what pointed me to the potential match that ultimately involved the usage of the small segments for triangulation.

Using larger data segments as pointers probably won’t be the case forever, especially if we can gain confidence that we can reliably utilize small segments, at least in certain situations.  Specifically, a small segment match may be nothing, but a small segment triangulated match in the context of a genealogical situation seems to abide by all of the genetic genealogy DNA rules.

In fact, a situation just arose in the past couple weeks that does not include larger segments matching at a vendor.

Let’s close this article by discussing this recent scenario.

The Adoptee

An adoptee approached me with matching data from GedMatch which included matches to me, Dean, Carl and Harold on chromosome 15, on segments that overlap, as follows.

adoptee chr 15

On the spreadsheet above, sent to me by the adoptee, we can see some matches but not all matches. I ran the balance of these 4 people at GedMatch and below is the matching chart for the segment of chromosome 15 where the adoptee matches the 4 Vannoy cousins plus William(H), the Hickerson cousin.

  Me Carl Dean Harold Adoptee
Me NA FTDNA FTDNA GedMatch GedMatch
Carl FTDNA NA FTDNA FTDNA GedMatch
Dean FTDNA FTDNA NA FTDNA GedMatch
Harold GedMatch FTDNA FTDNA NA GedMatch
Adoptee GedMatch GedMatch GedMatch GedMatch NA
William (H) GedMatch GedMatch GedMatch GedMatch GedMatch

I decided to take the easy route and just utilize Lazarus again, so I added all of the known Vannoy and Hickerson cousins I utilized in earlier Lazarus calculations at Gedmatch as siblings to our adoptee.  This means that each kit will be compared to the adoptees DNA and matching segments will be reported.  At a threshold of 300 SNPs and 4cM, our adoptee matches at 140cM of common DNA between the various cousins.

adoptee vannoy match

Please note that in addition to matching several of the cousins, our adoptee also triangulates on chromosomes 1, 11, 15, 18, 19 and 21.  The triangulation on chromosome 21 is to two proven Hickerson descendants, so he matches on this line as well.

I reduced the threshold to 4cM and 200 SNPs to see what kind of difference that would make.

adoptee vannoy match low threshold

Our adoptee picked up another triangulation on chromosome 1 and added additional cousins in the chromosome 15 “sticky Vannoy” cluster and the chromosome 18 cluster.

Given what we just showed about chromosome 15, and the discussions about IBD and IBS guidelines and small matching segments, what conclusions would you draw and what would you do?

  1. Tell the adoptee this is invalid because there are no qualifying large match segments that match at the vendors.
  2. Tell the adoptee to throw all of those small segments away, or at least all of the ones below 7cM because they are only small matching segments and utilizing small matching segments is only a folly and the adoptee is only seeing what he wants to see – even though the Vannoy cousins with whom he triangulates are proven, triangulated cousins.
  3. Check to see if the adoptee also matches the other cousins involved, although he does clearly already exceeds the triangulation criteria to declare a common ancestor of 3 proven cousins on a matching segment. This is actually what I did utilizing Lazarus and you just saw the outcome.

If this is a valid match, based on who he does and doesn’t match in terms of the rest of the family, you could very well narrow his line substantially – perhaps by utilizing the various Vannoy wives’ DNA, to an ancestral couple.  Given that our adoptee matches both the Vannoys and the Hickersons, I suspect he is somehow descended from Daniel Vannoy and Sarah Hickerson.

In Conclusion

What is the acceptable level to utilize small segments in a known or suspected match situation?

Rather than look for a magic threshold number, we are much better served to look at reliable methods to determine the difference between DNA passed from our ancestors to us, IBD, and matches by chance.  This helps us to establish the reliability of DNA segments in individual situations we are likely to encounter in our genealogy.  In other words, rather that throw the entire pile of wheat away because there is some percentage of chaff in the wheat, let’s figure out how to sort the wheat from the chaff.

Fortunately, both parental phasing and triangulation eliminate the identical by chance segments.

Clearly, the smaller the segments, even in a known match situation, the more likely they are identical by population, given that they triangulate.  In fact, this is exactly how the Neanderthal and Denisovan genomes have been reconstructed.

Furthermore, given that the Anzick DNA sample is over 12,000 years old, Identical by population must be how Anzick is matching to contemporary humans, because at least some of these people do clearly share a common ancestor with Anzick at some point, long ago – more than 12,000 years ago.  In my case, at least some of the Anzick segments triangulate with my mother’s DNA, so they are not IBS by chance.  That only leaves identical by population or identical by descent, meaning within a genealogical timeframe, and we know that isn’t possible.

There are yet other situations where small segment matches are not IBS by chance nor identical by population.  For example, I have a very hard time believing that the adoptee situation is nothing but chance.  It’s not a folly.  It’s identical by descent as proven by triangulation with 10 different cousins – all on segments below the vendor matching thresholds.

In fact, it’s impossible to match the Vannoy cousins, who are already triangulated individually, by chance.  While the adoptee match is not over the vendor threshold, the segments are not terribly small and they do all triangulate with multiple individuals who also triangulate with larger segments, at the vendors and on different chromosomes.

This adoptee triangulated match, even without the Hickerson-Vannoy study disproves the blanket statement that small segments below 5cM cannot be used for genealogy.  All of these segments are 7.1cM or below and most are below 5.

This small segment match between my mother and her first cousins also disproves that segments under 5cM can never be used for genealogy.

Two cousins combined

This small segment passed from my mother to me disproves that statement too – clearly matching with our cousin, Cheryl.  If I did not receive this from my mother, and she from her parent, then how do we match a common cousin???

me mother small seg

More small segment proof, below, between my mother and her second cousin when Lazarus was reconstructing my mother’s father.

2nd cousin lazarus match

And this Vannoy Hickerson 4 cousin triangulated segment also disproves that 5cM and below cannot be used for genealogy.

vannoy hickerson triang

Where did these small segments come from if not a common ancestor, either one or several generations ago?  If you look at the small segment I inherited from my mother and say, “well, of course that’s valid, you got it from your mother” then the same logic has to apply that she inherited it from her parent.  The same logic then applies that the same small segment, when shared by my mother’s cousin, also came from the their common grandparents.  One cannot be true without the others being true.  It’s the same DNA. I got it from my mother.  And it’s only a 1.46cM segment, shown in the examples above.

Here are my observations and conclusions:

  • As proven with hundreds of examples in this and other articles cited, small segments can be and are inherited from our ancestors and can be utilized for genetic genealogy.
  • There is no line in the sand at 7cM or 5cM at which a segment is viable and useful at 5.1cM and not at 4.9cM.
  • All small segment matches need to be evaluated utilizing the guidelines set forth for IBD versus IBS by chance versus identical by population set forth in the articles titled How Phasing Works and Determining IBD Versus IBS Matches and Demystifying Autosomal DNA Matching.
  • When given a choice, large segment matches are always easier to use because they are seldom IBS by chance and most often IBD.
  • Small segment matches are more likely to be IBS by chance than larger matches, which is why we need to judiciously apply the IBD/IBS Guidelines when attempting to utilize small segment matches.
  • All DNA matches, not just small segments, must be triangulated to prove a common ancestor, unless they are known close relatives, like siblings, first cousins, etc.
  • When working in genetic genealogy, always glean the information from larger matches and assemble that information.  However, when the time comes that you need those small segments because you are working 5, 6 or 7 generations back in time, remember that tools and guidelines exist to use small segments reliably.
  • Do not attempt to use small segments out of context.  This means that if you were to look only at your 1cM matches to unknown people, and you have the ability to triangulate against your parents, most would prove to be IBS by chance.  This is the basis of the argument for why some people delete their small segments.  However, by utilizing parental phasing, phasing against known family members (like uncles, aunts and first cousins) and triangulation, you can identify and salvage the useable small segments – and these segments may be the only remnants of your ancestors more than 5 or 6 generations back that you’ll ever have to work with.  You do not have to throw all of them away simply because some or many small segments, out of context, are IBS by chance.  It doesn’t hurt anything to leave them just sit in your spreadsheet untouched until the day that you need them.

Ultimately, the decision is yours whether you will use small segments or not – and either decision is fine.  However, don’t make the decision based on the belief that small segments under some magic number, like 5cM or 7cM are universally useless.  They aren’t.

Whether small segments are too much work and effort in your individual situation depends on your personal goals for genetic genealogy and on factors like whether or not you descend from an endogamous population.  People’s individual goals and circumstances vary widely.  Some people test at Ancestry and are happy with inferential matching circles and nothing more.  Some people want to wring every tidbit possible out of genealogy, genetic or otherwise.

I hope everyone will begin to look at how they can use small segment data reliably instead of simply discarding all the small segments on the premise that all small segment data is useless because some small segments are not useful.  All unstudied and discarded data is indeed useless, so discarding becomes a self-fulfilling prophecy.

But by far, the worst outcome of throwing perfectly good data away is that you’ll never know what genetic secrets it held for you about your ancestors.  Maybe the DNA of your own Sarah Hickerson is lurking there, just waiting for the right circumstances to be found.


Autosomal DNA 2015 – Which Test is the Best?

$
0
0

One of the questions most often asked today is which autosomal DNA test, or testing company, is the best, meaning Ancestry, 23andMe or Family Tree DNA.

The answer is often that it varies depending on your goals, individual priorities and budget.  As with all things, circumstances with the vendors change over time.  They offer new products, change features and overall, sometimes their actions and choices make them more or less valuable and attractive to the consumer.

This article reflects my opinions about what is good, and bad, at each vendor, today, in February 2015, and what they do best and worst.  I am reviewing them in alphabetical order.

23andMe

Best Feature

  • Ability to download matching information about who your matches match that you match as well, along with common matching DNA segments, allowing direct triangulation.

23andme best feature

In the example above, you can select the profile of any person you match and match  against the profile of anyone else you match, showing you the common DNA segments of all parties.

Good Features

  • Chromosome Browser
  • Ethnicity feature tends to report minority Native and African when other companies sometimes fail to do so.
  • Ethnicity painted on chromosome segments.
  • Matching names provided in order of frequency found – of course this assumes that the matches have entered a list of family surnames, which isn’t often the case.
  • Y and mitochondrial DNA haplogroup estimate provided.

Not So Good

  • Trees – were horrible before. 23andMe has recently partnered with MyHeritage which will require a subscription if your tree is larger than 250 individuals. The jury is still out on this but the initial release has been rocky and appears untested.
  • Most of their customers are not genealogists and are not interested or know little about their genealogy. Fortunately, serious genealogists often test with multiple companies so you’re likely to catch them at either Family Tree DNA or at Ancestry.
  • Very low match response rate to inquiries.  Positive response is required to see matching DNA segments.
  • Must communicate through internal message system.
  • Unfriendly website – difficult to find information.
  • Big Pharm alliances, contracts and medical patents – and your DNA is included one way or another, individually or aggregated, depending on the level of your authorization.
  • Corporate focus is on medical and not genealogical.
  • Customer support is poor, slow and often never replies.
  • Limit of roughly 1000 matches, at which point your matches begin to be trimmed. You can retain more if you have established communications with people. I have over 1200 matches today, but I don’t know how many I have lost. This can make your effective matching threshold much higher than their published number by virtue of the fact that your smallest matches are forever being trimmed after you reach the 1000 match threshold.
  • Spit kit versus swab kit.
  • Cannot adjust matching threshold.
  • V4 chip precludes data transfer to Family Tree DNA
  • Test not available worldwide, meaning data base is not worldwide.  Also not available in NY or MD.

Worst Feature

  • Horribly cumbersome and confusing multiple introductory and authorization/acceptance hurdles cause many people to not contact, communicate with and authorize sharing with most of their matches. I wrote about this here.

 

Ancestry.com

Best Feature

  • The shakey leaf hints that show you who, of your DNA matches, also share a common ancestor in your pedigree chart. This drastically reduces the amount of initial footwork you need to do.

shakey leaf

Good Features

  • The size of their data base increases likelihood of matching.
  • DNA Circles provides additional evidence of ancestral connection.
  • They are a genealogy, not a medically focused company.
  • Provides list and links to matching surnames on matches trees, even when no common ancestor is identified.
  • Clean, easy to use interface, although major changes have been announced and I have no idea whether that will be a positive or negative

Not So Good

  • Some people have private trees which means they can see your match information, including a common ancestor if there is one, but you cannot see theirs.
  • Ancestry ethnicity sometimes finds minority amounts of admixture, but can also be significantly incorrect on majority ancestry, so it’s difficult to have confidence in the consistency of results.
  • Subscription required (starting at $49) to see matches/circle members which may not be fully understood before testing by consumers. In my case, I have a full subscription, so it’s a moot point, but that is not the case with everyone and it can be an unwelcome surprise.
  • Ancestry’s consent allows them to sell anonymized results to buyers, including Big Pharm, should they choose to do so. As of October 2014 when I visited Ancestry as part of DNA Day, they stated that they had not sold any DNA data at that time.
  • Communication is only through internal message system.
  • Spit kit versus swab kit.
  • Customer service is often uneducated about genetic genealogy in general, although they are responsive.
  • Combination of matching and Circles leads people to believe that these are confirmed genetic matches to that particular line, even though Ancestry states otherwise, if one reads the text.
  • DNA is an auxiliary tool and not a primary or priority corporate focus.
  • Corporate history shows lack of commitment to DNA and to clients who tested – meaning their on-again-off-again DNA history the destruction of the Y and mtDNA data bases in October 2013.
  • Academic phasing may have trimmed real matches.
  • Test not available worldwide, meaning data base is not worldwide, although Ancestry has just announced availability in the UK and Ireland.
  • Y and mitochondrial DNA ignored.

Worst Feature

  • No chromosome browser or equivalent type of tool or tools. I can’t state this strongly enough and it is a HUGE negative and requires that you transfer your results to either Family Tree DNA or to Gedmatch where you do have tools.

 

Family Tree DNA

Best Feature

  • Full service genetic genealogy company – focused on genetic genealogy.

ftdna best feature

Good Features

  • Accepts transfers from Ancestry and V3 chip from 23andMe
  • Partnership with National Geographic for research.
  • Chromosome browser which includes in-common-with feature, search by surname and search by ancestral name.
  • Matching Matrix individually and within projects for administrators.
  • Projects and the ability within projects with advanced matching to see everyone you match autosomally within that project.
  • Match names and e-mails provided – not forced to utilize an internal messaging system.
  • Consent signed when ordering test is all that is needed for full matching and all features.
  • Does common surname matching with all matches – bolding the results.
  • Matching attempts to take highly endogamous populations into consideration.
  • Includes access to other genetic genealogy tools like various levels of Y and mtDNA tests.
  • Data base includes results for all tests, in one place, and resulting matches show Y and mtDNA haplogroups if that test has also been taken.
  • Searches can include multiple types of test results, like everyone who matches both the mtDNA and the Family Finder test.
  • Archives DNA for 25 years, allowing upgrades to be done on order without re-swabbing if DNA is adequate and viable.
  • Testing performed in in-house lab.
  • Project administrator liaison provided.
  • Educational webinars for general genetic genealogy education and new product/feature releases. Archived webinars available on demand.
  • Project administrator conference annually for the past decade.
  • New features regularly released.
  • Swab kit versus spit kit.
  • Responsive to customer and project administrator needs and requests.
  • Their customers more likely to be serious genealogists versus someone who tested initially for medical information (at 23andMe before December 2013) or impulse buyers.
  • They do not sell and do not request consent to sell your personal or aggregated data to outside buyers. If your DNA data is ever requested for an academic research project, you will be individually contacted for consent.
  • No subscription that increases actual cost of utilizing the test results.
  • Available worldwide (unless illegal in the location, like France.)

Not So Good

  • Cannot see if your matches also match each other on a specific segments, so cannot directly triangulate.
  • Cannot adjust matching threshold for initial match, but can after initial match.
  • Ethnicity often does not pick up small amounts of minority admixture found by other vendors and at Gedmatch.

Worst Feature

  • Trees are difficult to use.

Recommendations

1. In light of the above, my recommendation for autosomal DNA testing for genealogy if you can take only one test, order the Family Finder test with Family Tree DNA. They are unquestionably committed to genetic genealogy, have the most comprehensive set of tools, including a chromosome browser and other matching tools, and are overall the best company. The Family Finder test costs $99, unless you purchase when it’s on sale or have a coupon. (Current coupon code for $15 off is 15for15.)

2. If you can test with two companies, test with Family Tree DNA and Ancestry.com. You can do this by testing with Ancestry.com and transferring your results to Family Tree DNA,  This approach costs about $187 total: to test at Ancestry ($99), for the first year basic subscription at Ancestry to see all your matching results ($49) if you aren’t already a subscriber, then to transfer the results to Family Tree DNA (free) and unlock the results ($39) unless you find 4 more people to transfer and then the unlock is free.  Note that you will still need to swab to obtain the genealogy benefits of Y and mtDNA testing if you choose to take those tests in addition – and I hope you will because those are very valuable genealogy tools too and not available at the other vendors.

3. In my opinion, 23andMe has become a distant third in DNA testing due to their floundering and lack of commitment in the genealogy market-space, their prohibitively difficult introduction system that requires individual approvals for communicating and then for sharing of DNA (meaning matching) for each person you match, their recent alliance with Big Pharm, and their continuing lack of responsiveness to requests for genealogy enhancements. Lastly, you can no longer transfer your results from 23andMe to Family Tree DNA because 23andMe moved to the v4 chip (in December of 2013) which reduced the number of SNPs tested from about 900,000 to about 600,000, making the results incompatible with Family Tree DNA. However, just because they are third doesn’t mean you shouldn’t test there if you are really serious and want to fish in all of the ponds. It’s just the third choice if you can’t test at all three.

Judy Russell, The Legal Genealogist just wrote an article, 2015, Most bang for the DNA buck, which I suggest you read as well.  She makes some very good points, although our approach is a bit different.  But then again, I’d expect that.  I’ve spent my life doing “analytical” types of things and she has spent her life doing “lawyery” types of things, and there is nothing better than two perspectives to draw from.

The Future

It will be interesting to see what the landscape looks like a year, 2 and 5 years from now.  I think the X-prize (pardon the pun) will go to the company or companies that provide comprehensive tools and make genetic genealogy as easy and productive as possible – for both the beginner and the advanced user.  No small feat – that’s for sure!


Why Autosomal Response Rate REALLY DOES Matter

$
0
0

In my recent article “Autosomal DNA 2015 – Which Test is the Best?,” one of the comparison items between vendors I mentioned is response rate.  Specifically, I said, in reference to 23andMe, “Very low match response rate to inquiries.  Positive response is required to see matching DNA segments.”

This has generated some commentary, but based on the nature of the comments, both in terms of blog comments and private e-mails, I can tell that many people don’t understand why response rate matters at 23andMe.  On the other hand, some regular users of all 3 vendors felt I didn’t go far enough in explaining the difference and why response rate at 23andMe matters so much.

I’m going to see if I can make this issue a bit more clear.  Response rate really does matter and it’s not just whining!

apples oranges

At 23andMe YOU CAN’T SEE MATCH INFORMATION OR DO ANY DNA COMPARISON WITHOUT A POSITIVE RESPONSE FROM THOSE YOU MATCH.  In other words, they must reply in the affirmative – that they want to communicate with you AND that they want to share DNA results.  Otherwise, you can do nothing.

This is a process not required by either Family Tree DNA or Ancestry.  So, out the door, there is a very big difference.

At Family Tree DNA, you can see everything available WITHOUT additional correspondence, so while a response from a match would be nice, it’s not essential to being able to compare their DNA, see who you match in common, see their tree, if posted, find your common surnames, or perform any other function provided by the vendor.

At Ancestry.com, WITH a subscription, you can see your matches, their trees (if not private) and DNA Circles with no additional correspondence.  The only time you need to correspond with someone is if their tree is private or they don’t post a tree.

The operative words here are want and need.  At 23andMe, you absolutely positively NEED a positive response from each and every match (both authorization to communicate AND authorization to share DNA results) BEFORE you can DO anything.

So, comparatively speaking, a low response rate at 23andMe means that you’re only going to see a small fraction of your matches that are showing, while a low response rate at the other vendors is an irritant and comes after you’ve utilized the vendor’s tools and then asked your match for additional information.  In other words, no response at Family Tree DNA or Ancestry is not a barrier to playing.  At 23andMe, you’re dead in the water if your matches don’t respond.

In essence, 23andMe requires three authorizations to be able to see your matches DNA information: the original authorization to test, authorization to communicate and authorization to “share” DNA results.

With both Family Tree DNA and Ancestry, one authorization, when you initially test, is all you need – although the tools and approach of these two vendors are very different as pointed out in the original article.

So, as you can see, the response rate at Family Tree DNA and Ancestry really isn’t essential to utilizing the tools, but it’s another matter entirely at 23andMe – so we’re not comparing apples to apples.

So, let’s look at the real effects of 23andMe’s authorization policy.

At 23andMe

At 23andMe, this is what you get, out of the box.  The person’s account I’m using for this first graphic tested for two purposes and is not interested in genealogical contact, so this is an “untouched” account, except that I’ve redacted the names, if showing, in blue to the left.  Looks good – all those matches, until you realize you can’t DO anything without contacting each and every single match.

23andme untouched

What isn’t obvious is that you can’t COMPARE your DNA or information with any of these people WITHOUT sending an introduction request.  In addition, they ALSO must authorizing DNA sharing.  And by the way, an introduction request and DNA sharing are NOT one and the same thing.  You can see the names of public matches, who have pre-authorized communications, but you cannot compare DNA with them.  You can’t even see the names of other (nonpublic) matches until you send an introduction request to them and they reply in the affirmative.  Those are the accounts above that just say “male” with no blue partially redacted name above them.

If you click on “Send an introduction,” here are your options.

23andMe intro request

You can request an intro and genome sharing in one message, but that doesn’t mean they’ll accept both nor does it mean that someone will send you a request for both.

This is what an introduction request looks like to the receiver.

23andMe contact request

Now, an introduction request only allows you to talk to your match.  If they do not ask for, or authorize genome sharing, next, you have to request to share your DNA results – and they also have to reply in the affirmative to that request too.

Not intuitively obvious you say?  Right!

Here’s the process to request to share genomes.

23andme dna share request

And here’s the reply step to authorize genome sharing.

23andme dna share authorization crop

Is it any wonder the response rate is low?

So, as you can see, just being able to see that you have a match is not the same thing as being able to utilize the information.  With Family Tree DNA and Ancestry, you can immediately utilize the information from all of your matches to the full extent of that vendor’s offerings.

At Family Tree DNA

At Family Tree DNA, here is what you see out the gate (full names redacted.)

Family Tree DNA out the gate

No contact request needed, no separate authorization to share DNA and no subscription required to see your matches, to compare DNA, to see who you match in common, to see their trees (if provided) or to see your matching surnames.  The little dropdown box under each person provides additional options.

You don’t NEED to contact your matches for anything.  You may WANT to contact them for genealogy information, especially if they have not uploaded or created trees.

At Ancestry – WITH Subscription

At Ancestry.com, to see all three available DNA related features, your matches, their trees (if provided and if public) and DNA Circles, you must have a subscription.  Ancestry offers a minimal subscription for $49, per year, for this purpose or a standard subscription covers DNA functionality as well.  You must have a subscription to see your matches trees and your DNA Circles.

Here is what your Ancestry match page looks like.

Ancestry with subscription

You don’t NEED to contact your matches to view results.  You may WANT to contact those you match and if their tree is private, you will have to contact them to request to see the tree or for the identity of your common ancestor if you have a shakey leaf.

Comparative Numbers

So, let’s look at this comparatively, for my accounts at the three vendors.

23andMe Family Tree DNA Ancestry (with subscription)
Total Number of Matches 1373 2100 3950
Number of Matches I can see without special approvals (meaning a match response required) 0 (0%) 2100 (100%) 3950 (100%)
At 10% response rate, number of effective matches 137 (10%) 2100 (100%) 3950 (100%)
At 10% response rate, DNA accounts available to compare DNA 10% or 137  accounts 100% or 2100 accounts 0% (no chromosome browser)

This shows, in black and white, why a low response rate at 23andMe is so devastating.  The percent of people whose DNA you can see equals the response rate at 23andMe.  So if you have 1000 matches at 23andMe, but you only have a 10% response rate, it’s the same as having only 100 functional matches – because the rest are entirely unavailable to you – well except for the fact that they sit there and stare at you mockingly.

If one has a 10% response rate at 23andMe, and all of those responses are positive, and all authorize BOTH communication and DNA sharing, you are still only seeing 10% of the matches listed.  So, 1000 matches at 23andMe is not at all the same as 1000 matches at Family Tree DNA or Ancestry.

At Family Tree DNA, all of your match accounts are immediately available to you for viewing, communicating and comparison.

At Ancestry, you can see all of your matches (with a subscription), but you can’t compare the matching DNA because Ancestry offers no chromosome browser.

The Meat

The meat of genetic genealogy is comparing your actual segments to your matches.  So, let’s look at some real numbers.

I send a custom request to each of my matches at 23andMe and have been doing so since the product was introduced.

Looking at my top 100 matches, let’s see how many authorized sharing.

In a way, this is skewing the results, just so you know, because many of these matches are relatives who I recruited to test initially.  Plus I’ve worked on my closest matches at 23andMe much harder than my more distant matches, so this is an absolute BEST CASE scenario for the 23andMe numbers.  My actual response rate is about 10% for all matches.

At 23andMe, of my closest 100 matches, several of which are close family, 22 of my matches are sharing, one has declined and the rest are in limbo where I’ve sent an invitation and they have not responded. It’s interesting to note that of those 100, 23 are “public” which means that the intro step can be skipped, but they still have to be invited to share genomes.

Number of my 100 closest matches I can see:

23andMe Family Tree DNA Ancestry
Number of 100 closest matches I can see 22 (22%) 100 (100%) 100 (100%)
Extrapolated by % to entire match total 302 of 1373 2100 of 2100 3950 of 3950

23andMe said that existing trees would be available until May 1, 2015, but I can find no trees attached to any of my matching 23andMe accounts now, although there never were many.

Number of trees I can see:

23andMe Family Tree DNA Ancestry
Number of trees I can see 0 (0%) 33 (33%) 66 (66%)*
Extrapolated by % to match total 0 of 1373 693 of 2100 2607 of 3950

*The balance of Ancestry trees are 20 matches that have no trees and 14 that have private trees.  Twenty of the 66 have common ancestors, but of those, 6 are private trees.

Number of people with whom I can compare DNA segments in chromosome browser:

23andMe Family Tree DNA Ancestry
Number of people I can compare DNA 22 (22%) 100 (100%) 0 (0%) (no chromosome browser tool)
Extrapolated by % to match total 302 of 1373 2100 of 2100 0 of 3950

I hope these examples help make it clear why response rate really is an important factor – unfortunately – and why a response rate discussion about Family Tree DNA and Ancestry does not have the same meaning as a response rate discussion about 23andMe.

One of the best things 23andMe could do would be to get rid of the convoluted DNA authorization courtship Macarena dance.  There is no dance instructor, people don’t discover that they need to do it until after they test, and many people simply don’t understand, don’t bother or give up.  If 23andMe isn’t going to get rid of it, the LEAST they could do is to make it easy and step you through the process.  I don’t know who benefits from this, but I guarantee you, it’s not the genealogy consumer.

macarena


John Y. Estes (1818-1895), Civil War Soldier, Walked to Texas, Twice, 52 Ancestors #64

$
0
0

John Y Estes

John Y. Estes, whose photo we believe is shown above, started out years ago with a question, one that is probably answered now, but every time we think we answer one question about him, another dozen take its place.

Let’s start from the beginning.  When I first saw John’s name, I immediately noticed the Y.  Two things occurred to me…first, that’s someone’s last name and second, that’s shouldn’t be too difficult to find.  Y is not like S that would include something like Smith and takes up 10% of the alphabet.  Famous last words, or first thoughts, because assuredly, that second thought was NOT true.

Now don’t laugh, but one time I was at one of those fortune telling places.  The fortune teller asked me if I had any more questions.  I said yes, and asked her about John Y. Estes’s middle name.  She said something like Yarborough or maybe Yancy.  She wasn’t right about anything else either.

Nope, never let it be said that genealogists are a desperate group!

John Y. Estes was born on December 29, 1818, in Halifax County, Virginia to John R. Estes and his wife, Nancy Ann Moore.  Hmmmm, that middle initial R. might be someone’s last name….never mind….

We know that John R. Estes and his wife, Nancy Ann Moore, along with five if not six children made the long wagon journey from Halifax County, Virginia to Claiborne County, TN. sometime between 1818 and 1826 when John R. Estes had a land survey in Claiborne County.  The 1820 census doesn’t exist for Claiborne County and John appears to be gone from Halifax by then, so we’re out of luck knowing where John R. was in 1820.

In the 1830 census, John R. Estes was living in Claiborne County in the vicinity of Estes Holler, shown below.

Estes Holler 2

How do I know that?  Because these families have all become very familiar to me over my 30+ years of research.  John is living beside William Cunningham, who, in 1871 signed as a character witness for John R. Estes.  And six houses away we find John Campbell, the grandfather of Ruthy Dodson who likely raised her after her mother, Elizabeth Campbell died.  Rutha Dodson was the future wife of John Y. Estes.  And next door to John Campbell lived Mercurious Cook whose son’s widow John R. Estes would marry in another 40 years – but that is a story for a different day.

In the early 1830s, John R. Estes took his family to live in Grainger County for a short time.  Nancy Ann Moore’s two uncles, Rice and Mackness Moore lived there, Rice being a Methodist minister.  John R. Estes’s daughter, Lucy, married in Grainger County in 1833.  By 1835, John was back in Claiborne County when Temperance married Adam Clouse, so they didn’t stay long in Grainger County.

For the most part, John Y. Estes grew up in or near Estes Holler, below, from the cemetery, which, of course, is why it’s called Estes Holler today.

estes holler 5

By 1840, John Y. was probably courting the lovely Ruthy Dodson, likely at her grandfather’s house.  John Campbell had died in 1838, but his widow Jenny Dobkins Campbell didn’t die until between 1850 and 1860, so she would have still been living on the old home place, on Little Sycamore Road, below, when young John Y. Estes came to call.

Campbell house

We don’t find John R. Estes in the 1840 census, but by 1841, John R. Estes had to be living someplace in the vicinity because both his sons Jechonias and John Y. Estes married local gals.

On January 3, 1841, John Y. Estes married Ruthy Dodson, less than a week after his 23rd birthday.

John Y Estes Rutha Dodson marriage

Ruthy Dodson’s mother, Elizabeth Campbell died before Elizabeth’s father, John Campbell, did in 1838.  After John’s death, a guardian was appointed for Elizabeth’s children to function on behalf of their financial interests in his estate.

In the 1830 census, the John Campbell household has small children, so it’s very likely that the grandparents, John and Jenny Dobkins Campbell were raising Elizabeth Campbell’s children she had with her husband, Lazarus Dodson.

On September 5th, 1842, John Y. Estes signed a receipt for receiving part of Ruthy’s inheritance.  This seems to have been paid yearly, at least until the children reached the age of majority.

“John Y. Estes rect. dated 5th Sept. 1842, $54.35. Ditto rents for the year 1841, $1.50. Ditto order for what ballence may be in my hands as guardean, amt. $56.61.”

By 1850, we find John Y. Estes living in Estes Holler along with the rest of the Estes clan.  John is listed as a laborer, age 30, Ruthy as age 25 and Lazarus as age 2.

Given that John and Ruthy were married in 1841 and their oldest child in 1850 is only 2, this suggests that John and Ruthy had already buried several children.  If they had one child per year and the child died at or shortly after birth, they could have buried as many as six children in this time.  The Upper Estes cemetery, as well as the Venable Cemetery at the end of the road have many, many unmarked graves.  The Upper Estes Cemetery was within view of the John Y. Estes home place.

Upper Estes Cemetery

Furthermore, we know that John Y. Estes was living on this land, even though we find very few records of John Y. Estes in official county documents.

This land was originally granted to William Devenport and would eventually, in part, become the property of Rutha Estes, John Y.’s wife – but that wouldn’t happen for another 30 years.

William Devenport, April 17, 1850 – James McNeil trustee to William S. McVey, Districts 6 and 8, 475 acres, Buzzard’s Rock Knob – corner of grant to James M. Patterson, from Devenport’s spring, grant to Drewry Gibson, 50 acres #14072, line of Drewry Gibson, crossing Gibson’s branch, S with John Dobkins grant owned at present by Leander and Greenberry Cloud near N.S. McNeil’s line crossing Gibson’s branch on top of Middle Ridge, Planks fence of old Wier place, John Mason’s corner and line, Cunningham’s line, Devenport-Lanham’s corner, Weatherman’s spring, middle ridge – all of above contained in grant 16628 from the St. of Tennessee to William Devenport.

Second tract – 130 acres of land on the S. Side of Wallen’ ridge, corner of D. Gibson’s 50 acres tract #14072, Houston’s line, NW of Devenport’s line, Harkins corner, large rock on top of knob called Buzzard’s Rock, Harkins corner, Abel Lanham’s corner, Henderson’s line, 100 acre tract of WH Jennings, Bise’s corner, top of Wallen Ridge at Bise’s stake corner of Hardy tract, Henderson’s corner, the above contained in grant 27438 St. of Tn. to Devenport.

Also a 25 acre tract known as the Weatherman place.

1851 – William Devenport tax sale to William McVey – bid July 7, 1851 at courthouse, land in the 8th district, but due to a change in the lines now in the 6th district living near the lines of the 6th and 8th, sold for the taxes of 1845 and 1846, $16.77, 200 acres.

Tract 1 – S side Wallen Ridge near Little Sycamore adjacent lands of William Houston, Mordica Cunningham on the South, Samuel Harkins on the North, on NE Cunningham, William Houston’s, the land commonly known as the Weatherman place where William Devenport and John Estes now live.  Census records show that this is John Y. Estes, not John R. Estes that lives beside William Devenport.

So, in 1851, William Devenport is losing his land and apparently, neither he nor John Estes can do anything about it.  John is not bidding on the land.  William S. McVey purchased this land and in 1852, William McVey also purchased a very large tract of land granted to William Estes, John’s brother, which John Y. Estes witnessed.

By 1876, this same land is being conveyed by Henry Sharp to W.H. Cunningham.  How do we know this is the same land that is where John Y. Estes lived?  Metes and bounds are included, it states that is was William Devenport’s and it says that is where David A. King lived when he died.  The Reverend David A. King, a Methodist minister fought for the Union in the Civil War, died in 1873 and is buried in the Upper Estes Cemetery.  His daughter, Elizabeth married the son of John Y. Estes, George Buchanan Estes, in 1878.  I wonder if the old Reverend rolled over in his grave to have his daughter marry the son of a Confederate.  Yes, the secret is out, John Y. Estes was a Confederate.

David King

1876, Mar 30 – Henry Sharp of Campbell Co., TN to W.H. Cunningham of Claiborne for $400, 2 tracts of land in Claiborne on the waters of Little Sycamore Creek on the South side of Wallen’s Ridge adj the land of William Houston, decd and constitute the farm on which David A. King lived at the time of his death, one part is an entry made by William Devenport and bounded as follows: Beginning at a hickory stump on a red bank in Houston’s line thence north 9 deg west with Hentins? Line 94 poles to the Buzzard Rock on the top of Wallen’s Ridge thence with the top of Wallen’s ridge 240 poles to a chestnut oak and when redused to a strait line is south 60 deg west 234 poles then south 75? Deg east on Houston’s line 34 poles to a stake in the other line of Houston’s then with the same north 70 deg east 93.75 poles to a double chestnut and gum on a spur at Houston’s corner thence with lines of Houston’s land south 390 deg east 43 poles to a maple at the branch then east 62 poles to a hickory stump then with lines of Houston’s land south 30 east 43 poles to a maple at a branch then east 62 poles to a hickory stump then north 62 poles to a large white oak corner then east 9 poles to the beginning containing 90 acres more or less.

This land would eventually be owned by Rutha Estes, the wife of John Y. Estes.

The second parcel bounded by…Houston’s line, Devenport’s grant line, 25 acres.  Witness JW Bois, WW Greer.

This was a very, very indirect “round the mountain” way to track John Y. Estes, but it worked.  However, we’re getting ahead of ourselves, so let’s go back before the Civil War.

On March 8, 1856, in the court records, we show that John Y. Estes had an account in the estate of Thomas Baker – in other words, he owed Thomas money.

In the 1860 census, John and Rutha have four more children, although with a gap of 4 years between Lazarus and Elizabeth, it looks like they lost at least one more child.

John Y Estes 1860

Interestingly, John Y. Estes is a shoemaker.  John is shown as owning no land, but he does have a personal estate of $173, which isn’t exactly trivial.

I think in 1860 that John Y. Estes is not living in Estes Holler.  He is living beside carpenters, stage drivers, a wagon maker, a wagoner and a carriage maker who was quite wealthy.  That sounds suspiciously like he was living in town which would have been Tazewell.

The Civil War

Shortly after 1860, life would change dramatically for the Estes family.  Tensions were escalating towards the Civil War, and in 1861, they erupted when initially 4, then 7, then 11 states seceded from the Union, forming the Confederacy.  Tennessee did secede, but not initially.  Claiborne County was badly torn between the North and South, the blue and grey – and families were torn apart as different brothers and sons joined opposite sides.  Loyalties were divided and family members fought against one another.

In 1862, at the height of the Civil War, Confederate troops occupied Tazewell as part of the greater struggle for the strategic Cumberland Gap. When the Confederates evacuated the town in November of that year, a fire followed, destroying much of Tazewell.  In essence, anyone who could leave, did, because Tazewell was a target of continuous raids for food and supplies.

We know by 1870, positively, from the census, that the John Y. Estes family is back in Estes Holler.  We also know from family stories about the Civil War that they spent the majority of the War in Estes Holler.

But what we didn’t know was something far, far more important.

Aunt Margaret told me that while the war was over, it was really never resolved in Claiborne County.  The Crazy Aunts used to tell stories of the men in Claiborne County wearing their Civil War uniforms once again, on Memorial Day, and head for town to “refight” the war, as long as there were any veterans left to do so.  I suspect that most of the fighting was verbal and in the form of relived memories, but assuredly, not all, especially if region’s notorious moonshine was involved….and you know it was!

The aunts, Margaret and Minnie, lived in Estes Holler as a child, and while I knew none of my direct Estes ancestors had served in the Civil War, obviously some people from that area had.  Just a couple years ago, I decided to look for Estes men in Claiborne County, TN to see if any of them had fought in the Civil War at www.fold3.com.  Was I ever in for the surprise of my life.

My great-great-grandfather, John Y. Estes served in the Civil War – but for which side?

John Y Estes reference slip

Look what that says.  Confederate.

John’s service records are confusing, to say the least.  There are documents in his file from both sides, it seems.  How can that be?  Let’s start with the basics.

The Civil War began in earnest in April, 1861 when confederate forces bombarded the Union controlled Fort Sumter, SC in Charleston Harbor.

Many people who lived in Claiborne County fought for the North and joined the Union troops, but not all.  The Civil War was a source of dissention within and between families in Claiborne County.  Few people there held slaves, so slavery was not a driving force.  By searching for his unit, I confirmed that John Y. Estes had joined the Confederate Army, but I was stunned.  All of my other family members in my various lines fought for the Union – including the families from that area.

The history of Carter’s Tennessee Cavalry Regiment F, formed in Claiborne County shows that it was formed on August 10, 1862 by Captain R. Frank Fulkerson who lived near John Y. Estes in the 1860 census.  There is no existing muster roll, although I recreated one as best I could from the various men’s service records in his unit.  Reading John’s record, along with the other men’s records in his unit, (along with regimental and other histories,)  is also how I reconstructed where that unit was, when, and what they were doing.

We don’t know when John enlisted, although it was likely when the unit was formed, nor do we know if he ever applied for a pension.  John would have been 44 years old in 1862, so no spring chicken.  His daughter, Nancy Jane has been born in November of 1861.  He had a wife and 6 children at home ranging in age from Lazarus born in 1848, so 13, to newborn.  His wife probably wanted to kill him for enlisting and save the Union Forces the trouble.

What we do know is that on March 20, 1865, in Louisville, KY, John Y. Estes signed the following allegiance document.  I later discovered that he had been captured and this was one way men obtained their freedom. This document tells us that he had dark skin, dark hair and dark eyes and was 5 feet 7 inches tall, just slightly taller than me. Information I didn’t have before.  If you look closely at John’s picture at the beginning of this article, he may have been mixed-race.

John Y. Estes allegiance

And look, we also have his signature.

So, how did John Y. Estes get to Louisville, KY in 1865 from Claiborne County?  To answer that question, I tracked the activities of his unit.  That was much easier said than done.

Here’s what we know about the activities of Carter’s Tennessee Cavalry Regiment.

Prior to the organization of the regiment, the battalion had been operating in the neighborhood of Cumberland Gap and Big Creek Gaps, at present day LaFollette, TN, about 33 miles distant from each other, along the line of the railroad.

When the regiment was organized it was assigned to Brigadier General John Pegram’s Cavalry Brigade in Lieutenant General E. Kirby Smith’s Department. This brigade was composed of Howard’s Alabama Regiment, 2nd (Ashby’s), 4th (Starnes’), I. E. Carter’s Tennessee Cavalry Regiment, and Marshall’s Battery.

Prior to the Battle of Murfreesboro, on December 29, 1862, Carter’s Regiment joined Brigadier General Joseph Wheeler’s Brigade, and participated in his raid around the Federal Army from Jefferson Springs to LaVergue, to Nolensville, to Murfreesboro, TN. The unit was engaged on December 31 along the Murfreesboro Pike.

Following this battle, the regiment returned to Pegram’s Brigade, in the Department of East Tennessee under Brigadier General D. S. Donelson.

With Pegram’s Brigade, the regiment took part in operations in Lincoln, Boyle and Garrard Counties of Kentucky, and was engaged March 30, 1863 at the junction of the Stanford and Crab Orchard Roads where it was under the command of Colonel Scott, of the 1st Louisiana Regiment. General Pegram’s comment on this operation is interesting: “For Colonel Scott’s operations, I refer you to the accompanying report. Touching this curious document I have only to say that I cannot but admire the ingenuity with which Colonel Scott has attempted to account for disobedience of orders and dilatoriness of action which it is my sincere belief lost us the fight.” Colonel Carter reported five officers and 32 men as casualties in this operation.

It was not a good day to be a Confederate soldier.  John saw his comrades die. It probably wasn’t the first time, and it certainly wouldn’t be the last.

On April 25, 1863, Colonel J. I. Morrison was reported in command of the brigade, now listed as composed of 1st Georgia, 1st and 2nd Tennessee Regiments, 12th and 16th Cavalry Battalions, and Huwald’s Battery. The brigade was at Albany, Kentucky on May 1; at Travisville, Fentress County, Kentucky on May 2.

On July 23, the Chief of Staff, at Knoxville, ordered Colonel Scott, then commanding the brigade, to send 300 horses of 1st (Carter’s) Regiment to Loudon, Tennessee.

On July 31, Pegram’s Brigade, consisting of 1st and 6th Georgia Regiments, 7th North Carolina Battalion, 1st Tennessee Regiment, Rucker’s Legion, and Huwald’s Battery was reported at Ebenezer.

From December of 1862 to August of 1863, John Y. Estes’s unit covered over 1000 miles and marched from East Tennessee, near the Cumberland Gap to central Tennessee to Kentucky, back to central Tennessee and then back to the Cumberland Gap.

John Y Estes civil war map

On August 15, Carter’s Regiment was reported as operating near Clinton and participated in the fighting around Cumberland Gap.  This fighting occurred on the land previously owned by John Y. Estes’s wife’s father, Lazarus Dodson.  The photo below is on Tipprell Road, on Lazarus’s land, looking North towards Cumberland Gap.

dodson land tipprell road

This is where Lazarus Dodson’s father, Lazarus Dodson’s Revolutionary War marker stands today, in the Cottrell Cemetery, below, now on land owned by Lincoln Memorial University.  This photo is standing in the cemetery, looking North towards the mountains and Cumberland Gap.

Cottrell cem looking north

This map shows LMU complex, the location of the cemetery with the upper red arrow and the location of the Dodson homestead with the lower arrow.  You can see the now abandoned road that used to connect the homestead with the cemetery.

Dodson homestead Cottrell Cem

The map below shows the larger area.  It’s probably a mile between the Dodson homestead and the LMU campus across the back way and maybe two and a half miles to Cumberland Gap, up Tipprell Road from the Dodson home.

Cumberland Gap Dodson homestead

This Civil War map shows where the troops camped, at Camp Cottrell, at Butcher Springs.  Lazarus Dodson had sold this land in 1861 to David Cottrell whose residence is marked on the map.  That was the old Lazarus Dodson homestead.  The main road, now called Tipprell Road, was called Gap Creek Road at the time.  It connects the valley, passes Butcher Springs and continues up to Cumberland Gap along the creek and now the railroad as well.  The road heading to the right above the Cottrell homestead used to go up to the cemetery, but is no longer a road today.

camp cottrell civil war map

This photo shows that area today.  It’s flat, so perfect for camping.  Butcher Springs is to the right in this photo, below, just out of sight.

DSCF9016

This is me standing in the Cottrell Cemetery.

Me in Cottrell Cemetery

Butcher springs would be behind me in the valley to the right.  On the Civil War map, Patterson’s Smith shop would be the cluster of buildings where you can see the church, to the left in the picture, in the distance, across the road.

Cumberland Gap was captured by the Federal troops on September 9, 1863, but the Confederate regiment had escaped up the valley before the surrender, and on September 11, Colonel Carter was reported in command of the brigade near Lee Courthouse.  Lee Courthouse is present day Jonesville, VA, about 35 miles from Cumberland Gap.  I’ve added Estes Holler here for context.

John Y Estes Cumberland Gap Lee Courthouse

On September 18, Carter’s Regiment was driven from the ford above Kingsport, TN after a severe fight.  This fight was only 7 days later and Kingsport was another 45 miles distant over rough, mountainous terrain.

John Y Estes Jonesville Kingsport

Somewhere about this time, the regiment was assigned to Brigadier General John S. Williams’ Cavalry Brigade, composed of the 16th Georgia Battalion, 4th Kentucky Regiment, 10th Kentucky Battalion, May’s Kentucky Regiment, 1st Tennessee and 64th Virginia Regiments, which on October 31, 1863 was reported at Saltville, Virginia, 60 miles northeast of Kingsport, TN.

The unit received orders to proceed to Dalton, GA, but despite these orders, Carter’s Regiment was reported near Rogersville on November 1, in Williams’ Brigade, with Colonel H. L. Gutner commanding.

Rogersville was back, through Kingsport, about 90 miles “down the valley,” so to speak.

John Y Estes Rogersville Saltville

In the meantime, Captain Van Dyke’s Company “C” had returned from Mississippi, and on November 24, 1863 was at Charleston, Tennessee with Colonel John C. Carter’s 38th Tennessee Infantry Regiment. Charleston was 145 miles from Rogersville.

John Y Estes Rogersville Charleston

Colonel Carter highly commended Captain Van Dyke and his 44 men for the part they played in helping his forces to evacuate Charleston without being captured.  On April 16, 1864, the regiment was transferred to Vaughn’s Brigade, of Brigadier General J. C. Vaughn’s Division, and reported 248 men present. It remained in this brigade until the end of the war.

By May of 1864, the majority of the fighting had shifted to Virginia.  Between mid-April and May, John Y. Estes’s unit traveled almost 400 miles, from Charleston, TN to the Lynchburg, VA region.

John Y Estes Charleton Lynchburg

The Civil War was becoming a series of constant battles which were referenced as the Campaign in the Valley of Virginia which lasted from May-July of 1864 as shown on this map by Hal Jespersen.

Shenandoah Valley Campaign 1864

As part of Vaughn’s Brigade, the regiment moved into Virginia in early 1864, fought at the Battle of Piedmont, New Hope Church, and in the subsequent campaign in the Valley of Virginia under General Early.

Germanna Ford

This drawing from Harper’s Weekly shows the troops crossing at Germanna Ford during the Battle of New Hope Church, also called the Mine Run Campaign.

Mine-Run

This drawing shows the “Army of the Potomac at Mine-Run, General Warren’s Troops attacking.”

Battle of Piedmont

This is the location, today, of the Battle of Piedmont.  This battlefield looked very different when John Y. Estes stood here on June 5th, 1864.  There were men, horses and blood all over this battlefield.  After severe fighting, the Confederates lost, badly.

It was this point, nearing the end of this chapter of the war, that John Y. Estes entered the hospital on June 12th.  But, that doesn’t mean he was done…the worst, perhaps, was yet to follow.  What happened next?  There has to be more.

Hmmm, let’s check the 1890 Civil War veterans census.  Nope, nothing there.

Well, let’s look under Eastice.  His folder says that name was used as well.

John Y Estes private

Well, Glory Be, look what we’ve found.  His index packet, indeed, under Eastice.

John Y Estes absent

This regimental return of October 1864 says that he was an absent enlisted man accounted for, “Without Cane Valley of Va. Aug. 28.”  That’s odd phrasing.  Does it mean “without leave?”  But it says he is accounted for?

John Y Estes deserter

Uh-oh, this doesn’t look good.  Now he’s on the list of deserters as of March 18, 1865.  It says he was released north of the Ohio River.  That goes along with the “Oath of Allegiance” document that he signed on March the 20th.

John Y Estes POW

Wikipedia says that during the Civil War, prisoners of War were often released upon taking at “oath of allegiance.”  General Sherman was known to ship people to Louisville and those who signed were freed, north of the Ohio, and those who didn’t remained in prison.

This documents John Y’s oath of allegiance, and the faint writing says that his name also appears as John Y. Estus.  How many ways can you spell Estes?  I checked and there are no additional records under Estus – at least none that are indexed yet.

John Y Estes transfer

This document says that he was a Prisoner of War, but this kind of Prisoner of War was a Rebel Deserter.  He was apparently “caught” on March 6th, 1865, send to Chattanooga, then to Louisville apparently in late March where he was taken across the Ohio River.  I’m thinking John Y. considered this a very bad month.

John Y Estes desertion info

This page gives us a little more info.  Apparently he deserted at Staunton, Va. on June 30 of 1864, just days after his hospitalization and release.  Where was he between June 30, 1864 and March 6 of 1865?  And where was he captured?  The first document says that in October of 1864, he was accounted for which I would interpret to mean that they knew where he was and whatever the situation, was OK.  Nothing confusing about these records….

John Y Estes medical

Well here is at least part of the answer.  On June the 12, 1864 he was hospitalized and had a partial anchyloses of his knee.  On June the 19th he was sent to a convalescent camp.  The 30th of the same month, he was reported as having deserted at Staunton.

What they don’t say here is that Staunton was devastated by the Union in June of 1864 – everything was burned including shops, factories mills and miles of railroad tracks were destroyed.  If that is where he was convalescing, it’s no wonder he deserted, or simply left.

He was accounted for in October, but sometime between then and March 1865, he apparently deserted for real, or he already had in October.  I wonder if he simply went home, or attempted to go home.  Where was he when he was caught, or deserted?  If you are a Confederate deserter, and the Union forces “catch” you, do they still hold you prisoner?  Maybe the Confederates only thought he deserted and he was in fact captured?  But the Union paperwork indicates he was listed as a Rebel deserter.  So many questions.

Ankylosis or anchylosis is a stiffness of a joint due to abnormal adhesion and rigidity of the bones of the joint, which may be the result of injury or disease, sometimes resulting from malnutrition. The rigidity may be complete or partial and may be due to inflammation of the tendons or muscular structures outside the joint or of the tissues of the joint itself.  Sometimes the bones fuse together.  This disease is considered a severe functional limitation.

So here is what we know about John Y. Estes and the Civil War.  He probably joined when the regiment was formed on August 10, 1862, although he may have been participating in the unofficial unit since 1861.  The Fulkerson’s in Tazewell, his near neighbors, were instrumental in raising Confederate volunteers in Claiborne County.  John Y. Estes fought and served until he was either injured or a previous condition became so serious in 1864 that he could not function, although he participated in some of the worst fighting and most brutal battles of the war.  John is reported to have been admitted to the hospital in Charlottesville, VA on June the 12th, transferred to a convalescent camp on June 19th, and deserted at Staunton, Va. on June the  30th.  In October, 1864 records say he was accounted for, but absent.  By March 6th of 1865, he was in prison, captured as a deserter, transferred to Chattanooga, signed the allegiance oath and by the end of March, had been taken to Louisville before being deposited on the north side of the Ohio River, having agreed to stay there for the duration of the war.

He didn’t have long to wait.  General Lee surrendered at the Battle of Appomattox Court House on April 9th, 1865.  But then John probably had to walk home on that injured leg.

That leg apparently didn’t slow him down much.  John Y. Estes eventually walked to Texas, not once, but twice, according to the family, which means he walked back to Tennessee once too.  The family said one leg was shorter than the other and he walked with a cane or walking stick.  It’s about 950 miles from Estes Holler in Claiborne County, Tennessee to Montague County, Texas.  I surely want to know why he walked back from Texas to Tennessee.  After making the initial journey, on foot, taking months, what could be that important in Tennessee?  Was he hoping to convince his wife to relocate with him?  Even then, land and other legal transactions could be handled from afar, so it must have been an intensely personal reason.  Maybe he only decided to return to Texas, forever, after he had returned to Tennessee.

I have to wonder how John’s Civil War allegiance and subsequent desertion, if that is actually what it was, affected John himself and the way that the people in Claiborne County viewed him.  He went back home and lived for several years.  His neighbor in Estes Holler, David King, fought for the North.  So did his sister’s husbands and children.  I’m betting holidays were tough and there was no small talk at the table.  Maybe there were no family gatherings because of these polarized allegiances.  They would have been extremely awkward and difficult.  Maybe John was quietly ostracized.  Maybe that’s part of why he eventually left for Texas.

On October 5, 1865, just six months after being released on the north side of the Ohio River, John Y. Estes did a very unusual thing.  He deeded his property, mostly kitchen items and livestock, to his son Lazarus who was about 17 years old and lived in the family home.

Transcribed from book Y, pages 286 and 287, Claiborne County, Tennessee, by Roberta Estes.

Deed of Gift From John Eastis to Lazarus Eastis :

State of Tennessee, Claiborne County. Personally appeared before me J. I. Hollingsworth, clerk of the county court of the said county, J. R. Eastis and Sallie Bartlett, with whom I am personally aquainted, and after being duly sworn depose and say that they heard John Y. Eastis acknowledge the written deed of conveyance, for the purpose therein contained upon the day it being dated. Given under my hand at office in Taswell this 9th day of October, 1865. J. I. Hollingsworth, clerk. Know all men by these presents that I, John Eastis of the County of Claiborne, State of Tennessee in consideration of the natural love and affection which I feel for, my son, Lazarus and also for divers good cause and consideration, I the said John Eastis, hereunto moving, have given, granted and confirmed by these presents, do give, grant and confirm unto said Lazarus Eastis all and singularly, the six head of sheep, one horse, fourteen head of hogs, one cow and calf, two yearlings, the crop of corn that is on hand, and all the fodder, and all the household and kitchen furniture, to have and to hold and enjoy the same to the only proper use, benefit and behoof of the said Lazarus Eastis, his heirs and assigns, forever and I the said John Eastis for myself and my heirs, executors, and administrators all and singular the said goods unto the said Lazarus Eastis, his heirs and assigns, against myself and against all and every person, or persons, whatever shall and will warrant and forever defend by these presents in witness whereof, I have hereunto set my hand and seal this 5th day of October 1865.  John Y. Eastis.

ATTEST: John R. Eastis, Sallie Bartlett. I certify this deed of gift was filed in my office, October 9, 1865 at 12:00 and registered the 10th day of the same month. E. Goin, register for Claiborne County. [ stamped on page 58 ].

John R. Estes is the father of John Y. Estes who would have been close to 80 years old at that time.

Is this somehow in conjunction with or a result of the Civil War?  Did it take him that long to find his way back to Claiborne County?  Was he angry with his wife?  Lazarus was only a teenager and didn’t live in his own home, and wouldn’t for another 18 months.

The verbiage in this transaction, “hereunto moving” does not mean that John was literally moving, but refers to what motivated him or moved him to make this transaction.  So, in this context, love and affection for his son “moved” John to convey this property.  Of course, this begs the question, “what about your wife?”  Rutha would be the person to use all of that kitchen gear to prepare meals for the entire family.  What about Rutha?

In the 1870 census, John is shown with his wife and family, with another baby, Rutha, named after his wife, born in 1867. John and his wife, Ruthy Dodson, would have one more child, John Ragan (or Reagan or Regan) Estes, born in March of 1871.

We know that in 1879, John Y. Estes was in Claiborne County, but whether he was “back” from Texas or whether he had not yet left, we don’t know.  On June 20, 1879, John Y. Estes signs an agreement granting James Bolton and William Parks permission to make a road across his land in order to enable Bolton and Parks to have access to their own land that they had just purchased from Lazarus Estes, John Y’s son.  This is the last document that John Y. signs in Tennessee.  And actually, it’s the only deed, ever.

Deed records show no evidence of John Y. Estes ever owning land or a conveyance to or from John Y. Estes.  My suspicion is that John was buying this land “on time” and when he failed to pay, the transaction was simply null and void and the deed never filed.  It’s still odd that he would sign to grant access on land he did not officially own.  This is very likely the same land that Rutha would eventually own in her own name.  Sometimes truth is stranger than fiction.

We know that by June of 1880 when the census was taken, John Y. Estes is living in Texas and his wife Rutha, is shown in Claiborne County as divorced, although no divorce papers have been found.  Maybe divorce was less formal then.  Given the distance involved, about 900 miles, and give that John could probably not walk more than 8 or 10 miles a day, the walk to Texas likely took someplace between 95 and 120 days, or 3 to 4 months, if he walked consistently every day and didn’t hitch rides.  So John likely left Claiborne County not long after the signing of the 1879 deed.  In fact, that might have been the last bit of business he took care of before departing.

The family in Texas tells the story that John Y. was wounded in the leg as a young man, although they don’t say how, and that one leg was shorter than the other.  He walked with a stick.  It causes me to wonder if the injury was truly when he was a child or if it was a result of his time in the Civil War, or maybe some of each.  It’s a wonder they would have accepted him as a soldier if he was disabled and his military battle history certainly doesn’t suggest a disability.  Maybe they were desperate or maybe the old injury got much worse during his military service – or maybe the injury occurred during one of the Civil War battles.  John was hospitalized and I find it difficult to believe he would have been hospitalized for an old injury.

During John’s absence, Claiborne County was not immune to the effects of the war.  In fact, they were right in the middle of the war, time and time again, and without a man in the household, Rutha and the family weref even more vulnerable.

During the Civil War, soldiers from both sides came through Estes Holler and took everything they could find: food, animals, anything of value. They didn’t hurt anyone that we know about, but the people hid as best they could. Adults and children both were frightened, as renegade troops were very dangerous.  Elizabeth Estes, born in 1851, was the second oldest (living) child of John Y. Estes and Rutha Dodson.  After the soldiers took all the family had, the 4 smaller children were hungry and crying. The baby had no milk. Elizabeth was angry, not only at what they had done, but the way they had been humiliated. She was a strong and determined young woman, about age 14 or 15, and she knew the soldiers were camping up on the hillside. She snuck into the camp of the soldiers that night, past the sentries, and stole their milk cow back. She took the cow’s bell off and the cow just followed her home. I don’t know if it’s true or not, but another story adds that she went back the second night and took their one horse back too. That one horse was all the family had to plow and earn their living.

Today, not one family member knew that John Y. Estes had served in the Civil War, not even the Crazy Aunts.  Given the way his service ended, it’s probably not something he talked about.  He would have been considered a traitor by both sides.  He didn’t claim his service on the 1890 veterans census either.  It seems a shame to have served for most of the war, in many battles, and survived, only to have had something go wrong in the end that seems to be medically related.  The term “deserter” is so harsh, and while I’m sure it technically applies, I have to wonder at the circumstances.  During the Revolutionary War, men “deserted” regularly to go home and tend the fields for a bit, showing back up a month or two later.  No one seemed to think much of it then.  That’s very likely what happened to John when he supposedly deserted in June of 1864, right after his injury.  He probably just left and went home.

I’m sure there is more to this story, much more, and we’ll never know those missing pieces.  And it’s a chapter, a very important chapter in the life of John Y. Estes and who he was.  It’s very ironic that none of his descendants alive today knew about his Civil War Service.

The Walk to Texas

Initially, I had no idea John Y. Estes ever left Claiborne County.

When I first visited Claiborne County, I did what all genealogists do – I went to the library.  I had called the library and the librarians seemed friendly enough, and they told me they had these wonderful things called “vertical files.”  I didn’t know what that was, so the nice lady sighed and said, “family files.”  Now, that I understood.

The first day I arrived in town, I went straight to the library.  I looked through the books and the family histories that had been contributed.  Most of those were for the “upstanding families” whose members had been judges and public officials.  That would not be my family.  In fact, there was very little for my family.  I was sorely disappointed.  Those promising vertical files either held little or there were none for my surnames.

I had packed up and was leaving, walking past the shelves that held so much disappointment, when one of the files literally fell off the shelf and about three feet onto the floor.  I was no place close to it, so it was prepared to fall with no help from a human, but the librarians looked up at me, and then down at the file on the floor, with great disdain and disgust.  They, obviously, felt I was careless and had knocked the file onto the floor.

I had no problem picking the file up, but I wished they hadn’t been so put out with me.  The file hit sideways and all of the papers fanned across the floor.  Most of them weren’t stapled together, so I was trying to make sure that I put them back in the file in order that they had come out, without mixing things up.  I have no idea the surname on the file.  I had already checked all of mine.  But as I was gathering those papers back into the file, a familiar name crossed my vision, Vannoy, then another, and then Estes.  I stopped and actually looked at the papers in the file.

I was holding a story about John Y. Estes, written by a Vannoy who had moved to Texas.  I put my bag and purse down, and sat down – on the floor – in the aisle way – oblivious to the librarians and their stares, now glares.  I read all three pages of the story and sat in stunned disbelief.  This had to be the wrong man. It was in the wrong family file.  Otherwise, someone would have told me….wouldn’t they?

My family didn’t go to Texas.  Did they?

This story says John Y. Estes walked to Texas, not once, but twice.  This man injured his leg somehow as a child and walked with a limp, one leg being shorter than the other. He walked with a cane or a stick, and still, he walked to Texas, twice, and back to Tennessee once.  This man had tenacity.  Of course, when I was reading this, I didn’t realize he had also fought through the Civil War with this lifelong challenge. I wouldn’t know that piece of the puzzle for another 30 years. I hesitate to call it a disability, because John Y. apparently didn’t treat it as such.  In fact, it just might have saved his life in the Civil War.

Fannie Ann Estes, John’s grand-daughter, said that John Y. brought a skin cancer medicine from Tennessee and sold it in Texas.  He traveled throughout north Texas selling his remedy and established a relationship with William Boren, a merchant that sold goods on both sides of the Red River throughout the Red River Valley.  This was also the location where the Chisolm Trail crossed from Texas into Oklahoma, so comparatively speaking, it received a lot of traffic.

So John Y. Estes was either a snake-oil salesman or a genius on top of being a shoemaker, according to the census, a Civil War veteran and a former Prisoner of War.  This man was certainly full of surprises.  What a great plot for a book!

His grandchildren said that as an old man, they remember him being short and fat.  Hardly a fitting legacy.  Thankfully, one person remembered more and wrote it down.

To the onlooker, it appears that John Y. Estes basically left his family in Claiborne County, TN and absconded to Texas.  But looking at what happened next, his children apparently did not seem to hold a grudge against him for leaving their mother….in fact, John Y. Estes seemed to be more leading the way than abandoning the family.

It’s clear from Rutha’s 1880 census designation as divorced that she viewed the relationship as over.  She never intended to leave Claiborne County, nor did she.  But that didn’t stop her relatives from going to Texas – and they all settled together, including her husband.  Many are buried in the same cemetery.

William Campbell, Ruthy’s uncle, and his family were in Texas by 1870. Barney J. Jennings married Emily Estes, daughter of Jechonias Estes, and they went to Montague Co., TX, as well.

Many of John Y’s children, in fact all of them except Lazarus, eventually moved to Texas, including brave Elizabeth who married William George Vannoy.  She left with William Buchanan Estes and Elizabeth King in 1893, in a wagon train.

Children

The following children were born to John Y. and Ruthy Dodson Estes:

  • Lazarus Estes, born in May 1848 in Claiborne Co., died in July of 1918 in Claiborne Co., married Elizabeth Ann Vannoy.  Both buried in the Pleasant View Cemetery.
  • Elizabeth Ann Estes, born July 11, 1851 in Claiborne Co., died July 7, 1946 at Nocona, Montague Co., Texas.  On September 11, 1870, she married William George Vannoy, brother to Lazarus’s wife and son of Joel Vannoy and Phebe Crumley.  They settled in Belcherville, TX in 1893 and her husband was buried in the Boren Cemetery in Nocona on Sept. 12, 1895, only seven days before her father died and was buried in the same cemetery.  I wonder what killed both men.  This must have been a devastating week for Elizabeth.  She spent most of her life in Texas as a widow – more than 50 years.

Elizabeth Estes Vannoy

Elizabeth Estes Vannoy’s 95th birthday. She liked to sit on an old seat out under a tree.  Elizabeth is buried in the Nocona Cemetery, not with her husband.

Elizabeth Estes Vannoy stone

  • Margaret Melvina Estes, born July 19, 1854 in Claiborne Co., died April 7, 1888 in Claiborne Co., buried in Pleasant View Cemetery.  Never married and no children.
  • George Buchanan Estes, born December 17, 1855 in Claiborne Co., died July 1, 1948 at Nocona, Texas, buried at Temple, Cotton Co., Oklahoma. In 1878 he married Elizabeth King, daughter of David King, in Claiborne Co. She died in 1920 and is buried at Temple, Oklahoma.

George Buchanan Estes and Wanda Hibdon

George Buchanan Estes and granddaughter Wanda Hibdon Russell in 1945.

  • Martha Geneva J. Estes, born October 6, 1859 in Claiborne Co., died April 9, 1888, buried in Cook Cemetery on Estes Road. She married Thomas Daniel Ausban in Claiborne Co. April 17, 1884.  It’s not believed that she had any surviving children.
  • Nancy J. Estes, born November 1861 in Claiborne Co., died at Terral, Jefferson County, Oklahoma in 1951, married a Montgomery.  Buried in the Terral cemetery.  No children.

Nancy Jane Estes Montgomery

  • Rutha Estes, born January 7, 1868 in Claiborne Co., died at Terral, Jefferson Co., Oklahoma in 1957.  She married Thomas Vannoy in 1902 in Claiborne County, or at least she took the license to marry him.  They may have never actually married, as she never used the Vannoy surname, nor is she ever found living with him.  She married William H. Sweatman after 1920 in Texas or Oklahoma and is buried in the Terral Cemetery.  No children.

Ruthie Estes Sweatman

  • John Reagan Estes, born March 25, 1871 in Claiborne Co., died July 8, 1960 in Jefferson Co., Oklahoma. On April 10, 1891 he married Docia Neil Johnson, daughter of William Johnson and Jinsey Nervesta King in Claiborne Co., She was born November 7, 1872 in Claiborne Co. and died August 30, 1957 in Jefferson Co.  John and Docia are both buried at Terral, Oklahoma.

The Texas family provides this information about John Regan Estes.

John Regan Estes grew to manhood in Claiborne Co. Tennessee, he received his schooling on the old split log seats and was taught to the “tune of a hickory stick”. On April 9, 1891 he married Docia Neil Johnson in Tazewell, with Rev. Bill Cook, the old family preacher, reading the vows. John and Docia were wed on horseback. A daughter, Fannie Ann, was born to them on May 4, 1892 at Tazewell.

In 1893, John Regan Estes had the ambition to go west. On the first day of November 1893, he stepped off the train at Belcherville, Texas. He was accompanied by his brother, George Buchanan Estes and family, Clabe Bartlett, and Lewis Taylor Nunn. He worked on the Silverstein ranch until January 1894.

He saved his money and sent it back to Docia and on February 9, 1894, Docia and Fannie, aged 20 months, arrived at the train station in Belcherville. At this time, they went to Oscar, Indian Territory. He located on a farm in the Oscar area and lived there until moving to the Fleetwood community in 1901. John’s farm was located on the Red River across from Red River Crossing where the Chisholm Trail crossed into Oklahoma. He had a shop near his barn and shod horses, sharpened plows, and did other metal work for the community.

Cousin Gib’s grandmother, granddaughter of John Y. Estes through John Reagan Estes told of life in Texas when they first arrived:

Fannie wrote about the Estes family living conditions at the time that Lula was born. She said that they lived in an old log house at the end of Ketchum Bluff, this is the area where the road going south from Oscar, Oklahoma makes a turn along a high rock formation an goes to where, at a later time, there was a toll bridge built going into Texas.

Ketchum Bluff map

Courtesy Butch Bridges

Note that the old trestle of the toll bridge can still be seen on the shore of Ketchum Bluff in the aerial photo, below, about one fourth of the way from the right hand side, directly across from the sand bar.  The bend in the river at the turn is in the lower left hand corner of the photo.  The bluff, of course, lies along the river.

Ketchum Bluff aerial

Courtesy Butch Bridges

 

Lula was born January 29, 1899 and Fannie said that it was extremely cold and they had snow on the ground for about six weeks. The sun would come out about noon each day for a little while and then it would cloud up again and snow all night. She said that their father would cut wood all day and carry it into the house. He did not have any gloves and his hands would crack open and bleed and hurt so bad that at night he would sit by the fire and cry from the pain.

In 1901, John got the farm a little farther west of here, just east of Fleetwood, and that is where Lula grew up.

The Estes family had moved to Indian Territory in 1894 and Oklahoma did not become a state until 1907. During this time it was pretty much every man for himself and gunfights were common. John Reagan worked as a farmer, blacksmith, farrier and lawman. The family remembers him wearing a gun.

Once, a man named Joe Barnes sent word to John that he was coming to kill him. John only had a black powder shotgun and he told Barnes to stop and to not come any closer. Barnes kept coming and John blew him full of birdshot. John had a bullet hole in his stomach and would tell the grandchildren that he had two navels.

John Reagan Estes circa 1905

John Reagan Estes about 1905.

John Reagan Estes family 1905

John Reagan Estes and family in 1905.

John Reagan Estes

John Reagan Estes in 1943.

Uncle George said that John R. Estes came to visit in the 1940s in Claiborne County Tennessee and that he was extremely tall and had very long eyebrows.

John Reagan Estes stone

The Texas family members, tell another secret too, that John Y. Estes had another family in Texas, but a search of marriage records produced nothing.  However, when I visited, I realized that the location where John lived was on the Choctaw land.  Perhaps he did have a second family without benefit of a legal marriage.  Laws and customs on Indian lands on the Texas/Oklahoma border were quite different than back in “civilized, orderly” Tennessee.  Furthermore, Indian tribes were considered sovereign Nations.  We will probably never know the details unless another family member steps forward.

John Y. Estes died on September 19, 1895 and is buried in the Boren cemetery, northeast of Ringgold, Texas.

Old Time Texas

In 2005, I visited my cousin, Gib, in Texas.  Gib had come back to Claiborne County, TN the year before and had visited Estes Holler.  Now, I was visiting Texas to retrace the steps of my great-grandfather, John Y. Estes.

Gib gave me a great piece of advice before I set out on my great adventure to Texas.

We went to see the movie “Open Range” starring Kevin Costner and Robert Duvall. The setting for the movie is 1882 and they are “free grazing” a herd of cattle on the open range as they are moving toward market. They pass through a little town, cross a river, and are tending their herd.

John Y. Estes was in Montague County Texas in 1880. The Chisholm Cattle trail came right through the little town of Red River Station which was two miles south of the Red River. From the information that I have, the movie town was exactly like what Red River Station was like in 1882. I really got intrigued with the movie by imagining John Y. being in a place just like that. This was where he would have been at that time because Nocona and Belcherville were not founded until 1887 when the MKT railroad came through going from east to west. Ringgold was not founded until 1892 when the Rock Island railroad was built going south to north and crossed the MKT at the site of Ringgold.

Of course no good western movie would be worth the price of admission without a good gun battle. They had one and people were killed. The next thing that grabbed me was the burial scene. They dug graves out on top of a hill and hauled the wooden caskets out in a wagon. This setting was just like what I found at Boren cemetery.

Another thing that caught my attention was the heavy rain storm that they experienced at the little town. Red River Station was pretty much wiped out by a Tornado in the late 1880’s and all the business moved to Belcherville and Nocona.

Anyway, go see the movie and imagine John Y. being one of the residents of the little town and then visualize all of our relatives crossing the Red River on horseback as they did in the movie. The River depth shown is also accurate of Red River. Later, John Reagan Estes owned the land on the Oklahoma side and the Campbells and Vannoys owned ranches on the Texas side.

Go see where John Y. lived in 1882, let your imagination run wild and enjoy it.

 I agree 100% with Gib’s recommendation.

The Chisolm Trail

The Chisolm Trail cut through the Estes land.

Chisholm Trail

Not far from Ryan is one of the cuts in a creek bank  worn by the pounding of thousands of hoofs when the Chisholm Trail was noted for its cattle drives from Texas to Wichita, Kansas.

This map shows Ryan and Terral, OK, and the ghost location of Fleetwood.  All that is left today is a store full of bullet holes and a cemetery.

Fleetwood OK

According to Gib, that cut is still visible on the Estes property. Although highway U.S. 81 mostly follows the route of the old Chisolm Trail, at times Engineers had to diverge from the trail itself in the interest of safety, mileage and economy. The original route crosses a cow lot owned by a man who probably knows more about that trail than anyone in this area. ( Note: the worn cattle trail rut up the hill was just west of the Estes cow lot. ) The location is about three miles east of Fleetwood.

The Chisolm Trail crossed the Red River at Red River Station.  On the Oklahoma side, or Indian Territory at that time, this was at Fleetwood and a marker has been placed today.  On the map below, you can see the balloon of the marker at Fleetwood and below the Red River, Red River Station Road.

Red River Station

Turning on the satellite image, here’s that part of the Red River near Station Road where the cattle would have crossed into Oklahoma.  Apparently, this is the area where the Estes land was located.  I thought sure I’d still be able to see the Chisolm trail today, but I can’t.

Red River Chisolm Crossing

There was a large dugout in the side of the hill where the Estes family lived while their house was being built.

dugout house

You really have to want to visit the Boren Cemetery.  It’s nearly impossible to find, to begin with, and after you to locate it, getting to it through 3 or 4 farm gates is another problem entirely.  And then there’s the issue of wild hogs – and they are not friendly.  In fact, they’re pretty testy – and they aren’t looking to you to feed them, but are looking at you as food.  I fully understand why people here carry guns – plural.

The Boren Cemetery

Boren cemetery crop

The Boren cemetery isn’t far from the Chisolm Trail and not far from where the Estes land was located.  On the map below, you can see the cemetery, marked by the red balloon, and you can also see the Red River Station Road to the right and Fleetwood on the Oklahoma side of the border.

Boren Cem near Red River Station

The Boren Cemetery is located in rolling Texas hill country – and sometimes those rolls are a bit steep.

Gib says to me, “It’s over there somewhere.”

Boren cemetery approach

Ok, Texas is a mighty big place and I don’t SEE anything that looks like a cemetery.

Gib had obtained directions and he and his wife had come out once already and scouted the area.  His wife opted not to come a second time.  That should have been a clue.

Gib had called the local farmer, so he had the lock combinations to the several gates we encountered.

Eventually, we entered a field and started driving across the field, then up the hill, then Gib’s 4 wheel drive vehicle bottomed out.  We were on foot from here on.

Gib forgot to mention about the snakes to me.  Those would be rattlesnakes.  Now, I have snake-boots at home, but those boots at home weren’t helping me one bit here.  I was not to be deterred.  Gib was wearing cowboy boots and walked in front of me.

???????????????????????????????

We found the path that led up to the cemetery,

We had to crawl under the barbed wire fence, or climb over it – because there was no gate.  By now, I could feel the rivulets of sweat running down my back.  Gib, the consummate Texas cowboy, was entirely unphased.  They make ‘em tough down there – I’m telling ya!

Boren cemetery cactus

And if the barbed wire doesn’t get you, the cactus will.  Yes, that’s a bone.  I don’t know is the answer to your next question.  Just don’t ask.

Boren cemetery stones

It’s kind of rough country here, with the stones scattered in no order, graves dug where there were no rocks to interfere with the shovels.  At home on the Indiana farm where I grew up, we would have called this scrub, scratch or hard-scrabble.  Here, it is normal.  But that’s why they need a lot of it to make a living.

???????????????????????????????

This stone in front is the marker for John Y. Estes.  It’s beside a Campbell and Vannoy marker, in fact, John’s son-in-law who was buried just a week before John was.  Did John stand at his son-in-law’s grave just a week before he would be buried beside him?  John’s marker is actually very unique, as gravestones go – and the only one here like it.  In fact, it’s the only one I’ve ever seen like it.

???????????????????????????????

John’s stone was cast in concrete and then the information was drawn in the wet concrete with some kind of object – freestyle.  This tickled Gib a great deal because he had spent many years of his life working in the concrete business – so this somehow seemed fitting.

???????????????????????????????

Tracking John Y. Without GPS

So now we’ve followed John Y. Estes across half of the United States.  While his son, Lazarus likely never ranged further than Knoxville, John Y. Estes not only was very widely traveled, the biggest part was on foot – at least the Tennessee to Texas to Tennessee to Texas part – and probably much of the Civil War part too.

Let’s look at where John Y. Estes was and when.  I can’t keep track.

Location Date
Halifax Co., VA 1818 – birth location
Claiborne Co., TN 1820s, 1840-1870s
Grainger Co., TN 1830s
Tazewell, Claiborne Co., TN 1860
Claiborne County, TN Aug. 10, 1862 – Confederate Unit Formed
Murfeesboro, TN Dec. 29, 1862 – Civil War battle
Murfeesboro Pike, TN Dec 31, 1862 – Civil War battle
Stanford and Crab Orchard Road, KY March 30, 1863 – Civil War battle
Albany, KY May 1, 1863 – Civil War battle
Travisville, Fentress Co., KY May 2, 1863 – Civil War battle
Ebenezer, TN July 31, 1863 – Civil War activity
Clinton, TN August 15, 1863 – Civil War activity
Cumberland Gap, TN August 15, 1863 – Sept. 1863 – Civil War activity
Lee County, VA Courthouse Sept. 18, 1863 – the North took the Gap – Civil War battle
Kingsport, TN Sept. 18, 1863 – Civil War battle
Saltville, VA Oct. 31, 1863 – Civil War battle
Rogersville, TN Nov. 1, 1863 – Civil War battle
Charleston, TN Nov. 24, 1863 – Civil War battle
Battle of New Hope Church, Orange Co., VA Nov 27 – Dec. 2, 1863
Valley of Virginia Campaigns, Shenandoah Valley, VA May-July, 1864
Battle of Piedmont, Augusta Co., VA June 5, 1864
Charlottesville, VA June 12, 1864 – hospital
Stanton, VA June 30, 1864 – deserted
Chattanooga, TN March 6, 1865 – POW
Louisville, KY March 20, 1865 – POW signed oath of allegiance – released north of the Ohio
Claiborne Co., TN 1865-1879
Nocona, TX 1880-1895

I would have loved to sit for a day and talk to this man.  What stories he had to tell.

The John Y. Part of Me

I have to tell you, this man had hootspa.  He was tenacious.  He walked to Texas, twice, using a cane or stick to walk, more than 900 miles each way, when he was 61 years of age.  And it didn’t kill him.  I can’t even begin to imagine this trip, once, let alone once there, walking back to Tennessee and then back to Texas, again.  In essence, just one of those trips took 3-4 months.  Three of them probably took more than year of his life.

The concept of that just baffles me. What could be that alluring about Texas?  And why go back to Tennessee once you had arrived in Texas?

But then again, I’m not so terribly different in some ways.  And sometimes things I do baffle others.

In the 1980s, I decided to retrace the Trail of Tears, in honor of my Native American ancestors and in protest of the atrocities that befell them.  I walked part of the trail, but that’s a lot easier said than done for various reasons – not the least of which is that the trail isn’t (or wasn’t then) marked and segments are lost or missing in many places.  In the 1980s and 1990s, I had completed the segment through Tennessee and Kentucky, into Illinois.  In 2005, I completed the section between southern Illinois and Tahlequah, Oklahoma, the home of the western Cherokee nation today, where the Cherokee settled. Altogether, this trek took me over 20 years because I had to make it in segments.  In 2005, I picked up where I had left off in Illinois and within a couple days, found myself at the location where the Native people crossed the Mississippi..

Trail of Tears State Park

I walked part of that as well, on both sides of the river, but given that I was traveling alone, I had to walk back to my car and then drive to the next segment to walk.  Take my word for it, the state of Missouri goes on forever!

Trail of Tears Crossing

I was a lot younger then that John Y. was when he walked to Texas, and he walked the entire distance, not just a few miles or a day here and there.

One of the most unforgetable stops on that journey was the Trail of Tears State Park in Missouri, just across the border from Illinois where the Cherokee spent a horrific winter, starving and freezing to death, and waiting for the ice to melt so they could cross the Mississippi.  It took eleven weeks to cover 60 miles and the Native people suffered terribly, horrifically – the local people refusing to help them with food.  Within days, there was no wildlife left to hunt.

Trail of Tears at Mississippi

This is on the Missouri side of the River, looking across the river at the land where more than 15,000 Native people camped, and waited, with no food and only light blankets in one of the worst winters recorded.  Weakened from starvation, people froze to death nightly.  The dead couldn’t even be buried, their bodies left in the snow.  There were no reports of cannibalism, but that level of desperation would not have surprised me.

The Trail of Tears as a whole, but in particular, this segment was a unfathomable act of inhumane genocide – torture, hour by hour, day by day, as you watched those you love starve and freeze, as you were doing so yourself.  One can feel their aching spirts as you stand on the land, even yet today.  Some were so devastated that they never spoke again in their lifetimes.  Their torture and grief is unfathomable and the depth of that black hole remains both tangible and palpable today.  There simply are no words.

My final destination in 2005, 125 years after John Y. Estes walked to Texas?  Texas.  Why?  To find John Y. Estes’s grave.  I never, at that time, realized the parallels.  But then, I didn’t really know the rest of the story.  Today, I find the parallels mind-boggling.

What of John Y. Estes do I have in me?  Do I carry his tenacity?  My mother would assuredly have voted in the affirmative, and she would not have meant that as a compliment!  I, on the other hand, am quite proud of that trait.

Sometimes it’s difficult to answer these kinds of questions – meaning how much of one particular ancestor’s DNA you carry.  One reason is that generational DNA is often measure in couples.  By this, I mean that if I compare myself to another individual who descends from John Y. Estes, like cousin Buster for example, the DNA that Buster and I share will not be just the DNA of John Y., but also the DNA of John Y’s wife, Rutha Dodson.

The only way to avoid this “spousal contamination,” and I mean that only in the nicest of ways, is by comparing the DNA of descendants of John Y. to someone who only descends from the Estes side, not the Dodson side.  What this really means is that the comparison has to be against someone who descended from John R. Estes, the father of John Y. Estes (or another Estes whose ancestor is upstream of John Y. Estes and who doesn’t share other family lines.)  Unfortunately, this means that it pushes the relationship back another generation, which means that less DNA will be shared between the cousins.

The cousins I have to work with are as follows, at least at Family Tree DNA.

Estes descent chart

In order for the closest descendants of John Y. Estes to be compared to a descendant of John R. Estes, I utilized the chromosome browser at Family Tree DNA.  Garmon is descended from John R. Estes, so carries none of Rutha’s DNA.  Therefore, any DNA that John Y’s descendants share with Garmon had to come from the Estes side of the house.

The chromosome browser graphic below shows the chromosome of Garmon, with the following individuals with matching DNA displayed as follows:

  • Me – Orange
  • Iona – Blue
  • David – Green
  • Buster – Magenta

On chromosome 1, Buster and Iona match Garmon, but I don’t and neither does David.  This is clearly John Y. Estes’s DNA, but I don’t carry it.

On chromosome 7 there is a small segment shared by everyone except David.

On chromosome 10, there is another small segment shared by me, David and Garmon.

Part of chromosome 13 is shared by Garmon, Iona and David.

To me, the most interesting part of this equation is that chromosome 19 holds a fairly large segment shared by everyone except Buster.

Garmon chromosome

So, let’s answer the question of how much of John Y’s DNA I carry.  I downloaded the segment chart that accompanies the chromosome browser and used that information to triangulate my matches – meaning that I noted when I matched two other cousins.  Not all matches are triangulated, proving a common Estes ancestor, but some are.  I then checked those cousin’s accounts to be sure they did, indeed, match each other on those segments – which is the criteria for triangulation.

This chart shows all of my matches to Garmon, which, precluding a second line or matches by chance, would all be John Y.’s DNA.

Garmon Roberta DNA matches

As we know, the only way to actually prove that these segments descend from John Y. is through triangulation but how can I triangulate more DNA to John Y. Estes?

The answer is the Lazarus tool at GedMatch, a tool built to reassemble or recreate our ancestors from their descendants – to reassemble their scattered DNA.

First, Lazarus allows you to enter up to 10 direct descendants and up to 100 “other relatives,” which means brothers, cousins, descendants of those people, but not someone who descends from the same spouse as John Y. Estes’s wife, Rutha Dodson.  If he had two wives and you were comparing children from both spouses against each other, then the criteria would be a bit different.

In other words, we’re only utilizing direct Estes line descendants, upstream of John Y. Estes.

I selected 4cM and 300 SNPs as my match criteria.

I have a total of 7 descendants and 4 other relatives, not all of whom have tested at Family Tree DNA.

I was pleased to note after running Lazarus at GedMatch that we had a total of 513.9 cM of John Y. Estes’s DNA reconstructed through his descendants and his other relatives.  In essence, that’s approximately 7.6% of John’s DNA that we’ve recovered.  Not bad for someone who was born 197 years ago.

The Lazarus tool matched my DNA with other Estes relatives, but NOT descendants of John Y. Estes.  I inherited the following segments directly from John Y. Estes.  Several of these segments were triangulated with 2 or more relatives.

John Y. Estes reconstruct DNA matches

Of these, only two, on chromosomes 9 and 19, are partial matches to the original list from Family Tree DNA. While, at first glance this looks unusual, it isn’t.  Both of the matches at Family Tree DNA over the threshold selected at GedMatch are included.  The lower segment matches were not “seen” at Gedmatch.  This is one reason why I utilize both tools when possible.  GedMatch allows you to utilize people’s results who tested at a different company, and Family Tree DNA allows you to easily pick up those common small segments.

If all of these segments are from John (and not from a secondary unknown shared line or identical by chance,) then I carry 156.6 cM of John Y. Estes’ DNA that I can map.  Given that John is my great-great-grandfather, I would be expected to carry about 6.25% of his DNA.  Of that amount, I’ve been able to tentatively identify about 2.3%, so if the right people were to test, I should be able to identify about another 3.95%.  So, in rough numbers, I’ve identified around one third of the DNA that I inherited from John Y. Estes utilizing 7 descendants and 4 other relatives.

So, now if I could just figure out which one of these genes is the “walk to Texas” and wanderlust gene, we’d be all set.  If I received that from any ancestor, it’s very likely to be from John Y. Estes, the only man I’ve ever know who walked to Texas, even once.

Red river aerial

Aerial view of the Red River, Texas on the right, Oklahoma on the left.

Acknowledgements:  A special thank you to cousin Gib, who supplied most of the Texas information and a lot of camaraderie over the years.



Help System at Family Tree DNA Updated Today

$
0
0

family tree dna logoGroup Administrators received this e-mail on April 1st. Presuming it’s not an April Fools Joke (it’s not), the new system is being implemented today, April 2, 2015 and it looks like it should help a lot with timely responses.

Family Tree DNA writes:

After reviewing several options to speed up e-mail response times and improve overall Customer Support efficiency, we are are putting a new system in place as of 10 am Central Time tomorrow, 2 April, 2015.

You’ll want to pay close attention because this is radically different from what you’re used to doing, but it’s for a good reason.

Until now, many of the emails would come to the general email boxes info@ familytreedna.com or helpdesk@familytreedna.com, uncategorized, adding the extra step of having to sort and triage emails, which takes a lot of time since they can vary from something as simple as changing a SNP order to more detailed analysis. Long emails with no kit number and an uninformative subject line complicate the process and require time to determine what the real question is.

With the new system, the above email addresses will be deactivated. Incoming emails will receive an automated, but politely worded reply directing the customer – or you – to the re-vamped Contact Us form. This form will require a kit number, unless you’re a new customer, and will require the submitter to choose a subject category.

All questions will be directed through a “Contact Us” form so that they are already categorized and can be routed to the appropriate customer service representatives.

ftdna support page

Again, please note: info@familytreedna.com and helpdesk@familytreedna.com will no longer be monitored. Emails sent to either of those addresses will get an auto-response directing the sender to the “Contact Us” form here: https://www.familytreedna.com/contact.aspx#contactForm

For future reference, a link to the Contact Us form is conveniently located at the bottom of the www.familytreedna.com home page under the “About” column.

ftdna support page 2

There will still be a “Group Projects” category, and for administrators, the email address groups@familytreedna.com will still be active, so if you’re writing with questions or situations regarding your project, or if you’re writing on behalf of project members, please use either of those options. The groups@familytreedna.com address is a priority category and is monitored regularly.

Please reserve messages to groups@ftdna.com for those that are time-sensitive, urgent, or simply cannot be answered by a CSR. Over the past few months so many routine – though still important – emails have been going to that email address that your Group Project Manager/Liaison cannot answer them all and still perform other duties such as advocating for you with management on important issues regarding group administration.

Important information for you to know and to share with your group members:

  • It’s important to choose the category closest to the topic of your email.
  • Kit numbers are mandatory.There will be an “I don’t have a kit number yet” box to click for new customers. Existing customers can recover their kit number from the “Lost Your Password” link on the login page.
  • Submitting multiple tickets from the same email address about the same issue before having received a reply groups those issues together and puts them at the bottom of the queue. (That will not be the case with groups@familytreedna.com, since admins write about different projects and discrete issues from the same email address. Still, please keep in mind our service hours and use your best judgement when submitting a second email about the same issue.)
  • Contact form submissions do not generate a confirmation email at this time. Your submission will be acknowledged on the page and you’ll be given a request ID number.

Thank you for your support and understanding as we work to bring you and your project members better service.


Proving Your Tree

$
0
0

With all the recent discussion about Ancestry’s new “New Ancestor Discovery” feature rollout, and some wrong individuals being assigned as my ancestors, some people have been asking the question, “How do you know your tree is right?”  In other words, how do I know those ancestors are not my genetic ancestors?  As they correctly pointed out, NPEs and adoptions do occur.

And they are right, absolutely right.  It’s a legitimate question, one that every one of us needs to answer for our own trees.

I answered their question briefly by saying that I have a combination of both paper genealogy and DNA for all ancestors through the 6th generation, which is true, but I want to share more than that.  Plus, I wanted to take the time to really evaluate every single line individually to be absolutely positive of what I was saying, and to weigh the evidence.  All too often, it’s not a handy-dandy yes or no, it’s shades of grey.

It’s important for all of us to treat this, the study of our ancestors, like a big mystery with clues for us to find and decipher.

In some cases, there isn’t much mystery.  For example, unless you’re an adoptee, you probably know your grandparents and their birth and death information is relatively easy to obtain.  First, you’re a family member, and second, relatively complete records exist in the past century.  There are lots of sources – birth and death certificates, obituaries, tombstones still remain, hopefully houses with Bibles haven’t all burned, etc.

But as you move back in time, there are fewer sources available, fewer records, if any, exist and eventually, you’re so far back that there is no “institutional memory” in the form of Aunt Marybelle’s or Uncle Jehosiphat’s stories.

Before DNA, we spent a lot of time compiling information about our families, fitting the pieces together, assembling old wills and estate distributions to figure out who the children were, and so forth.  But we had no avenue to verify for example, that William Jr. was really the biological son of William Sr.  Nor did we have the tools to figure out that William Sr. and his wife had taken a child to raise on a wagon train whose parents had died, and that child really wasn’t the biological child of either William or his wife.  None of that existed before, but does now, at least in certain circumstances.

One of the things people, for some reason, believe is that they are going to take a DNA test and somehow, with the wave of a magic wand, or maybe the click of a leaf, their ancestry is going to be revealed to them.  Needless to say, that’s not how it works.

What we do is continue to use a variety of types of DNA testing to prove various lines of our ancestry – and sometimes disprove them – in conjunction with other types of traditional records.  By now, you’ve probably all heard the story of my brother, who I searched for, for years, only to discover he was not my biological brother.  For me, there is peace in knowing and I love my brother regardless.  I’m so glad I found him before he passed away – regardless of the DNA results.  But before DNA, we would never have been able to know, for sure.  What we believed with all of our hearts was not the truth.  The DNA results were undeniable.

When I started working with DNA for genealogy, I was simply curious.  I did not set forth a goal to “prove my lines,” nor, for a long time, did I really think about that.  I was always just excited when someone from one of my ancestral lines would test, because their mitochondrial or Y results were relevant to my ancestors too – assuming we connected in the correct fashion.  I cherished the ability to discover that my ancestors in that line were from the British Isles, Africa, Scandinavia or were Native American, for example.  Mitochondrial and Y results allow us to extend what we know about that ancestral line back in time, beyond the time of surnames.  These tests help us to answer the question, for each ancestral line, “where did I come from?”  Because, after all, “I” am the combination of all of my ancestors.

In my article, The DNA Pedigree Chart – Mining for Ancestors, I talk about how to create pedigree charts that include Y and mtDNA for each ancestral line.  Obviously, I can’t test for all of these myself.

DNA Pedigree

This is part of the answer to how I know that some parts of my tree are correct.

For example, let’s say my Estes cousin, Buster, tests to represent my Estes Y line, and he matches many Estes men, all the way back to Abraham Estes, the immigrant into Virginia.  That unquestionably proves the Estes line he carries is the ancestral Estes line.  However, since I don’t carry the Estes Y chromosome, I have to utilize autosomal DNA to prove that I am related to Buster and other Estes descendants on the Estes side.  Those two pieces of information combined prove that my Estes line is confirmed back beyond the 6th generation – even though I don’t carry the Estes Y chromosome and I have no one  in my immediate family to “sit proxy” for me.

Why am I focused on the 6th generation?

When Ancestry rolled their new feature that gives people “New Ancestors,” they graciously gave me two that were not only wrong – I can’t figure out any connection at all.

I wrote about this in the article, “Ancestry DNA Gave Me A New DNA Ancestor – And It’s Wrong.”

In order for Diedamia Lyon and John Curnutte, spouses, to be my ancestors, they would have been born in about the 6th generation, given their birth dates, and reproduced in the 5th generation.  The problem is that I have my tree documented solidly back through both of those generations, and John and Diedamia are not my ancestors.

This caused someone to ask how I knew that an NPE hadn’t happened and that one of my ancestral couples, who I believe are my ancestors, aren’t really – and Diedamia Lyon and John Curnutte are instead – or at least John.  Like, perhaps a baby swap, or a wagon train parental death/child adoption or some other form of NPE (nonparental event.)

Good questions.  I want to know the answer too, for my own benefit.

In order to begin to address this, I looked at the years John and Diedamia were born and the locations where they are found.  Diedamia Lyon was born in Wilkes County in 1804 and she and John Curnutte married in 1822 in Lawrence County, KY, according to the Ancestry story attached to this couple, and she died there in 1866.  I can’t vouch for any of this, because it’s taken from Ancestry’s compiled trees.  What I can tell you is that I have no family at all in or near Lawrence County Kentucky, not in this or any other timeframe.

I do have family in Wilkes County, however, which is where I began the comparative search.  Both John Curnutte and Diedamia’s parents came from Virginia and I have lots of ancestry there, including several unknown lines – but not in any generation where John and Diedamia could be my ancestors.  However, these common locations would be the most likely places for an adoption, in whatever form, to have occurred – if one did.

So, let’s take this one parent’s side at a time and look at the proofs I have and how I know, beyond a doubt, that these two people are not my ancestors.

new ancestor mother tree

I’ve divided my ancestors into my mother’s side and my father’s side and created a proof table for each one in the 6th generation.  The Proof column, in this case, means proof that Diedamia Lyon and John Curnutte cannot replace these ancestors in my tree, confirming that these are my ancestors and John and Diedamia are not.

Let’s look at my mother’s side first.  It’s easy.  Hendrick Jans Ferverda, born in the Netherlands about 1806, so about the same time as John and Diedamia, was not in this country at that time.  We have documentary proof from the Netherlands.  We have further evidence of when his son did immigrate in the 1860s.  So, Diedamia and John cannot be clandestine ancestors, replacing Hendrick Jans Ferverda and his wife, Lijsbert Baukes Camstra in my tree.  They weren’t even on the same continent when the begetting would have occurred.

As we assemble the proof for each ancestor, we consider birth and death years and locations, whatever documentation we have, and DNA evidence.

Ancestor Birth/Death Location Facts Proof
Hendrick Jans Ferverda 1806-1874 Born and died in the Netherlands Proof from documents in Leeuwarden and Blija, Netherlands Not in the US at the time
Lijsbert Baukes Camstra 1806-1856 Born and died in the Netherlands Proof from documents in Leeuwarden and Blija, Netherlands Not in the US at the time
Harmen Gerrits de Jong 1803-1866 Born and died in the Netherlands Proof from documents from Oosterlittens and Leeuwarden, Netherlands Not in the US at the time
Angenietje Houtsma 1802-1866 Born and died in the Netherlands Proof from documents from Leeuwarden, Netherlands Not in the US at this time
David Miller 1781-1851 Born Washington Co., MD, died Elkhart Co., Indiana Marriage documents in Warren Co., Ohio, estate in  Elkhart Co., Indiana Miller Y DNA from this line matches to other sons’ descendants of Johann Michael Miller b 1692, autosomal matches to several Miller descendants including mother’s first and second cousins.
Catharina Schaeffer Circa 1775 – 1826 Born Berks Co, PA, died Montgomery Co., PA Marriage document 1799 Berks Co., Marriage document 1805 Warren Co., Ohio Schaeffer males have tested Y and autosomal.  They match the Schaeffer Y upstream of Catharina’s father and match cousins autosomally.
Jacob Lentz 1783-1870 Born in Germany, died in Montgomery Co., Ohio Citizenship papers and census show birth, tombstone and estate papers show death Multiple males have tested Y DNA and they match each other.  They also match other Lentz men, but we can’t figure the common ancestor in Germany.  The Y testers and other cousins match mother autosomally.  Ancestry DNA Circle.
Frederica Moselman 1788-1863 Born in Wurttemburg, Germany, died Montgomery Co., Indiana Was married before immigration Born in Germany, not in US at the time.  Ancestry DNA Circle.
Honore Lore 1766 – 1834 Born in New England during Acadian removal, died Quebec, Canada in Acadian community Church records Y DNA of descendant matches Y DNA of other Lore males upstream of Honore, autosomal DNA matches mother.
Marie Lafaille 1767-1836 Born in New England during Acadian removal, died in Quebec Canada in Acadian community Church records including marriage to Honore Lore in 1789 in Canada Not in right place, married to Honore, autosomal DNA of descendants matches both Lafaille and Lore family members.
Joseph Hill 1790-1871 Born Barrington, NH, died 1871, Lake Co., Ill Hill family from NH and Vermont where he is first found in records, death records in Illinois Autosomal DNA matches with other descendant of Joseph and his parents.  His father is Ancestry DNA Circle.
Nabby Hall 1792-1874 Birth record in Mansfield City, CT town records, death record in Lake Co., Illinois Family moved to Addison Co., VT where children were born and where they are in the records, ancestor daughter’s birth Autosomal DNA matches with other descendants of Nabby and her parents, Gershom Hall and Dorcas Richardson.
Phillip Jacob Kirsch 1806-1880 Born Fussgoenheim, Germany died Ripley Co., Indiana Church birth records, death recorded in cemetery records Not in US at the time
Katharina Lemmert 1807-1889 Born Mutterstadt, Germany, died Aurora, Indiana Church birth records and death recorded in cemetery records Not in US at the time
George Drechsel 1823-1908 Born Speichersdorf, Germany, died Aurora, Indiana German church birth records, death recorded in cemetery records Not in US at the time
Barbara Mehlheimer 1823-1906 Born Goppmansbuhl, Germany, died Aurora, Indiana Germany church birth records, death recorded in cemetery records Not in US at the time

I don’t think there is any doubt whatsoever in any of my mother’s lines that Diedamia Lyons and John Curnutte whose families were from from VA, NC and KY can possibly be substituted for any of these ancestors.

Now let’s move to my father’s side of the family, who were indeed from VA and NC.

new ancestor father tree

In the chart below, I’ve starred the ancestors who I feel have a weak or unknown parental connection, meaning with their parents, based on the facts.  In many cases, this is an unknown mother or unknown mother’s surname or lack of solid DNA proof.  My goal for each ancestor is to have both the genealogical and the DNA proof, supporting each other.

For example, let’s look at Nancy Ann Moore.  Nancy is starred because her mother’s surname is unknown.  This means I can’t prove or disprove any ancestral line through her mother, Lucy.  In other words, while it’s clear that John and Diedamia cannot replace John R. Estes and Nancy Ann Moore as ancestors, one of them might be related to Nancy’s mother.  Therefore, based on the evidence, we do have proof that John and Diedamia are not clandestine ancestors in place of John and Nancy, but what we can’t know is if they are related upstream to Nancy’s mother.

Ancestor Birth/Death Location Facts Proof
John R. Estes 1787-1885 Born Halifax Co., Va, died Claiborne Co., TN Birth and death from War of 1812 pension app Estes Y DNA proven beyond John R. Estes, autosomal DNA from descendants and other Estes descendants triangulate.  Ancestry DNA Circle.
Nancy Ann Moore* Circs 1785-1860/1870 Born Halifax Co., VA, died Claiborne Co. TN Marriage doc in Halifax Co in 1811, husband’s War of 1812 pension app Moore DNA tested to Nancy’s grandfather’s generation, descendants match other Moore testers autosomally, Nancy’s mother’s surname unknown.  Ancestry DNA Circle.
Lazarus Dodson* 1795-1861 Parents living in Hawkins Co in 1795, Pulaski Co., KY death records Father-in-law John Campbell estate records for marriage to Elizabeth Dodson Y DNA beyond Lazarus, many Dodson autosomal matches, Lazarus’s mother’s surname unknown.
Elizabeth Campbell C1802-1827/1830 Parents living in Claiborne County TN per tax and court records, death in Claiborne per her children’s guardian records Her father, John Campbell’s estate records regarding her children, guardianship settlement Campbell DNA from this line matches Campbell clan DNA, autosomal matches to many Campbell cousins.  Her parents are Ancestry DNA Circles.
Elijah Vannoy 1784-c1850 Parents lives Wilkes Co at that time, death from Hancock Co. TN census Elijah found in Wilkes Co deed records in 1807, in Claiborne court records by 1812 Vannoy Y DNA from his line matches lines earlier than Elijah, autosomal DNA matches cousins.  Son is Ancestry DNA Circle.
Lois McNiel c1786-c1839 Parents living in Wilkes at time of her birth per tax and deed records, died before census in Claiborne Co., TN Parents also moved to Claiborne Co., TN, family history records Elijah’s wife as Lois McNiel Y DNA matches back to Rev. George, 2 generations beyond Lois, autosomal matches Lois’ descendants as well.  Son is Ancestry DNA Circle.
William Crumley III* 1785/1789 – 1852/1860 Born where parents lived Frederick Co., VA proven by 1789 tax list, death in Appanoose Co., Iowa by census Was in Lee Co by 1820 census, marriage documents in 1807 in Greene Co., TN Crumley DNA from this line proves back to James, 3 gen earlier, autosomal matches to cousins, William’s mother unknown.  Daughter is Ancestry DNA Circle.
Lydia Brown* 1787/1790-1830/1849 Born where parents lived in Montgomery Co., VA, death by census in Lee Co., VA and husband’s remarriage Married in 1807 in Greene Co., TN, in Lee Co. Va by 1820, in 1830 census, 1850 census shows husband has been married within the year to new wife Brown Y DNA confirms Jotham and matches other Browns without common ancestor identified, autosomal DNA matches to cousins, Lydia’s mother surname unknown.
Henry Bolton* 1759-1846 Born England, died Giles Co., VA Birth location unproven except by family stories, marriage records, death by local documents and census Bolton DNA confirms Henry and there are other matches but common ancestor unproven.  No Y matches to Curnutte or Lyons. Many descendants autosomal match but cannot go beyond Henry with proofs.  Ancestry DNA Circle.
Nancy Mann* c 1780/1783 – 1841 Born where family lived, Botetourt Co., VA, died Giles Co, VA Birth from census and inferred from marriage document 1799, death from family Bible Parents are unconfirmed but believed to be James Mann and Mary Cantrell.  Not Y DNA confirmed to Mann line.  No known Manns from this direct line have tested.  Autosomal matches to James Mann’s brother Moses.  Ancestry DNA Circle.
William Herrell* 1789/1790 – 1859 Born in NC, probably Wilkes Co, died in Hancock Co., TN Birth from War of 1812 pension and bounty land apps, death from his wife on pension app Herrell confirmed back to John, William’s father on Y, match Y cousins autosomally, mother’s surname unknown.
Mary McDowell* 1785- after 1872 Born where in Wilkes Co., NC where parents lived at the time per tax records, died Hancock Co., TN Marriage in 1809 in Wilkes Co., lived in Claiborne & Hancock, death per pension docs and census McDowell Y DNA proven to Michael, her father, via multiple lines, autosomal matches to cousins, mother’s surname unknown.
Fairwick Claxton 1799/1800 – 1874 Birth in Russell Co., VA by census in location where parents lived, death Hancock Co., TN according to his mother’s War of 1812 pension app after his father’s death, death by chancery suit Born in Russell Co., VA, lived in Claiborne which became Hancock Co., TN entire life, chancery suit provides significant info, plus census. Claxton/Clarkson DNA proven to James via Y with additional matches from NC with earlier unidentified common ancestor, autosomal matches between entire group of cousins.  Ancestry DNA Circle.
Agnes Muncy* 1803 – after 1880 Born where Lee Co., VA parents lived according to tax and deed records, dead via census Hancock Co., TN Census and chancery suit show family in Hancock Co., TN Muncy Y DNA confirmed beyond Agnes, cousins matching autosomally.  Would like additional triangulated matches.  Ancestry DNA Circle.
Charles Speak 1804 – 1840/1850 Born Washington Co., VA where parents lived according to tax and deed records, death by census Lee Co., VA Marriage in 1823 in Washington Co., VA, later records in Lee Co., VA having to do with Speaks church Speak Y DNA confirmed back to Gisburn, England, many autosomal matches in this line.  Parents are Ancestry DNA Circles.
Ann McKee* 1804/1805 – 1840/1850 Birth in Washington Co., VA where parents lived according to father’s will, death from census Lee Co., VA Married in 1823 Montgomery Co., VA, moved to Lee Co., VA, her father’s will names her as daughter Have not found McKee Y to test, but match several McKee descendants on autosomal.  Ann’s mother’s surname is unknown.  Father Andrew was Ancestry DNA Circle, but disappeared.

As you can clearly see, there is no question that Diedamia Lyon and John Curnutte aren’t my ancestors.  There is no place for them to be born in 1801/1804, replacing two people here.  Plus there is no Canutte Y DNA matching downstream anyplace, nor any Lyon or Canutte matching at all that I can discover at Family Tree DNA where I can search for ancestral surnames among my matches.  At Ancestry, the only Curnutte surname DNA matches I have are the two individuals that are in the Curnutte “New Ancestor” circle.  Lyon is a more common surname, but nothing connecting matching people, the Lyon surname and any common ancestor or location – other than the two people who also match Curnutte.

I am 100% positive, bet on it and take it to the bank positive, that Diedamia Lyon and John Curnutte are not my ancestors.  And anyone who knows me knows that I never, ever, bet unless I know it’s a sure thing.  So, if I ever say to you, “wanna bet,” think twice.  I wound up with a nice piece of jewelry because my husband hadn’t learned that yet.  Not once, but twice.  Unfortunately, he has learned now:)

However, that doesn’t mean that I don’t share DNA with the descendants of Diedamia Lyon and John Curnutte.  One of two scenarios can be happening.

1. I do share DNA with two of the Lyon/Canutte descendants, but that DNA could be from two different, unidentified, lines, neither of which are John Curnutte and Diedamia Lyon. It just so happens that the two people I share DNA with happen to share the Lyon/Curnutte line between them. Therefore, the leap of faith has been made that I too share those ancestors. A triangulation tool would answer this question, because if I don’t match my two matches on the same segment, there is no proof of the same ancestor.  Lack of a triangulated match doesn’t mean that I don’t share those ancestors either. In other words, it’s not negative proof.  Lack of a triangulated match wouldn’t mean I don’t want to see this information.  I do. I just want to know how strong the evidence is, or isn’t. Without analysis tools, we’re left to flop around in the dark.

2. I share DNA with two of the Lyon/Canutte descendants because there is a common ancestor upstream of EITHER John Canutte or Diedamia Lyon whose DNA comes through that couple to their children who match me. If this is the case, then the common ancestor is most likely in one of the lines that are starred above where the parents are unknown.  If Ancestry provided chromosome matching and triangulation tools, I could see who else I match on that segment and perhaps find some common genealogy between others who match me (and my matches) on that same segment.

Summary

So, the answer to the question, “How do you know your tree is right?” is threefold.

First, genealogically, I’m a terribly anal, er, I mean thorough, researcher.  If you have any doubt, please feel free to read my 52 ancestor series and you can see for yourself the kind of in-depth research I do.

This isn’t to say everything is perfect or that I can’t make mistakes.  I clearly can, do and have.  But for the most part, my trees are solid and I know when they aren’t, where and why.  Plus, I’ve been doing this now for 37 years.  Experience is a wonderful teacher, so long as you learn and don’t just make the same mistakes over and over again.

And, yes, thank you, I did start when I was quite young – barely of age.

Secondly, I have been triangulating my autosomal DNA for several years now, proving segments through both known and previously unknown cousins to specific ancestral lines, and specific common ancestors.  But, I have to be able to see where we match to utilize those tools, and we can’t do that at Ancestry where it’s genetic genealogy wearing blinders.  I’m very thankful for GedMatch so I can compare DNA with the Ancestry cousins who will download their results.  If my two matches who descend from John and Diedamia downloaded their results to GedMatch, then I could see WHERE I match them and I might have that segment already mapped to a specific family line.  That would help immensely tracking backwards and finding the common link with my matches.

Third, I have been utilizing Y and mtDNA where possible and appropriate to learn about, prove and confirm various lines for nearly 15 years.

Often, I use combinations of these tools, like in my Buster example where Buster proved the Estes Y in my line, and I proved my relationship to Buster through autosomal DNA.

These combinations are powerful tools to prove, or disprove, family lines.

And now that you know how to do this, you can prove each one of your ancestral lines too!


DNA Testing Strategy for Adoptees and People with Uncertain Parentage

$
0
0

Adoptees aren’t the only people who don’t know who their parents are.  There are many people who don’t know the identity of one of their two parents…and it’s not always the father.  Just this week, I had someone who needed to determine which of two sisters was her mother.  Still, the “who’s your Daddy” crowd, aside from adoptees, is by far the largest.

The DNA testing strategy for both of these groups of people is the same, with slight modifications for male or female. Let’s take a look.

Males have three kinds of DNA that can be tested and then compared to other participants’ results.  The tests for these three kinds of DNA provide different kinds of information which is useful in different ways.  For example, Y DNA testing may give you a surname, if you’re a male, but the other two types of tests can’t do that, at least not directly.

Females only have two of those kinds of DNA that can be tested.  Females don’t have a Y chromosome, which is what makes males male genetically.

adopted pedigree

If you look at this pedigree chart, you can see that the Y chromosome, in blue, is passed from the father to the son, but not to daughters.  It’s passed intact, meaning there is no admixture from the mother, who doesn’t have a Y chromosome, because she is female.  The Y chromosome is what makes males male.

The second type of DNA testing is mitochondrial, represented by the red circles.  It is passed from the mother to all of her children, of both genders, intact – meaning her mitochondrial DNA is not admixed with the mtDNA of the father.  Woman pass their mtDNA on to their children, men don’t.

Therefore when you test either the Y or the mtDNA, you get a direct line view right down that branch of the family tree – and only that direct line on that branch of the tree.  Since there is no admixture from spouses in any generation, you will match someone exactly or closely (allowing for an occasional mutation or two) from generations ago.  Now, that’s the good and the bad news – and where genealogical sleuthing comes into play.

On the chart above, the third kind of DNA testing, autosomal DNA, tests your DNA from all of your ancestors, meaning all of those boxes with no color, not just the blue and red ones, but it does include the blue and red ancestors too.  However, autosomal DNA (unlike Y and mtDNA) is diluted by half in each generation, because you get half of your autosomal DNA from each parent, so only half of the parents DNA gets passed on to each child.

Let’s look at how these three kinds of DNA can help you identify your family members.

Y DNA

Since the Y DNA typically follows the paternal surname, it can be extremely helpful for males who are searching for their genetic surname.  For example, if your biological father’s surname is Estes, assuming he is not himself adopted or the product of a nonpaternal event (NPE) which I like to refer to as undocumented adoptions, his DNA will match that of the Estes ancestral line.  So, if you’re a male, an extremely important test will be the Y DNA test from Family Tree DNA, the only testing company to offer this test.

Let’s say that you have no idea who your bio-father is, but when your results come back you see a preponderance of Estes men whom you match, as well as your highest and closest matches being Estes.

By highest, I mean on the highest panel you tested – in this case 111 markers.  And by closest, I mean with the smallest genetic distance, or number of mutations difference.  On the chart below, this person matches only Estes males at 111 markers, and one with only 1 mutation difference (Genetic Distance.)  Please noted that I’ve redacted first names.

Hint for Mr. Hilbert, below – there is a really good chance that you’re genetically Estes on the direct paternal side – that blue line.

Estes match ex

The next step will be to see which Estes line you match the most closely and begin to work from there genealogically.  In this case, that would be the first match with only one difference.  Does your match have a tree online?  In this case, they do – as noted by the pedigree chart icon.  Contact this person.  Where did their ancestors live?  Where did their descendants move to?  Where were you born?  How do the dots connect?

The good news is, looking at their DNA results, you can see that your closest match has also tested autosomally, indicated by the FF icon, so you can check to see if you also match them on the Family Finder test utilizing the Advanced Matching Tool.  That will help determine how close or distantly related you are to the tester themselves.  This gives you an idea how far back in their tree you would have to look for a common ancestor.

Another benefit is that your haplogroup identifies your deep ancestral clan, for lack of a better word.  In other words, you’ll know if your paternal ancestor was European, Asian, Native American or African – and that can be a hugely important piece of information.  Contrary to what seems intuitive, the ethnicity of your paternal (or any) ancestor is not always what seems evident by looking in the mirror today.

Y DNA – What to order:  From Family Tree DNA, the 111 marker Y DNA test.  This is for males only.  Family Tree DNA is the only testing company to provide this testing.  Can you order fewer markers, like 37 or 67?  Yes, but it won’t provide you with as much information or resolution as ordering 111 markers.  You can upgrade later, but you’ll curse yourself for that second wait.

FTDNA Y

Mitochondrial DNA

Males and females both can test for mitochondrial DNA.  Matches point to a common ancestor directly up the matrilineal side of your family – your mother, her mother, her mother – those red circles on the chart.  These matches are more difficult to work with genealogically, because the surnames change in every generation.  Occasionally, you’ll see a common “most distant ancestor” between mitochondrial DNA matches.

Your mitochondrial DNA is compared at three levels, but the most accurate and detailed is the full sequence level which tests all 16,569 locations on your mitochondria.  The series of mutations that you have forms a genetic signature, which is then compared to others.  The people you match the most closely at the full sequence level are the people with whom you are most likely to be genealogically related to a relevant timeframe.

You also receive your haplogroup designation with mitochondrial DNA testing which will place you within an ethnic group, and may also provide more assistance in terms of where your ancestors may have come from.  For example, if your haplogroup is European and you match only people from Norway….that’s a really big hint.

Using the Advanced Matching Tool, you can also compare your results to mitochondrial matches who have taken the autosomal Family Finder test to see if you happen to match on both tests.  Again, that’s not a guarantee you’re a close relative on the mitochondrial side, but it’s a darned good hint and a place to begin your research.

Mitochondrial DNA – What to Order:  From Family Tree DNA, the mitochondrial full sequence test.  This is for males and females both.  Family Tree DNA is the only company that provides this testing.

FTDNA mtDNA

Autosomal DNA

Y and mitochondrial DNA tests one line, and only one line – and shoots like a laser beam right down that line, telling you about the recent and deep history of that particular lineage.  In other words, those tests are deep and not wide.  They can tell you nothing about any of your other ancestors – the ones with no color on the pedigree chart diagram – because you don’t inherit either Y or mtDNA from those ancestors.

Autosomal DNA, on the other hand tends to be wide but not deep.  By this I mean that autosomal DNA shows you matches to ancestors on all of your lines – but only detects relationships back a few generations.  Since each child in each generation received half of their DNA from each parent – in essence, the DNA of each ancestor is cut in half (roughly) in each generation.  Therefore, you carry 50% of the DNA of your parents, approximately 25% of each grandparent, 12.5% of the DNA of each great-grandparent, and so forth.  By the time you’re back to the 4th great-grandparents, you carry only about 1% of the DNA or each of your 64 direct ancestors in that generation.

What this means is that the DNA testing can locate common segments between you and your genetic cousins that are the same, and if you share the same ancestors,  you can prove that this DNA in fact comes from a specific ancestor.  The more closely you are related, the more DNA you will share.

Another benefit that autosomal testing provides is an ethnicity prediction.  Are these predictions 100% accurate?  Absolutely not!  Are they generally good in terms of identifying the four major ethnic groups; African, European, Asian and Native American?  Yes, so long at the DNA amounts you carry of those groups aren’t tiny.  So you’ll learn your major ethnicity groups.  You never know, there may be a surprise waiting for you.

FTDNA myOrigins

The three vendors who provide autosomal DNA testing and matching all provide ethnicity estimates as well, and they aren’t going to agree 100%.  That’s the good news and often makes things even more interesting.  The screen shot below is the same person at Ancestry as the person above at Family Tree DNA.

Ancestry ethnicity

If you’re very lucky, you’ll test and find an immediate close match – maybe even a parent, sibling or half-sibling.  It does happen, but don’t count on it.  I don’t want you to be disappointed when it doesn’t happen.  Just remember, after you test, your DNA is fishing for you 24X7, every single hour of every single day.

If you’re lucky, you may find a close relative, like an uncle or first cousin.  You share a common grandparent with a first cousin, and that’s pretty easy to narrow down.  Here’s an example of matching from Family Tree DNA.

FTDNA close match

If you’re less lucky, you’ll match distantly with many people, but by using their trees, you’ll be able to find common ancestors and then work your way forward, based on how closely you match these individuals, to the current.

Is that a sometimes long process?  Yes.  Can it be done?  Absolutely.

If you are one of the “lottery winner” lucky ones, you’ll have a close match and you won’t need to do the in-depth genealogy sleuthing.  If you are aren’t quite as lucky, there are people and resources to help you, along with educational resources.  www.dnaadoption.com provides tools and education to teach you how to utilize autosomal DNA tools and results.

Of course, you won’t know how lucky or unlucky you are unless you test.  Your answer, or pieces of your answer, may be waiting for you.

Unlike Y and mtDNA testing, Family Tree DNA is not the only company to provide autosomal of testing, although they do provide autosomal DNA testing through their Family Finder test.

There are two additional companies that provide this type of testing as well, 23andMe and Ancestry.com.  You should absolutely test with all three companies, or make sure your results are in all three data bases.  That way you are fishing in all of the available ponds directly.

If you have to choose between testing companies and only utilize one, it would be a very difficult choice.  All three have pros and cons.  I wrote about that here.  The only thing I would add to what I had to say in the comparison article is that Family Tree DNA is the only one of the three that is not trying to obtain your consent to sell your DNA out the back door to other entities.  They don’t sell your DNA, period.  You don’t have to grant that consent to either Ancestry or 23andMe, but be careful not to click on anything you don’t fully understand.

Family Tree DNA accepts transfers of autosomal data into their data base from Ancestry.  They also accept transfers from 23andMe if you tested before December of 2013 when 23andMe reduced the number of locations they test on their V4 chip

Autosomal DNA:  What to Order

Ancestry.com’s DNA product at http://www.ancestry.com – they only have one and it’s an autosomal DNA test

23andMe’s DNA product at http://www.23andMe.com – they only have one and it’s an autosomal DNA test

Family Tree DNA – either transfer your data from Ancestry or 23andMe (if you tested before December 2013), or order the Family Finder test. My personal preference is to simply test at Family Tree DNA to eliminate any possibility of a file transfer issue.

FTDNA FF

Third Party Autosomal Tools

The last part of your testing strategy will be to utilize various third party tools to help you find matches, evaluate and analyze results.

GedMatch

At GedMatch, the first thing you’ll need to do is to download your raw autosomal data file from either Ancestry or Family Tree DNA and upload the file to www.gedmatch.com.  You can also download your results from 23andMe, but I prefer to utilize the files from either of the other two vendors, given a choice, because they cover about 200,000 additional DNA locations that 23andMe does not.

Ancestry.com provides you with no tools to do comparisons between your DNA and your matches.  In other words, no chromosome browser or even information like how much DNA you share.  I wrote about that extensively in this article, and I don’t want to belabor the point here, other than to say that GedMatch levels the playing field and allows you to eliminate any of the artificial barriers put in place by the vendors.  Jim Bartlett just wrote a great article about the various reasons why you’d want to upload your data to Gedmatch.

GedMatch provides you with many tools to show to whom you are related, and how.  Used in conjunction with pedigree charts, it is an invaluable tool.  Now, if we could just convince everyone to upload their files.  Obviously, not everyone does, so you’ll still need to work with your matches individually at each of the vendors and at GedMatch.

GedMatch is funded by donations or an inexpensive monthly subscription for the more advanced tools.

DNAGEDCOM.com

Another donation based site is http://www.dnagedcom.com which offers you a wide range of analytical tools to assist with making sense of your matches and their trees.  DNAGEDCOM works closely with the adoption community and focuses on the types of solutions they need to solve their unique types of genealogy puzzles.  While everyone else is starting in the present and working their way back, adoptees are starting with the older generations and piecing them together to come forward to present.  Their tools aren’t just for adoptees though.  Tools such as the Autosomal DNA Segment Analyzer are great for anyone.  Visit the site and take a look.

Third Party Y and Mitochondrial Tools – YSearch and MitoSearch

Both www.ysearch.org and www.mitosearch.org are free data bases maintained separately from Family Tree DNA, but as a courtesy by Family Tree DNA.  Ysearch shows only a maximum of 100 markers for Y DNA and Mitosearch doesn’t show the coding region of the mitochondrial DNA, but they do allow users to provide their actual marker values for direct comparison, in addition to other tools.

Furthermore, some people who tested at other firms, when other companies were doing Y and mtDNA testing, have entered their results here, so you may match with people who aren’t matches at Family Tree DNA.  Those other data bases no longer exist, so Ysearch or Mitosearch is the only place you have a prayer of matching anyone who tested elsewhere.

You can also adjust the match threshold so that you can see more distant matches than at Family Tree DNA.  You can download your results to Ysearch and Mitosearch from the bottom of your Family Tree DNA matches page.

Mitosearch upload

Answer the questions at Mito or Ysearch, and then click “Save Information.”  When you receive the “500” message that an error has occurred at the end of the process, simply close the window.  Your data has been added to the data base and you can obtain your ID number by simply going back to your match page at Family Tree DNA and clicking on the “Upload to Ysearch” or Mitosearch link again on the bottom of your matches page.  At that point, your Y or mitosearch ID will be displayed.  Just click on “Search for Genetic Matches” to continue matching.

Get Going!

Now that you have a plan, place your orders and in another 6 to 8 weeks, you’ll either solve the quandry or at least begin to answer your questions.  Twenty years ago you couldn’t have begun to unravel your parentage using DNA.  Now, it’s commonplace.  Your adventure starts today.

Oh, and congratulations, you’ve just become a DNA detective!

I wish you success on your journey – answers, cousins, siblings and most importantly, your genetic family.  Hopefully, one day it will be you writing to me telling me how wonderful it was to meet your genetic family for the first time, and what an amazing experience it was to look across the dinner table and see someone who looks like you.


Father’s Day DNA Sale

$
0
0

Need a gift for Dad?  Family Tree DNA is offering their Family Finder test for $89 between now and midnight the 21st.  That’s $10 off and is good for either new customers or upgrades and for either males or females.

FTDNA Father's Day

Give Dad the gift of cousins, DNA matches and an ethnicity estimate.  That should give you plenty to discuss over dinner for the next several weeks!

PS – you can order more than one test.  No limit.  The more cousins you test, the easier your genealogy becomes!  I go to family reunions with DNA kits in my bag.  Seriously!  And I have them swab right there at the picnic table!

Click here to order.


The 1 Million Mark and Effective Matches

$
0
0

one million

Last week, Ancestry.com announced the millionth customer in their autosomal data base.  On January 18th, 23andMe did the same.  I don’t have exact numbers from Family Tree DNA, but they can’t be terribly far behind.  So, let’s look at the effectiveness of these matches at the roughly 1 million mark between the various vendors.

comparison chart

Black bold highlights the vendor’s positive aspects and red bold notes the drawbacks and places where each vendor could stand improvement.  I’ve underlined the two red issues I feel are the most serious.

*1 – Both 23andMe and Ancestry provide communications with others whom you match through internal message systems.  However, you have to request permission at 23andMe with anyone you match to communicate with them, and then additionally to share their DNA.  The 23andMe the 1404 number is how many people I match and the 162 number is the number of people that have accepted communications from me.  Not all of those 162 are sharing DNA.

*2 – At 23andMe, this would be the number of people sharing DNA results with me.  Ancestry has no tools that allow comparison of DNA segments.  At Family Tree DNA this would be all of my matches.

*3 – 23andMe cuts your matches off at 1000 unless you are communicating with your matches or you have an outstanding “introduction sent” request.  Of the 1404 people I match, 138 are sharing genomes, 24 have accepted communications but have not shared genomes, and 12 have declined.  The balance of my 1404 are either those to whom I’ve requested an introduction and they haven’t replied at all or some that I haven’t gotten around to inviting yet.  Ironically, my last of 1404 matches (in percentage of shared DNA order) is my known cousin who would have been purged had we not been sharing genomes.  You don’t have to send introductory invitations to those you match at either Family Tree DNA nor Ancestry and neither of those companies have an arbitrary cutoff, although Ancestry.com did a massive match purge when they implemented phasing.

*4 – At 23andMe, I can request to communicate with all 1404 people I match.  Of those, 162 have agreed to communicate or share genomes.  I can only communicate with those 162 people.  That doesn’t compare very well to either 1040 nor 5481 – and it shows how much genealogical benefit I’ve derived from 23andMe as compared to both Ancestry and Family Tree DNA.

*5 – At Ancestry, a minimum level subscription is required at $49 per year to see matching trees.  Not all participants have trees uploaded, and many trees aren’t public, so are not available for tree matching.  Otherwise, all trees connected to DNA results are included in matching function.

*6 – At Family Tree DNA, testers are encouraged to upload GEDCOM files or create trees in their account, and matching surname hints are given, but no actual ancestor matching in trees is performed.  Each participant must look at the tree of their matches, if provided.

*7 – 23andMe no longer hosts family trees on their site.  They have entered into collaboration with subscription service, MyHeritage.  Family Tree DNA is the only one of the vendors who hosts their own trees and does not require an additional subscription for that service, or for tree matching.

*8 – I have fewer matches at Family Tree DNA now than I did in November of 2014 when I had 1875 matches.  I have submitted a query to Family Tree DNA about this discrepancy but have not yet received a reply.

Commentary

The disparity between the 23andMe and Ancestry match numbers, since both vendors have 1 million autosomal results in their data bases, is suggestive of how many matches may have been pared from my match list at 23andMe.

The number of effective matches that can be usefully utilized, and how they can be utilized, are quite a bit different than the total number of matches implies without further analysis.

Both Family Tree DNA and Ancestry have unique strong points that make them stand out as vendors.

23andMe, since I can only work with or communicate with about 10% of my matches, is the least useful, for me, for genealogy.  I found their health services, which 23andMe is no longer allowed to offer following a dust-up with the FDA, very beneficial.

The tree matches and DNA Circles at Ancestry are very useful, but the fact that Ancestry provides absolutely no tools such as a chromosome browser or the other comparison tools that both 23andMe and Family Tree DNA provide makes Ancestry’s tree matches terribly frustrating eye candy in the candy shop behind a hermetically sealed window we can’t get through.  Tree matches and Circles are suggestive of an ancestral connection, but without comparison and triangulation tools, your match to an individual could be through a different, potentially unknown, line, and you have no tools at Ancestry to confirm or deny.  People are left to assume that the tree matches and Circles are proof, and unfortunately, they do in droves.

Thankfully, Family Tree DNA accepts transfers from Ancestry, V3 chip transfers from 23andMe (not the V4 chip since Dec. 2013) and GedMatch accepts files from all 3 vendors.  Those are the only avenues to actually compare the DNA of those who tested at Ancestry to triangulate and prove ancestral matches.

The great news in all of this is that more than 1 million people have tested, and probably more than two million in total – although there is clearly some overlap between vendors.  With every person that tests and that we match in one place or another, it increases our odds as genealogists to confirm our genealogy or break through those pesky brick walls.

Footnote:  The prices for the tests are the same, at $99, unless a sale is taking place at one of the vendors.  Both 23andMe and Ancestry also sell the aggregated anonymized DNA data for other purposes.  Both 23andMe and Ancestry will request that you sign (digitally authorize by clicking a box) an informed consent agreement for your non-anonymized (or less anonymized) data to be utilized or sold as well.  Family Tree DNA is the only one of these three firms that does not sell your DNA data in any form.


Family Tree DNA New Privacy Settings

$
0
0

As you may or may not have noticed, Family Tree DNA recently implemented more options in the privacy and sharing section of everyone’s personal DNA page.  That’s both the good news and the bad news.

Recent queries from group participants as to why their results were not showing in projects after they joined sent me on a quest to find out why.  The answer is that the new privacy and security settings at Family Tree DNA now default to a setting on new kit purchases that causes new participants results to not show in projects.  Another symptom is that as a project administrator, you’ll be able to see the participants results in your project, but you won’t be able to see their results in other projects they have joined when trying to help them with something like understanding haplogroup project grouping assignments.

In today’s more litigious society, giving people these types of options is not only a good thing, it’s necessary.  Now the bad news.  In the past, when you joined a project, your DNA results were automatically being shared on the project page, if the project had a public page.  That was the point of joining a project and is what everyone has come to expect.

privacy and sharing not sharing

Please note that people who were already clients when these new options were added, so who had already joined projects and were sharing, were not set to the default of not sharing, and were set to the value of sharing.  So if you were previously in a project and your results were being displayed, they still are.  This only affects new kit purchases.  Based on a kit I purchased on March 31, 2015, this new feature was implemented sometime after the middle of February and before the end of March, but I don’t know exactly when.

As more and more people purchase these kits with the default option set to not sharing, more and more administrators are finding themselves being asked why results are not showing up in projects…and asking themselves this same question.  The answer is, of course, that the defaults are now set for not sharing – but no one knows that.  The participants are not ASKED this question and they have no idea THAT this is happening, that there is a problem…or that they need to DO anything to rectify the situation.

Furthermore, most administrators aren’t aware of this either.  What this means, is that kits purchased since this change was made are NOT SHARING, but no one is aware of that until they stumble over it by accident.

Therefore, as interested parties and project administrators, we need to inform our participants of this default selection and that it needs to be changed.  Please feel free to share this article to accomplish this goal.

I very much hope that Family Tree DNA will implement a stepped process with options and educational “balloon boxes” so that both new participants and people whose results are now set to “not share” will be able to make selection choices when they set their account up or when they join projects.  Testers need to understand what they are being asked to select, why, and how their selections will affect their results and experience, both today and into the future.  Defaulting to not sharing is counter-productive and I fear that new testers will inadvertently be eliminated from project matching and grouping when that wasn’t their intention at all.

So, let’s take a look at the newest Family Tree DNA privacy and sharing options and how they affect participants, projects and project administrators.

Privacy and Sharing

You reach the privacy and sharing options by clicking on the “Manage Personal Information” link in the “Your Account” box to the left of your personal page at Family Tree DNA.

privacy and sharing

By clicking on the orange link, you’ll see the following Account Settings.

privacy and sharing profile

While you’re here, you may want to update your profile information.

On all selections, don’t forget to click on SAVE, or it won’t.

privacy and sharing save

Now, let’s move on to the privacy and sharing tab, to the far right of the options on the tab at the top.  Privacy and sharing options are divided into three sections.

privacy and sharing tab2

The selections greyed out on the right are the current default settings when you purchase a new kit.  There are no instructions or step-through dialogue boxes to help participants understand how these selections will affect who can see their results, and how that will affect their experience with DNA testing.

Needless to say, the power of DNA testing is sharing ancestral and genealogical information.  Otherwise, there is truly no reason to test.  Family Tree DNA has recently implemented changes which allow participants to select various levels of sharing.

Unfortunately, the default settings are in essence “off” for project sharing, once someone joins a project, which creates a great deal of confusion for participants and project administrators alike.

Participants presume their results are being shared, just the like results of the people they match.  Project administrators have no idea that the participants results aren’t being displayed in the projects, and when they discover that little tidbit, they have no idea why the results aren’t being displayed – because they always were before.

The Privacy and Sharing options are divided into three sections, My Profile, My DNA Results and Account Access

Let’s look at these one section at a time.

My Profile

Who can view my Most Distant Ancestor?

Default Setting:  Only You

This means that no one you match can see your most distant ancestor.

Options:  Share my Most Distant Ancestor with other people in projects that I’ve joined.

Creating an exception.

It appears that you can select to share within all projects (that you’ve joined), but elect to omit some projects, or you can select to not share with all projects, but to elect to share with only select projects.

privacy and sharing most distant ancestor

Note that I manage several kits with the same surname.  The default for both existing and new accounts is “only you”.  I checked and the most distant ancestor does show in both projects and matching when the “only you” selection is selected.  I suspect this is a bug, but currently, it’s how this option is functioning.  If this options starts functioning as it appears that it is supposed to, all of a sudden, your most distant ancestor information may disappear.  If so, this is why and this option needs to be changed to “share with other people in projects.”

Of course, this entire question presumes you’ve entered your most distant ancestor information.

Please enter your most distant ancestor for both your male paternal (father’s surname) line and your matrilineal (mother’s mother’s mother’s) line on the Genealogy Tab, under Most Distant Ancestors, shown below.

privacy and sharing most distant ancestor setup

If you don’t enter this information, your “Most Distant Ancestor” won’t be listed in projects, example below, so if other people from this line are looking to see if their line has tested, that information won’t be available to them.

privacy and sharing project

Furthermore, if your information isn’t there, it can’t and won’t be displayed to your matches.  You certainly want that information from your matches, so be sure to provide it for your matches to see as well.  In the example below, the first person did not complete this information, but the second person did.  As it turns out, they both descend from the same ancestor, but the person matching them can’t tell, because one person doesn’t have their Most Distant Ancestor listed.

privacy and sharing match

Who can see me in project member lists?

Default:  Project Administrators

Options:

privacy and sharing project member list

This selection works in tandem with how the project administrators of various projects you may have joined choose to implement the project display.  In other words, if the project isn’t public, then the “anyone” option is meaningless, because the public won’t be able to see the project at all.

hap q front page

Fortunately, most projects are publicly displayed.

The next question about this option is what, exactly, and where is a project member list?

When you visit any project, you will see a front page.  On that page, you will see several options relating to that project.  In the Kvochick project, there are 5 members.  If you click on the 5 members, that should display the list of the names of project members.

kvochick dna page

The default setting is only for project administrators to see the names.  In this case, your name would not appear in this list if clicked on by anyone other than the project administrator.

The second option would be for project members only, and the third option would be for the general public.

Please note that as of the writing of this article, I tested several projects and none had clickable numbers, so this option does not appear to be implemented at this time.

My DNA Results

Who can view my ethnic breakdown in myOrigins?

Default:  Project Administrators

Options:

privacy and sharing myorigins

Your two options are to share with your matches, or not share with your matches.  Do not share is the default.

Here is an example of people who are sharing ethnic results in myOrigins.  If you are not sharing, your name would not appear on this list for your matches on the bottom left.

privacy and sharing myorigins example

Lastly, the only ethnicity that is shared with your matches is an ethnicity they have as well.  In this case, the participant only has European ethnicity, so that is the only portion of his matches ethnicity that is shown to him.

Who can view my DNA results in group projects?

This new option is the one causing havoc with administrators and projects.

Default:  Make my mtDNA and Y-DNA private.  It will only be shown to people in my project.

Options:  Make my mtDNA and Y-DNA public.

privacy and sharing group projects

I strongly, strongly suggest that you make this selection public.  Let me give you an example of why.

Let’s say I’m a female, and I want to know if my paternal line has tested.  I would check the appropriate surname project.

In this case, let’s say I’m looking to see if any descendants of John Harrold (Herrell, Harrell, Harrald) who died in 1825 in Wilkes County, NC have tested.

When people share their results, you will be able to find out if your line has tested.

You can see in the example below that my Harrold line is group 7 in the Harrell project, so I now know my line has tested, and I can see my haplogroup designation and Y markers for John’s line.

privacy and sharing Harrell

If none of these John Harrold descendants had elected to share, then I would never be able to find this information.  If you’re looking for any of your ancestral surnames, you won’t be able to find those lines either – if the people who test don’t share.  If people who are looking to test don’t see their ancestral line, they will think there is no one to compare to, and they may be discouraged from testing.  This is certainly not what we want.

The problem today is that people who purchase tests don’t know they aren’t sharing – they assume they are.  Before these new privacy options became available, by default, if you joined a project, you WERE sharing.  Now, new participants aren’t sharing – even though they joined the project – unless they change their options.

Furthermore, if you are a project member, let’s say of the Harrell project, and one of the administrators is trying to help you understand your results in a haplogroup project, the Harrell administrator can’t see your results in the haplogroup project either – so we can’t help you.

PLEASE, PLEASE MAKE THE PROJECT RESULTS PUBLIC UNLESS YOU HAVE A COMPELLING REASON NOT TO DO SO. 

To not share this information defeats the entire purpose of DNA testing.

The most information that any project at Family Tree DNA can reveal is the kit number, surname (only) of tester, paternal (or maternal) most distant ancestor name, country of origin, haplogroup and the DNA markers (Y 12-111 and mtDNA HVR1 and HVR2 only) for which the individual has tested.  Below, a sample project is shown with the maximum amount of information categories shown (except I’ve truncated the markers shown to the right for space reasons.)

privacy and sharing most shown

To review the project setting, by default, only project members who are signed into their account and looking at the project can view your data.  Anyone who is not a project member and not signed into their account cannot see your data in the project

If you select public, anyone looking at the public project page can see your results, like the example above – assuming that the project itself is public.  This is only valid for Y and mtDNA HVR1 and HVR2 data, as mitochondrial DNA coding region and autosomal DNA results are never displayed publicly.

Who can view my mtDNA Coding Region mutations?

Default:  Only you.

Options:

Privacy and sharing mtDNA coding

If you have tested at the mitochondrial full sequence level, you will have tested the full HVR1, HVR2 and coding regions.  While the HVR1 and HVR2 regions are not currently known to reveal medical conditions, the coding region has the potential to carry some medical information.  Therefore, your coding region is NEVER displayed publicly, in a project.  Displaying the coding region is not an option.  If you elect to share your coding region mutations privately, that is up to you.

However, in order for mitochondrial DNA project administrators to correctly group you in mitochondrial DNA projects, they must be able to see your coding region results to know where your mutations fall.

Therefore, you can authorize project administrators to view the coding region results, by project.  In the example above, the individual is only a member of one project.  In order to authorize the Estes project administrator to view the coding region, click the box and then Save.

Account Access

How much access to Project Administrators have to my account?

Default:  Limited

Options:

privacy and sharing project admin access

What do the various authorization levels allow?  Here’s the list.

privacy and sharing admin access

If you have given an administrator full access to your account, which means you have given them your kit number and password, they have full access to everything and that supercedes these options.

Who has full access to my account?

Default:  Only You

Options:  Give the administrator your kit number and password.

privacy and sharing admin full access

Obviously, if you have privately e-mailed your kit number and password to an admin or anyone, Family Tree DNA has no way of knowing or tracking that.

Genealogy Tab

You will find a few more options that affect how your Family Tree is displayed on the Genealogy tab.

privacy and sharing genealogy tab

If you have uploaded a GEDCOM file or completed a family tree online at Family Tree DNA, who can be seen in your tree, and by whom, is controlled by this setting.

Having an entirely private tree is the same as having no tree and is not useful to anyone, so I really have no idea why someone would do this.

Of course, you can always see which of your matches has a tree available and can click on the pedigree icon to view your matches tree, if they authorize matches to view their tree.  On the example below of a Y DNA matching page, the first two participants do have a family tree, as indicated by the little blue pedigree icon, and the third individual does not.

privacy and sharing pedigree

I encourage everyone to either upload your GEDCOM file or create a family tree online at Family Tree DNA.  You can do either by clicking on the Family Tree Link on your myFTDNA menu at the top left of your personal page.

privacy and sharing upload gedcom

Including a family tree makes finding a common ancestor so much easier.  Genetic genealogy is all about sharing and collaboration – and finding those ancestors!

Public Search

Family Tree DNA recently implemented a public search function that allows public searches of online trees and GEDCOMS.

Why would someone search like this?  To see if people from their genealogocal lines have tested.  In other words, people wondering if they should test.  Allowing your tree to be seen publicly is in essence, cousin bait – of course you want them to test – the more the merrier and the better chance you have of breaking down those brick walls.

privacy and sharing search box

Below is an example of how your tree privacy selection, made under the Genealogy Tab above, impacts what can be seen by a public search.

privacy and sharing search

As an example, I did a public search for my ancestor, Jotham Brown.  Sure enough, there are several people at Family Tree DNA who have good ole Jotham in their trees.  That’s great – because it means I have a chance of matching some of them using the Family Finder test.

In the results above, you can see the three options for how trees are listed:

  • Entirely private such that you need to test and will only see the tree if you match
  • Public tree noted by the name of the owner
  • Tree included but noted as private member – which just means the name of the tree owner is not displayed

You can see the actual trees of both the public and private trees that are shown with clickable links.  You cannot see the tree of the private family tree with no link.

Clicking on the trees shows you the following example, depending on the tree display options you’ve selected.  The tree below has selected to mask living people and people deceased within a hundred years.

privacy and sharing tree2

Both trees labeled with a source and private member trees are shown, but with the privacy screening you’ve selected.  The only difference I’ve been able to find between those two options is that the source tree name is given for the public trees, and is not for the private member trees.  However, there is no contact information for the public trees (or any trees), so this is not a way to contact other genealogists.  You can only contact them if you have a match through DNA testing.

The third option is that completely private trees are only shown to matches.  These are noted as a private family tree and the searcher is instructed to purchase a Family Finder test to see if they match.  That is, after all, the goal!!!

privacy and sharing search2

Hopefully this search function will encourage more people to test.  After all, other people who descend from their ancestor are in the data base!

Summary

Privacy settings have changed and we have to figure out the best way to work with the new features.

Let’s make sure our new participants understand their settings and what needs to be changed in order to have their results displayed in the manner they desire.

As always, the way to obtain the best genetic genealogy experience is by sharing.  That’s what collaborative research and crowd-sourcing is all about.  Everyone shares individually and the power of the group is what gives genetic genealogy its awesome results.

So, the 4 key elements for successful sharing are to:

  • Set your project sharing status to public, not private.
  • Enter your most distant ancestor information
  • Share your most distant ancestor information with matches and projects
  • Upload your GEDCOM file or create a family tree at Family Tree DNA

Autosomal Matchmaking Vendor Comparison

$
0
0

Matchmaker, matchmaker…make me a match.

Indeed, matching is what autosomal DNA for genetic genealogy is all about.  Let’s take a look at the difference between matching at the various vendors and how it affects us as genetic genealogists.

Harold is my third cousin.  We have been genealogy research partners now for about 20 years on our family lines.  Fortunately, both Harold and I have encouraged our cousins and family members to test their DNA – at all 3 testing companies.  We’ve uploaded the results to GedMatch and we’ve matched, compared and triangulated until we’re blue in the face.

Hey, it keeps us off the streets:)

What this does, however, is gives us a very firm foundation to compare results at the different companies and with different tools.

Today, I’m going to take a look at how the matches differ at the different companies and at GedMatch when comparing the same people – and how it affects us as genealogists.

First, the matching thresholds aren’t the same, but we can compensate for that and we can see how the threshold differences affect our actual matches.

The following table shows the vendor autosomal matching thresholds.

Vendor Autosomal Matching Thresholds

23andMe

At 23and Me, Harold and I share a total of 133.8 cM of DNA and 21,031 SNPs spread across 6 different segments on 5 chromosomes.

Harold Me 23andMe

Family Tree DNA

At Family Tree DNA, Harold and I share 152.44 cM of DNA with 35,774 shared SNPs.

FTDNA Harold Me

Family Tree DNA reports much smaller matching segments than 23andMe and by process of inference, Ancestry.  The chart below shows Harold matching to me at Family Tree DNA.  The green overlay highlights the segments that 23andMe shows for Harold and I as matches.  The non-highlighted rows are shown at Family Tree DNA, but not at 23andMe.

Harold Me FTDNA 23andMe

Family Tree DNA does us the HUGE favor of providing all the actual matching DNA segments over 500 SNPs in length as long as we match first on a larger segment.  The other vendors remove these.

Ancestry

Utilizing a new private tool currently in beta test, Harold and I share 113.92 cM of DNA at Ancestry.  Of course, there is no segment data, so all we have is a total, which is certainly more than we had before.

Ancestry runs their customer’s DNA through a phasing process that eliminates many segments before they do matching.  Therefore, the significantly smaller cM total on Ancestry is a result of their phasing and matching routines.

However, by comparing the Ancestry total to the 23andMe total, which is the next most restrictive result, we can see the difference.

23andMe’s total is 133.8, so the difference between the 23andMe and the Ancestry match is 19.88 cM.  If you look at the 23andMe matches, you’ll notice that the two smallest segment matches are 10.4cM and 12.8cM and together they total 23.2 cM, with is just slightly more than the 19.88 we’re looking for.

Harold Me 23andMe Ancestry

You may have noticed already that begin and end segments and matches between vendors even on the same chromosome do vary some.  These two red segments, above, are the most likely candidates to be the missing Ancestry segments, in part, because they are the smallest and their total is near to the 19.88.

GedMatch

At GedMatch, comparing Harold and I at the default of 700 SNPs and 5cM, which is equivalent to the 23andMe threshold, gives us the following:

Harold Me GedMatch at 23andMe Threshold

Next, I ran GedMatch at 500 SNPs and 1cM which is the equivalent of the FTDNA threshold after you have an initial match.

Harold Me GedMatch at FTDNA threshold

Vendor Summary

I’ve put together a vendor summary of our findings.

vendor match summary2

There’s quite a difference between vendors.  More than I expected.

Comparing the Vendors

Given that the GedMatch comparison using the FTDNA thresholds is the most generous in terms of matching segments, let’s compare the three vendors matching segments against the GedMatch matching segments.  Because start and end segments aren’t exactly the same, if any portion of the vendor’s match falls into the GedMatch match segment, I’ve counted it as a match, so in favor of the vendor.

The chart below utilizes the GedMatch to FTDNA matching segments as the foundation, and I’m comparing other vendors’ matches to the GedMatch results.

All Vendor Match Comparison

For purposes of this comparison, ignore WHICH (start, end, cM) column is colored.  I’ve just selected 3 columns and assigned one to color per vendor.  If that segment row is found in that vendor’s comparison, it’s highlighted in that vendor’s color.  So, for the first row, only FTDNA reported chromosome 1, from 44,938,970 to 47,788,153 as a match.  So, therefore, their cell in that row is the only one colored with their color, green.  Looking down to chromosome 5, you can see that both FTDNA and 23andMe show those segments as matches.  Only four chromosome segments are matches using the inferred Ancestry results based on their total cM information.

How Does This Affect Matching

When Ancestry introduced their phasing, as you might recall, a great many matches disappeared.  In essence, what Ancestry has done is relieved you of the problem of figuring out which matches are “solid” by not giving you any option to work with the raw data.

One of the comments that Ancestry has made is that few people who match in a DNA Circles match on the same segments.  In other words, they don’t triangulate, which means that Ancestry is telling us we don’t need to bother with triangulation because it won’t work anyway.  Their commentary becomes more understandable if you eliminate anything but large segments.  Most people who are distantly related are NOT going to match on large segments, and an entire group is not going to match on the same large segment, which is why we desperately need those smaller segments too – along with the raw data to compare.

Of course, because Ancestry provides us with no tools, we can’t see how we match our matches.

The best we can do is to download Ancestry raw data results to either or both Family Tree DNA and GedMatch – but we’ll never see what matches we are missing at Ancestry, which is really sad.

I ran my matches at both Family Tree DNA and at GedMatch for the two segments that Ancestry has apparently removed.

Yes, I have quite a few matches on those segments.  But not beyond what would be expected in terms of the number of people in the data base that I’m being compared to.  I do have some regions that are clearly from endogamous populations, and those areas have pages and pages of matches.  These two segments aren’t like that.

At GedMatch, I ran a triangulation report of that segment of chromosome 5 where I match others at both 23andMe and Family Tree DNA.  And for the really sad part – look at all those A kits, meaning Ancestry – more than half.  Those aren’t small segment matches either.  One triangulation group that includes an Ancestry kit is 14.7cM.  I’m missing those matches at Ancestry unless I happen to match these people on a larger segment that hasn’t been removed by Ancestry’s phasing.

GedMatch Triangulation Chr 5

I decided to check the second segment that Ancestry has removed that shows as a match through23andMe, Family Tree DNA and GedMatch – on chromosome 18.  There are fewer matches on that segment of chromosome 18, not more – so it’s not a pileup area either.  It does triangulate with other people who descend from a common Vannoy ancestor who are not close relatives.

At Family Tree DNA, here are my matches to 5 known Vannoy cousins on chromosome 5 at the FTDNA default threshold.  As you can see, I match two cousins, so we have a triangulation group of 3.

FTDNA Chr 5

Look what happens below, in terms of matching, when the match threshold is lowered.  In addition to several other matches on other chromosomes, I’ve picked up another match on that segment of chromosome 5, which serves to increase that triangulation group to four people on that segment.

FTDNA chr 5 at 1cM

I checked, and indeed, the green, blue and orange cousins do match each other on this segment as well.  Chromosome 18 triangulated too, but with different cousins matching the base person.   The orange cousin is in both triangulation groups.

FTDNA chr 18

Ancestry apparently discarded both of those segments on chromosome 5 and on 18.  Ancestry claims that seldom do people in their DNA Circles match each other on the same segments.  That’s probably true if you’re measuring only very large segments, but we can see from these examples that these are neither pileup regions nor nonmatching segments.  They triangulate between cousins, so they are valid identical by descent matches.

Discussion

I ran this little test as an experiment, but I must admit, I was stunned at the disparity in the matching of the vendors.  There has been a great deal of discussion surrounding the merits of Ancestry’s phasing.  Ancestry claims they are removing non-genealogical matches, as in IBS matches by population in pileup regions.

Based on what we’re seeing above, assuming the inferred discarded segments are accurate (without additional tools, inference is as good as it gets), they’ve pruned the tree too deeply.  That’s really not apparent when you look at your matches at Ancestry for three reasons:

  • Their data base is very large, so you still have a lot of matches
  • You can’t see your segment information
  • The Ancestry matches you do have are only the strongest – so you, proportionally, will have more “solid” matches at Ancestry than at other vendors – which makes people happy who don’t understand the behind the scenes ramifications of what they AREN’T getting and that those matches are not proven to that ancestor – nor is there any way to prove the data without a chromosome browser type of tool.

The smaller the matches reported by the vendors, the further out in time it moves the bar to finding your ancestors – which is why Family Tree DNA has a larger threshold, but still reports the small matching segments.

Let me say that again, in another way.

If you used a hypothetical matching threshold of 50cM for the smallest matching segment, you’re only going to get matches to about second cousins or closer.  Harold and I wouldn’t match with our largest segment being 47cM and we very clearly share a common ancestor. You’d have very few matches (if any) BUT they would all be very solid.  You’d be able to figure out quickly how you are related.  But how would this be useful to genealogy?  You likely already know those people. So this approach is very accurate, but also very restrictive, providing no opportunity to break down those distant brick walls.

If you move the threshold out to Ancestry levels, you’re going to get more matches, but fewer further back in time because the DNA from each contributing ancestor is reduced in each generation.  The majority of your matches will be beyond the 2nd cousin level, because you have a LOT more matches with each generation you go back in time.  Still, your matches will still probably be within a few generations.

At Ancestry, I have only one 3rd cousin DNA tree match, meaning a common ancestor has been identified with that person, about thirty 4th cousins, about a hundred 5th cousins and about thirty distant cousins.  So, you can see that 5th cousins are probably your most likely match and it falls off quickly after that.

If you move the matching threshold out even further, by making it smaller, you’ll have even more matches but many will be distant.  A greater percentage will be identical by chance and identical by population, but you will have some valid matches in those smaller segments.  The caveat is of course that you would have to work to sort the wheat from the chaff, by using triangulation methods.  The common ancestor will likely not be evident and may not be identifiable.  Conversely, the common ancestor may be identifidable…and that may be just what you need to break down that long standing brick wall.  I’ve done that twice now, once on my Younger/Hart line, confirming a wife’s rumored maiden name and one in my Vannoy line, confirming Elijah’s parents through matches to his mother’s Hickerson line.

But, if you don’t have those smaller segments to work with, along with tools, you will NEVER be able to find those elusive distant ancestors using DNA.

The great irony in all of this is that while I was working with the matches to chromosome 5 for this article, I noticed a couple of new matches I hadn’t seen before.  These matches also triangulate, but are from a female line, and now I know that at least part of that segment comes from the Crumley maternal line that married into the Vannoy line.  So, of you think for one minute that these smaller segments aren’t useful or important, think again.

So, the bottom line here is that if you’re interested in the immediate gratification aspect, with no work, but also no ability to utilize DNA segments to find distant ancestors, Ancestry is the one.  Their strong suit is their tree matching and many people are perfectly happy to never go beyond that – replete with incorrect assumptions that this means the ancestral genetic relationship is “proven.”

I currently have about 5400 total matches at Ancestry.  Of those, the day I did this comparison, 152 people matched my DNA and we have a tree match as well, meaning a common ancestor in my tree and their tree has been identified.  Of course, that does not assure that particular ancestor is how our DNA matches, and we can’t confirm that without a chromosome browser.  Still having those matches and matching trees, along with Circles is a wonderful first step.  It’s “feel good” stuff and who doesn’t like feel good.

If you’re interested in the vendor that gives you the most DNA segments to work with along with the tools to do it and therefore the most opportunity, Family Tree DNA, hands down, is the one.  Less feel good but way more potential.

23andMe is someplace in the middle – not easy or intuitive with a difficult communication process resulting in very few people who actually share their matching DNA with you, no feel good stuff, but they have a great matching tool that shows you not only who you match, but who your matches match in common with you as well.

I wish we could combine the best parts of all 3 vendors.  I wrote in detail about the autosomal offerings of all three vendors here.  Today, the best alternative is to test with all three.

Regardless, everyone who tests with any of the 3 vendors (or all of the three vendors) should upload their results to GedMatch where additional tools are provided that aren’t available at any vendor.  Another benefit of GedMatch is that the people there tend to be more serious about genetic genealogy.  The down side is that percentagewise, few people actually do upload their files, so you do still need to test at all of the vendors to achieve maximum matching and benefit from their individual strengths.

Additional tools are also available at www.dnagedcom.com where you will find analysis tools that utilize the matches found at the vendors (via downloads) but provide analysis and display in different ways.

Gedmatch, which works with your raw data and provides comparisons to others, and DNAGedcom.com which downloads your actual match information from the vendors are the great equalizers between vendors today, as much as possible given the vendor matching threshold limits in place internally.  No matter what, the third party tools can’t get more than the vendors give you.

What’s the bottom line?  Fish in all of the ponds, but understand the wide variance in the boundaries and the limitations of each pool.  There is more difference between vendors in ways that might not be initially apparent.



Autosomal DNA Testing 101 – What Now?

$
0
0

When I first started this blog, my goal was to provide explanations and examples of genetic genealogy topics so that there would be fewer questions and easier answers.

That sounded like a great idea, but the reality of the situation is that the consumer market for autosomal DNA testing has exploded – meaning more and more consumers with more and more questions.  Compounding that situation, the consumers who purchase these tests today, especially on impulse, and mostly I’m referring to Ancestry.com here, often have absolutely no idea what to expect or even what they want except that Ancestry will find their ancestors for them.  That’s because that’s what Ancestry tells them in their advertising.

So, in the big picture, the questions and inquiries that experienced people are currently receiving are becoming less specific and more general and often exhibit a lack of understanding of what DNA testing can do.  It’s frustrating to parties on both sides of the fence, but I’m glad people are asking because it means they are interested and willing to learn.

Rather than approach this topic from a technical perspective of how to work with autosomal DNA, I’d like to talk about what can be done with autosomal DNA testing from a newbie perspective.  The person who just got their results back and are saying to themselves, “OK, now what can I do with this?”

However, there is lots “how to” information in this article for everyone if you click on the links.  If nothing else, this gives you a tool to send to those overly excited newbies who are starry eyed but have no clue how to proceed.  Remember, you were once new too!

This is part 1 of a two part series.  The second part will focus on how to make contact with your matches successfully.  But now, let’s pretend it’s day 1 and you just got your autosomal test results back.

Why Did You Test?

The first question to ask yourself is why did you test in the first place?  If your answer is “because Ancestry had a sale,” that’s fine, but then you’ll need to read all four options to know what you can do with autosomal DNA.

1.  I want to meet other people I’m related to.

Ok, but the first thing here you’re going to have to define is the word “related.”  You are likely related to everyone on your match list.  I said likely, because there may be some people there whose DNA simply matches yours by chance.  For the most part, and especially for those people who are your closest matches, you’re related somehow. The challenge, of course, is to figure out how – meaning through which ancestor.  This is the genealogy jigsaw puzzle of you!

All three of the major vendors, Family Tree DNA, Ancestry and 23andMe show you your closest matches first on your match list.

autosomal 101 FTDNA

Do you want to meet your DNA cousins only if you can identify a common ancestor?  Do you want to work with them on genealogy? The answers to these questions will help sort through the rest of what to do and how.

If your goal is to contact your matches, then Family Tree DNA is the easiest, as they provide you with the e-mail addresses of your matches by clicking on the little envelope for each match on your match page, shown above.

Ancestry is second easiest, but forces you to use their internal message system which often doesn’t deliver the messages.  (Do not send more than 30 in one day or Ancestry will blacklist your messages and block your communications, thinking you are a spammer.)

23andMe is the most difficult as you have to request permission to communicate with each match and also to share DNA and if your match authorizes communication, then you can communicate through 23andMe’s message system.  Sound cumbersome?  It is and the response rate is low.

Confirming Genealogy

Let’s look at another reason for testing.

2.  I want to confirm my genealogy is correct – meaning that my great-grandfather really is my great-grandfather and so forth on up the line.

Well, you’re in luck, especially if some of your cousins, known or otherwise, have tested.  Confirming your genealogy is easier done in closer generations than more distant ones and the more cousins from various lines that have tested, the better.  That’s because you will share more of your DNA with relatives when you have a close common ancestor.

Autosomal DNA is divided approximately in half in each generation, when the child receives half of their DNA from each parent – so the closer your cousin, the more likely you are to share more DNA with them.  The more DNA you share, the more likely you are to be able to identify which ancestor it comes from.  And if a match matches you and your proven cousin both on the same segment, that identifies positively which line that match comes from.  That three way matching is called triangulation.

Let’s talk about the word “confirm.”  Herein lies a challenge, because DNA does have the absolute ability to confirm ancestors, as noted above.  DNA also has the ability to give you hints that go towards a “preponderance of evidence.”  DNA, can also lead you astray if you draw erroneous conclusions – and one vendor provides a tool (or tools) that encourages overstepping conclusions.  Let’s look at each circumstance.

Proof Positive through Triangulation

Just what it says – absolutely unquestionable proof that a particular ancestor is your ancestor.  If you match two other people who also descend from your common ancestors, Joe and Jane Doe, on the same segment of DNA, that is confirmation that you share that ancestor and that segment of your DNA is considered proven to that ancestral line.  This requires two things.  First, that your DNA matches on the same segment AND that you have identified the same ancestors, Joe and Jane Doe, genealogically in your trees.

Now, you probably can’t tell which side of the couple, Jane or Joe, the DNA is from unless you also match two people on just Jane’s side of the family or just Joe’s on that same segment.

One caveat here – counting you and your parent as two of the three people doesn’t work because you and your parents are too close in the tree.  By three people, that would preferably be three people who descend from that couple through three different children.

Here’s an example.

JohnDoe

It would also ideally be more than three people, but three is the minimum to form a triangulation group.  In the real world, these matches might not start and end of the same segments as in the example above, but the overlapping portion should be significant

The example above is proof positive, because the three people descend from the same ancestor, through different children, and match on the same chromosome in the same locations.

This technique is called triangulation.

Now for the bad news – you can’t do this at Ancestry.com, because they don’t provide you with any of the segment information in the last 5 columns.  Ancestry has no chromosome browser, which is the tool that shows you where on your DNA you match your cousins.

Family Tree DNA’s chromosome display tool that is part of their chromosome browser is shown below.

Two cousins browser

On the example above, you can see that Barbara Jean Long, the black background person on the chromosome graphic, is being compared to her two first cousins, the blue and orange on the chromosome graphic.

You can download the information from Family Tree DNA or 23andMe in spreadsheet format, or you can display the information graphically, like in the example above.  You can see the “stacked” locations where both the cousins match the black background person they are being compared to.  You can also see that there are some locations where only one of the cousins matches the background person, like on chromosome 20.  And of course, some locations where neither cousin matches the background person, like on chromosome 21.

If you download that data, the information gives you the locations where the people being compared match the person they are being compared against.

Two cousins combined

The chart above is the download of part of chromosome 1 for Barbara, Cheryl and Donald, siblings who are Barbara’s first cousins.

The areas where the 3 people overlap, or triangulate, are colored in green on the spreadsheet, while the rows entirely in pink or blue do not triangulate – meaning Barbara matches either one cousin or the other, but not both.  Keep in mind that this example only proves their common ancestral couple, which in this case are common grandparents – but the technique is the same no matter which common ancestor you are trying to prove.

This bring us to our next topic, that of close relatives.

Close Relative Matches

I previously said that you can’t use you and a close relative to prove a distant ancestor.  But that’s not necessarily true when the relationship you are trying to prove is closer in time.  The chart below shows the relationships of the example above.

Miller Ferverda chart

In the case shown above, two first cousins who are siblings, Cheryl and Don, are being compared to their common first cousin, Barbara.  Their fathers were siblings and their common ancestors were their grandparents.  This is not 6 generations up a tree where matching is iffy.  You can be expected to match closely with your first cousins where you may not match with more distant cousins, because you simply didn’t inherit any of the same DNA from your distant common ancestor.  You should be sharing about 12.5% of your DNA with first cousins, and if you have first cousins that you’re not matching, that might signal that an undocumented adoption has occurred in one line or the other.

In a case like this, if you and a first cousin match, that suffices to prove a close connection.  If you don’t match, it suffices to raise questions.  A lot of questions.  Big ugly questions.  The next thing to do is to see if any other known cousins have tested and who they match – or don’t match.

For example, if Barbara Ferverda was not the child of John Ferverda, she would not match either Cheryl nor Don, and we’d know there was a problem.  If Cheryl and Don match other Ferverda or Miller relatives and Barbara didn’t, then we’d know the genetic break in the line was on Barbara’s side and not on Cheryl/Don’s side.

This same technique is also how we know which “side” matches are on.  If an unknown match matches both Barbara and Cheryl, for example, it’s a good bet that their common ancestor is someplace in the Miller/Ferverda line.  If they also match another Miller on the same segment, then the common ancestor has been narrowed to the Miller side of the Miller/Ferverda couple.

Unfortunately, not all DNA results are as definitive or easy to prove as these.  Let’s look at some of the more “squishy” results.

Preponderance of Evidence through Aggregated Data

In regular genealogy, there are a range of proofs.  There is direct evidence that someone is the child of an ancestor.  That would be a will, for example, that names a daughter and her husband and maybe even tells where they moved to.  This would be your lucky day!

Think of that will as equivalent to triangulated proof of a common ancestor.  There is just no arguing with the evidence.

If you’re not that lucky, you have to piece the shreds of indirect evidence together to make a story.  In the genealogy world, this is called preponderance of evidence, and I am always, always much less comfortable with this type of evidence than I am with solid proof.

There are various flavors of pieces of evidence in the DNA world. Sometimes we have hints of relationships without proof.

The most common is when you have matches with a group of people who share the same surname, but you can’t get back far enough to find a common ancestor.  Is this a probable match?  Yes?  Guaranteed?  No.  Have I seen them fall apart and the actual match be on another entirely unrelated line?  Yes.  See why I call these squishy?

Ancestry takes this one step further with their DNA Circles.  For a DNA Circle to be created, you must match DNA with someone in the Circle AND everyone in the Circle must match DNA with someone else in the Circle AND everyone in the Circle must have a common ancestor in their tree.  Circles begin with a minimum of three people.  Generally, the more people who match AND have the same ancestor, the stronger the likelihood that you would be able to confirmation the common ancestor of the group as your ancestor too – if you had a chromosome browser type of tool.  Still, Circles alone are not and never will be, proof.  Circles are great hints and along with other research, can confirm genealogical research.  For example, my paper genealogy says I descend from Henry Bolton, and I find myself in Henry Bolton’s tree, matching several other Bolton descendants through Henry’s other children.  Those multiple connections pretty well confirms the paper trail is accurate and no undocumented adoptions have occurred in my line.

Now, the bad news….Circles is predicated upon matching of trees.  If there is a common misconception out there that is replicated in these trees, then people who match will be shown in a Circle predicated on bad information.  And, there is no way to know.  However, people interpret the existence of a DNA Circle as proof positive and that it confirms the tree.  Membership in a DNA Circle is absolutely NOT proof of any kind, let alone proof positive – except that your DNA matches the people who you are connected to by lines and their DNA matches the people they are connected to by lines.  You can see my connections in orange below, and the background connections in light grey.

circle henry bolton matches2

This is an example of my Henry Bolton Circle.  I match 5 different people’s DNA (the orange lines) who also show Henry Bolton as their ancestor.  This does NOT mean the match is on the same segment, so it is NOT triangulated.  This is a grouping of data where multiple people match each other, not a genetic triangulation group where everyone matches on the same segment.  In fact there are cases that I have found where the person I match in a circle is through a different line entirely, so in that case, the presumption of which common ancestor our common DNA is from is incorrect.

I want to be very clear, there is nothing wrong with DNA Circles, so far as they go.  The consumer needs to understand what Circles are really saying – and what they can’t and don’t say.  DNA Circles are another important tool in our arsenal.  We just have to be careful not to assume, or presume, more than is there.  Presuming that we match someone in the Circle because we share Henry Bolton’s DNA may in fact be inaccurate.  We may match on a completely unrelated line – but because we do match and share a common ancestor in our tree – we both find ourselves in the Henry Bolton Circle.

Are you reading those squishy words?  Presume – it’s related to the word assume…right???  And keep in mind that Circles are created based in part on those wonderfully accurate Ancestry trees.  Are you feeling good about this preponderance of evidence yet?

However, in my case, I’ve done due diligence with the genealogy and I have all of my proof ducks in a row.  The fact that I do match so many Bolton descendants confirms my work, along with the fact that at the other vendors and at GedMatch, I  have triangulated my matches and proven the Bolton DNA.  So, this circle is valid but the only proof I have is not found at Ancestry or because I’m a Circle member, but by triangulation and aggregated data using other vendor’s tools.

This next screen shot is of an exact triangulated match using GedMatch’s triangulation tool.  Each line shows me matching two cousins, along with the start and stop segments.  This just happens to be the Ferverda example.  So, I match six people, all on the same segment, all with a known common ancestor.  This is proof positive.  Not all “matching” is nearly so definitive.

Gedmatch triangulation

Sometimes the matches aren’t so neat and tidy. That’s when we move to using aggregated data.

Aggregated Data – What’s That?

Aggregated data is a term I’ve come up with because there isn’t any term to fit in today’s genetic genealogy vocabulary.  In essence, aggregated data is when a group of people (who may or may not know who their common ancestor is) match on common segments of data, but not necessarily on the same segments, or not all of the same segments.  When you have an entire group of these people, they form a stair step “right shift” kind of graph.

The interesting part of this is that by utilizing aggregated data and looking not only at who we match, but who our matches match that share a common ancestor, we can gain insight and hints.  Finding a common ancestor is of course a huge benefit in this type of situation because then you’ve identified at least a DNA “line” for the entire group.

If we were to utilize the triangulation tools at Gedmatch and look at my closest triangulated matches, they would look something like this, where the segments that I match with each person (or in this case, two people) shift some to the right.  What you are seeing is the start and stop match locations, with graphing.  Therefore, I match all of these people that have a common ancestor.

Each match overlaps the one above and below to come extent – and often by a lot.  These are known as triangulation groups (TG).

However, the top match and the bottom match do not overlap, so they don’t triangulate with each other.  They are still valid triangulated matches to me and you can expect to see this kind of matching when using aggregated data.

Understand that when you see your triangulation groups at GedMatch, your mother’s side and your father’s side will be intermixed. In this case, I know the common ancestor and I know many of these testers, so I’m positive that this is a valid grouping (plus, they all match my Mom too – the best test of all.)

gedmatch triang group

Here’s another example only showing three matches.  All three are triangulated to me through the same ancestor, but the locations of the top and bottom matches don’t overlap with each other.  Both overlap the one in the middle in part.

gedmatch overlap

New Ancestor Discoveries – Not Evidence at All

Let’s look at the third reason for DNA testing.

3.  I want to find new ancestors.

Discovering brand new ancestors is a bit tougher.

There are two ways to discover new ancestors.  The first is through triangulation combined with traditional genealogy.  I have done this, but in these cases, I did have a clue as to what I was looking for.  In other words, the new ancestor I discovered was actually confirming a wife’s surname or identifying the parents of an ancestor from several potential candidate couples.

The second way to potentially discover a new ancestor is Ancestry’s New Ancestor Discoveries, NADs, which is really a somewhat misleading name.  What Ancestry has determined is that you match a group of people who share a common ancestor – and Ancestry’s leap of faith is that you share that ancestor do too.  While that may not be correct, what IS very relevant is that you do match this group of people who DO share a common lineage and there is an important hint there for you someplace!  But don’t just accept Ancestry’s discovery as your new ancestor – because there is a good chance it isn’t.  Let’s take a look.

Ancestral Lines Through Triangulation

Let’s go back to the John Doe example.

JohnDoe

Let’s take the worst case scenario.  You’re an adopted and have no information.  But you match an entire group of people in a triangulated group who DO know the identity of their common ancestor.

Does this mean that John Doe is your ancestor?  No.  John Doe could be your ancestor, or he could be the brother of your ancestor, or the uncle of your ancestor.  What this does tell you is that either John Doe is your ancestor, some of John Doe’s ancestors are your ancestors, or you are extremely unlucky and you are matching this entire group by chance.  The larger the segment, the less likely your match will be by chance.  Over 10 cM you’re pretty safe on an individual match and I think you’re safe with triangulated groups well below 10 cM.

Ancestry’s New Ancestor Discoveries

You can make this same type of discovery at Ancestry, but it’s not nearly as easy as Ancestry implies in their ads and you have no segment data to work with, just their match, shown below.

Larimer NAD

“Just take the test and we’ll find your ancestors,” the ad says.  Well, yes and no and “it depends.”

Ancestry went out on a limb a few months ago, right about April Fools Day, and frankly, they fell off the end of the branch by claiming that New Ancestor Discoveries are your missing ancestors found.  While that is clearly an overly optimistic marketing statement, the concept of matching you with people you match who all share a common ancestor is sound – it was the implementation and hyper-marketing that was flawed.

The premise here is that if you match people in a Circle that have a common ancestor, that you too might, please note the word might, share that ancestor – even if that person is not in your tree.  In other words, even if you don’t know who they are.  Just like the John Doe triangulation example above.

Here is my connection to the Larimer DNA Circle, even though I don’t know of a Larimer ancestor.

Larimer NAD circle

Now, the problem is that you might be related to an ancestor on one side upstream several generations, but it’s manifesting itself as a match to that particular couple because several people of that couple’s descendants have tested.  I’ve shown an example of how this might work below.

common unknown ancestor

In this example, you can see that your true common ancestor is unknown to both groups of people, but it’s not Mary Johnson and John Jones, or in my case, not John and Jane Larimer.

However, three descendants of Mary Johnson and John Jones tested, and you match all three.  If you also showed Mary Johnson and John Jones in your tree, then you’d be in a Circle with them at Ancestry.  However, since Mary Johnson and John Jones are NOT your ancestors, they are not in your tree.  Since you match three of their descendants, Ancestry concludes that indeed, Mary Johnson and John Jones must also be your ancestors.

While NADs are inaccurate about half the time, the fact that you do share DNA with the people in this group is important, because someplace, upstream, it’s likely that you share a common ancestor.  It’s also possible that you match these three people through unconnected ancestors upstream and it’s a fluke that they all three also descend from this couple.  And yes, that does happen, especially when all of the people involved have ancestors from the same region.

The first day that Ancestry rolled the New Ancestor Discoveries, I was assigned a couple that could not possibly be my ancestors.  I called them Bad NADs.

In my experience, there are more erroneous NADs out there than good ones.  I knew my original one was bad, as I had proof positive because I have triangulated my other lines.  Then, one day, my bad NAD was gone and now, a few weeks later, I have another assigned NAD couple that I have not been able to prove or disprove – the Larimers.  Truthfully, after the bad NAD fiasco, I haven’t spent a lot of time or effort because without tools, there is no place to go with this unless the people I match will download their results to GedMatch.  I’m hoping that a new tool to be released soon will help.

Here’s how NADs could be useful.  Let’s say that my Larimer matches download to GedMatch and I discover that they also match a triangulated group from my McDowell line.  Well, guess what – my Michael’s McDowell’s wife is unknown.  Might she be a Larimer?  Michael’s mother is also unknown.  Might she be a Larimer?  It gives me a line and a place to begin to work, especially if they share any common geography with my ancestors.

Even if the NADs aren’t my direct ancestors, this is still useful information, because somehow, I probably do connect to these people, even though my hands are somewhat tied.  However, labeling them New Ancestor Discoveries encourages people to jump to highly incorrect conclusions.  This isn’t even in the preponderance of evidence category, let alone proof.  It’s information that you can potentially use with other DNA tools (at GedMatch) and old fashioned genealogy to work on proving a connection to this line.  Nothing more.

So what is the net-net of this? Circles can count in the preponderance of evidence, especially in conjunction with other evidence, but NADs don’t.  Neither are proof.  If we were able to work with the segment data and compare it, we might very well be able to determine more, but Ancestry does not provide a chromosome browser, so we can’t.

Ancestor Chromosome Mapping

4.  I want to map my chromosomes to my ancestors so that I know which of my DNA I inherited from each ancestor.

If this is your DNA testing goal, you certainly did not start by testing with Ancestry.com, because they don’t have any tools to help you do this.  This tends to be a goal that people develop after they really understand what autosomal DNA testing can do for them.  In order to map your genome, you have to have access to segment information and you have to triangulate, or prove, the segments to each ancestor.  So count Ancestry out unless you can talk your matches into downloading their raw data files to either GedMatch or Family Tree DNA.  You’ll be testing with both Family Tree DNA and 23andMe and downloading your match information to a spreadsheet and utilizing the tools at www.gedmatch.com and www.dnagedcom.com.

Just so you get an idea of how much fun this can be, here’s my genome mapped to ancestors a few months ago.  I have more mapped now, but haven’t redone my map utilizing Kitty Cooper’s Tools.

Roberta's ancestor map2

Tips and Tricks for Contact Success

Regardless of which of these goals you had when you tested, or have since developed, now that you know what you can do – most of the options are going to require you to do something – often contacting your matches.

One thing that doesn’t happen is that your new genealogy is not delivered to you gift wrapped and all you have to do is open the box, untie the bow around the scroll, and roll it down the hallway.  That only happens on the genealogy TV shows:)

So join me in a few days for part two of Autosomal DNA Testing 101 – Tips and Tricks for Contact Success.


Autosomal DNA Testing 101 – Tips and Tricks for Contact Success

$
0
0

contact

In the first part of this two part series, Autosomal DNA Testing 101 – What Now?, we talked about the different kinds of things you can do when you receive your autosomal DNA test results from either Family Tree DNA, Ancestry or 23andMe.  There are, in general, 4 types of goals that people have when they test their autosomal DNA – if they have any specific goals:

  1. I want to meet people I’m related to.
  2. I want to confirm my genealogy is correct.
  3. I want to find new ancestors.
  4. I want to map my chromosomes to my ancestors.

Regardless of which of these goals you had when you tested, or have since developed, now that you know what is possible – most of the options are going to require you to do something – often contacting your matches.

One thing that doesn’t happen is that your new genealogy is not delivered to you gift wrapped and all you have to do is open the box, untie the bow around the scroll, and roll it down the hallway.  That only happens on the genealogy TV shows:)

Because of the different ways the various vendors have implemented their DNA matching software, there are different reasons why you might want to contact your matches.

23andMe

At 23andMe, you cannot send messages to your matches or share your matching DNA segments unless you obtain permission from each match to first communicate with them and then to share matching DNA segments, which can be one or two separate permissions.  23andMe has an internal messaging system that facilitates you sending a permission request to your matches.  Personalized messages work best.  If permission is granted, you can then begin a dialogue about common ancestors and how you might match that person.

Family Tree DNA

At Family Tree DNA, you are provided with the e-mail address of everyone that you match within each person’s privacy selections.  Participants can upload their GEDCOM files, create family trees and enter a list of ancestral surnames.  You can search by current or ancestral surname.  The most common reason to contact someone you match at Family Tree DNA is if you are a match to them and they have not uploaded or created a family tree.

Ancestry

Ancestry also uses an internal messaging system.  The most common reason to contact a DNA match at Ancestry is if you match someone, and especially if you share a shakey leaf hint with them, meaning you have a common ancestor in your trees – but your match’s tree is private and you can’t see who that common ancestor might be.

GedMatch

If you upload your results to www.gedmatch.com, a free (donation based) site, you can then match your results to people who tested at all 3 companies – if they also have uploaded their results.  People provide their e-mails when uploading and configuring their accounts at GedMatch.  People who use GedMatch are often the most excited and “into” autosomal DNA matching and therefore, the most likely to contact matches.

Regardless of where you are matching, it’s important to make that first communication attempt count.  At 23andMe, if your match declines contact, you can’t communicate with them.  If they don’t reply, you can delete that first contact attempt and try again, but your attempts are limited – so you really do have to make them count.

Here are some helpful hints and approaches that do and don’t work well.  Your goal is to obtain a helpful response, so you want to position yourself in the best possible light to get that response.

A faux pas may kill your chances, so let’s start out with what not to do, and why, then we’ll look at how to make your communications a winner!

Don’t!

  • Don’t send group e-mails to everyone you match saying, “Hi, we all match. Can you tell me how?” Guess what? You won’t get many or any replies and you’ll have irritated all of your matches in one fell swoop. This is considered DNA spamming. Think about what you are writing before you press that send button.
  • Don’t say things like this to people: “Hi, I’m guessing (or hoping) that you’ve mapped your (or your cousin’s) chromosomes and you can just tell me how we are related.” When I told this person I have not mapped my cousin’s chromosomes – they had the bad judgment to ask me when I might get around to it.
  • Don’t provide just a few surnames and ask if they are related. Most of your matches will be more than 2 or 3 generations back in the tree, so the answer is likely going to be “no,” or no answer at all.
  • Don’t offer to send them an ancestry invite. That means they have to sort through your entire tree to find a match, AND Ancestry will attach your tree to their list forever. Give the e-mail recipient something to work with in the e-mail itself. Don’t make your problem their problem or they won’t reply. The more work they have to do to reply, the less chance they will.
  • Don’t send multiple e-mails with dribs and drabs of information in each one. If you have something to share, put it together logically and concisely in an e-mail and send one.
  • Don’t assume that someone of a different ethnicity isn’t related to you.
  • Don’t assume a particular surname is indicative of a person’s entire genealogical background.
  • Don’t convey an entitlement attitude. Remember, you are asking them to take a few minutes of their time to help you.
  • Don’t assume that all of your matches are from the US, or that English is their primary language – so use full state names and locations. The good news is that more and more people are testing from around the world.
  • Don’t send messages in all caps.
  • Don’t send messages with misspellings, incorrect grammar or abbreviated texting language. Translated, this means your phone or i-pad with autocorrect is probably not a good idea.
  • Don’t send the entire request in the title of the message. Yes, people do this.
  • Don’t send a message with a title like “hi there.”  It’s likely to go to the spam folder or be over looked or ignored.  Instead, title each message with the name of the test, the testing company and whose DNA you are writing about.  In other words, something like this: “Autosomal DNA Match at Family Tree DNA to John Doe”
  • Don’t just send the “canned” request message at 23andMe. Send a personal note. If you have an online tree, include that link. If you notice you have ancestors from the same part of the world, or country, tell them. If you match their DNA, tell them. Some people send match requests because they notice a common surname. In other words, try to find some common ground to start a conversation.
  • Don’t dash off a hurried, half-baked, partially complete message.  It shows and will be reflected in the responses you do, and don’t, receive.
  • Don’t expect others to do your work for you.  Recently, I received a match contact and when I asked the sender for the name of the person they matched, they told me they couldn’t remember, they had sent out a “mass mailing,” and asked me to check my kits and see if there were matches to them.  Seriously?  They also didn’t tell me the testing company name, nor the test type.  Three e-mails later, I still don’t know the name of the person they matched.  Guess what.  Delete!  Make it easy for your matches to help you and don’t waste their time by only providing partial information.

Do!

  • Read your matches profiles if they have provided one. It shows you took the time to read what they provided, and may give you some common ground out the door. “I see we both have ancestors from the Netherlands,” is a good icebreaker, for example.
  • Address the e-mail to the person using their name if it’s available. In other words, begin, “Hi Joe” not just “Hi.” Do not assume a gender. Names can be deceiving. My name is not deceiving, Roberta, but I can’t tell you how many e-mails I receive to Robert or “Mr. Estes.” This tells me they didn’t pay attention.
  • Do use capital letters and punctuation.  Otherwise, you’re telling the person on the receiving end they aren’t important enough to bother with – and they will likely treat your request in kind.
  • Enter information about yourself in your profile at the vendors, including your country of origin.
  • Upload a photo of yourself into your profile at the vendor so that people can see you. This makes you seem more like a real person and they may look at you for family resemblance. Probably shouldn’t upload a photo that might be controversial or off-putting if your goal is to maximize response.
  • Link your tree to your DNA results (Ancestry) or upload a Gedcom file (Family Tree DNA.) 23andMe is more challenging since their collaboration with My Heritage which is a subscription service. Most people simply put a link to their public tree someplace in their profile information at 23andMe.
  • Provide your name and kit number or other identifying information in all correspondence – including the first e-mail.
  • Include kit numbers (GedMatch) and/or names (Family Tree DNA) that you’re matching. Many people manage multiple kits for family members and if they have to go and look for you in their kits’ matches, they won’t.  Don’t make the recipient have to guess at any part of the equation.  Say something like this, “Hi, I match John Smith’s autosomal DNA test at Family Tree DNA and you are his e-mail contact…”
  • Tell them where you tested and where you are matching them. “Hi, I tested at Ancestry and downloaded by kit to GedMatch where I’m kit number A100007. I’m matching kit F9141, Jane Doe, where you are listed as the contact.”  Be sure to get the name of the testing company right.  Today, someone told me the test was through “Family Search,” who, of course, does not do DNA testing.
  • If you are matching on a Y or mitochondrial DNA test, tell them at what level you’re matching.  Otherwise, they have to search through each level to find you.  On mitochondrial DNA, if you and they both tested to the full sequence level, but you’re only matching on the HVR1 level, it’s not nearly as compelling or interesting as if you match at the full sequence level with no mutations difference.  So, tell them, “I’m a match to John Doe at Family Tree DNA at the full mitochondrial level, with no mutations difference.  Maybe we can find our common ancestor.  My direct mitochondrial line is….”
  • If you are matching at GedMatch and you lowered the match threshold from the default, tell them. Better yet, don’t lower the threshold, at least not for initial comparisons.
  • Make replying to your query as easy as possible. You stand a much better chance of getting a reply. The more work you make them do, the less chance you’ll get a reply.
  • Include your full name and e-mail address if you are using Ancestry’s or 23andMe’s message systems.
  • Get your facts straight. I recently received an e-mail from someone who told me that we matched on 21% of our DNA and one segment. I knew that was impossible because 21% is in the half sibling range and if you’re a half sibling – you will match on a whole lot more than one segment. If you don’t pay attention and get your facts straight, it’s less likely that the person you are contacting will take you seriously.
  • Accept contact requests if you tested at 23andMe and receive a contact or sharing request, and be sure to share genomes so that you can see how you match and use their comparison tools like their Family Inheritance: Advanced.
  • Include a very brief, maybe two sentence summary about yourself in contact requests. Something like. “It appears we may match on my father’s side which is primarily from Appalachia, which means they were Scotch-Irish and British before that” or “My maternal heritage is from Scandinavia, so the names may not look familiar to you. My mother’s family is from the area near Stockholm.” Do not tell them your life story or ramble. You’ll lose them.
  • Send a pedigree chart (preferably with an index) in pdf format if you’re using e-mail or a link to a tree. I have a pedigree chart for my mother’s side and my father’s side. I can tell which side they match because my mother has tested as well. One of the best tools I have ever received with a query is shown below. It was sent as a spreadsheet, which made it incredibly easy for me to sort, but wouldn’t work for everyone. It could be sent as a pdf file as well, and is very easy to scan for surnames and locations. I immediately liked this person and absolutely knew they were serious and we stood a chance of making a genealogy connection.  (Click on the image to make larger in a separate window.)

ancestor spreadsheet

  • Take the time to learn about autosomal DNA, matching and what it means. Aside from the many articles on this blog which you can find by using the key search word “autosomal,” here are four additional resources for you:

Genetic Genealogy: The Basics and Beyond by Emily Aulicino
NextGen Genealogy: The DNA Connection by David Dowell
DNA Adoption’s classes
Beginners Guide to Genetic Genealogy by Kelly Wheaton

  • If you’re adopted or searching for an unknown parents or grandparent, visit www.dnaadoption.com.
  • If you have a blog or genealogy webpage, include that information, maybe below your signature.
  • If you’re serious about maximizing your opportunities for success with genetic genealogy, you’ll want to test at all 3 companies, Family Tree DNA (Y, mtDNA and autosomal), Ancestry and 23andMe. Family Tree DNA facilitates reduced cost file transfers from Ancestry and from 23andMe if you tested before Dec. 2013 (when 23andMe changed their chip.) They all have their strong and weak points – but the bottom line is that you’ll want to fish in all three ponds. You’ll also want to download your results from one of those companies, preferably Family Tree DNA or Ancestry, to www.gedmatch.com, a site that facilitates comparison of data from the various companies and provides some great tools. GedMatch is a contribution site, so don’t forget to donate. Some of their Tier 1 tools require a minimal subscription of $10 a month, which is well worth it if you are serious. Ask your matches if they have downloaded their data to GedMatch and provide your kit number there.
  • Be courteous and gracious. Say please and thank you. You’d be amazed how many people say neither.
  • Share this article with eager newbies who need a little direction. Most newbies aren’t going to find this article before shooting off that e-mail in their initial excitement to an entire group of matches. By helping them to better focus their efforts, you’ll be helping yourself too. Most newbies have no idea what they’ve just gotten themselves into!

Acknowledgements:  Thanks to contributors in the ISOGG Facebook group for helping to flesh out these tips for success.


Ethnicity Testing and Results

$
0
0

I have written repeatedly about ethnicity results as part of the autosomal test offerings of the major DNA testing companies, but I still receive lots of questions about which ethnicity test is best, which is the most accurate, etc.  Take a look at “Ethnicity Percentages – Second Generation Report Card” for a detailed analysis and comparison.

First, let’s clarify which testing companies we are talking about.  They are:

Let’s make this answer unmistakable.

  1. Some of the companies are somewhat better than others relative to ethnicity – but not a lot.
  2. These tests are reasonably reliable when it comes to a continent level test – meaning African, European, Asian and sometimes, Native American.
  3. These tests are great at detecting ancestry over 25% – but if you know who your grandparents are – you already have that information.
  4. The usefulness of these tests for accurately providing ethnicity information diminishes as the percentage of that minority admixture declines.  Said another way – as your percentage of a particular ethnicity decreases, so does the testing companies’ ability to find it.
  5. Intra-continental results, meaning within Europe, for example, are speculative, at best.  Do not expect them to align with your known genealogy.  They likely won’t – and if they do at one vendor – they won’t at others.  Which one is “right”?  Who knows – maybe all of them when you consider population movement, migration and assimilation.
  6. As the vendors add to and improve their data bases, reference populations and analysis tools, your results change. I discussed how vendors determine your ethnicity percentages in the article, “Determining Ethnicity Percentages.”
  7. Sometimes unexpected results, especially continent level results, are a factor of ancient population mixing and migrations, not recent admixture – and it’s impossible to tell the difference. For example, the Celts, from the Germanic area of Europe also settled in the British Isles. Attila the Hun and his army, from Asia, invaded and settled in what is today, Germany, as well as other parts of Eastern Europe.
  8. Ethnicity tests are unreliable in consistently detecting minority admixture. Minority in this context means a small amount, generally less than 5%.  It does not refer to any specific ethnicity. Having said that, there are very few reference data base entries for Native American populations.  Most are from from Canada and South America.

In the context of ethnicity, what does unreliable mean?

Unreliable means that the results are not consistent and often not reproducible across platforms, especially in terms of minority admixture.  For example, a German/Hungarian family member shows Native American admixture at low percentages, around 3%, at some, but not all, vendors.  His European family history does not reflect Native heritage and in fact, precludes it.  However, his results likely reflect Native American from a common underlying ancestral population, the Yamnaya, between the Asian people who settled Hungary and parts of Germany and also contributed to the Native American population.

Unreliable can also mean that different vendors, measuring different parts of your DNA, can assign results to different regions.  For example, if you carry Celtic ancestry, would you be surprised to see Germanic results and think they are “wrong?”  Speaking of Celts, they didn’t just stay put in one region within Europe either.  And who were the Celts and where did they ‘come from’ before they were Celts.  All of this current and ancient admixture is carried in your DNA.  Teasing it out and the meaning it carries is the challenge.

Unreliable may also mean that the tests often do not reflect what is “known” in terms of family history.  I put the word “known” in quotes here, because oral history does not constitute “known” and it’s certainly not proof.  For the most part, documented genealogy does constitute “known” but you can never “know” about an undocumented adoption, also referred to as a “nonparental event” or NPE.  Yes, that’s when one or both parents are not who you think they are based on traditional information.  With the advent of DNA testing, NPEs can, in some instances, be discovered.

So, the end result is that you receive very interesting information about your genetic history that often does not correlate with what you expected – and you are left scratching your head.

However, in some cases, if you’re looking for something specific – like a small amount of Native American or African ancestry, you, indeed, can confirm it through your DNA – and can confirm your family history.  One thing is for sure, if you don’t test, you will never know.

Minority Admixture

Let’s take a look at how ethnicity estimates work relative to minority admixture.

In terms of minority admixture, I’m referring to admixture that is several generations back in your tree.  It’s often revealed in oral history, but unproven, and people turn to genetic genealogy to prove those stories.

In my case, I have several documented Native American lines and a few that are not documented.  All of these results are too far back in time, the 1600s and 1700s, to realistically be “found” in autosomal admixture tests consistently.  I also have a small amount of African admixture.  I know which line this comes from, but I don’t know which ancestor, exactly.  I have worked through these small percentages systematically and documented the process in the series titled, “The Autosomal Me.”  This is not an easy or quick process – and if quick and easy is the type of answer you’re seeking – then working further, beyond what the testing companies give you, with small amounts of admixture, is probably not for you.

Let’s look at what you can expect in terms of inheritance admixture.  You receive 50% of your DNA from each parent, and so forth, until eventually you receive very little DNA (or none) from your ancestors from many generations back in your tree.

Ethnicity DNA table

Let’s put this in perspective.  The first US census was taken in 1790, so your ancestors born in 1770 should be included in the 1790 census, probably as a child, and in following censuses as an adult.  You carry less than 1% of this ancestor’s DNA.

The first detailed census listing all family members was taken in 1850, so most of your ancestors that contributed more than 1% of your DNA would be found on that or subsequent detailed census forms.

These are often not the “mysterious” ancestors that we seek.  These ancestors, whose DNA we receive in amounts over 1%, are the ones we can more easily track through traditional means.

The reason the column of DNA percentages is labeled “approximate” is because, other than your parents, you don’t receive exactly half of your ancestor’s DNA.  DNA is not divided exactly in half and passed on to subsequence generations, except for what you receive from your parents.  Therefore, you can have more or less of any one ancestor’s individual DNA that would be predicted by the chart, above.  Eventually, as you continue to move further out in your tree, you may carry none of a specific ancestor’s DNA or it is in such small pieces that it is not detected by autosomal DNA testing.

The Vendors

At least two of the three major vendors have made changes of some sort this year in their calculations or underlying data bases.  Generally, they don’t tell us, and we discover the change by noticing a difference when we look at our results.

Historically, Ancestry has been the worst, with widely diverging estimates, especially within continents.  However, their current version is picking up both my Native and African.  However, with their history of inconsistency and wildly inaccurate results, it’s hard to have much confidence, even when the current results seem more reasonable and in line with other vendors.  I’ve adopted a reserved “wait and see” position with Ancestry relative to ethnicity.

Family Tree DNA’s Family Finder product is in the middle with consistent results, but they don’t report less than 1% admixture which is often where those distant ancestors’ minority ethnicity would be found, if at all.  However, Family Tree DNA does provide Y and mitochondrial mapping comparisons, and ethnicity comparisons to your matches that are not provided by other vendors.

Ethnicity DNA matches

In this view, you can see the matching ethnicity percentages for those whom you match autosomally.

23andMe is currently best in terms of minority ethnicity detection, in part, because they report amounts less than 1%, have a speculative view, which is preferred by most genetic genealogists and because they paint your ethnicity on your chromosomes, shown below.  You can see that both chromosome 1 and 2 show Native segments.

Ethnicity 23andMe chromosome

So, looking at minority admixture only – let’s take a look at today’s vendor results as compared to the same vendors in May 2014.

Ethnicity 2014-2015 compare

The Rest of the Story

Keep in mind, we’re only discussing ethnicity here – and there is a lot more to autosomal DNA testing than ethnicity – for example – matching to cousins, tools, such as a chromosome browser (or lack thereof), trees, ease of use and ability to contact your matches.  Please see “Autosomal DNA 2015 – Which Test is the Best?”  Unless ethnicity is absolutely the ONLY reason you are DNA testing, then you need to consider the rest of the story.

And speaking of the rest of the story, National Geographic has been pretty much omitted from this discussion because they have just announced a new upgrade, “Geno 2.0: Next Generation,” to their offering, which promises to be a better biogeographical tool.  I hope so – as National Geographic is in a unique position to evaluate populations with their focus on sample collection from what is left of unique and sometimes isolated populations.  We don’t have much information on the new product yet, and of course, no results because the new test won’t be released until in September, 2015.  So the jury is out on this one.  Stay tuned.

GedMatch – Not A Vendor, But a Great Toolbox

Finally, most people who are interested in ethnicity test at one (or all) of the companies, utilize the rest of the tools offered by that company, then download their results to www.gedmatch.com, a donation based site, and make use of the numerous contributed admixture tools there.

Ethnicity GedMatch

GedMatch offers lots of options and several tools that provide a wide range of focus.  For example, some tools are specifically written for European, African, Asian or even comparison against ancient DNA results.

Ethnicity ancient admixture

Conclusion

So what is the net-net of this discussion?

  1. There is a lot more to autosomal DNA testing than just ethnicity – so take everything into consideration.
  2. Ethnicity determination is still an infant and emerging field – with all vendors making relatively regular updates and changes. You cannot take minority results to the bank without additional and confirming research, often outside of genetic genealogy. However, mitochondrial or Y DNA testing, available only through Family Tree DNA, can positively confirm Native or minority ancestry in the lines available for testing. You can create a DNA Pedigree Chart to help identify or eliminate Native lines.
  3. If the ancestors you seek are more than a few generations removed, you may not carry enough of their ethnic DNA to be identified.
  4. Your “100% Cherokee” ancestor was likely already admixed – and so their descendants may carry even less Native DNA than anticipated.
  5. You cannot prove a negative using autosomal DNA (but you can with both Y and mitochondrial DNA). In other words, a negative autosomal ethnicity result alone, meaning no Native heritage, does NOT mean your ancestors were not Native. It MIGHT mean they weren’t Native. It also might mean that they were either very admixed or the Native ancestry is too far back in your tree to be found with today’s technology. Again, mitochondrial and Y DNA testing provide confirmed ancestry identification for the lines they represent. Y is the male paternal (surname) line and mitochondrial is the matrilineal line of both males and females – the mother’s, mother’s, mother’s line, on up the tree until you run out of mothers.
  6. It is very unlikely that you will be able to find your tribe, although it is occasionally possible. If a company says they can do this, take that claim with a very big grain of salt. Your internal neon warning sign should be flashing about now.
  7. If you’re considering purchasing an ethnicity test from a company other than the four I mentioned – well, just don’t.  Many use very obsolete technology and oversell what they can reliably provide.  They don’t have any better reference populations available to them than the major companies and Nat Geo, and let’s just say there are ways to “suggest” people are Native when they aren’t. Here are two examples of accidental ways people think they are Native or related – so just imagine what kind of damage could be done by a company that was intentionally providing “marginal” or misleading information to people who don’t have the experience to know that because they “match” someone who has a Native ancestor doesn’t mean they share that same Native ancestor – or any connection to that tribe. So, stay with the known companies if you’re going to engage in ethnicity testing. We may not like everything about the products offered by these companies, but we know and understand them.

My Recommendation

By all means, test.

Test with all three companies, 23andMe, Family Tree DNA and Ancestry – then download your results from either Family Tree DNA or Ancestry (who test more markers than 23andMe) to GedMatch and utilize their ethnicity tools.  When I’m looking for minority admixture, I tend to look for consistent trends – not just at results from any one vendor or source.

If you have already tested at Ancestry, or you tested at 23andMe on the V3 chip, prior to December 2013, you can download your raw data file to Family Tree DNA and pay just $39.  Family Tree DNA will process your raw data within a couple days and you will then see your myOrigins ethnicity results as interpreted by their software.  Of course, that’s in addition to having access to Family Tree DNA’s other autosomal features, functions and tools.  The transfer price of $39 is significantly less expensive than retesting.

Just understand that what you receive from these companies in terms of ethnicity is reflective of both contemporary and ancient admixture – from all of your ancestral lines.  This field is in its infancy – your results will change from time to time as we learn – and the only part of ethnicity that is cast in concrete is probably your majority ancestry which you can likely discern by looking in the mirror.  The rest – well – it’s a mystery and an adventure.  Welcome aboard to the miraculous mysterious journey of you, as viewed through the DNA of your ancestors!


Autosomal DNA Matching Confidence Spectrum

$
0
0

Are you confused about DNA matches and what they mean…different kinds of matches…from different vendors and combined results between vendors.  Do you feel like lions and tigers and bears…oh my?  You’re not alone.

As the vendors add more tools, I’ve noticed recently that along with those tools has come a significant amount of confusion surrounding matches and what they mean.  Add to this issue confusion about the terminology being used within the industry to describe various kinds of matches.  Combined, we now have a verbiage or terminology issue and we have confusion regarding the actual matches and what they mean.  So, as people talk, what they mean, what they are trying to communicate and what they do say can be interpreted quite widely.  Is it any wonder so many people are confused?

I reached out within the community to others who I know are working with autosomal results on a daily basis and often engaged in pioneering research to see how they are categorizing these results and how they are referring to them.

I want to thank Jim Bartlett, Blaine Bettinger, Tim Janzen and David Pike (in surname alphabetical order) for their input and discussion about these topics.  I hope that this article goes a long way towards sorting through the various kinds of matches and what they can and do mean to genetic genealogists – and what they are being called.  To be clear, the article is mine and I have quoted them specifically when applicable.

But first, let’s talk about goals.

Goals

One thing that has become apparent over the past few months is that your goals may well affect how you interpret data.  For example, if you are an adoptee, you’re going to be looking first at your closest matches and your largest segments.  Distant matches and small segments are irrelevant at least until you work with the big pieces.  The theory of low hanging fruit, of course.

If your goal is to verify and generally validate your existing genealogy, you may be perfectly happy with Ancestry’s Circles.  Ancestry Circles aren’t proof, as many people think, but if you’re looking for low hanging fruit and “probably” versus “positively,” Ancestry Circles may be the answer for you.

If you didn’t stop reading after the last sentence, then I’m guessing that “probably” isn’t your style.

If your goal is to prove each ancestor and/or map their segments to your DNA, you’re not going to be at all happy with Ancestry’s lack of segment data – so your confidence and happiness level is going to be greatly different than someone who is just looking to find themselves in circles with other descendants of the same ancestor and go merrily on their way.

If you have already connected the dots on most of your ancestry for the past 4 or 5 generations, and you’re working primarily with colonial ancestors and those born before 1700, you may be profoundly interested in small segment data, while someone else decides to eliminate that same data on their spreadsheet to eliminate clutter.  One person’s clutter is another’s goldmine.

While, technically, the different types of tests and matches carry a different technical confidence level, your personal confidence ranking will be influenced by your own goals and by some secondary factors like how many other people match on a particular segment.

Let’s start by talking about the different kinds of matching.  I’ve been working with my Crumley line, so I’ll be utilizing examples from that project.

Individual Matching, Group Matching and Triangulation

There is a difference between individual matching, group matching and triangulation.  In fact, there is a whole spectrum of matching to be considered.

Individual Matching

Individual matching is when someone matches you.

confidence individual match

That’s great, but one match out of context generally isn’t worth much.  There’s that word, generally, because if there is one thing that is almost always true, it’s that there is an exception to every rule and that exception often has to do with context.  For example, if you’re looking for parents and siblings, then one match is all you need.

If this match happens to be to my first cousin, that alone confirms several things for me, assuming there is not a secondary relationship.  First, it confirms my relationship with my parent and my parent’s descent from their parents, since I couldn’t be matching my first cousin (at first cousin level) if all of the lines between me and the cousin weren’t intact.

confidence cousins

However, if the match is to someone I don’t know, and it’s not a close relative, like the 2nd to 4th cousins shown in the match above, then it’s meaningless without additional information.  Most of your matches will be more distant.  Let’s face it, you have a lot more distant cousins than close cousins.  Many ancestors, especially before about 1900, were indeed, prolific, at least by today’s standards.

So, at this point, your match list looks like this:

confidence match list

Bridget looks pretty lonely.  Let’s see what we can do about that.

Matching Additional People

The first question is “do you share a common ancestor with that individual?”  If yes, then that is a really big hint – but it’s not proof of anything – unless they are a close relative match like we discussed above.

Why isn’t a single match enough for proof?

You could be related to this person through more than one ancestral line – and that happens far more than I initially thought.  I did an analysis some time back and discovered that about 15% of the time, I can confirm a secondary genealogical line that is not related to the first line in my tree.  There were another 7% that were probable – meaning that I can’t identify a second common ancestor with certainty, but the surname and location is the same and a connection is likely.  Another 8% were from endogamous lines, like Acadians, so I’m sure there are multiple lines involved.  And of those matches (minus the Acadians), about 10% look to have 3 genealogical lines, not just two.  The message here – never assume.

When you find one match and identify one common genealogical line, you can’t assume that is how you are genetically related on the segment in question.

Ideally, at this point, you will find a third person who shares the common ancestor and their DNA matches, or triangulates, between you and your original match to prove the connection.  But, circumstances are not always ideal.

What is Triangualtion?

Triangulation on the continuum of confidence is the highest confidence level achievable, outside of close relative matching which is evident by itself without triangulation.

Triangulation is when you match two people who share a common ancestor and all three of you match each other on that same segment.  This means that segment descended to all three of you from that common ancestor.

This is what a match group would look like if Jerry matches both John and Bridget.

confidence example 1 match group

Example 1 – Match Group

The classic definition of triangulation is when three people, A, B and C all match each other on the same segment and share a known, identifiable common ancestor.  Above, we only have two.  We don’t know yet if John matches Bridget.

A matches B
A matches C
B matches C

This is what an exact triangulation group would look like between Jerry, John and Bridget.  Most triangulation matches aren’t exact, meaning the start and/or end segment might be different, but some are exact.

confidence example 2 triangulation group

Example 2 – Triangulation Group

It’s not always possible to prove all three.  Sometimes you can see that Jerry matches Bridget and Jerry matches John, but you have no access to John or Bridget’s kits to verify that they also match each other.  If you are at Family Tree DNA, you can run the ICW (in common with) tool to see if John and Bridget do match each other – but that tool does not confirm that they match on the same segment.

If the individuals involved have uploaded their kits to GedMatch, you have the ability to triangulate because you can see the kit numbers of your matches and you can then run them against each other to verify that they do indeed match each other as well.  Not everyone uploads their kits to GedMatch, so you may wind up with a hybrid combination of triangulated groups (like example 2, above) and matching groups (like example 1, above) on your own personal spreadsheet.

Matching groups (that are not triangulated) are referred to by different names within the community.  Tim Janzen refers to them as clusters of cousins, Blaine as pseudo triangulation and I have called them triangulation groups in the past if any three within the group are proven to be triangulated. Be careful when you’re discussing this, because matching groups are often misstated as triangulated groups.  You’ll want to clarify.

Creating a Match List

Sometimes triangulation options aren’t available to us.  For example, at Family Tree DNA, we can see who matches us, and we can see if they match each other utilizing the ICW tool, but we can’t see specifically where they match each other.  This is considered a match group.  This type of matching is also where a great deal of confusion is introduced because these people do match each other, but they are NOT (yet) triangulated.

What we know is that all of these people are on YOUR match list, but we don’t know that they are on each other’s match lists.  They could be matching you on different sides of your DNA or, if smaller segments, they might be IBC (identical by chance.)

You can run the ICW (in common with) tool at Family Tree DNA for every match you have.  The ICW tool is a good way to see who matches both people in question.  Hopefully, some of your matches will have uploaded trees and you can peruse for common ancestors.

The ICW tool is the little crossed arrows and it shows you who you and that person also match in common.

confidence match list ftdna

You can run the ICW tool in conjunction with the ancestral surname in question, showing only individuals who you have matches in common with who have the Crumley surname (for example) in their ancestral surname list.  This is a huge timesaver and narrows your scope of search immediately.  By clicking on the ICW tool for Ms. Bridget,  you see the list, below of those who match both the person whose account we are signed into and Ms. Bridget, below.

confidence icw ftdna

Another way to find common matches to any individual is to search by either the current surname or ancestral surnames.  The ancestral surname search checks the surnames entered by other participants and shows them in the results box.

In the example above, all of these individuals have Crumley listed in their surnames.  You can see that I’ve sorted by ancestral surname – as Crumley is in that search box.

Now, your match lists looks like this relative to the Crumley line.  Some people included trees and you can find your common ancestor on their tree, or through communications with them directly.  In other cases, no tree but the common surname appears in the surname match list.  You may want to note those results on your match list as well.

confidence match list 2

Of course, the next step is to compare these individuals in a matrix to see who matches who and the chromosome browser to see where they match you, which we’ll discuss momentarily.

Group Matching

The next type of matching is when you have a group of people who match each other, but not necessarily on the same segment of DNA.  These matching groups are very important, especially when you know there is a shared ancestor involved – but they don’t indicate that the people share the same segment, nor that all (or any) of their shared segments are from this particular ancestor.  Triangulation is the only thing that accomplishes proof positive.

This ICW matrix shows some of the Crumley participants who have tested and who matches whom.

confidence icw grid

You can display this grid by matching total cM or by known relationship (assuming the individuals have entered this information) or by predicted relationship range.  The total cMs shared is more important for me in evaluating how closely this person might be related to the other individual.

The Chromosome Browser

The chromosome browser at Family Tree DNA shows matches from the perspective of any one individual.  This means that the background display of the 22 Chromosomes (plus X) is the person all of the matches are comparing against. If you’re signed in to your account, then you are the black background chromosomes, and everyone is being compared against your DNA.  I’m only showing the first 6 chromosomes below.

confidence chromosome browser

You can see where up to 5 individuals match the person you’re comparing them to.  In this case, it looks like they may share a common segment on chromosome 2 among several descendants.  Of course, you’d need to check each of these individuals to insure that they match each other on this same segment to confirm that indeed, it did come from a common ancestor.  That’s triangulation.

When you see a grouping of matches of individuals known to descend from a common ancestor on the same chromosome, it’s very likely that you have a match group (cluster of cousins, pseudo triangulation group) and they will all match each other on that same segment if you have the opportunity to triangulate them, but it’s not absolute.

For example, below we have a reconstructed chromosome 8 of James Crumley, the common ancestor of a large group of people shown based on matches.  In other words, each colored segment represents a match between two people.  I have a lot more confidence in the matches shown with the arrows than the single or less frequent matches.

confidence chromosome 8 match group'

This pseudo triangulation is really very important, because it’s not just a match, and it’s not triangulation.  The more people you have that match you on this segment and that have the same ancestor, the more likely that this segment will triangulate.  This is also where much of the confusion is coming from, because matching groups of multiple descendants on the same segments almost always do triangulate so they have been being called triangulation groups, even when they have not all been triangulated to each other.  Very occasionally, you will find a group of several people with a common ancestor who triangulate to each other on this common segment, except one of a group doesn’t triangulate to one other, but otherwise, they all triangulate to others.

confidence triangulation issue

This situation has to be an error of some sort, because if all of these people match each other, including B, then B really must match D.  Our group discussed this, and Jim Bartlett pointed out that these problem matches are often near the vendor matching threshold (or your threshold if you’re using GedMatch) and if the threshold is lowered a bit, they continue to match.  They may also be a marginal match on the edge, so to speak or they may have a read error at a critical location in their kit.

What “in common with” matching does is to increase your confidence that these are indeed ancestral matches, a cousin cluster, but it’s not yet triangulation.

Ancestry Matches

Ancestry has added another level of matching into the mix.  The difference is, of course, that you can’t see any segment data at all, at Ancestry, so you don’t have anything other than the fact that you do match the other person and if you have a shakey leaf hint, you also share a common ancestor in your trees.

confidence ancestry matches

When three people match each other on any segment (meaning this does not infer a common segment match) and also share a common ancestor in a tree, they qualify to be a DNA Circle.  However, there is other criteria that is weighted and not every group of 3 individuals who match and share an ancestor becomes a DNA Circle.  However, many do and many Circles have significantly more than three individuals.

confidence Phoebe Crumley circle

This DNA Circle is for Phebe Crumley, one of my Crumley ancestors.  In this grouping, I match one close family group of 5 people, and one individual, Alyssa, all of whom share Phebe Crumley in their trees.  As luck would have it, the family group has also tested at Family Tree DNA and has downloaded their results to GedMatch, but as it stands here at Ancestry, with DNA Circle data only…the only thing I can do is to add them to my match list.

confidence match list 3

In case you’re wondering, the reason I only added three of the 5 family members of the Abija group to my match list is because two are children of one of the members and their Crumley DNA is represented through their parent.

While a small DNA Circle like Phebe Crumley’s can be incorrect, because the individuals can indeed be sharing the DNA of a different ancestor, a larger group gives you more confidence that the relationship to that group of people is actually through the common ancestor whose circle you are a member of.  In the example Circle shown below, I match 6 individuals out of a total of 21 individuals who are all interrelated and share Henry Bolton in their tree.

Confidence Henry Bolton circle

New Ancestor Discoveries

Ancestry introduced New Ancestor Discoveries (NADs) a few months ago.  This tool is, unfortunately, misnamed – and although this is a good concept for finding people whose DNA you share, but whose tree you don’t – it’s not mature yet.

The name causes people to misinterpret the “ancestors” given to them as genuinely theirs.  So far, I’ve had a total of 11 NADS and most have been easily proven false.

Here’s how NADs work.  Let’s say there is a DNA Circle, John Doe, of 3 people and you match two of them.  The assumption is that John Doe is also your ancestor because you share the DNA of his descendants.  This is a critically flawed assumption.  For example, in one case, my ancestors sister’s husband is shown as my “new ancestor discovery” because I share DNA with his descendants (through his wife, my ancestor’s sister.)  Like I said, not mature yet.

I have discussed this repeatedly, so let’s just suffice it to say for this discussion, that there is absolutely no confidence in NADs and they aren’t relevant.

Shared Matches

Ancestry recently added a Shared Matches function.

For each person that you match at Ancestry, that is a 4th cousin or closer and who has a high confidence match ranking, you can click on shared matches to see who you and they both match in common.

confidence ancestry shared matches

This does NOT mean you match these people through the same ancestor.  This does NOT mean you match them on the same segment.  I wrote about how I’ve used this tool, but without additional data, like segment data, you can’t do much more with this.

What I have done is to build a grid similar to the Family Tree DNA matrix where I’ve attempted to see who matches whom and if there is someone(s) within that group that I can identify as specifically descending from the same ancestor.  This is, unfortunately, extremely high maintenance for a very low return.  I might add someone to my match list if they matched a group (or circle) or people that match me, whose common ancestor I can clearly identify.

Shared Matches are the lowest item on the confidence chart – which is not to say they are useless.  They can provide hints that you can follow up on with more precise tools.

Let’s move to the highest confidence tool, triangulation groups.

Triangulation Groups

Of course, the next step, either at 23andMe, Family Tree DNA, through GedMatch, or some combination of each, is to compare the actual segments of the individuals involved.  This means, especially at Ancestry where you have no tools, that you need to develop a successful begging technique to convince your matches to download their data to GedMatch or Family Tree DNA, or both.  Most people don’t, but some will and that may be the someone you need.

You have three triangulation options:

  1. If you are working with the Family Inheritance Advanced at 23andMe, you can compare each of your matches with each other. I would still invite my matches to download to GedMatch so you can compare them with people who did not test at 23andMe.
  2. If you are working with a group of people at Family Tree DNA, you can ask them to run themselves against each other to see if they also match on the same segment that they both match you on. If you are a project administrator on a project where they are all members, you can do this cross-check matching yourself. You can also ask them to download their results to GedMatch.
  3. If your matches will download their results to GedMatch, you can run each individual against any other individual to confirm their common segment matches with you and with each other.

In reality, you will likely wind up with a mixture of matches on your match list and not everyone will upload to GedMatch.

Confirming that segments create a three way match when you share a common ancestor constitutes proof that you share that common ancestor and that particular DNA has been passed down from that ancestor to you.

confidence match list 4

I’ve built this confidence table relative to matches first found at Family Tree DNA, adding matches from Ancestry and following them to GedMatch.  Fortunately, the Abija group has tested at all 3 companies and also uploaded their results to GedMatch.  Some of my favorite cousins!

Spectrum of Confidence

Blaine Bettinger built this slide that sums up the tools and where they fall on the confidence range alone, without considerations of your goals and technical factors such as segment size.  Thanks Blaine for allowing me to share it here.

confidence level Blaine

These tools and techniques fall onto a spectrum of confidence, which I’ve tried to put into perspective, below.

confidence level highest to lowest

I really debated how to best show these.  Unfortunately, there is almost always some level of judgment involved. In some cases, like triangulation at the 3 vendors, the highest level is equivalent, but in other cases, like the medium range, it really is a spectrum from lowest to highest within that grouping.

Now, let’s take a look at our matches that we’ve added to our match list in confidence order.

confidence match list 5

As you would expect, those who triangulated with each other using some chromosome browser and share a common ancestor are the highest confidence matches – those 5 with a red Y.  These are followed by matches who match me and each other but not on the same segment (or at least we don’t know that), so they don’t triangulate, at least not yet.

I didn’t include any low confidence matches in this table, but of the lowest ones that are included, the shakey leaf matches at Ancestry that won’t answer inquiries and the matches at FTDNA who do share a common surname but didn’t download their information to be triangulated are the least confident of the group.  However, even those lower confidence matches on this chart are medium, meaning at Ancestry they are in a Circle and at FTDNA, they do match and share a common surname.  At Family Tree DNA, they may eventually fall into a triangulation group of other descendants who triangulate.

Caveats

As always, there are some gotchas.  As someone said in something I read recently, “autosomal DNA is messy.”

Endogamy

Endogamous populations are just a mess.  The problem is that literally, everyone is related to everyone, because the founder population DNA has just been passed around and around for generations with little or no new DNA being introduced.

Therefore, people who descend from endogamous populations often show to be much more closely related than they are in a genealogical timeframe.

Secondly, we have the issue pointed out by David Pike, and that is when you really don’t know where a particular segment came from, because the segment matches both the parents, or in some cases, multiple grandparents.  So, which grandparent did that actual segment that descended to the grandchild descend from?

For people who are from the same core population on both parent’s side, close matches are often your only “sure thing” and beyond that, hopefully you have your parents (at least one parent) available to match against, because that’s the only way of even beginning to sort into family groups.  This is known as phasing against your parents and while it’s a great tool for everyone to use – it’s essential to people who descend from endogamous groups. Endogamy makes genetic genealogy difficult.

In other cases, where you do have endogamy in your line, but only in one of your lines, endogamy can actually help you, because you will immediately know based on who those people match in addition to you (preferably on the same segment) which group they descend from.  I can’t tell you how many rows I have on my spreadsheet that are labeled with the word “Acadian,” “Brethren” and “Mennonite.”  I note the common ancestor we can find, but in reality, who knows which upstream ancestor in the endogamous population the DNA originated with.

Now, the bad news is that Ancestry runs a routine that removes DNA that they feel is too matchy in your results, and most of my Acadian matches disappeared when Ancestry implemented their form of population based phasing.

Identical by Population

There is sometimes a fine line between a match that’s from an ancestor one generation further back than you can go, and a match from generations ago via DNA found at a comparatively high percentage in a particular population.  You can’t tell the difference.  All you know is that you can’t assign that segment to an ancestor, and you may know it does phase against a parent, so it’s valid, meaning not IBC or identical by chance.

Yes, identical by population segment matching is a distinct problem with endogamy, but it can also be problematic with people from the same region of the world but not members of endogamous populations.  Endogamy is a term for the timeframe we’re familiar with.  We don’t know what happened before we know what happened.

From time to time, you’ll begin to see something “odd” happened where a group of segments that you already have triangulated to one ancestor will then begin to triangulate to a second ancestor.  I’m not talking about the normal two groups for every address – one from your Mom’s side and one from your Dad’s.  I’m talking, for example, when my Mom’s DNA in a particular area begins to triangulate to one ancestral group from Germany and one from France.  These clearly aren’t the same ancestors, and we know that one particular “spot” or segment range that I received from her DNA can only come from one ancestor.  But these segment matches look to be breaking that rule.

I created the example below to illustrate this phenomenon.  Notice that the top and bottom 3 all match nicely to me and to each other and share a common ancestor, although not the same common ancestor for the two groups.  However, the range significantly overlaps.  And then there is the match to Mary Ann in the middle whose common ancestor to me is unknown.

confidence IBP example

Generally, we see these on smaller segment groups, and this is indicative that you may be seeing an identical by population group.  Many people lump these IBP (identical by population) groups in with IBC, identical by chance, but they aren’t.  The difference is that the DNA in an IBP group truly is coming from your ancestors – it’s just that two distinct groups of ancestors have the same DNA because at some point, they shared a common ancestor.  This is the issue that “academic phasing” (as opposed to parental phasing) is trying to address.  This is what Ancestry calls “pileup areas” and attempts to weed out of your results.  It’s difficult to determine where the legitimate mathematical line is relative to genealogically useful matches versus ones that aren’t.  And as far as I’m concerned, knowing that my match is “European” or “Native” or “African” even if I can’t go any further is still useful.

Think about this, if every European has between 1 and 4% Neanderthal DNA from just a few Neanderthal individuals that lived more than 20,000 years ago in Europe – why wouldn’t we occasionally trip over some common DNA from long ago that found its way into two different family lines.

When I find these multiple groupings, which is actually relatively rare, I note them and just keep on matching and triangulating, although I don’t use these segments to draw any conclusions until a much larger triangulated segment match with an identified ancestor comes into play.  Confidence increases with larger segments.

This multiple grouping phenomenon is a hint of a story I don’t know – and may never know.  Just because I don’t quite know how to interpret it today doesn’t mean it isn’t valid.  In time, maybe its full story will be revealed.

ROH – Runs of Homozygosity

Autosomal DNA tests test someplace over 500,000 locations, depending on the vendor you select.  At each of those locations, you find a value of either T, A, C or G, representing a specific nucleotide.  Sometimes, you find runs of the same nucleotide, so you will find an entire group of all T, for example.  If either of your parents have all Ts in the same location, then you will match anyone with any combination of T and anything else.

confidence homozygosity example

In the example above, you can see that you inherited T from both your Mom and Dad.  Endogamy maybe?

Sally, although she will technically show as a match, doesn’t really “match” you.  It’s just a fluke that her DNA matches your DNA by hopping back and forth between her Mom’s and Dad’s DNA.  This is not a match my descent, but by chance, or IBC (identical by chance.)  There is no way for you to know this, except by also comparing your results to Sally’s parents – another example of parental phasing.  You won’t match Sally’s parents on this segment, so the segment is IBC.

Now let’s look at Joe.  Joe matches you legitimately, but you can’t tell by just looking at this whether Joe matches you on your Mom’s or Dad’s side.  Unfortunately, because no one’s DNA comes with a zipper or two sides of the street labeled Mom and Dad – the only way to determine how Joe matches you is to either phase against Joe’s parents or see who else Joe matches that you match, preferable on the same segment – in other words – create either a match or ICW group, or triangulation.

Segment Size

Everyone is in agreement about one thing.  Large segments are never IBC, identical by chance.  And I hate to use words like never, so today, interpret never to mean “not yet found.”  I’ve seen that large segment number be defined both 13cM and 15cM and “almost never” over 10cM.  There is currently discussion surrounding the X chromosome and false positives at about this threshold, but the jury is still out on this one.

Most medium segments hold true too.  Medium segment matches to multiple people with the same ancestors almost always hold true.  In fact, I don’t personally know of one that didn’t, but that isn’t to say it hasn’t happened.

By medium segments, most people say 7cM and above.  Some say 5cM and above with multiple matching individuals.

As the segment size decreases, the confidence level decreases too, but can be increased by either multiple matches on that segment from a common proven ancestor or, of course, triangulation.  Phasing against your parent also assures that the match is not IBD.  As you can see, there are tools and techniques to increase your confidence when dealing with small segments, and to eliminate IBC segments.

The issue of small segments, how and when they can be utilized is still unresolved.  Some people simply delete them.  I feel that is throwing the baby away with the bathwater and small segments that triangulate from a common ancestor and that don’t find themselves in the middle of a pileup region that is identical by population or that is known to be overly matchy (near the center of chromosome 6, for example) can be utilized.  In some cases, these segments are proven because that same small segment section is also proven against matches that are much larger in a few descendants.

Tim Janzen says that he is more inclined to look at the number of SNPs instead of the segment size, and his comfort number is 500 SNPs or above.

The flip side of this is, as David Pike mentioned, that the fewer locations you have in a row, the greater the chance that you can randomly match, or that you can have runs of heterozygosity.

No one in our discussion group felt that all small segments were useless, although the jury is still out in terms of consensus about what exactly defines a small segment and when they are legitimate and/or useful.  Everyone of us wants to work towards answers, because for those of us who are dealing with colonial ancestors and have already picked the available low hanging fruit, those tantalizing small segments may be all that is left of the ancestor we so desperately need to identify.

For example, I put together this chart detailing my matching DNA by generation. Interesting, I did a similar chart originally almost exactly three years ago and although it has seemed slow day by day, I made a lot of progress when a couple of brick walls fell, in particular, my Dutch wall thanks to Yvette Hoitink.

If you look at the green group of numbers, that is the amount of shared DNA to be expected at each level.  The number of shared cMs drops dramatically between the 5th and 6th generation from 13 cM which would be considered a reasonable matching level (according to the above discussion) at the 5th generation, and 3.32 cM at the 6th generation level, which is a small segment by anyone’s definition.

confidence segment size vs generation

The 6th generation was born roughly in 1760, and if you look to the white grouping to the right of the green group, you can see that my percentage of known ancestors is 84% in the 5th generation, 80% in the 6th generation, but drops quickly after that to 39, 22 and 3%, respectively.  So, the exact place where I need the most help is also the exact place where the expected amount of DNA drops from 13 to 3.32 cM.  This means, that if anyone ever wants to solve those genealogical puzzles in that timeframe utilizing genetic genealogy, we had better figure out how to utilize those small segments effectively – because it may well be all we have except for the occasional larger sticky segment that is passed intact from an ancestor many generations past.

From my perspective, it’s a crying shame that Ancestry gives us no segment data and it’s sad that 23andMe only gives us 5cM and above.  It’s a blessing that we can select our own threshold at GedMatch.  I’m extremely grateful that FTDNA shows us the small segment matches to 1cM and 500 SNPs if we also match on 20cM total and at least one segment over 7cM.  That’s a good compromise, because small segments are more likely to be legitimate if we have a legitimate match on a larger segment and a known ancestor.  We already discussed that the larger the matching segment, the more likely it is to be valid. I would like to see Family Tree DNA lower the matching threshold within projects.  Surname projects imply that a group of people will be expected to match, so I’d really like to be able to see those lower threshold matches.

I’m hopeful that Family Tree DNA will continue to provide small segment information to us.  People who don’t want to learn how to use or be bothered with small segments don’t have to.  Delete is perfectly legitimate option, but without the data, those of us who are interested in researching how to best utilize these segments, can’t.  And when we don’t have data to use, we all lose.  So, thank you Family Tree DNA.

Coming Full Circle

This discussion brings us full circle once again to goals.

Goals change over time.

My initial reason for testing, the first day an autosomal test could be ordered, was to see if my half-brother was my half-brother.  Obviously for that, I didn’t need matching to other people or triangulation.  The answer was either yes or no, we do match at the half-sibling level, or we don’t.

He wasn’t.  But by then, he was terminally ill, and I never told him.  It certainly explained why I wasn’t a transplant match for him.

My next goal, almost immediately, was to determine which if either my brother or I were the child of my father.  For that, we did need matching to other people, and preferably close cousins – the closer the better.  Autosomal DNA testing was new at that time, and I had to recruit cousins.  Bless those who took pity on me and tested, because I was truly desperate to know.

Suffice it to say that the wait was a roller coaster ride of emotion.

If I was not my father’s child, I had just done 30+ years of someone else’s genealogy – not a revelation I relished, at all.

I was my father’s child.  My brother wasn’t.  I was glad I never told him the first part, because I didn’t have to tell him this part either.

My goal at that point changed to more of a general interest nature as more cousins tested and we matched, verifying different lineages that has been unable to be verified by Y or mtDNA testing.

Then one day, something magical happened.

One of my Y lines, Marcus Younger, whose Y line is a result of a NPE, nonparental event, or said differently, an undocumented adoption, received amazing information.  The paternal Younger family line we believed Marcus descended from, he didn’t.  However, autosomal DNA confirmed that even though he is not the paternal child of that line, he is still autosomally related to that line, sharing a common ancestor – suggesting that he may have been born of a Younger female and given that surname, while carrying the Y DNA of his biological father, who remains unidentified.

Amazingly, the next day, a match popped up that matched me and another Younger relative.  This match descended not from the Younger line, but from Marcus Younger’s wife’s alleged surname family.  I suddenly realized that not only was autosomal DNA interesting for confirming your tree – it could also be used to break down long-standing brick walls.  That’s where I’ve been focused ever since.

That’s a very different goal from where I began, and my current goal utilizes the tools in a very different way than my earlier goals.  Confidence levels matter now, a great deal, where that first day, all I wanted was a yes or no.

Today, my goal, other than breaking down brick walls, is for genetic genealogy to become automated and much easier but without taking away our options or keeping us so “safe” that we have no tools (Ancestry).

The process that will allow us to refine genetic genealogy and group individuals and matches utilizing trees on our desktops will ultimately be the key to unraveling those distant connections.  The data is there, we just have to learn how to use it most effectively, and the key, other than software, is collaboration with many cousins.

Aside from science and technology, the other wonderful aspect of autosomal DNA testing is that is has the potential to unite and often, reunite families who didn’t even know they were families.  I’ve seen this over and over now and I still marvel at this miracle given to us by our ancestors – their DNA.

So, regardless of where you fall on the goals and matching confidence spectrum in terms of genetic genealogy, keep encouraging others to test and keep reaching out and sharing – because it takes a village to recreate an ancestor!  No one can do it alone, and the more people who test and share, the better all of our chances become to achieve whatever genetic genealogy goals we have.


Get Your Ducks in a Row – Time May Be Shorter Than You Think

$
0
0

ducks

Helen Rutledge is my cousin.  She and I have been sniffing around the same records in the same counties for many years now.  I only wish we had met earlier so we could have shared more of the chase.

Helen is no “spring chicken” as we say on the farm.  In fact, Helen has continued to research far into her golden years – being in her 90s now.  Want to hear the great irony? Helen has no children to leave her work to – but this does not deter her.  Helen is the aunt that every one of us wants to have in our family.

Recently Helen sent me an e-mail that both saddened me and inspired me, and with her permission, I’m sharing it with you.  I have omitted some of the more personal portions.

After 13 days in the hospital I returned to long term nursing care. I brought my computer and genealogy records from Assisted Living to my new level of care. However, now instead of researching, I am organizing my research to leave for my nephew and some research archives. I have been forewarned in the most urgent way that there may not be time to think about how I will do this when the research is done. Well, we all know research is never finished.

Keep urging perseveration of research on your blog. It is as important as the research itself. Answers are no good if I am the only one who knows the answer to the puzzle…I must share it with others whether they give me credit or not. I thank you for alerting me to that truth and God for allowing me extended days to get my records in order as a gift to other researchers. Oh, the many little tidbits I have garnered, documented, and put together for those who follow in my footsteps with our family lineage.

Organization is not just entering our data into a genealogy program. It is documenting, making copies of the documents available when possible, and recording the ORDER of our research so those who are not familiar with the records, can follow the generations and become acquainted with their ancestors.

Be honest, say information is not documented, when such is the case, and challenge your readers to find documentation. Try to inspire descendants to fill in the blanks and record those who are yet unborn. While they will miss the thrill of solving the puzzle after years of frustration, they will know the joy of learning who they are.

Thank you, Helen, for your lovely, inspirational message. Sometimes we aren’t fortunate enough to receive a warning.

Another e-mail this week told of another cousin’s husband who died suddenly, with no warning, and he was 30 years younger than Helen.

DNA in Perpetuity

I would add one thing though, and that is to record your user names and passwords – especially relative to DNA accounts and tests and anyplace, like GedMatch, you have uploaded your results.  Your DNA can never, and I repeat, NEVER, be replaced, while genealogy research could be with enough effort.  Don’t let your DNA results become inaccessible.

At Family Tree DNA, you can designate a beneficiary.

On your personal page, under “Your Account” on the left hand side, select “Manage Personal Information.”

ducks2

Then select Beneficiary Information and complete the form which includes your beneficiary’s name, e-mail and phone number.  If you should pass away, this is who Family Tree DNA will allow to access your account.  Other companies, to the best of my knowledge don’t include this information or provide this option, so you’ll need to be sure to leave your account access information available for your family members.

ducks3

If you have not prepared for the inevitable, please take a few minutes to do so.   You can make the DNA arrangements now, and easily.

Remember, at Ancestry, your DNA won’t be available unless your account (subscription and login) remains active, so you’ll need to take how to handle that into consideration.

You might want to download not just your raw data files, but matches as well when possible.

Public Sites

Upload your Y and mitochondrial DNA to sites like http://www.ysearch.org and http://www.mitosearch.org.  Be sure to record the most distant ancestor and enough information to positively identify them, like birth and death dates, locations and spouse’s name.  This is the only way to get your info into a public data base that is accessible without having DNA tested for a match.  You can also enter Y and mito info at http://www.wikitree.com and attach it to the proper ancestor.  This helps others in the future learn about their ancestors.  Be sure to include your full haplogroup in the notes and a link to anything you may have published about that line.

Upload your autosomal results to http://www.gedmatch.com and upload trees where possible.

Don’t put all of your eggs in one basket, because baskets aren’t forever either.  Think about how many genealogy companies have come and gone and what happened to our Y and mitochondrial DNA with both Ancestry and Sorenson (also destroyed by Ancesty).

Genealogy Research

You can take a few minutes to put together a plan for how to preserve and present the balance of your genealogy information.  Preserving and publishing my genealogy research has been on my bucket list for some time now and is the purpose of the 52 Ancestors articles I’ve been writing for the past 18 months.  I’ll write them until every ancestor is covered….or I can no longer write the articles – and I sincerely hope I have the opportunity to finish.  Not just for my own sake, but for the benefit of everyone else who follows.  I hope future researchers make huge breakthroughs and add immensely to what I know today.  My work will at least give them a firm foundation to start from and they won’t have to replow the same ground.

One of the avenues to preserve your work online is a blog.  WordPress offers free blogs and they will be available into perpetuity, whatever that really means.  I am also printing my articles and will be donating them to archival facilities like the Allen County Public Library.  And of course, I’ll have a set of binders for each of my children.

WikiTree is another public resource for your trees, your Y and mtDNA results and additional information, although that’s not the same as offering the detail in an article.

So, however you choose to do whatever you choose to do… just do it.

And do it now.

You may not have an opportunity later.

Time may be shorter than you think.

Get your ducks in a row.


Viewing all 367 articles
Browse latest View live