One of the most persuasive pieces of evidence of the common ancestry of all life on Earth is genetic overlap -- and the fact that the percent overlap gets higher when you compare more recently-diverged species.
What is downright astonishing, though, is that there is genetic overlap between all life on Earth. Yeah, okay, it's easy enough to imagine there being genetic similarity between humans and gorillas, or dogs and foxes, or peaches and plums; but what about more distant relationships? Are there shared genes between humans... and bacteria?
The answer, amazingly, is yes, and the analysis of these universal paralogs was the subject of a fascinating paper in the journal Cell Genomics last week. Pick any two organisms on Earth -- choose them to be as distantly-related to each other as you can, if you like -- and they will still share five groups of genes, used for making the following classes of enzymes:
- aminotransferases
- imidazole-4-carboxamide isomerase
- carbamoyl phosphate synthetases
- aminoacyl-tRNA synthetases
- initiation facter IF2
The first three are connected with amino acid metabolism; the last two, with the process of translation -- which decodes the message in mRNA and uses it to synthesize proteins.
The fact that all life forms on Earth have these five gene groups suggests something wild; that we're looking at genes that were present in LUCA -- the Last Universal Common Ancestor, our single-celled, bacteria-like forebear that lived in the primordial seas an estimated four billion years ago. Since then, two things happened -- the rest of LUCA's genome diverged wildly, under the effects of mutation and selection, so that now we have kitties and kangaroos and kidney beans; and those five gene groups were under such extreme stabilizing selection that they haven't significantly changed, in any of the branches of the tree of life, in millions or billions of generations.
The authors write:
Universal paralog families are an important tool for understanding early evolution from a phylogenetic perspective, offering a unique and valuable form of evidence about molecular evolution prior to the LUCA. The phylogenetic study of ancient life is constrained by several fundamental limitations. Both gene loss across multiple lineages and low levels of conservation in some gene families can obscure the ancient origin of those gene families. Furthermore, in the absence of an extensive diagnostic fossil record, the dependence of molecular phylogenetics on conserved gene sequences means that periods of evolution that predated the emergence of the genetic system cannot be studied. Even so, emerging technologies across a number of areas of computational biology and synthetic biology will expand our ability to reconstruct pre-LUCA evolution using these protein families. As our understanding of the LUCA solidifies, universal paralog protein families will provide an indispensable tool for pushing our understanding of early evolutionary history even further back in time, thereby describing the foundational processes that shaped life as we know it today.
