Author: Marcus W. Feldman, PhD

Insofar as the genetic evolution of modern humans is concerned, large scale SNP studies of worldwide populations have provided a consistent picture of a migration out of Africa that gave rise to the human populations of the other continents. This migration probably began 60–80 kya, was probably not continuous, and could have resulted in a division during the passage through the Levant en route from east Africa. One division may have moved in a more southerly direction towards south and east Asia, possibly to Australia, and eventually, 15–30 kya into the Americas. The other division may have “turned left” and moved towards Europe.

In this process, which we call the “serial founder” model of human expansion (refs. 1, 2), migration and demography probably had effects that constrained the subsequent action of natural selection on human genes.

  • Variation in skin pigmentation genes today provides some of the strongest signals of natural selection during this human expansion. However, it is also likely that the
  • Immune response genes, e.g., MHC genes, achieved their high levels of polymorphism in response to new pathogens encountered in the great expansion.

Many of the strongest signals of natural selection indicate the importance of the innovations of farming and pastoralism. The gene sequences involved in lactose tolerance and starch metabolism, for example, are strikingly different in groups that adopted dairying or farming, respectively, from hunter-gatherers, who did not.

From the analysis of SNPs, I take home two messages.

  • The first is that although some parts of the genome show clear signals of selection, most of our DNA perceived via SNPs does not.
  • The second is that population growth and migration have been major forces in determining the patterns of variation. Indeed,
  • recent analyses of exome sequences confirm that the spectrum of rare allele frequencies is compatible only with recent and rapid population growth (ref. 3). Indeed,
  • recent analyses of the 1000 genomes data, that is, data from whole genome sequencing of one-thousand human genomes representing Africa (Yoruba), Europe (from Utah), and East Asia (China and Japan), identified only 35 non-synonymous SNPs from 33 genes as having been subject to recent adaptive selection (ref. 4).

The next phase of genomic analysis of humans, complete exome sequencing of large cohorts, or whole genome sequencing of samples from many representative populations, will focus more on two themes.

  • The first will be the role of rare alleles in human phenotypes, especially diseases. The previous phase, GWAS (genome-wide association studies), has been disappointing in revealing genetic “causes” of complex traits. However, my view is that
  • the second theme, the molecular genetics of gene regulation, and interaction of this regulation with the environment, is likely to have bigger payoffs, not only for determination of phenotypes, but also in showing where in the genome the strongest signals of selection lie. As more methylation profiles, small RNA patterns of interference, and other gene-regulatory analyses of whole genomes are completed, both the medical and evolutionary significance of DNA variation will become clearer.

1.  Cavalli-Sforza, L.L., and M.W. Feldman. 2003. The application of molecular genetic approaches to the study of human evolution. Nat. Genet. Supp. 33: 266–275.

2.  Henn, B. M., L. L. Cavalli-Sforza, and M. W. Feldman. 2012. The great human expansion. Proc. Natl. Acad. Sci. USA 109: 17758–17764.

3.  Keinan, A., and A. G. Clark. 2012. Recent explosive human population growth has resulted in an excess of rate genetic variants. Science 336: 740–743.

4.  Grossman, S. R., K. G. Andersen, I. Shlyakhter, S. Tabrizi, S. Winnicki, A. Yen, D. J. Park, D. Griesemer, E. K. Karlsson, S. H. Wong, M. Cabili, R. A. Adegbola, R. N. K. Bamezai, A. V. S. Hill, F. O. Vannberg, J. L. Rinn, 1000 Genomes Project, E. S. Lander, S. F. Schaffner, and P. C. Sabeti. 2013. Identifying recent adaptations in large-scale genomic data. Cell 152: 703–713.


