The so-called “streetlight effect” has often fettered scientists who study complex hereditary diseases. The term refers to an old joke about a drunk searching for his lost keys under a streetlight. A cop asks, “Are you sure this is where you lost them?” The drunk says, “No, I lost them in the park, but the light is better here.”
For researchers who study the genetic roots of human diseases, most of the light has shone down on the 2 percent of the human genome that includes protein-coding DNA sequences. “That’s fine. Lots of diseases are caused by mutations there, but those mutations are low-hanging fruit,” says University of Toronto (U.T.) professor Brendan Frey who studies genetic networks. “They’re easy to find because the mutation actually changes one amino acid to another one, and that very much changes the protein.”
The trouble is, many disease-related mutations also happen in noncoding regions of the genome—the parts that do not directly make proteins but that still regulate how genes behave. Scientists have long been aware of how valuable it would be to analyze the other 98 percent but there has not been a practical way to do it.
Now Frey has developed a “deep-learning” machine algorithm that effectively shines a light on the entire genome. A paper appearing December 18 in Science describes how this algorithm can identify patterns of mutation across coding and noncoding DNA alike. The algorithm can also predict how likely each variant is to contribute to a given disease. “Our method works very differently from existing methods,” says Frey, the study’s lead author. “GWAS-, QTL- and ENCODE-type approaches can’t figure out causal relationships. They can only correlate. Our system can predict whether or not a mutation will cause a change in RNA splicing that could lead to a disease phenotype.”
RNA splicing is one of the major steps in turning genetic blueprints into living organisms. Splicing determines which bits of DNA code get included in the messenger-RNA strings that build proteins. Different configurations yield different proteins. Misregulated splicing contributes to an estimated 15 to 60 percent of human genetic diseases.
The combination of whole-genome analysis and predictive models for RNA splicing makes Frey’s method a major contribution to the field, according to Stephan Sanders, an assistant professor at the University of California, San Francisco, School of Medicine. “I’m looking forward to using this tool in larger data sets and really getting sense of how important splicing is,” he says. Sanders, who researches the genetic causes of diseases, notes Frey’s approach complements, rather than replaces, other methods of genetic analysis. “I think any genomist [sic] would agree that noncoding [areas of the genome] are hugely important. This method is a really novel way of getting at that,” he says.
Source: www.scientificamerican.com
Leave a Reply