Feeds:
Posts
Comments

Posts Tagged ‘RNA’

Long Noncoding RNA Network regulates PTEN Transcription

Author: Larry H Bernstein, MD, FCAP

Scientists Find Surprising New Influence On Cancer Genes

A pseudogene long noncoding RNA networkregulates PTEN transcription and translation in human cells
Per Johnsson, A Ackley, L Vidarsdottir, Weng-Onn Lui, M Corcoran, D Grandér, and KV Morris
a new study led by scientists at The Scripps Research Institute (TSRI) shows how
  • pseudogenes can regulate the activity of a cancer-related gene called PTEN.
The study also shows that pseudogenes can be targeted to control PTEN’s activity.

Mol Cancer. 2011; 10: 38.   Published online 2011 April 13. doi:  10.1186/1476-4598-10-38    PMCID: PMC3098824

New Type of Gene That Regulates Tumour Suppressor PTEN Identified

Feb. 24, 2013 — Researchers at Karolinska Institutet in Sweden have identified a new so-called pseudogene that regulates the tumour-suppressing PTEN gene.
They hope that this pseudogene will be able to control PTEN to

  1. reverse the tumour process,
  2. make the cancer tumour more sensitive to chemotherapy and
  3. prevent the development of resistance.

The findings, which are published in the scientific journal Nature Structural and Molecular Biology, can be of significance in

    • the future development of cancer drugs.

The development of tumours coincides with the activation of several cancer genes as well as the inactivation of other tumour-suppressing genes owing to

  1. damage to the DNA and
  2. to the fact that
    • the cancer cells manage to switch off the transcription of tumour-suppressor genes.

To identify what might be regulating this silencing, the researchers studied PTEN,

    • one of the most commonly inactivated tumour-suppressor genes.

It has long been believed that the switching-off process is irreversible, but the team has now shown that

  • silenced PTEN genes in tumour cells can be ‘rescued’ and
  • re-activated by a ‘pseudogene’,
    • a type of gene that, unlike normal genes,
    • does not encode an entire protein.

“We identified a new non-protein encoding pseudogene, which

  • determines whether the expression of PTEN
    • is to be switched on or off,”

says research team member Per Johnsson, at Karolinska Institutet’s Department of Oncology-Pathology. “What makes this case spectacular is that the gene

  • only produces RNA,
  • the protein’s template.

It is this RNA that, through a sequence of mechanisms,

    • regulates PTEN.

Pseudogenes have been known about for many years, but

  • it was thought that they were only junk material.”

No less than 98 per cent of human DNA consists of non-protein encoding genes (i.e. pseudogenes), and by studying these formerly neglected genes the researchers

  • have begun to understand that they are very important and
    • can have an effect without encoding proteins.

Using model systems, the team has shown that the new pseudogene can

  • control the expression of PTEN and
    • make tumours more responsive to conventional chemotherapy.

Pre Johnssom suggests “we might one day be able to re-programme cancer cells

  • to proliferate less,
  • become more normal, and that
  • resistance to chemotherapy can hopefully be avoided.

“We also believe that our findings can be very important for the future development of cancer drugs.  The human genome conceals no less than 15,000 or so pseudogenes, and it’s not unreasonable to think

  • that many of them are relevant to diseases such as cancer.”

The study was conducted in collaboration with scientists at The Scripps Research Institute, USA, and the University of New South Wales, Australia, and was made possible with

  • grants from the Swedish Childhood Cancer Foundation, the Swedish Cancer Society, the Cancer Research Funds of Radiumhemmet, Karolinska Institutet’s KID programme for doctoral studies, the Swedish Research Council, the Erik and Edith Fernström Foundation for Medical Research, the National Institute of Allergy and Infectious Diseases, the National Cancer Institute and the National Institutes of Health.

The functional role of long non-coding RNA in human carcinomas
EA Gibb, CJ Brown, and WL Lam
Long non-coding RNAs (lncRNAs) are emerging as new players in the cancer paradigm demonstrating potential roles in both oncogenic and tumor suppressive pathways. These novel genes are frequently

    • aberrantly expressed in a variety of human cancers,

however the biological functions of the vast majority remain unknown. Recently, evidence has begun to accumulate describing the molecular mechanisms by which these RNA species function, providing insight into

    • the functional roles they may play in tumorigenesis.

In this review, we highlight the emerging functional role of lncRNAs in human cancer.

One of modern biology’s great surprises was the discovery that the human genome encodes only ~20,000 protein-coding genes, representing <2% of the total genome sequence [1,2]. However, with the advent of

  • tiling resolution genomic microarrays and
  • whole genome and transcriptome sequencing technologies
    • it was determined that at least 90% of the genome is actively transcribed [3,4].

The human transcriptome was found to be more complex than

  • a collection of protein-coding genes and their splice variants; showing
    • extensive antisense,
    • overlapping and non-coding RNA (ncRNA) expression [5-10].

Although initially argued to be spurious transcriptional noise, recent evidence suggests that the proverbial “dark matter” of the genome

  • may play a major biological role in cellular development and metabolism [11-17].

One such player, the newly discovered long non-coding RNA (lncRNA) genes, demonstrate

  1. developmental and tissue specific expression patterns, and
  2. aberrant regulation in a variety of diseases, including cancer [18-27].

NcRNAs are loosely grouped into two major classes based on transcript size; small ncRNAs and lncRNAs [28-30].

  1. Small ncRNAs are represented by a broad range of known and newly discovered RNA species, with many being associated
    • with 5′ or 3′ regions of genes [4,31,32].

This class includes the well-documented miRNAs, RNAs ~22 nucleotides (nt) long involved in the specific regulation of both

  1. protein-coding, and
  2. putatively non-coding genes,
    • by post-transcriptional silencing or infrequently
    • by activation [33-35].

miRNAs serve as major

  1. regulators of gene expression and as
  2. intricate components of the cellular gene expression network [33-38].

Another newly described subclass are the transcription initiation RNAs (tiRNAs), which are

  • the smallest functional RNAs at only 18 nt in length [39,40].
  1. small ncRNAs classes, including miRNAs, have established roles in tumorigenesis, an intriguing association between
  2. the aberrant expression of ncRNA satellite repeats and cancer has been recently demonstrated [41-46].

Types of human non-coding RNAs

In contrast to miRNAs, lncRNAs, the focus of this article, are

    • mRNA-like transcripts ranging in length from 200 nt to ~100 kilobases (kb) lacking significant open reading frames.

Many identified lncRNAs are transcribed by RNA polymerase II (RNA pol II) and are polyadenylated, but this is not a fast rule [47,48].
There are examples of lncRNAs, such as the

  • antisense asOct4-pg5 or the
  • brain-associated BC200,
    • which are functional, but not polyadenylated [49-51].
  1. lncRNA expression levels appear to be lower than protein-coding genes [52-55], and some
  2. lncRNAs are preferentially expressed in specific tissues [21].

Novel lncRNAs may contribute a significant portion of the aforementioned ‘dark matter’ of the human transcriptome [56,57]. In an exciting report
by Kapranov et.al., it was revealed the bulk of the relative mass of RNA in a human cell, exclusive of the ribosomal and mitochondrial RNA,
is represented by non-coding transcripts with no known function
[57].

Like miRNAs and protein-coding genes, some

  • transcriptionally active lncRNA genes display
  • histone H3K4 trimethylation at their 5′-end and
  • histone H3K36 trimethylation in the body of the gene [8,58,59].

The small number of characterized human lncRNAs have been associated with a spectrum of biological processes, for example,

  • epigenetics,
  • alternative splicing,
  • nuclear import,
    1. as structural components,
    2. as precursors to small RNAs and
    3. even as regulators of mRNA decay [4,60-70].

Furthermore, accumulating reports of misregulated lncRNA expression across numerous cancer types suggest that

    • aberrant lncRNA expression may be a major contributor to tumorigenesis [71].

This surge in publications reflects the increasing attention to this subject  and a number of useful lncRNA databases have been created .
In this review we highlight the emerging

    • functional role of aberrant lncRNA expression, including
    • transcribed ultraconserved regions (T-UCRs), within human carcinomas.

Publications describing cancer-associated ncRNAs. Entries are based on a National Library of Medicine Pubmed search using the terms
“ncRNA” or “non-coding RNA” or “noncoding RNA” or non-protein-coding RNA” with cancer and annual (Jan.1-Dec.31) date limitations. …
Publically available long non-coding RNA online databases

The definition ‘non-coding RNA’ is typically used to describe transcripts where

    • sequence analysis has failed to identify an open reading frame.

There are cases where ‘non-coding’ transcripts were found to encode short, functional peptides [72]. Currently, a
universal classification scheme to define lncRNAs does not exist. Terms such as

  • large non-coding RNA,
  • mRNA-like long RNA, and
  • intergenic RNA

all define cellular RNAs, exclusive of rRNAs,

    • greater than 200 nt in length and having no obvious protein-coding capacity [62].

This has led to confusion in the literature as to exactly which transcripts should constitute a lncRNA. One subclass of lncRNAs is called
large or long intergenic ncRNAs (lincRNAs). These lncRNAs are

  1. exclusively intergenic and are
  2. marked by a chromatin signature indicative of transcription [8,58].

RNA species that are bifunctional preclude categorization into either group of

  • protein-coding or
  • ncRNAs as

their transcripts function both at the RNA and protein levels [73].

The term ‘lncRNA‘ is used only to describe transcripts with no protein-coding capacity. In the meantime, and for the purposes of this review,
we will consider lncRNAs as a blanket term to encompass

  1. mRNA-like ncRNAs,
  2. lincRNAs, as well as
  3. antisense and intron-encoded transcripts,
  4. T-UCRs and
  5. transcribed pseudogenes.

Discovery of LncRNAs

The earliest reports describing lncRNA predated the discovery of miRNAs, although the term ‘lncRNA‘ had not been coined at the time .
One of the first lncRNA genes reported was the imprinted H19 gene, which was quickly followed by the discovery of the

  • silencing X-inactive-specific transcript (XIST) lncRNA gene, which
    • plays a critical function in X-chromosome inactivation [74,75].

The discovery of the first miRNA lin-14 dramatically redirected the focus of ncRNA research from long ncRNAs to miRNAs [76], and
the discovery of miRNAs revealed RNA could

  1. regulate gene expression and
  2. entire gene networks could be affected by ncRNA expression and

Within the last decade miRNAs were discovered to be associated with cancer. At the time of this writing there are approximately
1049 human miRNAs described in miRBase V16 [80,81] with the potential of

    • affecting the expression of approximately 60% of protein -coding genes [82,83].

Conversely, the variety and dynamics of lncRNA expression was not to be fully appreciated until the introduction of whole transcriptome sequencing.
With the advent of the FANTOM and ENCODE transcript mapping projects, it was revealed that the mammalian genome is extensively transcribed,
although a large portion of this represented non-coding sequences [3,84]. Coupled with the novel functional annotation of a few lncRNAs, this discovery
promoted research focusing on lncRNA discovery and characterization. Recent reports have described new lncRNA classes such as lincRNAs and T-UCRs [8,58,85].
Current estimates of the lncRNA gene content in the human genome ranges from ~7000 – 23,000 unique lncRNAs, implying this class of ncRNA will
represent an enormous, yet undiscovered, component of normal cellular networks that may be disrupted in cancer biology [62].

Emerging Role of Long Non-Coding RNA in Tumorigenesis

A role for differential lncRNA expression in cancer had been suspected for many years, however, lacked strong supporting evidence [86]. With advancements
in cancer transcriptome profiling and accumulating evidence supporting lncRNA function, a number of differentially expressed lncRNAs have been associated
with cancer. LncRNAs have been implicated to

  • regulate a range of biological functions and
  • the disruption of some of these functions, such as
    • genomic imprinting and transcriptional regulation,
    • plays a critical role in cancer development.

Here we describe some of the better characterized lncRNAs that have been associated with cancer biology.

Human cancer-associated lncRNAs

Imprinted lncRNA genes

Imprinting is a process whereby the copy of a gene inherited from one parent is epigenetically silenced [87,88]. Intriguingly, imprinted regions often
include multiple maternal and paternally expressed genes with a high frequency of ncRNA genes. The imprinted ncRNA genes are implicated in the
imprinting of the region by a variety of mechanisms including

  • enhancer competition and chromatin remodeling [89].

A key feature of cancer is the loss of this imprinting resulting in altered gene expression [90,91]. Two of the best known imprinted genes
are in fact lncRNAs.

H19

The H19 gene encodes a 2.3 kb lncRNA that is expressed exclusively from the maternal allele. H19 and its reciprocally imprinted protein-coding neighbor
the Insulin-Like Growth Factor 2 or IGF2 gene at 11p15.5 were among the first genes, non-coding or otherwise, found to demonstrate genomic imprinting [74,92].

The expression of H19 is high during vertebrate embryo development, but is

  • downregulated in most tissues shortly after birth with the exception of skeletal tissue and cartilage [20,93,94].
  • Loss of imprinting and subsequent strong gene expression has been well-documented in human cancers. Likewise,
  • loss of imprinting at the H19 locus resulted in high H19 expression in cancers of the esophagus, colon, liver, bladder and with hepatic metastases [95-97].

H19 has been implicated as having both oncogenic and tumor suppression properties in cancer. H19 is upregulated in a number of human cancers, including
hepatocellular, bladder and breast carcinomas, suggesting an oncogenic function for this lncRNA [97-99]. In colon cancer H19 was shown to be directly activated
by the oncogenic transcription factor c-Myc, suggesting

  • H19 may be an intermediate functionary between c-Myc and downstream gene expression [98].

Conversely, the tumor suppressor gene and transcriptional activator p53 has been shown to

  • down-regulate H19 expression [100,101].

H19 transcripts also serve as a precursor for miR-675, a miRNA involved in the regulation of developmental genes [102].
miR-675 is processed from the first exon of H19 and functionally

  • downregulates the tumor suppressor gene retinoblastoma (RB1) in human colorectal cancer, further implying an oncogenic role for H19 [103].

There is evidence suggesting H19 may also play a role in tumor suppression [104,105]. Using a mouse model for colorectal cancer, it was shown that
mice lacking H19 manifested an increased polyp count compared to wild-type [106]. Secondly, a mouse teratocarcinoma model demonstrated larger
tumor growth when the embryo lacked H19, and finally in a hepatocarcinoma model, mice developed cancer much earlier when H19 was absent [107].
The discrepancy as to whether H19 has oncogenic or tumor suppressive potential may be due in part to the bifunctional nature of the lncRNA or may
be context dependent. In either case, the precise functional and biological role of H19 remains to be determined.

XIST – X-inactive-specific transcript

The 17 kb lncRNA XIST is arguably an archetype for the study of functional lncRNAs in mammalian cells, having been studied for nearly two decades.
In female cells, the XIST transcript plays a critical role in X-chromosome inactivation by

  • physically coating one of the two X-chromosomes, and is necessary for the
  • cis-inactivation of the over one thousand X-linked genes [75,108-110].

Like the lncRNAs HOTAIR and ANRIL, XIST associates with polycomb-repressor proteins, suggesting

    • a common pathway of inducing silencing utilized by diverse lncRNAs.

Discovery of Molecular Mechanisms of Traditional Chinese Medicinal Formula Si-Wu-Tang Using Gene Expression Microarray and Connectivity Map
by Zhining Wen, Zhijun Wang, Steven Wang, Ranadheer Ravula, Lun Yang, …et al.
PLoS ONE (2011); 6:(3), Publisher: PLoS, Pages: 14    http.//dx.doi.org/10.1371/journal.pone.0018278        PubMed: 21464939
http://dx.plos.org/Wen Z, Wang Z, Wang Z, et al./discovery of molecular mechanisms of traditional chinese medicinal formula…/
To pursue a systematic approach to discovery of mechanisms of action of traditional Chinese medicine (TCM), we used

  • microarrays,
  • bioinformatics and the
  • Connectivity Map (CMAP)
    • to examine TCM-induced changes in gene expression.

We demonstrated that this approach can be used to elucidate new molecular targets using a model TCM herbal formula Si-Wu-Tang (SWT) which is

  • widely used for women’s health.

The human breast cancer MCF-7 cells treated with 0.1 µM estradiol or 2.56 mg/ml of SWT

  • showed dramatic gene expression changes, while
  • no significant change was detected for ferulic acid, a known bioactive compound of SWT.

Pathway analysis using

  • differentially expressed genes related to the treatment effect
  • identified that expression of genes in the nuclear factor erythroid 2-related factor 2 (Nrf2) cytoprotective pathway
  • was most significantly affected by SWT,
    • but not by estradiol or ferulic acid.
  • The Nrf2-regulated genes
    • HMOX1,
    • GCLC,
    • GCLM,
    • SLC7A11 and
    • NQO1 were
  • upreguated by SWT in a dose-dependent manner, which was validated by real-time RT-PCR. Consistently,
  • treatment with SWT and its four herbal ingredients resulted in an 
  • increased antioxidant response element (ARE)-luciferase reporter activity in MCF-7 and HEK293 cells.

Furthermore, the gene expression profile of differentially expressed genes related to SWT treatment was used to compare with those of

  • 1,309 compounds in the CMAP database.

The CMAP profiles of estradiol-treated MCF-7 cells showed an excellent match with SWT treatment,

  • consistent with SWT’s widely claimed use for women’s diseases and indicating a phytoestrogenic effect.

Read Full Post »

RNA Virus Genome as Bacterial Chromosome

Reporter: Larry H Bernstein, MD, FCAP

 

Engineering the largest RNA virus genome as an infectious bacterial artificial chromosome
F Almazan, JM Gonzalez, Z Penzes, A Izeta, E Calvo, J Plana-Duran, and L Enjuanes

PNAS  May 9, 2000; 97(10): 5516–5521.

the application of two strategies,

  • cloning of the cDNAs into a bacterial artificial chromosome and
  • nuclear expression of RNAs that are typically produced within the cytoplasm

is useful for the engineering of large RNA molecules.
A cDNA encoding an infectious coronavirus RNA genome

  • has been cloned as a bacterial artificial chromosome.

The rescued coronavirus

  • conserved all of the genetic markers introduced throughout the sequence and
  • showed a standard mRNA pattern and

the antigenic characteristics expected for the synthetic virus.
The cDNA was transcribed

  • within the nucleus, and
  • the RNA translocated to the cytoplasm.
Interestingly, the recovered virus had
  • essentially the same sequence as the original one, and
      • no splicing was observed.

During the engineering of the infectious cDNA,

  • the spike gene of the virus was replaced by
  • the spike gene of an enteric isolate.

The synthetic virus

  • replicated abundantly in the enteric tract and was fully virulent, demonstrating that
  • the tropism and virulence of the recovered coronavirus can be modified.

the application of two strategies,

  • cloning of the cDNAs into a bacterial artificial chromosome and
  • nuclear expression of RNAs that are typically produced within the cytoplasm,
    • is useful for the engineering of large RNA molecules.

A cDNA encoding an infectious coronavirus RNA genome has been cloned as a bacterial artificial chromosome. The rescued coronavirus

  • conserved all of the genetic markers introduced throughout the sequence and
  • showed a standard mRNA pattern and
  • the antigenic characteristics expected for the synthetic virus.
    • The cDNA was transcribed within the nucleus, and
    • the RNA translocated to the cytoplasm.

Interestingly, the recovered virus had essentially the same sequence as the original one, and no splicing was observed. During the engineering of the infectious cDNA, the spike gene of the virus was replaced by the spike gene of an enteric isolate. The synthetic virus

  • replicated abundantly in the enteric tract and
  • was fully virulent,

demonstrating that the tropism and virulence of the recovered coronavirus can be modified.}
http://www.PNAS.org/Engineering_the_largest_RNAvirus_genome_as_an_infectious_bacterial_artificial_chromosome/

Description: The interaction of mRNA in a cell...

Description: The interaction of mRNA in a cell. Source: http://www.genome.gov/Pages/Hyperion/DIR/VIP/Glossary/Illustration/mrna.shtml (file) License: “All of the illustrations in the Talking Glossary of Genetics are freely available and may be used without special permission.” http://www.genome.gov/page.cfm?pageID=10003803 (Photo credit: Wikipedia)

RNA Protein Virus

RNA Protein Virus (Photo credit: Wikipedia)

This image was created as part of the Philip G...

This image was created as part of the Philip Greenspun illustration project. (Photo credit: Wikipedia)

Read Full Post »

Transcript Dynamics of Proinflammatory Genes

Author: Larry H Bernstein, MD, FCAP

Transcript Dynamics of Proinflammatory Genes Revealed by Sequence Analysis of Subcellular RNA Fractions

DM Bhatt, A Pandya-Jones, Ann-Jay Tong, I Barozzi, MM Lissner, et al.
Cell 2012;150: 279–290

In addition to documenting the subcellular locations of coding and noncoding transcripts, the results provide a high-resolution view of the relationship between
  • defined promoter and chromatin properties and
    • the temporal regulation of diverse classes of coexpressed genes.
The data also reveal a striking accumulation of full-length yet incompletely spliced transcripts in the chromatin fraction, suggesting that
  • splicing often occurs after transcription has been completed,
  • with transcripts retained on the chromatin until fully spliced.
Summary
Macrophages respond to inflammatory stimuli by modulating the expression of hundreds of genes in
  • a defined temporal cascade,
  • with diverse transcriptional and posttranscriptional mechanisms contributing to the regulatory network.
We examined proinflammatory gene regulation in activated macrophages by
  • performing RNA-seq with fractionated chromatin-associated, nucleoplasmic, and cytoplasmic transcripts.
This methodological approach allowed us
  • to separate the synthesis of nascent transcripts from transcript processing and
  • the accumulation of mature mRNAs.
In addition to documenting the subcellular locations of coding and noncoding transcripts,
the results provide a high-resolution view of the relationship between
  • defined promoter and chromatin properties and
  • the temporal regulation of diverse classes of coexpressed genes.
The data also reveal a striking accumulation of full-length yet incompletely spliced transcripts in the chromatin fraction, suggesting that
  • splicing often occurs after transcription has been completed, with transcripts retained on the chromatin until fully spliced.

Two independent experiments were performed with lipid A-stimulated bone marrow-derived macrophages. The two experiments made use of different macrophages prepared from different mice, several months apart.(A) Pearson pair-wise correlation values (R) derived from an analysis of greater than 500 lipid A-induced genes (>5-fold induced) are shown. Each time point from the first experiment, A, was compared to every other time point from the same experiment and from the second experiment, B.(B) Hierarchical clustering of the R-values from panel A was performed. This analysis reveals that, when only induced genes are considered, each time point from each experiment correlates more closely with the corresponding time point from the other experiment than with any of the other time points from either experiment.(C)

This analysis reveals that, when the transcript levels of expressed genes are compared,
  • each time point from a given experiment correlates with the same time point from the independent experiment.
The results reveal close correlations between all time-points from both experiments, presumably because genes that are consistently unexpressed (i.e., not counted in B) are contributing to the high degree of correlation. Nevertheless, the time points of each independent experiment still have the highest degree of correlation with each other.
Hierarchical clustering of the R values from panel D was performed. As with other clusterings, each sample clusters with its cognate time point in the independent experiment
Highlights
► Coding and noncoding transcripts exhibit characteristic subcellular distributions
► The most potently induced genes favor promoters with low CpG content
► Full-length, incompletely spliced transcripts accumulate on the chromatin
► Delayed transcript release may reflect a requirement for the completion of splicing
Eukaryotic transcription overview

Eukaryotic transcription overview (Photo credit: Allen Gathman)

English: Nucleosome structure.

English: Nucleosome structure. (Photo credit: Wikipedia)

Read Full Post »

Zinc-Finger Nucleases (ZFNs) and Transcription Activator–Like Effector Nucleases (TALENs)

Reporter: Larry H Bernstein, MD, FCAP

 

TALENs and ZFNs are associated with different mutation signatures

Y Kim,  J Kweon  & Jin-Soo Kim

Zinc-finger nucleases (ZFNs) and transcription activator–like effector nucleases (TALENs) are of great interest for genome engineering in higher eukaryotic cells and organisms. These enzymes

  1. contain the same FokI nuclease domain and
  2. induce site-specific DNA cleavage.

http://www.nature.com/nmeth/journal/v10/n3/extref/nmeth.2364-S1.pdf

http://www.nature.com/nmeth/journal/v10/n3/full/nmeth.2364.html?WT.ec_id=NMETH-201303

English: Bacterial multi-drug resistance syste...

English: Bacterial multi-drug resistance system: complex of dimeric transcription-activator protein BmrR with bound TPP, untwisting the DNA to position the two promoter sites (top) for transcription. Coordinates from PDB file 1R8E, Brennan lab; displayed in KiNG. (Photo credit: Wikipedia)

                                   

English: Diagram of a typical rAAV vector

English: Diagram of a typical rAAV vector (Photo credit: Wikipedia)

Splicing activation

Splicing activation (Photo credit: Allen Gathman)

Read Full Post »

What about Circular RNAs?

Reporter: Larry H Bernstein, MD, FCAP

Circular RNAs throw genetics for a loop

RNA ‘sponges’ mop up sequences that curb gene expression.
Heidi Ledford       27 Feb 2013

At least some of the loops, described in two papers published this week by Nature1, 2,

  • act as molecular sponges’,
    • binding to and
    • blocking tiny gene modulators called microRNAs.

The researchers suspect that the circular RNAs have many other functions. The molecules comprise “a hidden, parallel universe” of unexplored RNAs, says Nikolaus Rajewsky, the lead author of one of the studies and a systems biologist at the Max Delbrück Center for Molecular Medicine in Berlin. The discovery is yet another a reminder that

  • RNA is much more than a mundane messenger ­between DNA and the proteins it encodes.
1. Memczak , S. et al. Nature 2013         http://dx.doi.org/10.1038/nature11928/
2. Hansen, T. B. et al. Nature 2013        http://dx.doi.org/10.1038/nature11993/
3. Salzman, J., Gawad, C., Wang, P. L., Lacayo, N. & Brown, P. O. 2012        http://www.PLoSONE.org/7/30733
Consensus secondary structure of yjdF RNAs. Th...

Consensus secondary structure of yjdF RNAs. This figure is adapted from a previous publication. (Photo credit: Wikipedia)

RNA

RNA (Photo credit: AJC1)

By Richard Wheeler (Zephyris) 2007. Lambda rep...

By Richard Wheeler (Zephyris) 2007. Lambda repressor protein bound to a lambda operator DNA sequence. From . (Photo credit: Wikipedia)

Consensus secondary structure of mini-ykkC RNA...

Consensus secondary structure of mini-ykkC RNAs. Layout is similar to that used in a previously published drawing. (Photo credit: Wikipedia)

Read Full Post »

When Clinical Application of miRNAs?

Author: Larry H Bernstein, MD, FCAP

Clinical Application of miRNAs Remains a Ways Off
When its time comes, prognostic tests will be first.
Patricia Fitzpatrick Dimond, Ph.D             GEN Insight & Intelligence

It’s still early to tell how well microRNAs (miRNAs) will prove clinically useful.  Preclinical research findings indicate their central role in controlling cellular pathways.
This novel class of nucleotides, about 20–25 nucleotides in length, affects gene expression by interacting with messenger RNAs. But unlike Small Interfering RNA,  siRNAs, miRNAs are encoded in the human genome and function as natural regulators of global gene expression.
Each of the more than 1,500 encoded miRNAs appears to regulate the expression of tens to hundreds of different genes, on-off switches, regulating multiple cellular functions including
  • growth and
  • proliferation.
miRNAs regulate the translation of genes through
  • sequence-specific binding to mRNA.

Depending on the degree of sequence complimentarity, they can inhibit

  • the translation and/or degradation of their target mRNAs.

Because of their role in controlling “suites” of genes and, ultimately, pathway function, these molecules have attracted considerable scientific and investor interest in the control of diseases ranging from cardiovascular diseases to cancer.

miRNAs target numerous biomolecules that play a role in carcinogenesis,
  • functioning as both tumor promoters or suppressors.
Aberrant expression of miRNAs
  • correlates with the development and progression of tumors;

inhibition of their expression can

  1. modulate the cancer phenotype,
  2. suggesting their potential as anticancer drug targets.

Further supporting their potential use as drug targets, miRNA expression profiling in a variety of tissue, cell, and disease types has revealed

  • a “miRNA signature” specific to those cell types or disease states.
Research

Carlo Croce, M.D., director of Human Cancer Genetics at the Ohio State University Comprehensive Cancer Center, and colleagues reported that

  • they identified a 9-miRNA signature that differentiated invasive (IDC) from in situ carcinoma (DCIS).

In studying the global changes of the miRNA repertoire along the transitions defining breast cancer progression, the scientists found that

  1. let-7d, miR-210, and miR-221 were downregulated in the in situ and
  2. upregulated in the invasive transition, thus
  3. featuring an expression reversal along the cancer progression path.
  4. in addition,  miRNAs for overall survival and time to metastasis.
Dr. Croce posed that targeted prognostic tests using miRNA will be available within the next two years.
  • the problem he suggests is validating the signature in a large enough cohort of patients.
They used deep sequencing, an extremely sensitive approach to the determination of miRNAs because you count the molecules. Studies have used microarrays and RT-PCR, and his group used general microarrays and validated RT-PCR.  Their method avoided the possibility of artifacts (by counting).  Sequencing permits counts of molecules to provide good data.

John F. McDonald, Ph.D., CSO Ovarian Cancer Institute, and colleagues at the Georgia Institute of Technology

  1. separately transfected two miRNAs (miR-7 and miR-128) into the ovarian cancer cell line (HEY) and
  2. then monitored global changes in gene expression levels.
  • 20% of the changes in expression patterns of hundreds to thousands of genes
  • could be attributed to direct miRNA–mRNA interactions, but
  • the majority of the changes were indirect,
involving the downstream consequences of miRNA-mediated changes in regulatory gene expression.
The pathways most significantly affected by miR-7 transfection, are involved with
  1. cell adhesion and
  2. developmental networks previously associated with epithelial-mesenchymal transitions and
  3. processes linked with metastasis.

http://www.genengnews.com/insight-and-intelligenceand153/clinical-application-of-mirnas-remains-a-ways-off/77899650/

ATVB in Focus
MicroRNAs 
From Basic Mechanisms to Clinical Application in Cardiovascular Medicine
Christian Weber, Ludwig-Maximilians-Univ and German Centre for Cardiovasc Res, Munich, Germany
Arterioscler Thromb Vasc Biol. 2013;33:168-169.  http://dx.doi.org/10.1161/ATVBAHA.112.300920
MicroRNAs (miRs) are small noncoding RNAs (≈23 nucleotides) that regulate gene expression at a posttranscriptional level by degradation or translational inhibition of target mRNAs. Initially discovered as regulators of development in plants, worms, and fruitflies,
miRs are emerging as
  • pivotal modulators of cardiovascular biology and disease in mice and men.
Besides a cell-specific transcription factor profile,
  • cell-specific miR-regulated gene expression is integral to cell fate and activation decisions.
Thus, the cell types involved in
  • atherosclerosis,
  • vascular disease, and
    • its myocardial sequelae may be
  • differentially regulated by distinct miRs, thereby
    • controlling highly complex processes
      • smooth muscle cell phenotype and
      • inflammatory responses of endothelial cells or macrophages.
The generation of mature miR strands requires several steps of processing of the primary miR gene transcript, including
  • cleavage of the terminal loop of miR-precursors by the RNase III enzyme,Dicer, to produce miR duplexes.
Although either strand of the miR duplex can be stably associated with an Argonaute (Ago) family protein,
  • preferential loading of a specific strand (ie, the guide strand) onto the miR-induced silencing complex (RISC) is common.
The strand that is not loaded into the RISC (ie, the passenger strand or miR*) is typically degraded.3 Strand selection may be tissue-specific, and an accumulation observed for both strands implies that
  • each strand can separately enter the silencing complex.4
Because of the often imperfect complementary binding of the miR seed sequence to the mRNA recognition element,
  • an individual miR can affect the expression of hundreds of target mRNAs.
http://atvb.ahajournals.org/content/33/2/168.extract

Life’s Tiniest Architects Pinpointed by Yale Researchers
If a genome is the blueprint for life, then the chief architects are
  • tiny slices of genetic material that orchestrate how we are assembled and function.
The study pinpoints the molecular regulators of epigenetics — the process by which unchanging genes along our DNA are switched on and off at precisely right time and place.
“Our genome is like a landscape with lakes, mountains, and rivers, but it is not yet a community or a city full of buildings,” said Haifan Lin, director of the Yale Stem Cell Center and senior author of the study. “What this system does is decide where and when to send out the masons, carpenters, and electricians to build a city or a community.”
In the past 20 years, scientists have discovered that some proteins, called epigenetic factors, traverse the static genome and turn the genes on or off. The staggering number of potential combinations of active and inactive genes explains why a relatively small number of genes can carry out such a wide range of functions.
What guides these epigenetic factors to their target? The answer:

  • specialized RNAs called piRNAs.
In the latest study, the Yale team discovered that
  • piRNAs guide epigenetic factors to numerous sites throughout the genome of the fruit fly Drosophila, where
    • these switches  work to turn genes on or off.
The dramatic change in gene expression patterns found illustrated
  • piRNAs key role in coordinating biological activity.
“This is the first major mechanism discovered that controls where epigenetic factors —the gene switches — are to be placed in the genome,” Lin said.
Several types of cancers appeared to be
  • triggered when the wrong kinds of piRNAs guide epigenetic factors to activate the wrong genes.
Blocking the action of these piRNAs should become a new opportunity to treat cancers, Lin said.
Xiao A. Huang and Hang Yin of Yale are co-lead authors of the paper.
The research was funded by a National Institutes of Health Pioneer Award to Haifan Lin and a grant from Connecticut Stem Cell Research Fund to
Lin and former Yale professor and co-author Michael Snyder, now of Stanford University.
English: A diagram showing at which stages in ...

English: A diagram showing at which stages in the DNA-mRNA-protein pathway expression can be controlled. (Photo credit: Wikipedia)

Virus-Encoded microRNAs

Virus-Encoded microRNAs (Photo credit: AJC1)

English: A Tet-ON doxycycline inducible transg...

English: A Tet-ON doxycycline inducible transgene expression system. (Photo credit: Wikipedia)

 

Read Full Post »

microRNA Biomarker

Reporter: Larry H Bernstein, MD, FCAP

MicroRNA Molecule May Serve as Biomarker

miRNA molecule called miR-7 decreased in highly metastatic cancer stem-like cells.
February 18, 2013
Researchers have identified two molecules that could potentially serve as biomarkers in

MicroRNAs are involved in

  • tumor initiation and
  • progression, and
  • may play a role in metastasis, particularly in relation to
  • cancer stem-like cells.
miR-7 is a metastasis

  • suppressor in cancer stem-like cells, and when they
  • increased expression of miR-7 in cancer stem-like cells from
    • it suppressed their metastatic properties.

miR-7 suppressed ………….expression of KLF4.
However, miR-7 significantly suppressed the ability of cancer stem-like cells to metastasize to the brain but not the bone.

A gram illustrating the disctinction between c...

A gram illustrating the disctinction between cancer stem cell targeted (above) and conventional (below) cancer therapies (Photo credit: Wikipedia)

Related articles

 

Read Full Post »

New Insight into How Genes Function

Reporter: Larry H Bernstein, MD, FCAP

 

New Insight into How Genes Function

GENNewsHighlights    Feb 17, 2013

Long segments of noncoding RNA are key to

  • physically manipulating DNA in order to activate certain genes.
  • These noncoding RNA-activators (ncRNA-a) have a crucial role in
An illustration of the central dogma of molecu...

Diagram of a eukaryotic gene

Diagram of a eukaryotic gene (Photo credit: Wikipedia)

This image shows the coding region in a segmen...

This image shows the coding region in a segment of eukaryotic DNA. (Photo credit: Wikipedia)

English: Sporulation involved ncRNA

English: Sporulation involved ncRNA (Photo credit: Wikipedia)

English: Genes required for ectodermal specifi...

English: Genes required for ectodermal specification during early embryogenesis (Photo credit: Wikipedia)

Related articles

Read Full Post »

Targeted Nucleases

Curator: Larry H Bernstein, MD, FCAP

A REVIEW of 3 published works

Targeted nucleases: spreading the joy
Nature Methods 10, 179 (2013)   http://dx.doi.org/10.1038/nmeth.2402

Published online 27 February 2013
New RNA-guided endonucleases (RGENs) are directed to their target sites
  • by a complementary RNA molecule.
In contrast to previous tools,
  • zinc-finger nucleases (ZFNs) and
  • transcription activator–like effector nucleases (TALENs),
the RGEN nuclease component itself does not require re-engineering to
  • target a new sequence.
The ability to manipulate DNA has led to a new genetics.
Professor of Genetics at Washington University’s School of Medicine
Backgrounders – introduction to issues of current interest

Restriction Endonucleases

In 1980, geneticists used the relatively new technique of gene splicing, which we will describe in this chapter, to introduce
  • the human gene that encodes interferon into a bacterial cell’s genome.
Interferon is a rare blood protein that increases human resistance to viral infection, and medical scientists have been interested in its possible usefulness in cancer therapy. Purification of the large amounts of interferon required for clinical testing would have been prohibitively expensive at the time.   Introducing the gene responsible for its production into a bacterial cell made that possible. The cell that had
  • acquired the human interferon gene proceeded to produce interferon at a rapid rate, and to grow and divide.

The  millions of interferon-producing bacteria growing in the culture were all descendants of the cell that had originally received the human interferon gene.

The Advent of Genetic Engineering
The human insulin gene has also been cloned in bacteria, and now insulin can be manufactured at little expense. Furthermore, cloning and related molecular techniques are needed to provide basic information about how genes are put together and regulated.
The essence of genetic engineering is
  • the ability to cut DNA into recognizable pieces and rearrange those pieces in different ways.
In the interferon experiment,
  • a piece of DNA carrying the interferon gene was
  • inserted into a plasmid,which
    • carried the gene into a bacterial cell.
Most other genetic engineering approaches bring the gene of interest into the target cell by first incorporating it into a plasmid or an infective virus.
This cutting is performed
  • by enzymes that recognize and
  • cleave specific sequences of nucleotides in DNA.
Discovery of Restriction Endonucleases
Scientific discoveries often have their origins in seemingly unimportant observations that receive little attention by researchers before their general significance is appreciated. In the case of genetic engineering, the original observation was that bacteria use enzymes to defend themselves against viruses.
Most organisms eventually evolve means of defending themselves from predators and parasites, and bacteria are no exception. Among the natural enemies of bacteria are bacteriophages, viruses that infect bacteria and multiply within them. At some point, they cause the bacterial cells to burst, releasing thousands more viruses.
Some types of bacteria have acquired powerful weapons against these viruses: they contain enzymes called restriction endonucleases
  • that fragment the viral DNA as soon as it enters the bacterial cell.
Many restriction endonucleases recognize
  1. specific nucleotide sequences in a DNA strand,
  2. bind to the DNA at those sequences, and
  3. cleave the DNA at a particular place within the recognition sequence.
Why don’t restriction endonucleases cleave the bacterial cells’ own DNA as well as that of the viruses?
  • bacteria modify their own DNA, using other enzymes known as methylases to add methyl (CH3) groups
  • to some of the nucleotides in the bacterial DNA.
When nucleotides within a restriction endonuclease’s recognition sequence have been methylated,
  • the endonuclease cannot bind to that sequence.
  • the bacterial DNA is protected from being degraded at that site.
  • but viral DNA has not been methylated, and therefore
    • is not protected from enzymatic cleavage.
How Restriction Endonucleases Cut DNA
The sequences recognized by restriction endonucleases are
  • typically four to six nucleotides long, and
  • they are often palindromes.
    • the nucleotides at one end of the recognition sequence are complementary to those at the other end, so that
    • the two strands of the DNA duplex have the same nucleotide sequence running in opposite directions for the length of the recognition sequence.

Two important consequences arise from this arrangement of nucleotides to be discussed.

Biochemistry. 5th edition.
Berg JM, Tymoczko JL, Stryer L.
Section 9.3  Restriction Enzymes: Performing Highly Specific DNA-Cleavage Reactions
Bacteria and archaea have evolved mechanisms to protect themselves from viral infections so that viruses inject their DNA genomes into cells and the viral DNA hijacks the cell’s machinery A major protective strategy for the host is to use restriction endonucleases (restriction enzymes) to degrade the viral DNA. These  particular base sequences the enzymes recognize are called recognition sequences or recognition sites.
  • theycleave that DNA at defined positions.
  • The most well studied class are the so-called type II restriction enzymes.
Restriction endonucleases must show tremendous specificity at two levels.
  • First, they must cleave only DNA molecules that contain recognition sites (hereafter referred to as cognate DNA) without cleaving DNA molecules that lack these sites.
    •  endonucleases must cleave cognate DNA molecules much more than 5000 times as efficiently as they cleave nonspecific sites.
  • Second, restriction enzymes must not degrade the host DNA.

How do these enzymes manage to degrade viral DNA while sparing their own?

The restriction endonuclease EcoRV (from E. coli) cleaves double-stranded viral DNA molecules that contain the sequence 5′-GATATC-3′ but leaves intact host DNA containing hundreds of such sequences. The host DNA is protected by other enzymes called methylases, which methylate adenine bases within host recognition sequences (Figure 9.32). For each restriction endonuclease, the host cell produces a corresponding methylase that marks the host DNA and prevents its degradation.
  • These pairs of enzymes are referred to as restriction-modification systems.
Hydrolysis of a Phosphodiester Bond.
All restriction enzymes catalyze the hydrolysis of DNA phosphodiester bonds, leaving a phosphoryl group attached to the 5′ end. The bond that is cleaved is shown in red.
Mechanism Type 1 (covalent intermediate)
Mechanism Type 2 (direct hydrolysis)
Each postulates a different nucleophile to carry out the attack on the phosphorus. In either case, each reaction takes place by an in-line displacement path:
  • The incoming nucleophile attacks the phosphorus atom, and
  • a pentacoordinate transition state is formed.

This species has a trigonal bipyramidal geometry centered at the phosphorus atom, with

  • the incoming nucleophile at one apex of the two pyramids and the group that is displaced (the leaving group, L) at the other apex.
  • The two mechanisms differ in the number of times the displacement occurs in the course of the reaction.
The incoming nucleophile attacks the phosphorus atom, and
  • a pentacoordinate transition state is formed.
The analysis revealed that the stereochemical configuration at the phosphorus atom was inverted only once with cleavage. This result is consistent with a direct attack of water at phosphorus and
  • rules out the formation of any covalently bound intermediate (Figure 9.35).
Stereochemistry of Cleaved DNA.
Cleavage of DNA by EcoRV endonuclease results in overall inversion of the stereochemical configuration at the phosphorus atom.
9.3.2 Restriction Enzymes Require Magnesium for Catalytic Activity
Restriction endonucleases as well as many other enzymes that act on phosphate-containing substrates require Mg2+ or some other similar divalent cation for activity. What is the function of this metal?
It has been possible to examine the interactions of the magnesium ion when it is bound to the enzyme. Crystals have been produced of EcoRV endonuclease
  • bound to oligonucleotides that contain the appropriate recognition sequences.
These crystals are grown in the absence of magnesium to prevent cleavage; then,
  • when produced, the crystals are soaked in solutions containing the metal.
  • No cleavage takes place, allowing the location of the magnesium ion binding sites to be determined (Figure 9.36).
The magnesium ion was found to be bound to six ligands:
  1. three are water molecules,
  2. two are carboxylates of the enzyme’s aspartate residues, and
  3. one is an oxygen atom of the phosphoryl group at the site of cleavage.
The magnesium ion holds a water molecule in a position from which the water molecule can attack the phosphoryl group and,
  • in conjunction with the aspartate residues,
  • helps polarize the water molecule toward deprotonation.
Cleavage does not take place within these crystals. But a second magnesium ion must be present in an adjacent site for EcoRV endonuclease to cleave its substrate.
Magnesium Ion Binding Site in ECORV Endonuclease. The magnesium ion helps to activate a water molecule and positions it so that it can attack the phosphate.
9.3.3 The Complete Catalytic Apparatus Is Assembled Only Within Complexes of Cognate DNA Molecules, Ensuring Specificity
Specificityis the defining feature of restriction enzymes. The recognition sequences for most restriction endonucleases are inverted repeats.
This arrangement gives the three-dimensional structure of the recognition site
  • a twofold rotational symmetry (Figure 9.37).
The restriction enzymes display a corresponding symmetry to facilitate recognition:
they are dimers whose two subunits are related by twofold rotational symmetry.
  • The matching symmetry of the recognition sequence and the enzyme has been confirmed
  • by the determination of the structure of the complex between EcoRV endonuclease and DNA fragments containing its recognition sequence (Figure 9.38).

The enzyme surrounds the DNA in a tight embrace.

Structure of the Recognition Site of ECORV Endonuclease.
(A) The sequence of the recognition site, which is symmetric around the axis of rotation designated in green.
(B) The inverted repeat within the recognition sequence of EcoRV
 Structure of the ECORV – Cognate DNA Complex.
This view of the structure of EcoRV endonuclease bound to a cognate DNA fragment is down the helical axis of the DNA. The two protein subunits are in yellow and blue, and the DNA backbone is in red.
A unique set of interactions occurs between the enzyme and a cognate DNA sequence. Within the 5′-GATATC-3′ sequence,
the G and A bases at the 5′ end of each strand and their Watson-Crick partners directly contact the enzyme
  • by hydrogen bonding with residues that are located in two loops,
  • one projecting from the surface of each enzyme subunit (Figure 9.39).
The most striking feature of this complex is the distortion of the DNA, which is substantially kinked in the center (Figure 9.40). The central two TA base pairs in the recognition sequence play a key role in producing the kink. They do not make contact with the enzyme but appear to be required because of their ease of distortion. 5′-TA-3′ sequences are known to be among the most easily deformed base pairs.
The distortion of the DNA at this site has severe effects on the specificity of enzyme action.
Hydrogen Bonding Interactions between ECORV Endonuclease and Its DNA Substrate.
One of the DNA-binding loops (in green) of EcoRV endonuclease is shown interacting with the base pairs of its cognate DNA binding site. Key amino acid residues are shown.
Distortion of the Recognition Site.
The DNA is represented as a ball-and-stick model. The path of the DNA helical axis, shown in red, is substantially distorted on binding to the enzyme. For the B form of DNA, the axis is straight (not shown).
Specificity is often determined by an enzyme’s binding affinity for substrates. In regard to EcoRV endonuclease, however, binding studies performed in the absence of magnesium have demonstrated that
  • the enzyme binds to all sequences, both cognate and noncognate, with approximately equal affinity.
  • the structures of complexes formed with noncognate DNA fragments are strikingly different from those formed with cognate DNA:
    • the noncognate DNA conformation is not substantially distorted (Figure 9.41).

This lack of distortion has important consequences with regard to catalysis. No phosphate is positioned sufficiently close to the active-site aspartate residues to complete a magnesium ion binding site (see Figure 9.36). Hence, the nonspecific complexes do not bind the magnesium ion and

  • the complete catalytic apparatus is never assembled.
The distortion of the substrate and the subsequent binding of the magnesium ion account for
  • the catalytic specificity of more than 1,000,000-fold that is observed for EcoRV endonuclease
Nonspecific and Cognate DNA within ECORV Endonuclease.
A comparison of the positions of the nonspecific (orange) and the cognate DNA (red) within EcoRV reveals that,
  • in the nonspecific complex, the DNA backbone is too far from the enzyme
We can now see the role of binding energy in this strategy for attaining catalytic specificity.
In binding to the enzyme, the DNA is distorted in such a way that
  • additional contacts are made between the enzyme and the substrate, increasing the binding energy.
However, this increase is canceled by the energetic cost of distorting the DNA from its relaxed conformation (Figure 9.42). Thus, for EcoRV endonuclease,
there is little difference in binding affinity for cognate and nonspecific DNA fragments.
  • However, the distortion in the cognate complex dramatically affects catalysis by completing the magnesium ion binding site.
  • This example illustrates how enzymes can utilize available binding energy to deform substrates and poise them for chemical transformation.
  • Interactions that take place within the distorted substrate complex
    • stabilize the transition state leading to DNA hydrolysis.
Greater Binding Energy of EcoRV Endonuclease Bound to Cognate Versus Noncognate Dna.
The additional interactions between EcoRV endonuclease and cognate DNA increase the binding energy, which can be used to drive DNA distortions.
The distortion in the DNA explains how methylation blocks catalysis and protects host-cell DNA. When a methyl group is added to the amino group of the adenine nucleotide at the 5′ end of the recognition sequence,
  • the methyl group’s presence precludes the formation of a hydrogen bond between the amino group and the side-chain carbonyl group of asparagine 185 (Figure 9.43).
  • This asparagine residue is closely linked to the other amino acids that form specific contacts with the DNA.
  • The absence of the hydrogen bond disrupts other interactions between the enzyme and the DNA substrate, and
    • the distortion necessary for cleavage will not take place.
Methylation of Adenine.
The methylation of adenine blocks the formation of hydrogen bonds
  • between EcoRV endonuclease and cognate DNA molecules and
  • prevents their hydrolysis.
 9.3.4 Type II Restriction Enzymes Have a Catalytic Core in Common and Are Probably Related by Horizontal Gene Transfer
 Type II restriction enzymes are prevalent in Archaea and Eubacteria. What can we tell of the evolutionary history of these enzymes?
Comparison of the amino acid sequences of a variety of type II restriction endonucleases did not reveal significant sequence similarity between most pairs of enzymes. However, a careful examination of three-dimensional structures, taking into account the location of the active sites, revealed
  • the presence of a core structure conserved in the different enzymes.
  • This structure includes β strands that contain the aspartate (or, in some cases, glutamate) residues forming the magnesium ion binding sites (Figure 9.44).
 A Conserved Structural Core in Type II Restriction Enzymes.
Four conserved structural elements, including the active-site region (in blue), are highlighted in color in these models of a single monomer from each dimeric enzyme.
These observations indicate that many type II restriction enzymes are indeed evolutionary related. Analyses of the sequences in greater detail suggest that bacteria may have obtained genes encoding these enzymes 
  • from other species by horizontal gene transfer, the passing between species of pieces of DNA (such as plasmids) that provide
  • a selective advantage in a particular environment.
For example, EcoRI (from E. coli) and RsrI (from Rhodobacter sphaeroides) are 50% identical in sequence over 266 amino acids, clearly
  • indicative of a close evolutionary relationship.
  • these species of bacteria are not closely related,
  • as is known from sequence comparisons of other genes and other evidence.
Thus, it appears that these species obtained the gene for this restriction endonuclease from a common source
  • more recently than the time of their evolutionary divergence.
  • the gene encoding EcoRI endonuclease uses particular codons to specify given amino acids that are
  • strikingly different from the codons used by most E. coli genes, which
    • suggests that the gene did not originate in E. coli.
  • Horizontal gene transfer may be a relatively common event.
    • genes that inactivate antibiotics are often transferred, leading to the transmission of antibiotic resistance from one species to another.
For restriction-modification systems,
  • protection against viral infections may have favored horizontal gene transfer.
Biochemistry. 5th edition.
Berg JM, Tymoczko JL, Stryer L.
New York: W H Freeman; 2002.
  • Cleavage Is by In-Line Displacement of 3′ Oxygen from Phosphorus by Magnesium-Activated Water
  • Restriction Enzymes Require Magnesium for Catalytic Activity
  • The Complete Catalytic Apparatus Is Assembled Only Within Complexes of Cognate DNA Molecules, Ensuring Specificity
  • Type II Restriction Enzymes Have a Catalytic Core in Common and Are Probably Related by Horizontal Gene Transfer
By Richard Wheeler (Zephyris) 2007. Image of E...

By Richard Wheeler (Zephyris) 2007. Image of EcoRV homodimer in complex with a DNA substrate. From . (Photo credit: Wikipedia)

HindIII restriction endonuclease in complex wi...

HindIII restriction endonuclease in complex with cognate DNA (Photo credit: Wikipedia)

English: 3d surface model of HindIII dimer com...

English: 3d surface model of HindIII dimer complexed with a DNA fragment from PDB 2E52. Ref.: Watanabe, N., Sato, C., Takasaki, Y., Tanaka, I. Crystal structural analysis of HindIII restriction endonuclease in complex with cognate DNA at 2.0 angstrom resolution to be published (Photo credit: Wikipedia)

English: BglII active site containing residues...

English: BglII active site containing residues that coordinate to a metal ion and water molecules including the nucleophilic water that breaks the scissile phosphodiester bond at the recognition site. (Photo credit: Wikipedia)

Related articles

 

Read Full Post »

CRACKING THE CODE OF HUMAN LIFE: Milestones along the Way – Part IIA

Curator: Larry H Bernstein, MD, FCAP

 

This image has an empty alt attribute; its file name is ArticleID-24.png

WordCloud Image Produced by Adam Tubman

Introduction and purpose

This material goes beyond the Initiation Phase of Molecular Biology, Part I.

http://pharmaceuticalintelligence.com/2013/02/08/the-initiation-and-growth-of-molecular-biology-and-genomics/
Part II reviews the Human Genome Project and the decade beyond.

In a three part series:
Part IIA.  CRACKING THE CODE OF HUMAN LIFE: Milestones along the Way
Part IIB.  CRACKING THE CODE OF HUMAN LIFE: The Birth of BioInformatics & Computational Genomics
Part IIC.  CRACKING THE CODE OF HUMAN LIFE: Recent Advances in Genomic Analysis and Disease

Part III will conclude with Ubiquitin, it’s Role in Signaling and Regulatory Control.
Part I reviewed the huge expansion of the biological research enterprise after the Second World War. It concentrated on the

  • discovery of cellular structures,
  • metabolic function, and
  • creation of a new science of Molecular Biology.
  •  

Part II follows the race to delineation of the Human Genome, discovery methods and fundamental genomic patterns that are ancient in both animal and plant speciation. But it explores both the complexity and the systems view of the architecture that underlies and understanding of the genome.

These articles review a web-like connectivity between inter-connected scientific discoveries, as significant findings have led to novel hypotheses and many expectations over the last 75 years. This largely post WWII revolution has driven our understanding of biological and medical processes at an exponential pace owing to successive discoveries of

  • chemical structure,
  • the basic building blocks of DNA  and proteins,
  • nucleotide and protein-protein interactions,
  • protein folding, allostericity,
  • genomic structure,
  • DNA replication,
  • nuclear polyribosome interaction, and
  • metabolic control.

In addition, the emergence of methods for

  • copying,
  • removal,
  • insertion,
  • improvements in structural analysis
  • developments in applied mathematics that have transformed the research framework.

Part IIA:

CRACKING THE CODE OF HUMAN LIFE:

Milestones along the Way

A NOVA interview with Francis Collins (NHGRI) (FC), J. Craig Venter (CELERA)(JCV), and Eric Lander (EL).
RK: For the past ten years, scientists all over the world have been painstakingly trying to read the tiny instructions buried inside our DNA. And now, finally, the “Human Genome” has been decoded.
EL: The genome is a storybook that’s been edited for a couple billion years.
The following will address the odd similarity of genes between man and yeast

EL: In the nucleus of your cell the DNA molecule resides that is about 10 angstroms wide curled up, but the amount of curling is limited by the negative charges that repel one another, but there are folds upon folds. If the DNA is stretched the length of the DNA would be thousands of feet.
EL: We have known for 2000 years that your kids look a lot like you. Well it’s because you must pass them instructions that give them the eyes, the hair color, and the nose shape they have. RK: Cracking the code of those minuscule differences in DNA that influence health and illness is what the Human Genome Project is all about. Since 1990, scientists all over the world have been involved in the effort to read all three billion As, Ts, Gs, and Cs of human DNA.  It took 10 years to find the one genetic mistake that causes cystic fibrosis. Another 10 years to find the gene for Huntington’s disease. Fifteen years to find one of the genes that increase the risk for breast cancer. One letter at a time, painfully slowly…     And then came the revolution. In the last ten years the entire process has been computerized. The computations can do a thousand every second and that has made all the difference. EL: This is basically a parts list with a lot of parts. If you take an airplane, a Boeing 777, I think it has like 100,000 parts. If I gave you a parts list for the Boeing 777 in one sense you’d know 100,000 components, screws and wires and rudders and things like that.  But you wouldn’t know how to put it together, or why it flies. We now have a parts list, and that’s not enough to understand why it flies.

The Human Genome

The Human Genome (Photo credit: dullhunk)

A Quest For Clarity

Tracy Vence is a senior editor of Genome Technology
Tracy Vence @GenomeTechMag
Projects supported by the US National Institutes of Health will have produced 68,000 total human genomes — around 18,000 of those whole human genomes — through the end of this year, National Human Genome Research Institute estimates indicate. And in his book, The Creative Destruction of Medicine, the Scripps Research Institute’s Eric Topol projects that 1 million human genomes will have been sequenced by 2013 and 5 million by 2014.
Daniel MacArthur, a group leader in Massachusetts General Hospital’s Analytic and Translational Genetics Unit estimates that “From a capacity perspective … millions of genomes are not that far off. If you look at the rate that we’re scaling, we can certainly achieve that.”    The prospect of so many genomes has brought clinical interpretation into focus. But there is an important distinction to be made between the interpretation of an apparently healthy person’s genome and that of an individual who is already affected by a disease.
In an April Science Translational Medicine paper, Johns Hopkins University School of Medicine‘s Nicholas Roberts and his colleagues reported that personal genome sequences for healthy monozygotic twin pairs are not predictive of significant risk for 24 different diseases in those individuals. The researchers concluded that whole-genome sequencing was not likely to be clinically useful. Ambiguities have clouded even the most targeted interpretation efforts.

  • Technological challenges,
  • meager sample sizes,
  • a need for increased,
  • fail-safe automation and most important
  • a lack of community-wide standards for the task.

have hampered researchers’ attempts to reliably interpret the clinical significance of genomic variation.

How signals from the cell surface affect transcription of genes in the nucleus.
 

James Darnell, Jr., MD, Astor Professor, Rockefeller
After graduation from Washington University School of Medicine he worked with Francois Jacob at the Pasteur Institute in Paris and served as Vice President for Academic Affairs at Rockefeller in 1990-91. He is the coauthor with S.E. Luria of General Virology and the founding author with Harvey Lodish and David Baltimore of Molecular Cell Biology, now in its sixth edition. His book RNA, Life’s Indispensable Molecule was published in July 2011 by Cold Spring Harbor Laboratory Press. A member of the National Academy of Sciences since 1973, recipient of  numerous awards, including the 2003 National Medal of Science, the 2002 Albert Lasker Award.
Using interferon as a model cytokine, the Darnell group discovered that cell transcription was quickly changed by binding of cytokines to the cell surface. The bound interferon led to the tyrosine phosphorylation of latent cytoplasmic proteins now called STATs (signal transducers and activators of transcription) that dimerize by

  • reciprocal phosphotyrosine-SH2 interchange.
  • accumulate in the nucleus,
  • bind DNA and drive transcription.

This pathway has proved to be of wide importance with seven STATs now known in mammals that take part in a wide variety of developmental and homeostatic events in all multicellular animals. Crystallographic analysis defined functional domains in the STATs, and current attention is focused on two areas:

  • how the STATs complete their cycle of  activation and inactivation, which requires regulated tyrosine dephosphorylation; and how
  • persistent activation of STAT3 that occurs in a high proportion of many human cancers contributes to blocking apoptosis in cancer cells.

Current efforts are devoted to inhibiting STAT3 with modified peptides that can enter cells.

Cell cycle regulation and the cellular response to genotoxic stress

Stephen J Elledge, PhD, Gregor Mendel Professor of Genetics and Medicine, Investigator, Howard Hughes Medical Institute, Harvard Medical School
As a postdoctoral fellow at Stanford working on eukaryotic homologous recombination, he serendipitously found a family of genes known as ribonucleotide reductases. He subsequently showed that

  • these genes are activated by DNA damage and
  • could serve as tools to help scientists dissect the signaling pathways
  • through which cells sense and respond to DNA damage and replication stress.

At Baylor College of Medicine he made a second major breakthrough with the discovery of the cyclin-dependent kinase 2 gene (Cdk2), which

  • controls the G1-to-S cell cycle transition,
  • an entry checkpoint for the cell proliferation cycle and
  • a critical regulatory step in tumorigenesis.

From there, using a novel “two-hybrid” cloning method he developed, Elledge and Wade Harper, PhD, proceeded to

  • isolate several members of the Cdk2-inhibitory family.

Their discoveries included the p21 and p57 genes, mutations in the latter (responsible for Beckwith-Wiedemann syndrome), characterized by somatic overgrowth and increased cancer risk. Elledge is also recognized for his work in understanding

  • proteome remodeling through ubiquitin-mediated proteolysis.
  • they identified F-box proteins that regulate protein degradation in the cell by
  1. binding to specific target protein sequences and then
  2. marking them with ubiquitin for destruction by the cell’s proteasome machinery.

This breakthrough resulted in

  • the elucidation of the cullin ubiquitin ligase family,
  • which controls regulated protein stability in eukaryotes.

nature10774-f5.2  nature10774-f3.2   ubiquitin structures  Rn1  Rn2

Elledge’s recent research has focused on the cellular mechanisms underlying DNA damage detection and cancer using genetic technologies. In collaboration with Cold Spring Harbor Laboratory researcher Gregory Hannon, PhD, Elledge has generated complete human and mouse short hairpin RNA (shRNA) libraries for genome-wide loss-of-function studies. Their efforts have led to

  • the identification of a number of tumor suppressor proteins
  • genes upon which cancer cells uniquely depend for survival.

This work led to the development of the “non-oncogene addiction” concept. This is noted as follows:

  • proteome remodeling through ubiquitin-mediated proteolysis
  • F-box proteins regulate protein degradation in the cell by binding to specific target protein sequences
  • and then marking them with ubiquitin for destruction by the cell’s proteasome machinery
  • elucidation of the cullin ubiquitin ligase family, which controls regulated protein stability in eukaryotes

Playing the dual roles of inventor and investigator, Elledge developed original techniques to define

  • what drives the cell cycle and
  • how cells respond to DNA damage.

By using these tools, he and his colleagues have identified multiple genes involved in cell-cycle regulation.

Elledge’s work has earned him many awards, including a 2001 Paul Marks Prize for Cancer Research and a 2003 election to the National Academy of Sciences. In his Inaugural Article (1), published in this issue of PNAS, Elledge and his colleagues describe the function of Fbw7, a protein involved in controlling cell proliferation (see below). Elledge studied the error-prone DNA repair mechanism in E-Coli (Escherichia coli) called SOS mutagenesis for his PhD thesis at MIT. His work identified  and described

  • the regulation of a group of enzymes now known as error-prone polymerases,
  • the first members of which were the umuCD genes in E. coli.

It was then that he developed a new cloning tool. Elledge invented a technique that allowed him to approach future cloning problems of this type with great rapidity. With the new technique, “you could make large libraries in lambda that behave like plasmids. We called them `phasmid’ vectors, like plasmid and phage together”. The phasmid cloning method was an early cornerstone for molecular biology research.

Elledge began working on homologous recombination in postdoctoral fellowship at Stanford University, an important niche in the field of eukaryotic genetics. Working with the yeast genome, Elledge searched for rec A, a gene that allows DNA to recombine homologously. Although he never located rec A, he discovered a family of genes known as ribonucleotide reductases (RNRs), which are involved in DNA production. Rec A and RNRs share the same last 4 amino acids, which caused an antibody crossreaction in one of Elledge’s experiments. Initially disappointed with the false positives in his hunt for rec A, Elledge was later delighted with his luck. He found that

  • RNRs are turned  on by DNA damage, and
  • these genes are regulated by the cell cycle.

Prior to leaving Stanford, Elledge attended a talk at the University of California, San Francisco, by Paul Nurse, a leader in cell-cycle research who would later win the 2001 Nobel Prize in medicine. Nurse described his success in isolating the homolog of a key human cell-cycle kinase gene, Cdc2, by using a mutant strain of yeast (8). Although Nurse’s methods were primitive, Elledge was struck by the message he carried: that

  • cell-cycle regulation was functionally conserved, and
  • many human genes could be isolated by looking for complimentary genes in yeast.

Elledge then took advantage of his past successes in building phasmid vectors to build a versatile human cDNA library that could be expressed in yeast. After setting up a laboratory at Baylor, he introduced this library into yeast, screening for complimentary cell-cycle genes.  He quickly identified the same Cdc2 gene isolated by Nurse. However, Elledge also discovered a related gene known as Cdk2. Elledge subsequently found that

  • Cdk2 controlled the G1 to S cell-cycle transition, a step that often goes awry in cancer. These results were published in the EMBO Journal in 1991.

He then continued to use

  • RNRs to perform genetic screens to
  • identify genes involved in sensing and responding to DNA damage.

He subsequently worked out the

  • signal transduction pathways in both yeast and humans that recognize damaged DNA and replication problems.

These “checkpoint” pathways are central to the

  • prevention of genomic instability and a key to understanding tumorigenesis.

This contribution is part of the special series of Inaugural Articles by members of the National Academy of Sciences elected on April 29, 2003.

Defective cardiovascular development and elevated cyclin E and Notch proteins in mice lacking the Fbw7 F-box protein.

Tetzlaff MT, Yu W, Li M, Zhang P, Finegold M, Mahon K , Harper JW, Schwartz RJ, and SJ Elledge. PNAS 2004; 101(10): 3338-3345. cgi doi 10.1073.  pnas.0307875101

The mammalian F-box protein Fbw7 and its Caenorhabditis elegans counterpart Sel-10 have been implicated in

  • the ubiquitin-mediated turnover of cyclin E
  • as well as the Notch Lin-12 family of transcriptional activators. Both unregulated
  1. Notch and cyclin E
  2. promote tumorigenesis, and
  3. inactivate mutations in human

Fbw7 studies suggest that it may be a tumor suppressor. To generate an in vivo system to assess the consequences of such unregulated signaling, we generated mice deficient for Fbw7.  Fbw7-null mice die around 10.5 days post coitus because of a combination of deficiencies in hematopoietic and vascular development and heart chamber mutations. The absence of Fbw7 results in elevated levels of cyclin E, concurrent with inappropriate DNA replication in placental giant trophoblast cells. Moreover, the levels of both Notch 1 and Notch 4 intracellular domains were elevated, leading to stimulation of downstream transcriptional pathways involving Hes1, Herp1, and Herp2. These data suggest essential functions for Fbw7 in controlling cyclin E and Notch signaling pathways in the mouse.

Science as an Adventure

Ubiquitins

Prof. Avram Hershko – Science as an Adventure
Prof. Avram Hershko shared the 2004 Nobel Prize in Chemistry with Aaron Ciechanover and Irwin Rose for “for the discovery of ubiquitin-mediated protein degradation.”

http://www.youtube.com/watch?v=lGJvsmG3mhw&feature=player_detailpage&list=EC8814C902ACB98559

Gene Switches

Nipam Patel is a professor in the Departments of Molecular and Cell Biology and Integrative Biology at UC Berkeley and runs a research laboratory that studies the role, during embryonic development, of homeotic genes (the genetic switches described in this feature). “Ghost in Your Genes” focuses on epigenetic “switches” that turn genes “on” or “off.” But not all switches are epigenetic; some are genetic. That is, other genes within the chromosome turn genes on or off. In an animal’s embryonic stage, these gene switches play a predominant role in laying out the animal’s basic body plan and perform other early functions;

  • the epigenome begins to take over during the later stages of embryogenesis.

Beginning as a fertilized single egg that egg becomes many different kinds of cells.  Altogether, multicellular organisms like humans have thousands of differentiated cells. Each is optimized for use in the brain, the liver, the skin, and so on. Remarkably, the DNA inside all these cells is exactly the same. What makes the cells differ from one another is that different genes in that DNA are either turned on or off in each type of cell.

Take a typical cell, such as a red blood cell. Each gene within that cell has a coding region that encodes the information used to make a particular protein. (Hemoglobin shuttles oxygen to the tissues and carbon dioxide back out to the lungs—or gills, if you’re a fish.) But another region of the gene, called “regulatory DNA,” determines whether and when the gene will be expressed, or turned on, in a particular kind of cell. This precise transcribing of genes is handled by proteins known as transcription factors, which bind to the regulatory DNA, thereby generating instructions for the coding region.

One important class of transcription factors is encoded by the so called homeotic, or Hox, genes. Found in all animals, Hox genes act to “regionalize” the body along the embryo’s anterior-to-posterior (head-to-tail) axis. In a fruit fly, for example, Hox genes lay out the various main body segments—the head, thorax, and abdomen. Amazingly, all animals, from fruit flies to mice to people, rely on the same basic Hox-gene complex. Using different-colored antibody stains, we can see exactly where and to what degree Hox genes are expressed. Each Hox gene is expressed in a specific region along the anterior-to-posterior axis of the embryo.

A fly’s body has three main divisions: head, thorax, and abdomen. We’ll focus on the thorax, which itself has three main segments. In a normal adult fly, the second thoracic segment features a pair of wings, while the third thoracic segment has a pair of small, balloon-shaped structures called halteres. A modified second wing, the haltere serves as a flight stabilizer. In order for the pair of wings and the pair of halteres (as well as all other parts of the fly) to develop properly, the fly’s suite of

  • Hox genes must be expressed in a precise way and at precise times.

During development, the fly’s two wings grow from a structure in the larva known as the wing imaginal disk. (An imago is an insect in its final, adult state.) The haltere grows from the larval haltere imaginal disk. Remember the Ubx Hox gene? Using staining again, we can detect the gene product of Ubx. This reveals that

  • the Ubx gene is naturally “off” in the wing disk—
  • and is “on” in the haltere disk.
  • Now you’ll see what happens when the Ubx gene—just one of a large number of Hox genes—is turned off in the haltere disk. What if a genetic mutation caused the Ubx gene to be turned off, during the larval stage, in the third thoracic segment, the segment that normally produces the haltere? Instead of a pair of halteres, the fly has a second set of wings. With the switch of that single Hox gene, Ubx, from on to off, the third thoracic segment becomes an additional second thoracic segment and the pair of halteres became a second pair of wings. This illustrates the remarkable ability of transcription factors like Ubx to control patterning as well as cell type during development.

ENCODE

A. Data Suggests “Gene” Redefinition

As part of a huge collaborative effort called ENCODE (Encyclopedia of DNA Elements), a research team led by Cold Spring Harbor Laboratory (CSHL) Professor Thomas Gingeras, PhD, publishes a genome-wide analysis of RNA messages, called transcripts, produced within human cells.
Their analysis—one component of a massive release of research results by ENCODE teams from 32 institutes in 5 countries, with 30 papers appearing in 3 different high-level scientific journals—shows that three-quarters of the genome is capable of being transcribed.  This indicates that nearly all of our genome is dynamic and active.  It stands in marked contrast to consensus views prior to ENCODE’s comprehensive research efforts, which suggested that

  • only the small protein-encoding fraction of the genome was transcribed.

The vast amount of data generated with advanced technologies by Gingeras’ group and others in the ENCODE project changes the prevailing understanding of what defines a gene. The current outstanding question concerns

  • the nature and range of those functions.  It is thought that these
  • “non-coding” RNA transcripts act something like components of a giant, complex switchboard, controlling a network of  many events in the cell by
  1. regulating the processes of
  2. replication,
  3. transcription
  4. and translation

– that is, the copying of DNA and the making of proteins is based on information carried by messenger RNAs.  With the understanding that so much of our DNA can be transcribed into RNA comes the realization that there is much less space between what we previously thought of as genes, Gingeras points out.

The full ENCODE Consortium data sets can be freely accessed through

  • the ENCODE project portal as well as at the University of California at Santa Cruz genome browser,
  • the National Center for Biotechnology Information, and
  • the European Bioinformatics Institute.

Topic threads that run through several different papers can be explored via the ENCODE microsite page at http://Nature.com/encode.    Date: September 5, 2012   Source: Cold Spring Harbor Laboratory

1000 Genomes Project Team Reports on Variation Patterns

(from Phase I Data) October 31, 2012 GenomeWeb

In a study appearing online today in Nature, members of the 1000 Genomes Project Consortium presented an integrated haplotype map representing the genomic variation present in more than 1,000 individuals from 14 human populations.  Using data on 1,092 individuals tested by

  • low-coverage whole-genome sequencing,
  • deep exome sequencing, and/or
  • dense genotyping,

the team looked at the nature and extent of the rare and common variation present in the genomes of individuals within these populations. In addition to population-specific differences in common variant profiles, for example, the researchers found distinct rare variant patterns within populations from different parts of the world — information that is expected to be important in interpreting future disease studies. They also encountered a surprising number of the variants that are expected to impact gene function, such as

  • non-synonymous changes,
  • loss-of-function variants, and, in some cases,
  • potentially damaging mutations.

ENCODE was designed to pick up where the Human Genome Project left off.
Although that massive effort revealed the blue­print of human biology, it quickly became clear that the instruction manual for reading the blueprint was sketchy at best. Researchers could identify in its 3 billion letters many of the regions that code for proteins, but they make up little more than 1% of the genome, contained in around 20,000 genes. ENCODE, which started in 2003, is a massive data-collection effort designed to catalogue the

  • ‘functional’ DNA sequences,
  • learn when and in which cells they are active and
  • trace their effects on how the genome is
  1. packaged,
  2. regulated and
  3. read.

After an initial pilot phase, ENCODE scientists started applying their methods to the entire genome in 2007. That phase came to a close with the publication of 30 papers, in Nature, Genome Research and Genome Biology. The consortium has assigned some sort of function to roughly 80% of the genome, including

  • more than 70,000 ‘promoter’ regions — the sites, just upstream of genes, where proteins bind to control gene expression —
  • and nearly 400,000 ‘enhancer’ regions that regulate expression of  distant genes (see page 57)1. But the job is far from done.

Junk DNA? What Junk DNA?

New data reveals that at least 80% of the human genome encodes elements that have some sort of biological function. [© Gernot Krautberger – Fotolia.com] Far from containing vast amounts of junk DNA between its protein-coding genes, at least 80% of the human genome encodes elements that have some sort of biological function, according to newly released data from the Encyclopedia of DNA Elements (Encode) project, a five-year initiative that aims to delineate all functional elements within human DNA. The massive international project, data from which are published in 30 different papers in Nature, Genome Research, Genome Biology, the Journal of Biological Chemistry, Science, and Cell, has identified four million gene switches, effectively

  • regulatory regions in the genome where
  • proteins interact with the DNA to control gene expression.

Overall, the Encode data define regulatory switches that are scattered all over the three billion nucleotides of the genome. In fact, the data suggests,

  • the regions that lie between gene-coding sequences contain a wealth of previously unrecognized functional elements,Including
  • nonprotein-coding RNA transcribed sequences,
  • transcription factor binding sites,
  • chromatin structural elements, and
  • DNA methylation sites.

The combined results suggest that 95% of the genome lies within 8 kb of a DNA-protein interaction, and 99% lies within 1.7 kb of at least one of the biochemical events, the researchers say. Importantly, given the complex three-dimensional nature of DNA, it’s also apparent that

  • a regulatory element for one gene may be located quite some ‘linear’ distance from the gene itself.

“The information processing and the intelligence of the genome reside in the regulatory elements,” explains Jim Kent, director of the University of California, Santa Cruz Genome Browser project and head of the Encode Data Coordination Center. “With this project, we probably went from understanding less than 5% to now around 75% of them.”
The ENCODE results also identified SNPs within regulatory regions that are associated with a range of diseases, providing new insights into the roles that

  • noncoding DNA plays in disease development.

“As much as nine out of 10 times, disease-linked genetic variants are not in protein-coding regions,” comments Mike Pazin, Encode program director at the National Human Genome Research Institute.  “Far from being junk DNA, this regulatory DNA clearly makes important contributions to human disease.”

Other Related Articles on this Open Access Online Scientific Journal, include the following: 

 

Big Data in Genomic Medicine LHB

http://pharmaceuticalintelligence.com/2012/12/17/big-data-in-genomic-medicine/

BRCA1 a tumour suppressor in breast and ovarian cancer – functions in transcription, ubiquitination and DNA repair S Saha
http://pharmaceuticalintelligence.com/2012/12/04/brca1-a-tumour-suppressor-in-breast-and-ovarian-cancer-functions-in-transcription-ubiquitination-and-dna-repair/

Computational Genomics Center: New Unification of Computational Technologies at Stanford A Lev-Ari
http://pharmaceuticalintelligence.com/2012/12/03/computational-genomics-center-new-unification-of-computational-technologies-at-stanford/

Personalized medicine gearing up to tackle cancer ritu saxena
http://pharmaceuticalintelligence.com/2013/01/07/personalized-medicine-gearing-up-to-tackle-cancer/

Differentiation Therapy – Epigenetics Tackles Solid Tumors sj Williams
http://pharmaceuticalintelligence.com/2013/01/03/differentiation-therapy-epigenetics-tackles-solid-tumors/

Mechanism involved in Breast Cancer Cell Growth: Function in Early Detection & Treatment A Lev-Ari
http://pharmaceuticalintelligence.com/2013/01/17/mechanism-involved-in-breast-cancer-cell-growth-function-in-early-detection-treatment/

The Molecular pathology of Breast Cancer Progression tilde barliya`
http://pharmaceuticalintelligence.com/2013/01/10/the-molecular-pathology-of-breast-cancer-progression/

Paradigm Shift in Human Genomics – Predictive Biomarkers and Personalized Medicine – Part 1 (pharmaceuticalintelligence.com) A Lev-Ari

http://pharmaceuticalintelligence.com/2013/01/13/paradigm-shift-in-human-genomics-predictive-biomarkers-and-personalized-medicine-part-1/

LEADERS in Genome Sequencing of Genetic Mutations for Therapeutic Drug Selection in Cancer Personalized Treatment: Part 2 A Lev-Ari
http://pharmaceuticalintelligence.com/2013/01/13/leaders-in-genome-sequencing-of-genetic-mutations-for-therapeutic-drug-selection-in-cancer-personalized-treatment-part-2/

Personalized Medicine: An Institute Profile – Coriell Institute for Medical Research: Part 3 A Lev-Ari
http://pharmaceuticalintelligence.com/2013/01/13/personalized-medicine-an-institute-profile-coriell-institute-for-medical-research-part-3/

Harnessing Personalized Medicine for Cancer Management, Prospects of Prevention and Cure: Opinions of Cancer Scientific Leaders @ http://pharmaceuticalintelligence.com ALA
http://pharmaceuticalintelligence.com/2013/01/13/7000/Harnessing Personalized Medicine for Cancer Management, Prospects of Prevention and Cure: Opinions of Cancer Scientific Leaders/

GSK for Personalized Medicine using Cancer Drugs needs Alacris systems biology model to determine the in silico effect of the inhibitor in its “virtual clinical trial” A Lev-Ari
http://pharmaceuticalintelligence.com/2012/11/14/gsk-for-personalized-medicine-using-cancer-drugs-needs-alacris-systems-biology-model-to-determine-the-in-silico-effect-of-the-inhibitor-in-its-virtual-clinical-trial/

Recurrent somatic mutations in chromatin-remodeling and ubiquitin ligase complex genes in serous endometrial tumors S Saha
http://pharmaceuticalintelligence.com/2012/11/19/recurrent-somatic-mutations-in-chromatin-remodeling-and-ubiquitin-ligase-complex-genes-in-serous-endometrial-tumors/

Personalized medicine-based cure for cancer might not be far away ritu saxena
http://pharmaceuticalintelligence.com/2012/11/20/personalized-medicine-based-cure-for-cancer-might-not-be-far-away/

Human Variome Project: encyclopedic catalog of sequence variants indexed to the human genome sequence A Lev-Ari
http://pharmaceuticalintelligence.com/2012/11/24/human-variome-project-encyclopedic-catalog-of-sequence-variants-indexed-to-the-human-genome-sequence/

Prostate Cancer Cells: Histone Deacetylase Inhibitors Induce Epithelial-to-Mesenchymal Transition sjwilliams
http://pharmaceuticalintelligence.com/2012/11/30/histone-deacetylase-inhibitors-induce-epithelial-to-mesenchymal-transition-in-prostate-cancer-cells/

Inspiration From Dr. Maureen Cronin’s Achievements in Applying Genomic Sequencing to Cancer Diagnostics A Lev-Ari
http://pharmaceuticalintelligence.com/2013/01/10/inspiration-from-dr-maureen-cronins-achievements-in-applying-genomic-sequencing-to-cancer-diagnostics/

The “Cancer establishments” examined by James Watson, co-discoverer of DNA w/Crick, 4/1953 A Lev-Ari
http://pharmaceuticalintelligence.com/2013/01/09/the-cancer-establishments-examined-by-james-watson-co-discover-of-dna-wcrick-41953/

Directions for genomics in personalized medicine lhb
http://pharmaceuticalintelligence.com/2013/01/27/directions-for-genomics-in-personalized-medicine/

How mobile elements in “Junk” DNA promote cancer. Part 1: Transposon-mediated tumorigenesis. SJwilliams
http://pharmaceuticalintelligence.com/2012/10/31/how-mobile-elements-in-junk-dna-prote-cancer-part1-transposon-mediated-tumorigenesis/

Mitochondria: More than just the “powerhouse of the cell” eritu saxena
http://pharmaceuticalintelligence.com/2012/07/09/mitochondria-more-than-just-the-powerhouse-of-the-cell/

Mitochondrial fission and fusion: potential therapeutic targets? Ritu saxena
http://pharmaceuticalintelligence.com/2012/10/31/mitochondrial-fission-and-fusion-potential-therapeutic-target/

Mitochondrial mutation analysis might be “1-step” away ritu saxena
http://pharmaceuticalintelligence.com/2012/08/14/mitochondrial-mutation-analysis-might-be-1-step-away/

mRNA interference with cancer expression lhb
http://pharmaceuticalintelligence.com/2012/10/26/mrna-interference-with-cancer-expression/

Expanding the Genetic Alphabet and linking the genome to the metabolome LHB
http://pharmaceuticalintelligence.com/2012/09/24/expanding-the-genetic-alphabet-and-linking-the-genome-to-the-metabolome/

Breast Cancer, drug resistance, and biopharmaceutical targets lhb
http://pharmaceuticalintelligence.com/2012/09/18/breast-cancer-drug-resistance-and-biopharmaceutical-targets/

Breast Cancer: Genomic profiling to predict Survival: Combination of Histopathology and Gene Expression Analysis A Lev-Ari
http://pharmaceuticalintelligence.com/2012/12/24/breast-cancer-genomic-profiling-to-predict-survival-combination-of-histopathology-and-gene-expression-analysis/

Gastric Cancer: Whole-genome reconstruction and mutational signatures A Lev-Ari
http://pharmaceuticalintelligence.com/2012/12/24/gastric-cancer-whole-genome-reconstruction-and-mutational-signatures-2/

Ubiquinin-Proteosome pathway, autophagy, the mitochondrion, proteolysis and cell apoptosis lhb
http://pharmaceuticalintelligence.com/2012/10/30/ubiquinin-proteosome-pathway-autophagy-the-mitochondrion-proteolysis-and-cell-apoptosis/

Genomic Analysis: FLUIDIGM Technology in the Life Science and Agricultural Biotechnology A Lev-Ari
http://pharmaceuticalintelligence.com/2012/08/22/genomic-analysis-fluidigm-technology-in-the-life-science-and-agricultural-biotechnology/

Reveals from ENCODE project will invite high synergistic collaborations to discover specific targets A. Sarkar

http://pharmaceuticalintelligence.com/2012/09/30/reveals-from-encode-project-will-lead-to-confusion-or-specific-target/

ENCODE: the key to unlocking the secrets of complex genetic diseases R. Saxena

http://pharmaceuticalintelligence.com/2012/09/26/encode-the-key-to-unlocking-the-secrets-of-complex-genetic-diseases/

Impact of evolutionary selection on functional regions: The imprint of evolutionary selection on ENCODE regulatory elements is manifested between species and within human populations s Saha

http://pharmaceuticalintelligence.com/2012/09/20/impact-of-evolutionary-selection-on-functional-regions-the-imprint-of-evolutionary-selection-on-encode-regulatory-elements-is-manifested-between-species-and-within-human-populations/

ENCODE Findings as Consortium A Lev-Ari

http://pharmaceuticalintelligence.com/2012/09/10/encode-findings-as-consortium/

Genomics Orientations for Personalized Medicine SJH, ALA, LHB

http://pharmaceuticalintelligence.com/biomed-e-books/genomics-orientations-for-personalized-medicine/

2013 Genomics: The Era Beyond the Sequencing of the Human Genome: Francis Collins, Craig Venter, Eric Lander, et al.

http://pharmaceuticalintelligence.com/2013/02/11/2013-genomics-the-era-beyond-the-sequencing-human-genome-francis-collins-craig-venter-eric-lander-et-al/

 Related Articles

Read Full Post »

« Newer Posts - Older Posts »