The Human Proteome Map Completed
Reporter and Curator: Larry H. Bernstein, MD, FCAP
Researchers Produce First Map of Human Proteome, and Reveal New
Significance in The Human Proteome
HAHNE, TECHNISCHE UNIVERSITÄT MÜNCHENTwo international teams have
independently produced the first drafts of the human proteome. These curated
catalogs of the proteins expressed in most non-diseased human tissues and
organs can be used as a baseline to better understand changes that occur in
disease states. Their findings were published today (May 29) in Nature.
Both teams uncovered new complexities of the human genome, identifying novel
proteins from regions of the genome previously thought to be non-coding.
“the real breakthrough with these two projects is the comprehensive coverage of
more than 80 percent of the expected human proteome” said Hanno Steen, director
of proteomics at Boston Children’s Hospital, who was not involved in the work.
The human proteome map provides a catalog of proteins expressed in nondiseased tissues and organs to use as baseline in understanding changes that occur in disease
Given the growing importance of proteins in medical laboratory testing,
- pathologists will want to know that drafts of the complete human proteome
- have been released to the public.
Experts are comparing this to the first complete map of the human genome
- and this information provides for rapid advances
- in understanding transcriptomics and metabolomics
Map of Human Proteome Expected to Advance Medical Science
“Housekeeping genes” that are expressed in all tissues and cell types
- have been thought to be involved in basic cellular functions.
Two teams developing a Human Proteome Map
- detected proteins encoded by 2,350 genes
- across all human cells and tissues.
The corresponding housekeeping proteins comprised
about 75% of total protein mass.
- histones,
- ribosomal proteins,
- metabolic enzymes, and
- cytoskeletal proteins
The two international teams produced
- the first drafts of the human protoeome,
- a catalog of proteins expressed in most
- nondiseased human issues and organs.
The evidence suggests there is translation from DNA regions
- that were not thought to be translated—including
- more than 400 translated long, intergenic non-coding RNAs (lincRNAs)—
found by the Küster team—and - 193 new proteins—uncovered by the Pandey team.
This proteome map can be used as a baseline to understand
- changes that occur in the disease state
These studies are part of the Human Proteome Project,
- an international effort by the Human Proteome Organization
- to revolutionize our understanding of the human proteome
- by coordinating research at laboratories around the world directed
- at mapping the entire human proteome.
This new information about the human proteome
- is expected to trigger rapid advances in medical science
- and a better understanding of the underlying causes of human diseases.
One Study Team Was at Johns Hopkins University
- In one study, which was headed by Ahilesh Pandey, M.D.,
at Johns Hopkins University in Baltimore, - and colleague Harsha Gowda, Ph.D.,
of the Institute of Bioinformatics in Bangalore, India, - the research team used an advanced form of mass spectrometry to analyze proteins
- to create the human proteome map,
according to a report published in NIH Research Matters.
The research team examined
- 30 normal human tissue and cell types:
- 17 adult tissues,
- 7 fetal tissue and
- 6 blood cell types.
Samples from three people per tissue type
- were processed through several steps.
The protein fragments, or peptides, were analyzed on
- high-resolution Fourier-transform mass spectrometers.
The amino acid sequences were
- then compared to known sequences.
Their results were published in the May 28, 2014, issue of Nature.
The resulting draft map of the human proteome map includes
- proteins encoded by more than 17,000 genes,
- noted the Research Matters article.
Among these are hundreds of proteins from regions
- previously thought to be non-coding.
This study also provided a new understanding of
- how genes are expressed.
For example, almost 200 genes begin in locations
- other than those predicted based on genetic sequence.
“The fact that 193 of the proteins came from DNA sequences
- predicted to be non-coding means that
- we don’t fully understand how cells read DNA,
- since the sequences code for proteins
This study also produced the Human Proteome Map,
- an interactive online portal.
This can be accessed at this link.
The study data will soon be accessible through
German’s ProteomicsDB Analyzed a Mix of Available and New Tissue Data
The other study was conducted by a team lead by Bernhard Küster
of the Technische Universität München in Germany.
Küster and his colleagues created a
- searchable,
- public database called
- ProteomicsDB.
This database contains 92% of the
- estimated 19,629 human proteins,
noted The Scientist article.
Küster’s team also used mass spectrometry
- to analyze human tissue samples.
This team’s approach differed from Johns Hopkins’ in that
- it compiled about 60% of the information
- in the ProteomicsDB database
- by using existing raw mass spec (MS) data
- from databases and colleagues’ contributions.
To fill data gaps, the Küster lab generated its own
MS data after analyzing
- 60 human tissues,
- 13 body fluids, and
- 147 cancer cell lines.
High-resolution public data
- was selected and computationally processed
- for strict quality
The database for ProteomicsDB is
- public and searchable.
It can be accessed at this link.
German Study Added New Insights to Transcription Process
Comparing the ratio of protein to mRNA levels for every protein globally,
- the Küster lab found that the translation rate
- is a constant feature of each mRNA transcript.
The proteomics community has viewed
- transcriptome and proteome data as two sides of a coin.
But this analysis shows that at least, at steady state,
- once the ratio for an mRNA/protein pair has been calculated,
- protein levels can be determined
- just from specific mRNA levels.
Proteomics researchers in Toronto maintaining ionic balance and in Boston commented on the
importance of the findings, even a “new paradigm” because of
- the fixed ratio of protein to mRNA
This is quite in keeping with what we have been learning
- with respect to homeostasis.
In 2003, the Human Genome Project created a
- draft map of the human genome—
- all the genes in the human body.
Genomics has since driven many advances in medical science.
This was a progress from the classic discovery of Watson and Crick –
- the classical dogma holds that
- DNA makes RNA makes protein.
- no constraints are place on this
But the cell is functioning in contact with other cells,
- immersed in interstitial fluid
- maintaining cationic and anionic balance
- and mitochondrial energy balance and ubiquitin systems interact
- and protein interacts with the chromatin and transcriptional RNA
So the restriction that has been discovered has credence,
- the classical diagram has to be redrawn
Deeper Knowledge of Proteome to Improve Diagnostics and Therapeutics
In the two projects is:
- the comprehensive coverage of more than 80% of
- the expected human proteome,
These studies indicate that to get to
- a deep level of proteome coverage,
- many different tissue types must be probed.
the studies are complimentary.
- The Hopkins group provided a survey of human proteins from a single source, which allows for easy comparisons within their data.
- The ProteomeDB effort connected new information with existing data
A deeper knowledge of the human proteome could help
- fill the gap between genomes and phenotypes.
As this occurs, it has the potential to transform
- the way diagnostics and therapeutics are developed,
- enhancing overall biomedical research and healthcare,
it was noted in a report presented to scientific leaders at a NIH workshop
- on advances in proteomics and its applications.
Having completed a draft map of the human proteome—
the set of all proteins in the human body
- It opens another window to cell function.
It has been ASSUMED –
- genes control the most basic functions of the cell,
- including what proteins to make and when.
- but we have assumed for too much in assigning
full control to the genome
Researchers have identified more than 20,000 protein- coding genes.
However, scientific understanding of the proteome has
- lagged behind that of the genome,
- partly because of the proteome’s complexities.
The relationship between genes and proteins isn’t a simple matter of
- one gene coding for one protein.
Stretches of DNA can be read and translated
- into proteins in different ways.
Proteins are also more difficult to sequence than genes.
The importance of these latest studies to pathologists and Ph.D.s working
- in molecular diagnostics laboratories is that
- this information will expedite further research into the human proteome.
Such research is expected to lead to
- novel methods of diagnosis and complex
- “multi-analyte” clinical laboratory tests that
- look for multiple proteins in a single assay.
“The prevalent view was that information transfer was from genome to transcriptome to proteome.
What these efforts show is that it’s a two-way road— proteomics can be used to annotate the genome.
The importance is that, using these datasets, we can improve the annotation of the genome and the
algorithms that predict transcription and translation,” said Steen. “The genomics field can now hugely
benefit from proteomics data.”
Wilhelm et al., “Mass-spectrometry- based draft of the human proteome,”
Nature, http://dx.doi.doi:/10.1038/nature13319, 2014
M.S. Kim et al. “A draft map of the human proteome,”
Nature, http://dx.doi.org:/10.1038/nature13302, 2014.
Tags
proteomics, noncoding RNA, human research, human proteome project, human genetics and genomics
http://www.the-scientist.com/?articles.view/articleNo/40083/title/Human-Proteome-Mapped/
__Patricia Kirk
__by Harrison Wein, Ph.D.
__by Anna Azvolinsky
Related Information:
- Finding Treasure in “Junk” DNA:
http://www.nih.gov/researchmatters/september2012/09242012junk.htm
- All About The Human Genome Project:
http://www.genome.gov/10001772 - What is Proteomics?:
http://proteomics.cancer.gov/whatisproteomics - Human Proteome Map:
http://www.humanproteomemap.org - ProteomicsDB:
https://www.proteomicsdb.org
Reference: A draft map of the human proteome.
Kim MS, Pinto SM, Getnet D, Nirujogi RS, Manda SS, Donahue CA, Gowda H, Pandey A.
Nature. 2014 May 29;509(7502):575-81. http://dx.doi.org:/10.1038/nature13302. PMID: 24870542
Funding: NIH’s National Institute of General Medical Sciences (NIGMS), National Cancer Institute (NCI),
and National Heart, Lung, and Blood Institute (NHLBI); the Sol Goldman Pancreatic Cancer Research Center;
India’s Council of Scientific and Industrial Research; and Wellcome Trust/DBT India Alliance.
http://nihprod.cit.nih.gov/researchmatters/june2014/06092014proteome.htm
Leave a Reply