Posts Tagged ‘clinical genomics’

Personalized Medicine, Omics, and Health Disparities in Cancer: Can Personalized Medicine Help Reduce the Disparity Problem?

Posted in and Bioethics, Big Data, Bio-Ethics, BioBanking, BioIT: BioInformatics, BioIT: BioInformatics, NGS, Clinical & Translational, Pharmaceutical R&D Informatics, Clinical Genomics, Cancer Informatics, Biomarkers & Medical Diagnostics, Breast Cancer - impalpable breast lesions, Cancer - General, Cancer and Current Therapeutics, CANCER BIOLOGY & Innovations in Cancer Therapy, Cancer Genomics, Cancer Informatics, Cancer Prevention: Research & Programs, Cancer Screening, Centers for Medicare & Medicaid Services, Funding Opportunities for Cancer Research, Health Care System by Country, Health Economics and Outcomes Research, Health Law & Patient Safety, HealthCare IT, Healthcare Reform, interventional oncology, TP53 - Germline mutations, Women Health, tagged #endcancer, African diaspora, breast cancer, Cancer, cancer disparities, clinical genomics, diagnostic genomics, health disparities, men's health, ovarian cancer, sub Saharan, womens' health on April 19, 2020| Leave a Comment »

Personalized Medicine, Omics, and Health Disparities in Cancer: Can Personalized Medicine Help Reduce the Disparity Problem?

Curator: Stephen J. Williams, PhD

UPDATED: 5/24/2026

Europe’s Medicine Access Crisis: What the Latest Data Means for Lung Cancer

This week, EFPIA published its annual WAIT Report. It tracks how quickly medicines approved by the European Medicines Agency reach people across EU member states. The 2025 data, covering 168 medicines approved between 2021 and 2024, makes for difficult reading.

Nearly half of those medicines, 49%, are not available to people in Europe. That figure has risen from 46% in 2019. The share of medicines fully available on public reimbursement lists has fallen from 42% in 2019 to 28% in 2025. A further 17% are only available under restricted conditions, up from 6% six years ago.

The median time from European marketing authorisation to availability across the continent is 532 days. In Germany, the median wait is 56 days. In Romania, it is 1,201 days. That is a difference of more than three years, for the same medicines, approved by the same regulatory body, for people living on the same continent.

For oncology, the trend is moving in the wrong direction. The average time to availability for cancer medicines has increased year on year. EFPIA’s analysis also flags a declining trend in the number of FDA-approved medicines subsequently approved by the EMA, with a particularly steep drop since October 2025.

These are system-level statistics. But behind every number is a person whose treatment options are shaped not by what science has made possible, but by where in Europe they happen to live.

What this means for lung cancer

Lung cancer is the leading cause of cancer-related death in Europe. It is also one of the most rapidly evolving areas of oncology. New treatment options, particularly in molecularly defined subtypes, have transformed outcomes for some groups over the past decade. The pipeline continues to produce results that would have been unimaginable ten years ago.

Yet the treatments that reach people in countries with faster, better-resourced reimbursement systems are the same treatments that remain unavailable or restricted for years in others. EFPIA’s data shows that Germany had 156 of the 168 tracked medicines available. Malta had 22. The gap between the best and worst performing countries in terms of availability was 88%.

This is not a question of what medicine can offer. It is a question of whether health systems, reimbursement frameworks, and pricing negotiations are structured in a way that gets treatments to the people who need them. As Lung Cancer Europe President Debra Montague wrote in her 2025-2026 Annual Report, “it has become clearer that these advancements are not reaching everyone equally.”

What people impacted by lung cancer are telling us

Access to approved medicines is one dimension of a broader picture. Lung Cancer Europe’s own research, drawn from surveys of more than 2,000 people impacted by lung cancer across 34 European countries, consistently shows that the challenges people face extend well beyond whether a medicine has been reimbursed.

Our 9th Report, published in November 2024, found that 40% of respondents did not receive enough information about their diagnosis, treatment, and care. Half received no information about alternative treatment options at all. Nearly nine in ten sought information outside the healthcare system, and one in four either could not find what they needed or encountered information that was inaccurate or unproven. Complex information was identified as the single biggest barrier to meaningful participation in treatment decisions.

Our 11th Report, launched in March 2026, focused on mental health. An average of 89.3% of all respondents experienced significant emotional difficulties following diagnosis. 31% received no mental health support at any point during their care. The burden falls unevenly. People living with small cell lung cancer reported the highest levels of distress, with 66.2% reporting a negative impact on their mental health. 74.3% were never referred to a patient organisation by their healthcare provider. People who had not undergone biomarker testing reported the poorest mental health outcomes of any group surveyed.

There is also a clear geographic dimension to mental health outcomes, one that maps closely onto the geographic inequality in treatment access that the EFPIA data describes. Respondents in Italy, Switzerland, and Denmark reported higher mental health scores. Those in Slovenia, Ukraine, and Greece reported the greatest negative psychological impact.

The pattern is consistent. Where people live in Europe determines not only which medicines are available to them, but the quality of information they receive, the extent to which they are involved in decisions about their own care, and the likelihood that their psychological wellbeing will be supported throughout the course of their illness.

What needs to change

The EFPIA report identifies the causes of these delays as multifactorial: slow regulatory processes, misalignment on evidence requirements, insufficient budgets in member states, and commercial decisions about where and when to launch. There is no single lever to pull. But the trend lines are moving in a direction that demands urgent attention from policymakers, health systems, and all stakeholders with a role in how medicines reach people.

Lung Cancer Europe calls on policymakers and healthcare providers across Europe to treat equitable access to treatment as a priority, not an aspiration. This means faster and more consistent reimbursement processes across member states, a commitment to closing the information gap that leaves too many people without the knowledge to understand and advocate for their own care, and the integration of psychological support as a standard component of the lung cancer care pathway.

The pipeline itself is also under pressure. New data published alongside the WAIT report shows a declining trend in the number of medicines approved by the US Food and Drug Administration that subsequently receive approval from the European Medicines Agency, with a particularly steep drop recorded since October 2025. For people living with lung cancer, where so many treatment advances in molecularly defined subtypes have originated from the global pipeline, a narrowing of what reaches European regulators at all would compound the access inequalities this data already describes.

The medicines exist. The evidence base is growing. The responsibility now is to ensure that what is possible for some becomes available to all.

For PDF report see here: efpia-patients-wait-indicator-2025

*Sources: https://www.lungcancereuropenews.eu/news/europe-medicine-access-crisis-lung-cancer-efpia-2026 EFPIA Patients W.A.I.T. Indicator 2025 (published May 2026); 9th Lung Cancer Europe Report (November 2024); 11th Lung Cancer Europe Report (March 2026); Lung Cancer Europe President’s Annual Report 2024-2025.* ®IQVIA

Original Article

In a Science Perspectives article by Timothy Rebbeck, health disparities, specifically cancer disparities existing in the sub-Saharan African (SSA) nations, highlighting the cancer incidence disparities which exist compared with cancer incidence in high income areas of the world [1]. The sub-Saharan African nations display a much higher incidence of prostate, breast, and cervix cancer and these cancers are predicted to double within the next twenty years, according to IARC[2]. Most importantly,

the histopathologic and demographic features of these tumors differ from those in high-income countries

meaning that the differences seen in incidence may reflect a true health disparity as increases rates in these cancers are not seen in high income countries (HIC).

Most frequent male cancers in SSA include prostate, lung, liver, leukemia, non-Hodgkin’s lymphoma, and Kaposi’s sarcoma (a cancer frequently seen in HIV infected patients [3]). In SSA women, breast and cervical cancer are the most common and these display higher rates than seen in high income countries. In fact, liver cancer is seen in SSA females at twice the rate, and in SSA males almost three times the rate as in high income countries.

Reasons for cancer disparity in SSA

Patients with cancer are often diagnosed at a late stage in SSA countries. This contrasts with patients from high income countries, which have their cancers usually diagnosed at an earlier stage, and with many cancers, like breast[4], ovarian[5, 6], and colon, detecting the tumor in the early stages is critical for a favorable outcome and prognosis[7-10]. In addition, late diagnosis also limits many therapeutic options for the cancer patient and diseases at later stages are much harder to manage, especially with respect to unresponsiveness and/or resistance of many therapies. In addition, treatments have to be performed in low-resource settings in SSA, and availability of clinical lab work and imaging technologies may be limited.

Molecular differences in SSA versus HIC cancers which may account for disparities

Emerging evidence suggests that there are distinct molecular signatures with SSA tumors with respect to histotype and pathology. For example Dr. Rebbeck mentions that Nigerian breast cancers were defined by increased mutational signatures associated with deficiency of the homologous recombination DNA repair pathway, pervasive mutations in the tumor suppressor gene TP53, mutations in GATA binding protein 3 (GATA3), and greater mutational burden, compared with breast tumors from African Americans or Caucasians[11]. However more research will be required to understand the etiology and causal factors related to this molecular distinction in mutational spectra.

It is believed that there is a higher rate of hereditary cancers in SSA. And many SSA cancers exhibit the more aggressive phenotype than in other parts of the world. For example breast tumors in SSA black cases are twice as likely than SSA Caucasian cases to be of the triple negative phenotype, which is generally more aggressive and tougher to detect and treat, as triple negative cancers are HER2 negative and therefore are not a candidate for Herceptin. Also BRCA1/2 mutations are more frequent in black SSA cases than in Caucasian SSA cases [12, 13].

Initiatives to Combat Health Disparities in SSA

Multiple initiatives are being proposed or in action to bring personalized medicine to the sub-Saharan African nations. These include:

The Human Heredity and Health in Africa (H3Africa) consortium

H3Africa empowers African researchers to be competitive in genomic sciences, establishes and nurtures effective collaborations among African researchers on the African continent, and generates unique data that could be used to improve both African and global health.

There is currently a global effort to apply genomic science and associated technologies to further the understanding of health and disease in diverse populations. These efforts work to identify individuals and populations who are at risk for developing specific diseases, and to better understand underlying genetic and environmental contributions to that risk. Given the large amount of genetic diversity on the African continent, there exists an enormous opportunity to utilize such approaches to benefit African populations and to inform global health.

The Human Heredity and Health in Africa (H3Africa) consortium facilitates fundamental research into diseases on the African continent while also developing infrastructure, resources, training, and ethical guidelines to support a sustainable African research enterprise – led by African scientists, for the African people. The initiative consists of 51 African projects that include population-based genomic studies of common, non-communicable disorders such as heart and renal disease, as well as communicable diseases such as tuberculosis. These studies are led by African scientists and use genetic, clinical, and epidemiologic methods to identify hereditary and environmental contributions to health and disease. To establish a foundation for African scientists to continue this essential work into the future work, the consortium also supports many crucial capacity building elements, such as: ethical, legal, and social implications research; training and capacity building for bioinformatics; capacity for biobanking; and coordination and networking.

The World Economic Forum’s Leapfrogging with Precision Medicine project

This project is part of the World Economic Forum’s Shaping the Future of Health and Healthcare Platform

The Challenge

Advancing precision medicine in a way that is equitable and beneficial to society means ensuring that healthcare systems can adopt the most scientifically and technologically appropriate approaches to a more targeted and personalized way of diagnosing and treating disease. In certain instances, countries or institutions may be able to bypass, or “leapfrog”, legacy systems or approaches that prevail in developed country contexts.

The World Economic Forum’s Leapfrogging with Precision Medicine project will develop a set of tools and case studies demonstrating how a precision medicine approach in countries with greenfield policy spaces can potentially transform their healthcare delivery and outcomes. Policies and governance mechanisms that enable leapfrogging will be iterated and scaled up to other projects.

Successes in personalized genomic research in SSA

As Dr. Rebbeck states:

Because of the underlying genetic and genomic relationships between Africans and members of the African diaspora (primarily in North America and Europe), knowledge gained from research in SSA can be used to address health disparities that are prevalent in members of the African diaspora.

For example members of the West African heritage and genomic ancestry has been reported to confer the highest genomic risk for prostate cancer in any worldwide population [14].

PERSPECTIVEGLOBAL HEALTH

Cancer in sub-Saharan Africa

Timothy R. Rebbeck

See all authors and affiliations

Science 03 Jan 2020:
Vol. 367, Issue 6473, pp. 27-28
DOI: 10.1126/science.aay474

Summary/Abstract

Cancer is an increasing global public health burden. This is especially the case in sub-Saharan Africa (SSA); high rates of cancer—particularly of the prostate, breast, and cervix—characterize cancer in most countries in SSA. The number of these cancers in SSA is predicted to more than double in the next 20 years (1). Both the explanations for these increasing rates and the solutions to address this cancer epidemic require SSA-specific data and approaches. The histopathologic and demographic features of these tumors differ from those in high-income countries (HICs). Basic knowledge of the epidemiology, clinical features, and molecular characteristics of cancers in SSA is needed to build prevention and treatment tools that will address the future cancer burden. The distinct distribution and determinants of cancer in SSA provide an opportunity to generate knowledge about cancer risk factors, genomics, and opportunities for prevention and treatment globally, not only in Africa.

References

Rebbeck TR: Cancer in sub-Saharan Africa. Science 2020, 367(6473):27-28.
Parkin DM, Ferlay J, Jemal A, Borok M, Manraj S, N’Da G, Ogunbiyi F, Liu B, Bray F: Cancer in Sub-Saharan Africa: International Agency for Research on Cancer; 2018.
Chinula L, Moses A, Gopal S: HIV-associated malignancies in sub-Saharan Africa: progress, challenges, and opportunities. Current opinion in HIV and AIDS 2017, 12(1):89-95.
Colditz GA: Epidemiology of breast cancer. Findings from the nurses’ health study. Cancer 1993, 71(4 Suppl):1480-1489.
Hamilton TC, Penault-Llorca F, Dauplat J: [Natural history of ovarian adenocarcinomas: from epidemiology to experimentation]. Contracept Fertil Sex 1998, 26(11):800-804.
Garner EI: Advances in the early detection of ovarian carcinoma. J Reprod Med 2005, 50(6):447-453.
Brockbank EC, Harry V, Kolomainen D, Mukhopadhyay D, Sohaib A, Bridges JE, Nobbenhuis MA, Shepherd JH, Ind TE, Barton DP: Laparoscopic staging for apparent early stage ovarian or fallopian tube cancer. First case series from a UK cancer centre and systematic literature review. European journal of surgical oncology : the journal of the European Society of Surgical Oncology and the British Association of Surgical Oncology 2013, 39(8):912-917.
Kolligs FT: Diagnostics and Epidemiology of Colorectal Cancer. Visceral medicine 2016, 32(3):158-164.
Rocken C, Neumann U, Ebert MP: [New approaches to early detection, estimation of prognosis and therapy for malignant tumours of the gastrointestinal tract]. Zeitschrift fur Gastroenterologie 2008, 46(2):216-222.
Srivastava S, Verma M, Henson DE: Biomarkers for early detection of colon cancer. Clinical cancer research : an official journal of the American Association for Cancer Research 2001, 7(5):1118-1126.
Pitt JJ, Riester M, Zheng Y, Yoshimatsu TF, Sanni A, Oluwasola O, Veloso A, Labrot E, Wang S, Odetunde A et al: Characterization of Nigerian breast cancer reveals prevalent homologous recombination deficiency and aggressive molecular features. Nature communications 2018, 9(1):4181.
Zheng Y, Walsh T, Gulsuner S, Casadei S, Lee MK, Ogundiran TO, Ademola A, Falusi AG, Adebamowo CA, Oluwasola AO et al: Inherited Breast Cancer in Nigerian Women. Journal of clinical oncology : official journal of the American Society of Clinical Oncology 2018, 36(28):2820-2825.
Rebbeck TR, Friebel TM, Friedman E, Hamann U, Huo D, Kwong A, Olah E, Olopade OI, Solano AR, Teo SH et al: Mutational spectrum in a worldwide study of 29,700 families with BRCA1 or BRCA2 mutations. Human mutation 2018, 39(5):593-620.
Lachance J, Berens AJ, Hansen MEB, Teng AK, Tishkoff SA, Rebbeck TR: Genetic Hitchhiking and Population Bottlenecks Contribute to Prostate Cancer Disparities in Men of African Descent. Cancer research 2018, 78(9):2432-2443.

Structure-guided Drug Discovery: (1) The Coronavirus 3CL hydrolase (Mpro) enzyme (main protease) essential for proteolytic maturation of the virus and (2) viral protease, the RNA polymerase, the viral spike protein, a viral RNA as promising two targets for discovery of cleavage inhibitors of the viral spike polyprotein preventing the Coronavirus Virion the spread of infection

Posted in Antibody Responses Predict Antigen Exposure, BioIT: BioInformatics, NGS, Clinical & Translational, Pharmaceutical R&D Informatics, Clinical Genomics, Cancer Informatics, COVID-19, Immunodiagnostics, Immunology, Innovation in Immunology Diagnostics, MHC Repertoires for Antigen Prediction, Population Health Management, Population Health Management, Genetics & Pharmaceutical, Viral diseases, Virology - Vector-borne DIsease, tagged BioIT: BioInformatics, Biological Engineering, Biological Networks, Biomarkers & Medical Diagnostics, Cancer Informatics, Clinical & Translational, clinical genomics, Commercialization, COVID-19, Gene Regulation and Evolution, Genetics & Pharmaceutical, Immunotherapy, Innovation in Immunology Diagnostics, Innovations, Intellectual property, Investment in Technological Breakthrough, NGS, Pharmaceutical R&D Informatics, Population Health Management, Proteomics, SARS-CoV, SARS-CoV-2, Universal Immune Cell Therapies (uICT), Virus Infective Acute Respiratory Syndrome: SARS-CoV | Tagged COVID-19 on March 12, 2020| Leave a Comment »

Structure-guided Drug Discovery: (1) The Coronavirus 3CL hydrolase (Mpro) enzyme (main protease) essential for proteolytic maturation of the virus and (2) viral protease, the RNA polymerase, the viral spike protein, a viral RNA as promising two targets for discovery of cleavage inhibitors of the viral spike polyprotein preventing the Coronavirus Virion the spread of infection

Curators and Reporters: Stephen J. Williams, PhD and Aviva Lev-Ari, PhD, RN

Therapeutical options to coronavirus (2019-nCoV) include consideration of the following:

(a) Monoclonal and polyclonal antibodies

(b) Vaccines

(c) Small molecule treatments (e.g., chloroquinolone and derivatives), including compounds already approved for other indications

(d) Immuno-therapies derived from human or other sources

Structure of the nCoV trimeric spike

The World Health Organization has declared the outbreak of a novel coronavirus (2019-nCoV) to be a public health emergency of international concern. The virus binds to host cells through its trimeric spike glycoprotein, making this protein a key target for potential therapies and diagnostics. Wrapp et al. determined a 3.5-angstrom-resolution structure of the 2019-nCoV trimeric spike protein by cryo–electron microscopy. Using biophysical assays, the authors show that this protein binds at least 10 times more tightly than the corresponding spike protein of severe acute respiratory syndrome (SARS)–CoV to their common host cell receptor. They also tested three antibodies known to bind to the SARS-CoV spike protein but did not detect binding to the 2019-nCoV spike protein. These studies provide valuable information to guide the development of medical counter-measures for 2019-nCoV. [Bold Face Added by ALA]

Science, this issue p. 1260

Abstract

The outbreak of a novel coronavirus (2019-nCoV) represents a pandemic threat that has been declared a public health emergency of international concern. The CoV spike (S) glycoprotein is a key target for vaccines, therapeutic antibodies, and diagnostics. To facilitate medical countermeasure development, we determined a 3.5-angstrom-resolution cryo–electron microscopy structure of the 2019-nCoV S trimer in the prefusion conformation. The predominant state of the trimer has one of the three receptor-binding domains (RBDs) rotated up in a receptor-accessible conformation. We also provide biophysical and structural evidence that the 2019-nCoV S protein binds angiotensin-converting enzyme 2 (ACE2) with higher affinity than does severe acute respiratory syndrome (SARS)-CoV S. Additionally, we tested several published SARS-CoV RBD-specific monoclonal antibodies and found that they do not have appreciable binding to 2019-nCoV S, suggesting that antibody cross-reactivity may be limited between the two RBDs. The structure of 2019-nCoV S should enable the rapid development and evaluation of medical countermeasures to address the ongoing public health crisis.

Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation

Daniel Wrapp¹,*,
Nianshuang Wang¹,*,
Kizzmekia S. Corbett²,
Jory A. Goldsmith¹,
Ching-Lin Hsieh¹,
Olubukola Abiona²,
Barney S. Graham²,
Jason S. McLellan ¹,†

Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX 78712, USA.
²Vaccine Research Center, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD 20892, USA.

↵†Corresponding author. Email: jmclellan@austin.utexas.edu

↵* These authors contributed equally to this work.

Science 13 Mar 2020:
Vol. 367, Issue 6483, pp. 1260-1263
DOI: 10.1126/science.abb2507

02/04/2020

New Coronavirus Protease Structure Available

PDB data provide a starting point for structure-guided drug discovery

A high-resolution crystal structure of COVID-19 (2019-nCoV) coronavirus 3CL hydrolase (Mpro) has been determined by Zihe Rao and Haitao Yang’s research team at ShanghaiTech University. Rapid public release of this structure of the main protease of the virus (PDB 6lu7) will enable research on this newly-recognized human pathogen.

Recent emergence of the COVID-19 coronavirus has resulted in a WHO-declared public health emergency of international concern. Research efforts around the world are working towards establishing a greater understanding of this particular virus and developing treatments and vaccines to prevent further spread.

https://doi.org/10.2210/pdb6lu7/pdb

While PDB entry 6lu7 is currently the only public-domain 3D structure from this specific coronavirus, the PDB contains structures of the corresponding enzyme from other coronaviruses. The 2003 outbreak of the closely-related Severe Acute Respiratory Syndrome-related coronavirus (SARS) led to the first 3D structures, and today there are more than 200 PDB structures of SARS proteins. Structural information from these related proteins could be vital in furthering our understanding of coronaviruses and in discovery and development of new treatments and vaccines to contain the current outbreak.

The coronavirus 3CL hydrolase (Mpro) enzyme, also known as the main protease, is essential for proteolytic maturation of the virus. It is thought to be a promising target for discovery of small-molecule drugs that would inhibit cleavage of the viral polyprotein and prevent spread of the infection.

Comparison of the protein sequence of the COVID-19 coronavirus 3CL hydrolase (Mpro) against the PDB archive identified 95 PDB proteins with at least 90% sequence identity. Furthermore, these related protein structures contain approximately 30 distinct small molecule inhibitors, which could guide discovery of new drugs. Of particular significance for drug discovery is the very high amino acid sequence identity (96%) between the COVID-19 coronavirus 3CL hydrolase (Mpro) and the SARS virus main protease (PDB 1q2w). Summary data about these closely-related PDB structures are available (CSV) to help researchers more easily find this information. In addition, the PDB houses 3D structure data for more than 20 unique SARS proteins represented in more than 200 PDB structures, including a second viral protease, the RNA polymerase, the viral spike protein, a viral RNA, and other proteins (CSV).

Public release of the COVID-19 coronavirus 3CL hydrolase (Mpro), at a time when this information can prove most vital and valuable, highlights the importance of open and timely availability of scientific data. The wwPDB strives to ensure that 3D biological structure data remain freely accessible for all, while maintaining as comprehensive and accurate an archive as possible. We hope that this new structure, and those from related viruses, will help researchers and clinicians address the COVID-19 coronavirus global public health emergency.

Update: Released COVID-19-related PDB structures include

PDB structure 6lu7 (X. Liu, B. Zhang, Z. Jin, H. Yang, Z. Rao Crystal structure of COVID-19 main protease in complex with an inhibitor N3 doi: 10.2210/pdb6lu7/pdb) Released 2020-02-05

PDB structure 6vsb (D. Wrapp, N. Wang, K.S. Corbett, J.A. Goldsmith, C.-L. Hsieh, O. Abiona, B.S. Graham, J.S. McLellan (2020) Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation Science doi: 10.1126/science.abb2507) Released 2020-02-26

PDB structure 6lxt (Y. Zhu, F. Sun Structure of post fusion core of 2019-nCoV S2 subunit doi: 10.2210/pdb6lxt/pdb) Released 2020-02-26

PDB structure 6lvn (Y. Zhu, F. Sun Structure of the 2019-nCoV HR2 Domain doi: 10.2210/pdb6lvn/pdb) Released 2020-02-26

PDB structure 6vw1
J. Shang, G. Ye, K. Shi, Y.S. Wan, H. Aihara, F. Li Structural basis for receptor recognition by the novel coronavirus from Wuhan doi: 10.2210/pdb6vw1/pdb
Released 2020-03-04

PDB structure 6vww
Y. Kim, R. Jedrzejczak, N. Maltseva, M. Endres, A. Godzik, K. Michalska, A. Joachimiak, Center for Structural Genomics of Infectious Diseases Crystal Structure of NSP15 Endoribonuclease from SARS CoV-2 doi: 10.2210/pdb6vww/pdb
Released 2020-03-04

PDB structure 6y2e
L. Zhang, X. Sun, R. Hilgenfeld Crystal structure of the free enzyme of the SARS-CoV-2 (2019-nCoV) main protease doi: 10.2210/pdb6y2e/pdb
Released 2020-03-04

PDB structure 6y2f
L. Zhang, X. Sun, R. Hilgenfeld Crystal structure (monoclinic form) of the complex resulting from the reaction between SARS-CoV-2 (2019-nCoV) main protease and tert-butyl (1-((S)-1-(((S)-4-(benzylamino)-3,4-dioxo-1-((S)-2-oxopyrrolidin-3-yl)butan-2-yl)amino)-3-cyclopropyl-1-oxopropan-2-yl)-2-oxo-1,2-dihydropyridin-3-yl)carbamate (alpha-ketoamide 13b) doi: 10.2210/pdb6y2f/pdb
Released 2020-03-04

PDB structure 6y2g
L. Zhang, X. Sun, R. Hilgenfeld Crystal structure (orthorhombic form) of the complex resulting from the reaction between SARS-CoV-2 (2019-nCoV) main protease and tert-butyl (1-((S)-1-(((S)-4-(benzylamino)-3,4-dioxo-1-((S)-2-oxopyrrolidin-3-yl)butan-2-yl)amino)-3-cyclopropyl-1-oxopropan-2-yl)-2-oxo-1,2-dihydropyridin-3-yl)carbamate (alpha-ketoamide 13b) doi: 10.2210/pdb6y2g/pdb
Released 2020-03-04

SOURCE – 02/04/2020

https://www.wwpdb.org/news/news?year=2020&fbclid=IwAR0mFKQD7KX29hkJcnV0MxeAe_1themx7KuBDo8txaMiYHJ08JvNN0cisug#5e39e03fa5007a04a313edc3

Coronavirus Disease 2019 (COVID-19) and Cardiovascular Disease

Kevin J. Clerkin , Justin A. Fried, Jayant Raikhelkar, Gabriel Sayer, Jan M. Griffin, Amirali Masoumi, Sneha S. Jain, Daniel Burkhoff, Deepa Kumaraiah, LeRoy Rabbani, Allan Schwartz

and

Nir Uriel

Originally published 21 Mar 2020 https://doi.org/10.1161/CIRCULATIONAHA.120.046941 Circulation. ;0:null

Abstract

Coronavirus disease 2019 (COVID-19) is a global pandemic impacting nearly 170 countries/regions and more than 285,000 patients worldwide. COVID-19 is caused by the Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2), which invades cells through the angiotensin converting enzyme 2 (ACE2) receptor. Among those with COVID-19, there is a higher prevalence of cardiovascular disease and more than 7% of patients suffer myocardial injury from the infection (22% of the critically ill). Despite ACE2 serving as the portal for infection, the role of ACE inhibitors or angiotensin receptor blockers requires further investigation. COVID-19 poses a challenge for heart transplantation, impacting donor selection, immunosuppression, and post-transplant management. Thankfully there are a number of promising therapies under active investigation to both treat and prevent COVID-19. Key Words: COVID-19; myocardial injury; pandemic; heart transplant

SOURCE

https://www.ahajournals.org/doi/pdf/10.1161/CIRCULATIONAHA.120.046941

ACE2

Towler P, Staker B, Prasad SG, Menon S, Tang J, Parsons T, Ryan D, Fisher M, Williams D, Dales NA, Patane MA, Pantoliano MW (Apr 2004). “ACE2 X-ray structures reveal a large hinge-bending motion important for inhibitor binding and catalysis”. The Journal of Biological Chemistry. 279 (17): 17996–8007. doi:10.1074/jbc.M311191200. PMID 14754895.

Turner AJ, Tipnis SR, Guy JL, Rice G, Hooper NM (Apr 2002). “ACEH/ACE2 is a novel mammalian metallocarboxypeptidase and a homologue of angiotensin-converting enzyme insensitive to ACE inhibitors”. Canadian Journal of Physiology and Pharmacology. 80 (4): 346–53. doi:10.1139/y02-021. PMID 12025971.

Zhang, Haibo; Penninger, Josef M.; Li, Yimin; Zhong, Nanshan; Slutsky, Arthur S. (3 March 2020). “Angiotensin-converting enzyme 2 (ACE2) as a SARS-CoV-2 receptor: molecular mechanisms and potential therapeutic target”. Intensive Care Medicine. Springer Science and Business Media LLC. doi:10.1007/s00134-020-05985-9. ISSN 0342-4642. PMID 32125455.

^ Gurwitz, David (2020). “Angiotensin receptor blockers as tentative SARS‐CoV‐2 therapeutics”. Drug Development Research. doi:10.1002/ddr.21656. PMID 32129518.

Angiotensin converting enzyme 2 (ACE2)

is an exopeptidase that catalyses the conversion of angiotensin I to the nonapeptide angiotensin[1-9]^[5] or the conversion of angiotensin II to angiotensin 1-7.^[6]^[7] ACE2 has direct effects on cardiac function^a and is expressed predominantly in vascular endothelial cells of the heart and the kidneys.^[8] ACE2 is not sensitive to the ACE inhibitor drugs used to treat hypertension.^[9]

ACE2 receptors have been shown to be the entry point into human cells for some coronaviruses, including the SARS virus.^[10] A number of studies have identified that the entry point is the same for SARS-CoV-2,^[11] the virus that causes COVID-19.^[12]^[13]^[14]^[15]

Some have suggested that a decrease in ACE2 could be protective against Covid-19 disease^[16], but others have suggested the opposite, that Angiotensin II receptor blocker drugs could be protective against Covid-19 disease via increasing ACE2, and that these hypotheses need to be tested by datamining of clinical patient records.^[17]

REFERENCES

https://en.wikipedia.org/wiki/Angiotensin-converting_enzyme_2

FOLDING@HOME TAKES UP THE FIGHT AGAINST COVID-19 / 2019-NCOV

February 27, 2020

by Greg Bowman

We need your help! Folding@home is joining researchers around the world working to better understand the 2019 Coronavirus (2019-nCoV) to accelerate the open science effort to develop new life-saving therapies. By downloading Folding@Home, you can donate your unused computational resources to the Folding@home Consortium, where researchers working to advance our understanding of the structures of potential drug targets for 2019-nCoV that could aid in the design of new therapies. The data you help us generate will be quickly and openly disseminated as part of an open science collaboration of multiple laboratories around the world, giving researchers new tools that may unlock new opportunities for developing lifesaving drugs.

2019-nCoV is a close cousin to SARS coronavirus (SARS-CoV), and acts in a similar way. For both coronaviruses, the first step of infection occurs in the lungs, when a protein on the surface of the virus binds to a receptor protein on a lung cell. This viral protein is called the spike protein, depicted in red in the image below, and the receptor is known as ACE2. A therapeutic antibody is a type of protein that can block the viral protein from binding to its receptor, therefore preventing the virus from infecting the lung cell. A therapeutic antibody has already been developed for SARS-CoV, but to develop therapeutic antibodies or small molecules for 2019-nCoV, scientists need to better understand the structure of the viral spike protein and how it binds to the human ACE2 receptor required for viral entry into human cells.

Proteins are not stagnant—they wiggle and fold and unfold to take on numerous shapes. We need to study not only one shape of the viral spike protein, but all the ways the protein wiggles and folds into alternative shapes in order to best understand how it interacts with the ACE2 receptor, so that an antibody can be designed. Low-resolution structures of the SARS-CoV spike protein exist and we know the mutations that differ between SARS-CoV and 2019-nCoV. Given this information, we are uniquely positioned to help model the structure of the 2019-nCoV spike protein and identify sites that can be targeted by a therapeutic antibody. We can build computational models that accomplish this goal, but it takes a lot of computing power.

This is where you come in! With many computers working towards the same goal, we aim to help develop a therapeutic remedy as quickly as possible. By downloading Folding@home here [LINK] and selecting to contribute to “Any Disease”, you can help provide us with the computational power required to tackle this problem. One protein from 2019-nCoV, a protease encoded by the viral RNA, has already been crystallized. Although the 2019-nCoV spike protein of interest has not yet been resolved bound to ACE2, our objective is to use the homologous structure of the SARS-CoV spike protein to identify therapeutic antibody targets.

This illustration, created at the Centers for Disease Control and Prevention (CDC), reveals ultrastructural morphology exhibited by coronaviruses. Note the spikes that adorn the outer surface of the virus, which impart the look of a corona surrounding the virion, when viewed electron microscopically. A novel coronavirus virus was identified as the cause of an outbreak of respiratory illness first detected in Wuhan, China in 2019.

Image and Caption Credit: Alissa Eckert, MS; Dan Higgins, MAM available at https://phil.cdc.gov/Details.aspx?pid=23311

Structures of the closely related SARS-CoV spike protein bound by therapeutic antibodies may help rapidly design better therapies. The three monomers of the SARS-CoV spike protein are shown in different shades of red; the antibody is depicted in green. [PDB: 6NB7 https://www.rcsb.org/structure/6nb7]

(post authored by Ariana Brenner Clerkin)

References:

PDB 6lu7 structure summary ‹ Protein Data Bank in Europe (PDBe) ‹ EMBL-EBI https://www.ebi.ac.uk/pdbe/entry/pdb/6lu7 (accessed Feb 5, 2020).

Tian, X.; Li, C.; Huang, A.; Xia, S.; Lu, S.; Shi, Z.; Lu, L.; Jiang, S.; Yang, Z.; Wu, Y.; et al. Potent Binding of 2019 Novel Coronavirus Spike Protein by a SARS Coronavirus-Specific Human Monoclonal Antibody; preprint; Microbiology, 2020. https://doi.org/10.1101/2020.01.28.923011.

Walls, A. C.; Xiong, X.; Park, Y. J.; Tortorici, M. A.; Snijder, J.; Quispe, J.; Cameroni, E.; Gopal, R.; Dai, M.; Lanzavecchia, A.; et al. Unexpected Receptor Functional Mimicry Elucidates Activation of Coronavirus Fusion. Cell 2019, 176, 1026-1039.e15. https://doi.org/10.2210/pdb6nb7/pdb.

SOURCE

https://foldingathome.org/2020/02/27/foldinghome-takes-up-the-fight-against-covid-19-2019-ncov/

UPDATED 3/13/2020

I am reposting the following Science blog post from Derrick Lowe as is and ask people go browse through the comments on his Science blog In the Pipeline because, as Dr. Lowe states that in this current crisis it is important to disseminate good information as quickly as possible so wanted the readers here to have the ability to read his great posting on this matter of Covid-19. Also i would like to direct readers to the journal Science opinion letter concerning how important it is to rebuild the trust in good science and the scientific process. The full link for the following In the Pipeline post is: https://blogs.sciencemag.org/pipeline/archives/2020/03/06/covid-19-small-molecule-therapies-reviewed

A Summary of current potential repurposed therapeutics for COVID-19 Infection from In The Pipeline: A Science blog from Derick Lowe

Covid-19 Small Molecule Therapies Reviewed

By Derek Lowe 6 March, 2020

Let’s take inventory on the therapies that are being developed for the coronavirus epidemic. Here is a very thorough list of at Biocentury, and I should note that (like Stat and several other organizations) they’re making all their Covid-19 content free to all readers during this crisis. I’d like to zoom in today on the potential small-molecule therapies, since some of these have the most immediate prospects for use in the real world.

The ones at the front of the line are repurposed drugs that are already approved for human use, for a lot of obvious reasons. The Biocentury list doesn’t cover these, but here’s an article at Nature Biotechnology that goes into detail. Clinical trials are a huge time sink – they sort of have to be, in most cases, if they’re going to be any good – and if you’ve already done all that stuff it’s a huge leg up, even if the drug itself is not exactly a perfect fit for the disease. So what do we have? The compound that is most advanced is probably remdesivir from Gilead, at right. This has been in development for a few years as an RNA virus therapy – it was originally developed for Ebola, and has been tried out against a whole list of single-strand RNA viruses. That includes the related coronaviruses SARS and MERS, so Covid-19 was an obvious fit.

The compound is a prodrug – that phosphoramide gets cleaved off completely, leaving the active 5-OH compound GS-44-1524. It mechanism of action is to get incorporated into viral RNA, since it’s taken up by RNA polymerase and it largely seems to evade proofreading. This causes RNA termination trouble later on, since that alpha-nitrile C-nucleoside is not exactly what the virus is expecting in its genome at that point, and thus viral replication is inhibited.

There are five clinical trials underway (here’s an overview at Biocentury). The NIH has an adaptive-design Phase II trial that has already started in Nebraska, with doses to be changed according to Bayesian readouts along the way. There are two Phase III trials underway at China-Japan Friendship Hospital in Hubei, double-blinded and placebo-controlled (since placebo is, as far as drug therapy goes, the current standard of care). And Gilead themselves are starting two open-label trials, one with no control arm and one with an (unblinded) standard-of-care comparison arm. Those might read out first, depending on when they get off the ground, but will be only rough readouts due to the fast-and-loose trial design. The two Hubei trials and the NIH one will add some rigor to the process, but I’m not sure when they’re going to report. My personal opinion is that I like the chances of this drug more than anything else on this list, but it’s still unlikely to be a game-changer.

There’s an RNA polymerase inhibitor (favipiravir) from Toyama, at right, that’s in a trial in China. It’s a thought – a broad-spectrum agent of this sort would be the sort of thing to try. But unfortunately, from what I can see, it has already turned up as ineffective in in vitro tests. The human trial that’s underway is honestly the sort of thing that would only happen under circumstances like the present: a developing epidemic with a new pathogen and no real standard of care. I hold out little hope for this one, but given that there’s nothing else at present, it probably should be tried. As you’ll see, this is far from the only situation like this.

One of the screens of known drugs in China that also flagged remdesivir noted that the old antimalarial drug chloroquine seemed to be effective in vitro. It had been reported some years back as a possible antiviral, working through more than one mechanism, probably both at viral entry and intracellularly thereafter. That part shouldn’t be surprising – chloroquine’s actual mode(s) of action against malaria parasites are still not completely worked out, either, and some of what people thought they knew about it has turned out to be wrong. There are several trials underway with it at Chinese facilities, some in combination with other agents like remdesivir. Chloroquine has of course been taken for many decades as an antimalarial, but it has a number of liabilities, including seizures, hearing damage, retinopathy and sudden effects on blood glucose. So it’s going to be important to establish just how effective it is and what doses will be needed. Just as with vaccine candidates, it’s possible to do more harm with a rushed treatment than the disease is doing itself

There are several other known antiviral drugs are being tried in China, but I don’t have too much hope for those, either. The neuraminidase inhibitors such as oseltamivir (better known as Tamiflu) were tried against SARS and were ineffective; there is no reason to expect anything versus Covid-19 although these drugs are a component of some drug cocktail trials. The HIV protease therapies such as darunavir and the combination therapy Kaletra are in trials, but that’s also a rather desperate long shot, since there’s no particular reason to think that they will have any such protease inhibition against what this new virus has to offer (and indeed, such agents weren’t much help against SARS in the end, either). The classic interferon/ribavirin combination seems to have had some activity against SARS and MERS, and is in two trials from what I can see. That’s not an awful idea by any means, but it’s not a great one, either: if your viral disease has interferon/ribavirin as a front line therapy, it generally means that there’s nothing really good available. No, unless we get really lucky none of these ideas are going to slow the disease down much.

There are a few other repurposed-protease-inhibitors ideas out there, such as this one. (Edit: I had seen this paper but couldn’t track it down, so thanks to those who sent it along). This paper suggests that the TMPRSS2 protease is important for viral entry on the human-cell-side of the process, a pathway that has been noted for other coronaviruses. And it points out that there is a an approved inhibitor (in Japan) for this enzyme (camostat), so that would definitely seem to be worth a trial, probably in combination with remdesivir.

That’s about it for the existing small molecules, from what I can see. What about new ones? Don’t hold your breath, is all I can say. A drug discovery program from scratch against a new pathogen is, as many readers here well know, not a trivial exercise. As this Bloomberg article details, many such efforts in the past (small molecules and vaccines alike) have come to grief because by the time they had anything to deliver the epidemic itself had passed. Indeed, Gilead’s remdesivir had already been dropped as a potential Ebola therapy.

You will either need to have a target in mind up front or go phenotypic. For the former, what you’d see are better characterizations of the viral protease and more extensive screens against it. Two other big target areas are viral entry (which involves the “spike” proteins on the virus surface and the ACE2 protein on human cells) and viral replication. To the former, it’s worth quickly noting that ACE2 is so much unlike the more familiar ACE protein that none of the cardiovascular ACE inhibitors do anything to it at all. And targeting the latter mechanisms is how remdesivir was developed as a possible Ebola agent, but as you can see, that took time, too. Phenotypic screens are perfectly reasonable against viral pathogens as well, but you’ll need to put time and effort into that assay up front, just as with any phenotypic effort, because as anyone who does that sort of work will tell you, a bad phenotypic screen is a complete waste of everyone’s time.

One of the key steps for either route is identifying an animal model. While animal models of infectious disease can be extremely well translated to human therapy, that doesn’t happen by accident: you need to choose the right animal. Viruses in general (and coronaviruses are no exception) vary widely in their effects in different species, and not just across the gaps of bird/reptile/human and the like. No, you’ll run into things where even the usual set of small mammals are acting differently from each other, with some of them not even getting sick at all. This current virus may well have gone through a couple of other mammalian species before landing on us, but you’ll note that dogs (to pick one) don’t seem to have any problem with it.

All this means that any new-target new-chemical-matter effort against Covid-19 (or any new pathogen) is going to take years, and there is just no way around that. Update: see here for just such an effort to start finding fragment hits for the viral protease. This puts small molecules in a very bimodal distribution: you have the existing drugs that might be repurposed, and are presumably available right now. Nothing else is! At the other end, for completely new therapies you have the usual prospects of drug discovery: years from now, lots of money, low success rate, good luck to all of us. The gap between these two could in theory be filled by vaccines and antibody therapies (if everything goes really, really well) but those are very much their own area and will be dealt with in a separate post.

Either way, the odds are that we (and I mean “we as a species” here) are going to be fighting this epidemic without any particularly amazing pharmacological weapons. Eventually we’ll have some, but I would advise people, pundits, and politicians not to get all excited about the prospects for some new therapies to come riding up over the hill to help us out. The odds of that happening in time to do anything about the current outbreak are very small. We will be going for months, years, with the therapeutic options we have right now. Look around you: what we have today is what we have to work with.

The 3rd STAT4ONC Annual Symposium: April 25-27, 2019 Hilton Hartford Connecticut

Posted in Artificial Intelligence in Medicine - Applications in Therapeutics, Cancer - General, Cancer and Current Therapeutics, Cancer Genomics, Clinical Diagnostics, Clinical Genomics, Genomic Testing: Methodology for Diagnosis, Health Economics and Outcomes Research, interventional oncology, Math, Precision Cancer Medicine, Scientific & Biotech Conferences: Press Coverage, tagged analysis, cancer genomics and therapy, clinical genomics, conference, Genome-wide analysis of gene expression, genomics, oncology, personalized cancer therapy, Statistics on February 26, 2019| Leave a Comment »

THE 3^RD STAT4ONC ANNUAL SYMPOSIUM APRIL 25-27, 2019, HILTON, HARTFORD, CONNECTICUT, 315 Trumbull St, Hartford, CT 06103

Reporter: Stephen J. Williams, Ph.D.

3.3.8 The 3^rd STATONC Annual Symposium, April 25-27, 2019, Hilton Hartford, CT, 315 Trumbull St., Hartford, CT 06103, Volume 2 (Volume Two: Latest in Genomics Methodologies for Therapeutics: Gene Editing, NGS and BioInformatics, Simulations and the Genome Ontology), Part 2: CRISPR for Gene Editing and DNA Repair

SYMPOSIUM OBJECTIVES

The three-day symposium aims to bring oncologists and statisticians together to share new research, discuss novel ideas, ask questions and provide solutions for cancer clinical trials. In the era of big data, precision medicine, and genomics and immune-based oncology, it is crucial to provide a platform for interdisciplinary dialogues among clinical and quantitative scientists. The Stat4Onc Annual Symposium serves as a venue for oncologists and statisticians to communicate their views on trial design and conduct, drug development, and translations to patient care. To be discussed includes big data and genomics for oncology clinical trials, novel dose-finding designs, drug combinations, immune oncology clinical trials, and umbrella/basket oncology trials. An important aspect of Stat4Onc is the participation of researchers across academia, industry, and regulatory agency.

Meeting Agenda will be announced coming soon. For Updated Agenda and Program Speakers please CLICK HERE

The registration of the symposium is via NESS Society PayPal. Click here to register.

Can Blockchain Technology and Artificial Intelligence Cure What Ails Biomedical Research and Healthcare

Posted in Artificial Intelligence - Breakthroughs in Theories and Technologies, Artificial Intelligence - General, Big Data, BioBanking, BioIT: BioInformatics, BioIT: BioInformatics, NGS, Clinical & Translational, Pharmaceutical R&D Informatics, Clinical Genomics, Cancer Informatics, Biomarkers & Medical Diagnostics, Computational Biology/Systems and Bioinformatics, Health Economics and Outcomes Research, Intelligent Information Systems, Population Health Management, Genetics & Pharmaceutical, tagged #mobilehealth, Artificial intelligence, biomarker, blockchain, clinical genomics, computational algorithms, economics, healthcare, healthcare delivery, healthcare reimbursement, Innovation, medical information, medical IT, omic data, patient control, patient data on December 10, 2018| Leave a Comment »

Can Blockchain Technology and Artificial Intelligence Cure What Ails Biomedical Research and Healthcare

Curator: Stephen J. Williams, Ph.D.

Updated 12/18/2018

In the efforts to reduce healthcare costs, provide increased accessibility of service for patients, and drive biomedical innovations, many healthcare and biotechnology professionals have looked to advances in digital technology to determine the utility of IT to drive and extract greater value from healthcare industry. Two areas of recent interest have focused how best to use blockchain and artificial intelligence technologies to drive greater efficiencies in our healthcare and biotechnology industries.

More importantly, with the substantial increase in ‘omic data generated both in research as well as in the clinical setting, it has become imperative to develop ways to securely store and disseminate the massive amounts of ‘omic data to various relevant parties (researchers or clinicians), in an efficient manner yet to protect personal privacy and adhere to international regulations. This is where blockchain technologies may play an important role.

A recent Oncotarget paper by Mamoshina et al. (1) discussed the possibility that next-generation artificial intelligence and blockchain technologies could synergize to accelerate biomedical research and enable patients new tools to control and profit from their personal healthcare data, and assist patients with their healthcare monitoring needs. According to the abstract:

The authors introduce new concepts to appraise and evaluate personal records, including the combination-, time- and relationship value of the data. They also present a roadmap for a blockchain-enabled decentralized personal health data ecosystem to enable novel approaches for drug discovery, biomarker development, and preventative healthcare. In this system, blockchain and deep learning technologies would provide the secure and transparent distribution of personal data in a healthcare marketplace, and would also be useful to resolve challenges faced by the regulators and return control over personal data including medical records to the individual.

The review discusses:

Recent achievements in next-generation artificial intelligence
Basic concepts of highly distributed storage systems (HDSS) as a preferred method for medical data storage
Open source blockchain Exonium and its application for healthcare marketplace
A blockchain-based platform allowing patients to have control of their data and manage access
How advances in deep learning can improve data quality, especially in an era of big data

Advances in Artificial Intelligence

Integrative analysis of the vast amount of health-associated data from a multitude of large scale global projects has proven to be highly problematic (REF 27), as high quality biomedical data is highly complex and of a heterogeneous nature, which necessitates special preprocessing and analysis.
Increased computing processing power and algorithm advances have led to significant advances in machine learning, especially machine learning involving Deep Neural Networks (DNNs), which are able to capture high-level dependencies in healthcare data. Some examples of the uses of DNNs are:

Prediction of drug properties(2, 3) and toxicities(4)
Biomarker development (5)
Cancer diagnosis (6)
First FDA approved system based on deep learning Arterys Cardio DL

Other promising systems of deep learning include:
- Generative Adversarial Networks (https://arxiv.org/abs/1406.2661): requires good datasets for extensive training but has been used to determine tumor growth inhibition capabilities of various molecules (7)
- Recurrent neural Networks (RNN): Originally made for sequence analysis, RNN has proved useful in analyzing text and time-series data, and thus would be very useful for electronic record analysis. Has also been useful in predicting blood glucose levels of Type I diabetic patients using data obtained from continuous glucose monitoring devices (8)
- Transfer Learning: focused on translating information learned on one domain or larger dataset to another, smaller domain. Meant to reduce the dependence on large training datasets that RNN, GAN, and DNN require. Biomedical imaging datasets are an example of use of transfer learning.
- One and Zero-Shot Learning: retains ability to work with restricted datasets like transfer learning. One shot learning aimed to recognize new data points based on a few examples from the training set while zero-shot learning aims to recognize new object without seeing the examples of those instances within the training set.

Highly Distributed Storage Systems (HDSS)

The explosion in data generation has necessitated the development of better systems for data storage and handling. HDSS systems need to be reliable, accessible, scalable, and affordable. This involves storing data in different nodes and the data stored in these nodes are replicated which makes access rapid. However data consistency and affordability are big challenges.

Blockchain is a distributed database used to maintain a growing list of records, in which records are divided into blocks, locked together by a crytosecurity algorithm(s) to maintain consistency of data. Each record in the block contains a timestamp and a link to the previous block in the chain. Blockchain is a distributed ledger of blocks meaning it is owned and shared and accessible to everyone. This allows a verifiable, secure, and consistent history of a record of events.

Data Privacy and Regulatory Issues

The establishment of the Health Insurance Portability and Accountability Act (HIPAA) in 1996 has provided much needed regulatory guidance and framework for clinicians and all concerned parties within the healthcare and health data chain. The HIPAA act has already provided much needed guidance for the latest technologies impacting healthcare, most notably the use of social media and mobile communications (discussed in this article Can Mobile Health Apps Improve Oral-Chemotherapy Adherence? The Benefit of Gamification.). The advent of blockchain technology in healthcare offers its own unique challenges however HIPAA offers a basis for developing a regulatory framework in this regard. The special standards regarding electronic data transfer are explained in HIPAA’s Privacy Rule, which regulates how certain entities (covered entities) use and disclose individual identifiable health information (Protected Health Information PHI), and protects the transfer of such information over any medium or electronic data format. However, some of the benefits of blockchain which may revolutionize the healthcare system may be in direct contradiction with HIPAA rules as outlined below:

Issues of Privacy Specific In Use of Blockchain to Distribute Health Data

Blockchain was designed as a distributed database, maintained by multiple independent parties, and decentralized
Linkage timestamping; although useful in time dependent data, proof that third parties have not been in the process would have to be established including accountability measures
Blockchain uses a consensus algorithm even though end users may have their own privacy key
Applied cryptography measures and routines are used to decentralize authentication (publicly available)
Blockchain users are divided into three main categories: 1) maintainers of blockchain infrastructure, 2) external auditors who store a replica of the blockchain 3) end users or clients and may have access to a relatively small portion of a blockchain but their software may use cryptographic proofs to verify authenticity of data.

YouTube video on How #Blockchain Will Transform Healthcare in 25 Years (please click below)

In Big Data for Better Outcomes, BigData@Heart, DO->IT, EHDN, the EU data Consortia, and yes, even concepts like pay for performance, Richard Bergström has had a hand in their creation. The former Director General of EFPIA, and now the head of health both at SICPA and their joint venture blockchain company Guardtime, Richard is always ahead of the curve. In fact, he’s usually the one who makes the curve in the first place.

Please click on the following link for a podcast on Big Data, Blockchain and Pharma/Healthcare by Richard Bergström:

https://soundcloud.com/vitalhealth/real-world-data-pay-for-performance-or-blockchain-richard-bergstrom-is-always-ahead-of-the-curve

References

Mamoshina, P., Ojomoko, L., Yanovich, Y., Ostrovski, A., Botezatu, A., Prikhodko, P., Izumchenko, E., Aliper, A., Romantsov, K., Zhebrak, A., Ogu, I. O., and Zhavoronkov, A. (2018) Converging blockchain and next-generation artificial intelligence technologies to decentralize and accelerate biomedical research and healthcare, Oncotarget 9, 5665-5690.
Aliper, A., Plis, S., Artemov, A., Ulloa, A., Mamoshina, P., and Zhavoronkov, A. (2016) Deep Learning Applications for Predicting Pharmacological Properties of Drugs and Drug Repurposing Using Transcriptomic Data, Molecular pharmaceutics 13, 2524-2530.
Wen, M., Zhang, Z., Niu, S., Sha, H., Yang, R., Yun, Y., and Lu, H. (2017) Deep-Learning-Based Drug-Target Interaction Prediction, Journal of proteome research 16, 1401-1409.
Gao, M., Igata, H., Takeuchi, A., Sato, K., and Ikegaya, Y. (2017) Machine learning-based prediction of adverse drug effects: An example of seizure-inducing compounds, Journal of pharmacological sciences 133, 70-78.
Putin, E., Mamoshina, P., Aliper, A., Korzinkin, M., Moskalev, A., Kolosov, A., Ostrovskiy, A., Cantor, C., Vijg, J., and Zhavoronkov, A. (2016) Deep biomarkers of human aging: Application of deep neural networks to biomarker development, Aging 8, 1021-1033.
Vandenberghe, M. E., Scott, M. L., Scorer, P. W., Soderberg, M., Balcerzak, D., and Barker, C. (2017) Relevance of deep learning to facilitate the diagnosis of HER2 status in breast cancer, Scientific reports 7, 45938.
Kadurin, A., Nikolenko, S., Khrabrov, K., Aliper, A., and Zhavoronkov, A. (2017) druGAN: An Advanced Generative Adversarial Autoencoder Model for de Novo Generation of New Molecules with Desired Molecular Properties in Silico, Molecular pharmaceutics 14, 3098-3104.
Ordonez, F. J., and Roggen, D. (2016) Deep Convolutional and LSTM Recurrent Neural Networks for Multimodal Wearable Activity Recognition, Sensors (Basel) 16.

Articles from clinicalinformaticsnews.com

Healthcare Organizations Form Synaptic Health Alliance, Explore Blockchain’s Impact On Data Quality

From http://www.clinicalinformaticsnews.com/2018/12/05/healthcare-organizations-form-synaptic-health-alliance-explore-blockchains-impact-on-data-quality.aspx

By Benjamin Ross

December 5, 2018 | The boom of blockchain and distributed ledger technologies have inspired healthcare organizations to test the capabilities of their data. Quest Diagnostics, in partnership with Humana, MultiPlan, and UnitedHealth Group’s Optum and UnitedHealthcare, have launched a pilot program that applies blockchain technology to improve data quality and reduce administrative costs associated with changes to healthcare provider demographic data.

The collective body, called Synaptic Health Alliance, explores how blockchain can keep only the most current healthcare provider information available in health plan provider directories. The alliance plans to share their progress in the first half of 2019.

Providing consumers looking for care with accurate information when they need it is essential to a high-functioning overall healthcare system, Jason O’Meara, Senior Director of Architecture at Quest Diagnostics, told Clinical Informatics News in an email interview.

“We were intentional about calling ourselves an alliance as it speaks to the shared interest in improving health care through better, collaborative use of an innovative technology,” O’Meara wrote. “Our large collective dataset and national footprints enable us to prove the value of data sharing across company lines, which has been limited in healthcare to date.”

O’Meara said Quest Diagnostics has been investing time and resources the past year or two in understanding blockchain, its ability to drive purpose within the healthcare industry, and how to leverage it for business value.

“Many health care and life science organizations have cast an eye toward blockchain’s potential to inform their digital strategies,” O’Meara said. “We recognize it takes time to learn how to leverage a new technology. We started exploring the technology in early 2017, but we quickly recognized the technology’s value is in its application to business to business use cases: to help transparently share information, automate mutually-beneficial processes and audit interactions.”

Quest began discussing the potential for an alliance with the four other companies a year ago, O’Meara said. Each company shared traits that would allow them to prove the value of data sharing across company lines.

“While we have different perspectives, each member has deep expertise in healthcare technology, a collaborative culture, and desire to continuously improve the patient/customer experience,” said O’Meara. “We also recognize the value of technology in driving efficiencies and quality.”

Following its initial launch in April, Synaptic Health Alliance is deploying a multi-company, multi-site, permissioned blockchain. According to a whitepaper published by Synaptic Health, the choice to use a permissioned blockchain rather than an anonymous one is crucial to the alliance’s success.

“This is a more effective approach, consistent with enterprise blockchains,” an alliance representative wrote. “Each Alliance member has the flexibility to deploy its nodes based on its enterprise requirements. Some members have elected to deploy their nodes within their own data centers, while others are using secured public cloud services such as AWS and Azure. This level of flexibility is key to growing the Alliance blockchain network.”

As the pilot moves forward, O’Meara says the Alliance plans to open ability to other organizations. Earlier this week Aetna and Ascension announced they joined the project.

“I am personally excited by the amount of cross-company collaboration facilitated by this project,” O’Meara says. “We have already learned so much from each other and are using that knowledge to really move the needle on improving healthcare.”

US Health And Human Services Looks To Blockchain To Manage Unstructured Data

http://www.clinicalinformaticsnews.com/2018/11/29/us-health-and-human-services-looks-to-blockchain-to-manage-unstructured-data.aspx

By Benjamin Ross

November 29, 2018 | The US Department of Health and Human Services (HHS) is making waves in the blockchain space. The agency’s Division of Acquisition (DA) has developed a new system, called Accelerate, which gives acquisition teams detailed information on pricing, terms, and conditions across HHS in real-time. The department’s Associate Deputy Assistant Secretary for Acquisition, Jose Arrieta, gave a presentation and live demo of the blockchain-enabled system at the Distributed: Health event earlier this month in Nashville, Tennessee.

Accelerate is still in the prototype phase, Arrieta said, with hopes that the new system will be deployed at the end of the fiscal year.

HHS spends around $25 billion a year in contracts, Arrieta said. That’s 100,000 contracts a year with over one million pages of unstructured data managed through 45 different systems. Arrieta and his team wanted to modernize the system.

“But if you’re going to change the way a workforce of 20,000 people do business, you have to think your way through how you’re going to do that,” said Arrieta. “We didn’t disrupt the existing systems: we cannibalized them.”

The cannibalization process resulted in Accelerate. According to Arrieta, the system functions by creating a record of data rather than storing it, leveraging machine learning, artificial intelligence (AI), and robotic process automation (RPA), all through blockchain data.

“We’re using that data record as a mechanism to redesign the way we deliver services through micro-services strategies,” Arrieta said. “Why is that important? Because if you have a single application or data use that interfaces with 55 other applications in your business network, it becomes very expensive to make changes to one of the 55 applications.”

Accelerate distributes the data to the workforce, making it available to them one business process at a time.

“We’re building those business processes without disrupting the existing systems,” said Arrieta, and that’s key. “We’re not shutting off those systems. We’re using human-centered design sessions to rebuild value exchange off of that data.”

The first application for the system, Arrieta said, can be compared to department stores price-matching their online competitors.

It takes the HHS close to a month to collect the amalgamation of data from existing system, whether that be terms and conditions that drive certain price points, or software licenses.

“The micro-service we built actually analyzes that data, and provides that information to you within one second,” said Arrieta. “This is distributed to the workforce, to the 5,000 people that do the contracting, to the 15,000 people that actually run the programs at [HHS].”

This simple micro-service is replicated on every node related to HHS’s internal workforce. If somebody wants to change the algorithm to fit their needs, they can do that in a distributed manner.

Arrieta hopes to use Accelerate to save researchers money at the point of purchase. The program uses blockchain to simplify the process of acquisition.

“How many of you work with the federal government?” Arrieta asked the audience. “Do you get sick of reentering the same information over and over again? Every single business opportunity you apply for, you have to resubmit your financial information. You constantly have to check for validation and verification, constantly have to resubmit capabilities.”

Wouldn’t it be better to have historical notes available for each transaction? said Arrieta. This would allow clinical researchers to be able to focus on “the things they’re really good at,” instead of red tape.

“If we had the top cancer researcher in the world, would you really want her spending her time learning about federal regulations as to how to spend money, or do you want her trying to solve cancer?” Arrieta said. “What we’re doing is providing that data to the individual in a distributed manner so they can read the information of historical purchases that support activity, and they can focus on the objectives and risks they see as it relates to their programming and their objectives.”

Blockchain also creates transparency among researchers, Arrieta said, which says creates an “uncomfortable reality” in the fact that they have to make a decision regarding data, fundamentally changing value exchange.

“The beauty of our business model is internal investment,” Arrieta said. For instance, the HHS could take all the sepsis data that exists in their system, put it into a distributed ledger, and share it with an external source.

“Maybe that could fuel partnership,” Arrieta said. “I can make data available to researchers in the field in real-time so they can actually test their hypothesis, test their intuition, and test their imagination as it relates to solving real-world problems.”

Shivom is creating a genomic data hub to elongate human life with AI

STEWART ROGERS @THEREALSJR AUGUST 3, 2018 8:30 AM

From VentureBeat.com

Blockchain-based genomic data hub platform Shivom recently reached its $35 million hard cap within 15 seconds of opening its main token sale. Shivom received funding from a number of crypto VC funds, including Collinstar, Lateral, and Ironside.

The goal is to create the world’s largest store of genomic data while offering an open web marketplace for patients, data donors, and providers — such as pharmaceutical companies, research organizations, governments, patient-support groups, and insurance companies.

“Disrupting the whole of the health care system as we know it has to be the most exciting use of such large DNA datasets,” Shivom CEO Henry Ines told me. “We’ll be able to stratify patients for better clinical trials, which will help to advance research in precision medicine. This means we will have the ability to make a specific drug for a specific patient based on their DNA markers. And what with the cost of DNA sequencing getting cheaper by the minute, we’ll also be able to sequence individuals sooner, so young children or even newborn babies could be sequenced from birth and treated right away.”

While there are many solutions examining DNA data to explain heritage, intellectual capabilities, health, and fitness, the potential of genomic data has largely yet to be unlocked. A few companies hold the monopoly on genomic data and make sizeable profits from selling it to third parties, usually without sharing the earnings with the data donor. Donors are also not informed if and when their information is shared, nor do they have any guarantee that their data is secure from hackers.

Shivom wants to change that by creating a decentralized platform that will break these monopolies, democratizing the processes of sharing and utilizing the data.

“Overall, large DNA datasets will have the potential to aid in the understanding, prevention, diagnosis, and treatment of every disease known to mankind, and could create a future where no diseases exist, or those that do can be cured very easily and quickly,” Ines said. “Imagine that, a world where people do not get sick or are already aware of what future diseases they could fall prey to and so can easily prevent them.”

Shivom’s use of blockchain technology and smart contracts ensures that all genomic data shared on the platform will remain anonymous and secure, while its OmiX token incentivizes users to share their data for monetary gain.

Rise in Population Genomics: Local Government in India Will Use Blockchain to Secure Genetic Data

Blockchain will secure the DNA database for 50 million citizens in the eighth-largest state in India. The government of Andhra Pradesh signed a Memorandum of Understanding with a German genomics and precision medicine start-up, Shivom, which announced to start the pilot project soon. The move falls in line with a trend for governments turning to population genomics, and at the same time securing the sensitive data through blockchain.

Andhra Pradesh, DNA, and blockchain

Storing sensitive genetic information safely and securely is a big challenge. Shivom builds a genomic data-hub powered by blockchain technology. It aims to connect researchers with DNA data donors thus facilitating medical research and the healthcare industry.

With regards to Andhra Pradesh, the start-up will first launch a trial to determine the viability of their technology for moving from a proactive to a preventive approach in medicine, and towards precision health. “Our partnership with Shivom explores the possibilities of providing an efficient way of diagnostic services to patients of Andhra Pradesh by maintaining the privacy of the individual data through blockchain technologies,” said J A Chowdary, IT Advisor to Chief Minister, Government of Andhra Pradesh.

PrecisionFDA Consistency Challenge supports projects to validate and increase reproduceability of genomic testing methods

Posted in Clinical Genomics, tagged clinical genomics, genomics, next generation sequencing on March 15, 2016| Leave a Comment »

PrecisionFDA Consistency Challenge supports projects to validate and increase reproduceability of genomic testing methods

Reporter: Stephen J. Williams, Ph.D.

PrecisionFDA
Consistency Challenge

Engage and improve DNA test results with our first community challenge

JOIN THE CHALLENGE

ABOUT 1 MONTH REMAINING

The Food and Drug Administration (FDA) calls on the genomics community to further assess, compare, and improve techniques used in DNA testing by launching the first precisionFDA challenge.

President Obama’s Precision Medicine Initiative envisions a day when an individual’s medical care will be tailored in part based on their unique characteristics and genetic make-up.

The goal of the FDA’s first precisionFDA challenge is to engage the genomics community in advancing the quality standards in order to achieve more consistent results in the context of genetic tests (related to whole human genome sequencing), advancing the goal of better personalized care.

PrecisionFDA invites all innovators to take the challenge and assess their software on the supplied reference human datasets. Participation is voluntary, but instrumental in helping the community prepare for the coming genomic data revolution.

Challenge Time Period

February 25, 2016 through April 25, 2016

AT A GLANCE

In the context of whole human genome sequencing, software pipelines typically rely on mapping sequencing reads to a reference genome and subsequently identifying variants (differences). One way of assessing the performance of such pipelines is by using well-characterized datasets such as Genome in a Bottle’s NA12878.

By supplying NA12878 whole-genome sequencing read datasets (FASTQ), and a framework for comparing variant call format (VCF) results, this challenge provides a common frame of reference for measuring some of the aspects of reproducibility and accuracy of participants’ pipelines.

The challenge begins with two precisionFDA-provided input datasets, corresponding to whole-genome sequencing of the NA12878 human sample at two different sequencing sites. Your mission is to process these FASTQ files through your mapping and variation calling pipeline and create VCF files. For one of the datasets, you are required to do a rerun of your pipeline and obtain a rerun VCF as well. You can generate those results on your own environment, and upload them to precisionFDA, or you can reconstruct your pipeline on precisionFDA and run it on the cloud.

Regardless of how you generate your VCF files, you will subsequently use the precisionFDA comparison framework to conduct several pairwise comparisons:

By comparing the rerun VCF to the original one, you will evaluate your pipeline’s reproducibility with respect to the same exact input file.
By comparing the VCF files of the two datasets, you will evaluate reproducibility on the same sample across different sites.
By comparing each of your three VCF files to the NIST (Genome in a Bottle) benchmark VCF, you will get estimates for accuracy.

The complete set of these five comparisons constitutes your submission entry to the challenge. Each comparison outputs several metrics (such as precision*, recall*, f-measure, or number of non-common variants). Selected participants and winners** will be recognized on the precisionFDA website. Therefore, we hope you are willing to share your experience with others to further enhance the community’s effort to ensure consistency of tests.

The challenge runs until April 25, 2016.

CHALLENGE DETAILS

Last updated: March 2nd, 2016

Getting on the precisionFDA website

If you do not yet have a contributor account on precisionFDA, file an access request with your complete information, and indicate that you are entering the challenge. The FDA acts as steward to providing the precisionFDA service to the community and ensuring proper use of the resources, so your request will be initially pending. In the meantime, you will receive an email with a link to access the precisionFDA website in browse (guest) mode. Once approved, you will receive another email with your contributor account information.

With your contributor account you can use the features required to participate in the challenge (such as transfer files or run comparisons). Everything you do on precisionFDA is initially private to you (not accessible to the FDA or the rest of the community) until you choose to publicize it. So you can immediately start working on the challenge in private, and whenever you are ready you can officially publish your results as your challenge entry.

Locating and understanding the input files

Running your pipeline

(Optional) Reconstructing your pipeline on precisionFDA

Conducting the comparisons

Submitting your entry

Determining the winners

Extra credit

Footnotes

* The terminology currently used in the precisionFDA comparison output (such as “precision” and “recall”) is not necessarily harmonized with definitions used by ISO, CLSI, or FDA, but are terms commonly used by NGS software developers.

** Winning a precisionFDA challenge is an acknowledgement by the precisionFDA community and does not imply FDA endorsement of any organization, tool, software, etc.

Read Full Post »

Cambridge Healthtech Institute’s Third Annual Clinical NGS Assays; August 23-24, 2016 Washington DC

Posted in Clinical Diagnostics, Clinical Genomics, Genome Biology, tagged clinical genomics, genetic testing, next generation sequencing, NGS, oncology on March 14, 2016| Leave a Comment »

Cambridge Healthtech Institute’s Third Annual

Clinical NGS Assays

Addressing Validation, Standards, and Clinical Relevance for Improved Outcomes

August 23-24, 2016 | Grand Hyatt Hotel | Washington, DC

Reporter: Stephen J. Williams, PhD

View Preliminary Agenda

Molecular diagnostics, particularly next-generation sequencing (NGS), have become an integral component of disease diagnosis. Still, there is work to be done to establish these tools as the standard of care. The Third Annual Clinical NGS Assays event will address NGS assay validation, establishing NGS standards, and determining clinical relevance. The pros and cons of various techniques such as gene panels, whole exome, and whole genome sequencing will also be debated with regards to depth of coverage, clinical utility, and reimbursement. Overall, this event will address the needs of both researchers and clinicians while exploring strategies to increase collaboration for improved patient outcomes.

Special Early Registration Savings Available
Register Now to Save up to $450

Preliminary Agenda

ASSAY VALIDATION AND ANALYSIS

Best Practices for Using Genome in a Bottle Reference Materials to Benchmark Variant Calls
Justin Zook, National Institute of Standards and Technology

NGS in Clinical Diagnosis: Aspects of Quality Management
Pinar Bayrak-Toydemir, M.D., Ph.D., FACMG, Associate Professor, Pathology, University of Utah; Medical Director, Molecular Genetics and Genomics, ARUP Laboratories

Thorough Validation and Implementation of Preimplantation Genetic Screening for Aneuploidy by NGS
Rebekah Zimmerman, Ph.D., Laboratory Director, Clinical Genetics, Foundation for Embryonic Competence

EXOME INTERPRETATION CHALLENGES

Are We There Yet? The Odyssey of Exome Analysis and Interpretation
Avni B. Santani, Ph.D., Director, Genomic Diagnostics, Pathology and Lab Medicine, The Children’s Hospital of Philadelphia

Challenges in Exome Interpretation: Intronic Variants
Rong Mao, M.D., Associate Professor, Pathology, University of Utah; Medical Director, Molecular Genetics and Genomics, ARUP Laboratories

Exome Sequencing: Case Studies of Diagnostic and Ethical Challenges
Lora J. H. Bean, Ph.D., Assistant Professor, Human Genetics, Emory University

ESTABLISHING STANDARDS

Implementing Analytical and Process Standards
Karl V. Voelkerding, M.D., Professor, Pathology, University of Utah; Medical Director for Genomics and Bioinformatics, ARUP Laboratories

Assuring the Quality of Next-Generation Sequencing in Clinical Laboratory Practice
Shashikant Kulkarni, M.S., Ph.D., Professor, Pathology and Immunology; Head of Clinical Genomics, Genomics and Pathology Services; Director, Cytogenomics and Molecular Pathology, Washington University at St. Louis

Sponsored Presentation to be Announced by Genection

PANEL DISCUSSION: GENE PANEL VS. WHOLE EXOME VS. WHOLE GENOME

Panelists:
John Chiang, Ph.D., Director, Casey Eye Institute, Oregon Health & Science University
Avni B. Santani, Ph.D., Director, Genomic Diagnostics, Pathology and Lab Medicine, The Children’s Hospital of Philadelphia
Additional Panelist to be Announced

DETERMINING CLINICAL SIGNIFICANCE AND RETURNING RESULTS

Utility of Implementing Clinical NGS Assays as Standard-of-Care in Oncology
Helen Fernandes, Ph.D., Pathology & Laboratory Medicine, Weill Cornell Medical College

An NGS Inter-Laboratory Study to Assess Performance and QC – Sponsored by Seracare
Andrea Ferreira-Gonzalez, Ph.D., Chair, Molecular Diagnostics Division, Pathology, Virginia Commonwealth University Medical School

This conference is part of the Eighth Annual Next-Generation Dx Summit.

Track Sponsor:

For exhibit & sponsorship opportunities, please contact:
Joseph Vacca, M.Sc.
Associate Director, Business Development
Cambridge Healthtech Institute
T: (+1) 781-972-5431
E: jvacca@healthtech.com

Read Full Post »

Roche is developing a high-throughput low cost sequencer for NGS

Posted in Bio Instrumentation in Experimental Life Sciences Research, BioIT: BioInformatics, BioIT: BioInformatics, NGS, Clinical & Translational, Pharmaceutical R&D Informatics, Clinical Genomics, Cancer Informatics, Clinical Diagnostics, Clinical Genomics, Next Generation Sequencing (NGS), tagged clinical genomics, deep sequencing, genomics, high throughput, highthroughput, next generation sequencing, NGS, Roche, Roche Diagnostics, Roche Institute of Molecular Biology on December 16, 2015| Leave a Comment »

Roche is developing a high-throughput low cost sequencer for NGS

Reporter: Stephen J. Williams, PhD

Reported from Diagnostic World News

Long-Read Sequencing in the Age of Genomic Medicine

By Aaron Krol

December 16, 2015 | This September, Pacific Biosciences announced the creation of the Sequel, a DNA sequencer half the cost and seven times as powerful as its previous RS II instrument. PacBio, with its unique long-read sequencing technology, had already secured a place in high-end research labs, producing finished, highly accurate genomes and helping to explore the genetic “dark matter” that other next-generation sequencing (NGS) instruments miss. Now, in partnership with Roche Diagnostics, PacBio is repositioning itself as a company that can serve hospitals as well.

“Pseudogenes, large structural variants, validation, repeat disorders, polymorphic regions of the genome―all those are categories where you practically need PacBio,” says Bobby Sebra, Director of Technology Development at the Icahn School of Medicine at Mount Sinai. “Those are gaps in the system right now for short-read NGS.”

Mount Sinai’s genetic testing lab owns three RS II sequencers, running almost around the clock, and was the first lab to announce it had bought a Sequel just weeks after the new instruments were launched. (It arrived earlier this month and has been successfully tested.) Sebra’s group uses these sequencers to read parts of the genome that, thanks to their structural complexity, can only be assembled from long, continuous DNA reads.

There are a surprising number of these blind spots in the human genome. “HLA is a huge one,” Sebra says, referring to a highly variable region of the genome involved in the immune system. “It impacts everything from immune response, to pharmacogenomics, to transplant medicine. It’s a pretty important and really hard-to-genotype locus.”

Nonetheless, few clinical organizations are studying PacBio or other long-read technologies. PacBio’s instruments, even the Sequel, come with a relatively high price tag, and research on their value in treating patients is still tentative. Mount Sinai’s confidence in the technology is surely at least partly due to the influence of Sebra―an employee of PacBio for five years before coming to New York―and Genetics Department Chair Eric Schadt, at one time PacBio’s Chief Scientific Officer.

Even here, the sequencers typically can’t be used to help treat patients, as the instruments are sold for research use only. Mount Sinai is still working on a limited number of tests to submit as diagnostics to New York State regulators.

Physician Use

Roche Diagnostics, which invested $75 million in the development of the Sequel, wants to change that. The company is planning to release its own, modified version of the instrument in the second half of 2016, specifically for diagnostic use. Roche will initially promote the device for clinical studies, and eventually seek FDA clearance to sell it for routine diagnosis of patients.

In an email to Diagnostics World, Paul Schaffer, Lifecycle Leader for Roche’s sequencing platforms division, wrote that the new device will feature an integrated software pipeline to interpret test results, in support of assays that Roche will design and validate for clinical indications. The instrument will also have at least minor hardware modifications, like near field communication designed to track Roche-branded reagents used during sequencing.

This new version of the Sequel will probably not be the first instrument clinical labs turn to when they decide to start running NGS. Short-read sequencers are sure to outcompete the Roche machine on price, and can offer a pretty useful range of assays, from co-diagnostics in cancer to carrier testing for rare genetic diseases. But Roche can clear away some of the biggest barriers to entry for hospitals that want to pursue long-read sequencing.

Today, institutions like Mount Sinai that use PacBio typically have to write a lot of their own software to interpret the data that comes off the machines. Off-the-shelf analysis, with readable diagnostic reports for doctors, will make it easier for hospitals with less research focus to get on board. To this end, Roche acquired Bina, an NGS analysis company that handles structural variants and other PacBio specialties, in late 2014.

The next question will be whether Roche can design a suite of tests that clinical labs will want to run. Long-read sequencing is beloved by researchers because it can capture nearly complete genomes, finding the correct order and orientation of DNA reads. “The long-read technologies like PacBio’s are going to be, in the future, the showcase that ties it all together,” Sebra says. “You need those long reads as scaffolds to bring it together.”

But that envisions a future in which doctors will want to sequence their patients’ entire genomes. When it comes to specific medical tests, targeting just a small part of the genome connected to disease, Roche will have to content itself with some niche applications where PacBio stands out.

Early Applications

“At this time we are not releasing details regarding the specific assays under development,” Schaffer told Diagnostics World in his email. “However, virology and genetics are a key focus, as they align with other high-priority Roche Diagnostics products.”

Genetic disease is the obvious place to go with any sequencing technology. Rare hereditary disorders are much easier to understand on a genetic level than conditions like diabetes or heart disease; typically, the pathology can be traced back to a single mutation, making it easy to interpret test results.

Some of these mutations are simply intractable for short-read sequencers. A whole class of diseases, the PolyQ disorders and other repeat disorders, develop when a patient has too many copies of a single, repetitive sequence in a gene region. The gene Huntingtin, for example, contains a long stretch of the DNA code CAG; people born with 40 or more CAG repeats in a row will develop Huntington’s disease as they reach early adulthood.

These disorders would be a prime target for Roche’s sequencer. The Sequel’s long reads, spanning thousands of DNA letters at a stretch, can capture the entire repeat region of Huntingtin at a stretch, unlike short-read sequencers that would tend to produce a garbled mess of CAG reads impossible to count or put in order.

Nonetheless, the length of reads is not the only obstacle to understanding these very obstinate diseases. “The entire category of PolyQ disorders, and Fragile X and Huntington’s, is really important,” says Sebra. “But to be frank, they’re the most challenging even with PacBio.” He suggests that, even without venturing into the darkest realms of the genome, a long-read sequencer might actually be useful for diagnosing many of the same genetic diseases routinely covered by other instruments.

That’s because, even when the gene region involved in a disease is well known, there’s rarely only one way for it to go awry. “An example of that is Gaucher’s disease, in a gene called GBA,” Sebra says. “In that gene, there are hundreds of known mutations, some of which you can absolutely genotype using short reads. But others, you would need to phase the entire block to really understand.” Long-read sequencing, which is better at distinguishing maternal from paternal DNA and highlighting complex rearrangements within a gene, can offer a more thorough look at diseases with many genetic permutations, especially when tracking inheritance through a family.

“You can think of long-read sequencing as a really nice way to supplement some of the inherited panels or carrier screening panels,” Sebra says. “You can also use PacBio to verify variants that are called with short-read sequencing.”

Virology is, perhaps, a more surprising focus for Roche. Diagnosing a viral (or bacterial, or fungal) infection with NGS only requires finding a DNA read unique to a particular species or strain, something short-read sequencers are perfectly capable of.

But Mount Sinai, which has used PacBio in pathogen surveillance projects, has seen advantages to getting the full, completely assembled genomes of the organisms it’s tracking. With bacteria, for instance, key genes that confer resistance to antibiotics might be found either in the native genome, or inside plasmids, small packets of DNA that different species of bacteria freely pass between each other. If your sequencer can assemble these plasmids in one piece, it’s easier to tell when there’s a risk of antibiotic resistance spreading through the hospital, jumping from one infectious species to another.

Viruses don’t share their genetic material so freely, but a similar logic can still apply to viral infections, even in a single person. “A virus is really a mixture of different quasi-species,” says Sebra, so a patient with HIV or influenza likely has a whole constellation of subtly different viruses circulating in their body. A test that assembles whole viral genomes—which, given their tiny size, PacBio can often do in a single read—could give physicians a more comprehensive view of what they’re dealing with, and highlight any quasi-species that affect the course of treatment or how the virus is likely to spread.

The Broader View

These applications are well suited to the diagnostic instrument Roche is building. A test panel for rare genetic diseases can offer clear-cut answers, pointing physicians to any specific variants linked to a disorder, and offering follow-up information on the evidence that backs up that call.

That kind of report fits well into the workflows of smaller hospital labs, and is relatively painless to submit to the FDA for approval. It doesn’t require geneticists to puzzle over ambiguous results. As Schaffer says of his company’s overall NGS efforts, “In the past two years, Roche has been actively engaged in more than 25 partnerships, collaborations and acquisitions with the goal of enabling us to achieve our vision of sample in to results out.”

But some of the biggest ways medicine could benefit from long-read sequencing will continue to require the personal touch of labs like Mount Sinai’s.

Take cancer, for example, a field in which complex gene fusions and genetic rearrangements have been studied for decades. Tumors contain multitudes of cells with unique patchworks of mutations, and while long-read sequencing can pick up structural variants that may play a role in prognosis and treatment, many of these variants are rarely seen, little documented, and hard to boil down into a physician-friendly answer.

An ideal way to unravel a unique cancer case would be to sequence the RNA molecules produced in the tumor, creating an atlas of the “transcriptome” that shows which genes are hyperactive, which are being silenced, and which have been fused together. “When you run something like IsoSeq on PacBio and you can see truly the whole transcriptome, you’re going to figure out all possible fusions, all possible splicing events, and the true atlas of reads,” says Sebra. “Cancer is so diverse that it’s important to do that on an individual level.”

Occasionally, looking at the whole transcriptome, and seeing how a mutation in one gene affects an entire network of related genes, can reveal an unexpected treatment option―repurposing a drug usually reserved for other cancer types. But that takes a level of attention and expertise that is hard to condense into a mass-market assay.

And, Sebra suggests, there’s another reason for medical centers not to lean too heavily on off-the-shelf tests from vendors like Roche.

Devoted as he is to his onetime employer, Sebra is also a fan of other technologies now emerging to capture some of the same long-range, structural information on the genome. “You’ve now got 10X Genomics, BioNano, and Oxford Nanopore,” he says. “Often, any two or even three of those technologies, when you merge them together, can get you a much more comprehensive story, sometimes faster and sometimes cheaper.” At Mount Sinai, for example, combining BioNano and PacBio data has produced a whole human genome much more comprehensive than either platform can achieve on its own.

The same is almost certainly true of complex cases like cancer. Yet, while companies like Roche might succeed in bringing NGS diagnostics to a much larger number of patients, they have few incentives to make their assays work with competing technologies the way a research-heavy institute like Mount Sinai does.

“It actually drives the commercialization of software packages against the ability to integrate the data,” Sebra says.

Still, he’s hopeful that the Sequel can lead the industry to pay more attention to long-read sequencing in the clinic. “The RS II does a great job of long-read sequencing, but the throughput for the Sequel is so much higher that you can start to achieve large genomes faster,” he says. “It makes it more accessible for people who don’t own the RS II to get going.” And while the need for highly specialized genetics labs won’t be falling off anytime soon, most patients don’t have the luxury of being treated in a hospital with the resources of Mount Sinai. NGS companies increasingly see physicians as some of their most important customers, and as our doctors start checking into the health of our genomes, it would be a shame if ubiquitous short-read sequencing left them with blind spots.

Source: http://diagnosticsworldnews.com/2015/12/16/long-read-sequencing-age-genomic-medicine.aspx

Read Full Post »

How Will FDA’s new precisionFDA Science2.0 Collaboration Platform Protect Data

Posted in Clinical & Translational, Clinical Diagnostics, Clinical Genomics, Computational Biology/Systems and Bioinformatics, Curation, Curation methodology, FDA, FDA Regulatory Affairs, tagged #science2_0, @science 2_0, biocuration, clinical genomics, Curation, NGS, Open access, scientific networks, Social media, Social network, Social science on December 16, 2015| Leave a Comment »

How Will FDA’s new precisionFDA Science 2.0 Collaboration Platform Protect Data?

Reporter: Stephen J. Williams, Ph.D.

As reported in MassDevice.com

FDA launches precisionFDA to harness the power of scientific collaboration

December 16, 2015 By MassDevice Leave a Comment

By: Taha A. Kass-Hout, M.D., M.S. and Elaine Johanson

Imagine a world where doctors have at their fingertips the information that allows them to individualize a diagnosis, treatment or even a cure for a person based on their genes. That’s what President Obama envisioned when he announced his Precision Medicine Initiative earlier this year. Today, with the launch of FDA’s precisionFDA web platform, we’re a step closer to achieving that vision.

PrecisionFDA is an online, cloud-based, portal that will allow scientists from industry, academia, government and other partners to come together to foster innovation and develop the science behind a method of “reading” DNA known as next-generation sequencing (or NGS). Next Generation Sequencing allows scientists to compile a vast amount of data on a person’s exact order or sequence of DNA. Recognizing that each person’s DNA is slightly different, scientists can look for meaningful differences in DNA that can be used to suggest a person’s risk of disease, possible response to treatment and assess their current state of health. Ultimately, what we learn about these differences could be used to design a treatment tailored to a specific individual.

The precisionFDA platform is a part of this larger effort and through its use we want to help scientists work toward the most accurate and meaningful discoveries. precisionFDA users will have access to a number of important tools to help them do this. These tools include reference genomes, such as “Genome in the Bottle,” a reference sample of DNA for validating human genome sequences developed by the National Institute of Standards and Technology. Users will also be able to compare their results to previously validated reference results as well as share their results with other users, track changes and obtain feedback.

Over the coming months we will engage users in improving the usability, openness and transparency of precisionFDA. One way we’ll achieve that is by placing the code for the precisionFDA portal on the world’s largest open source software repository, GitHub, so the community can further enhance precisionFDA’s features.Through such collaboration we hope to improve the quality and accuracy of genomic tests – work that will ultimately benefit patients.

precisionFDA leverages our experience establishing openFDA, an online community that provides easy access to our public datasets. Since its launch in 2014, openFDA has already resulted in many novel ways to use, integrate and analyze FDA safety information. We’re confident that employing such a collaborative approach to DNA data will yield important advances in our understanding of this fast-growing scientific field, information that will ultimately be used to develop new diagnostics, treatments and even cures for patients.

Taha A. Kass-Hout, M.D., M.S., is FDA’s Chief Health Informatics Officer and Director of FDA’s Office of Health Informatics. Elaine Johanson is the precisionFDA Project Manager.

The opinions expressed in this blog post are the author’s only and do not necessarily reflect those of MassDevice.com or its employees.

So What Are the Other Successes With Such Open Science 2.0 Collaborative Networks?

In the following post there are highlighted examples of these Open Scientific Networks and, as long as

transparancy
equal contributions (lack of heirarchy)

exists these networks can flourish and add interesting discourse. Scientists are already relying on these networks to collaborate and share however resistance by certain members of an “elite” can still exist. Social media platforms are now democratizing this new science2.0 effort. In addition the efforts of multiple biocurators (who mainly work for love of science) have organized the plethora of data (both genomic, proteomic, and literature) in order to provide ease of access and analysis.

Science and Curation: The New Practice of Web 2.0

Curation: an Essential Practice to Manage “Open Science”

The web 2.0 gave birth to new practices motivated by the will to have broader and faster cooperation in a more free and transparent environment. We have entered the era of an “open” movement: “open data”, “open software”, etc. In science, expressions like “open access” (to scientific publications and research results) and “open science” are used more and more often.

Curation and Scientific and Technical Culture: Creating Hybrid Networks

Another area, where there are most likely fewer barriers, is scientific and technical culture. This broad term involves different actors such as associations, companies, universities’ communication departments, CCSTI (French centers for scientific, technical and industrial culture), journalists, etc. A number of these actors do not limit their work to popularizing the scientific data; they also consider they have an authentic mission of “culturing” science. The curation practice thus offers a better organization and visibility to the information. The sought-after benefits will be different from one actor to the next.

Scientific Curation Fostering Expert Networks and Open Innovation: Lessons from Clive Thompson and others

Using Curation and Science 2.0 to build Trusted, Expert Networks of Scientists and Clinicians

Given the aforementioned problems of:

I. the complex and rapid deluge of scientific information

II. the need for a collaborative, open environment to produce transformative innovation

III. need for alternative ways to disseminate scientific findings

CURATION MAY OFFER SOLUTIONS

I. Curation exists beyond the review: curation decreases time for assessment of current trends adding multiple insights, analyses WITH an underlying METHODOLOGY (discussed below) while NOT acting as mere reiteration, regurgitation

II. Curation providing insights from WHOLE scientific community on multiple WEB 2.0 platforms

III. Curation makes use of new computational and Web-based tools to provide interoperability of data, reporting of findings (shown in Examples below)

Therefore a discussion is given on methodologies, definitions of best practices, and tools developed to assist the content curation community in this endeavor

which has created a need for more context-driven scientific search and discourse.

However another issue would be Individual Bias if these networks are closed and protocols need to be devised to reduce bias from individual investigators, clinicians. This is where CONSENSUS built from OPEN ACCESS DISCOURSE would be beneficial as discussed in the following post:

Risk of Bias in Translational Science

As per the article

Risk of bias in translational medicine may take one of three forms:

a systematic error of methodology as it pertains to measurement or sampling (e.g., selection bias),
a systematic defect of design that leads to estimates of experimental and control groups, and of effect sizes that substantially deviate from true values (e.g., information bias), and
a systematic distortion of the analytical process, which results in a misrepresentation of the data with consequential errors of inference (e.g., inferential bias).

This post highlights many important points related to bias but in summarry there can be methodologies and protocols devised to eliminate such bias. Risk of bias can seriously adulterate the internal and the external validity of a clinical study, and, unless it is identified and systematically evaluated, can seriously hamper the process of comparative effectiveness and efficacy research and analysis for practice. The Cochrane Group and the Agency for Healthcare Research and Quality have independently developed instruments for assessing the meta-construct of risk of bias. The present article begins to discuss this dialectic.

Information dissemination to all stakeholders is key to increase their health literacy in order to ensure their full participation
threats to internal and external validity represent specific aspects of systematic errors (i.e., bias)in design, methodology and analysis

So what about the safety and privacy of Data?

A while back I did a post and some interviews on how doctors in developing countries are using social networks to communicate with patients, either over established networks like Facebook or more private in-house networks. In addition, these doctor-patient relationships in developing countries are remote, using the smartphone to communicate with rural patients who don’t have ready access to their physicians.

Located in the post Can Mobile Health Apps Improve Oral-Chemotherapy Adherence? The Benefit of Gamification.

I discuss some of these problems in the following paragraph and associated posts below:

Mobile Health Applications on Rise in Developing World: Worldwide Opportunity

According to International Telecommunication Union (ITU) statistics, world-wide mobile phone use has expanded tremendously in the past 5 years, reaching almost 6 billion subscriptions. By the end of this year it is estimated that over 95% of the world’s population will have access to mobile phones/devices, including smartphones.

This presents a tremendous and cost-effective opportunity in developing countries, and especially rural areas, for physicians to reach patients using mHealth platforms.

How Social Media, Mobile Are Playing a Bigger Part in Healthcare

E-Medical Records Get A Mobile, Open-Sourced Overhaul By White House Health Design Challenge Winners

In Summary, although there are restrictions here in the US governing what information can be disseminated over social media networks, developing countries appear to have either defined the regulations as they are more dependent on these types of social networks given the difficulties in patient-physician access.

Therefore the question will be Who Will Protect The Data?

For some interesting discourse please see the following post

Atul Butte Talks on Big Data, Open Data and Clinical Trials

Read Full Post »

Bioinformatic Tools for Cancer Mutational Analysis: COSMIC and Beyond

Posted in BioIT: BioInformatics, Cancer - General, Clinical Diagnostics, Clinical Genomics, tagged Bioinformatics, breast cancer, Cancer Genome Atlas, Cancer Genomics, CGAP, clinical genomics, colon cancer, COSMIC, gene mutations, mutational analysis, mutational signatures, mutational spectrum, TCGA, Wellcome trust sanger institute on December 2, 2015| Leave a Comment »

Bioinformatic Tools for Cancer Mutational Analysis: COSMIC and Beyond

Curator: Stephen J. Williams, Ph.D.

Updated 7/26/2019

Updated 04/27/2019

Signatures of Mutational Processes in Human Cancer (from COSMIC)

From The COSMIC Database

The genomic landscape of cancer. The COSMIC database has a fully curated and annotated database of recurrent genetic mutations founds in various cancers (data taken form cancer sequencing projects). For interactive map please go to the COSMIC database here: http://cancer.sanger.ac.uk/cosmic

Somatic mutations are present in all cells of the human body and occur throughout life. They are the consequence of multiple mutational processes, including the intrinsic slight infidelity of the DNA replication machinery, exogenous or endogenous mutagen exposures, enzymatic modification of DNA and defective DNA repair. Different mutational processes generate unique combinations of mutation types, termed “Mutational Signatures”.

In the past few years, large-scale analyses have revealed many mutational signatures across the spectrum of human cancer types [Nik-Zainal S. et al., Cell (2012);Alexandrov L.B. et al., Cell Reports (2013);Alexandrov L.B. et al., Nature (2013);Helleday T. et al., Nat Rev Genet (2014);Alexandrov L.B. and Stratton M.R., Curr Opin Genet Dev (2014)]. However, as the number of mutational signatures grows the need for a curated census of signatures has become apparent. Here, we deliver such a resource by providing the profiles of, and additional information about, known mutational signatures.

The current set of mutational signatures is based on an analysis of 10,952 exomes and 1,048 whole-genomes across 40 distinct types of human cancer. These analyses are based on curated data that were generated by The Cancer Genome Atlas (TCGA), the International Cancer Genome Consortium (ICGC), and a large set of freely available somatic mutations published in peer-reviewed journals. Complete details about the data sources will be provided in future releases of COSMIC.

The profile of each signature is displayed using the six substitution subtypes: C>A, C>G, C>T, T>A, T>C, and T>G (all substitutions are referred to by the pyrimidine of the mutated Watson–Crick base pair). Further, each of the substitutions is examined by incorporating information on the bases immediately 5’ and 3’ to each mutated base generating 96 possible mutation types (6 types of substitution ∗ 4 types of 5’ base ∗ 4 types of 3’ base). Mutational signatures are displayed and reported based on the observed trinucleotide frequency of the human genome, i.e., representing the relative proportions of mutations generated by each signature based on the actual trinucleotide frequencies of the reference human genome version GRCh37. Note that only validated mutational signatures have been included in the curated census of mutational signatures.

Additional information is provided for each signature, including the cancer types in which the signature has been found, proposed aetiology for the mutational processes underlying the signature, other mutational features that are associated with each signature and information that may be relevant for better understanding of a particular mutational signature.

The set of signatures will be updated in the future. This will include incorporating additional mutation types (e.g., indels, structural rearrangements, and localized hypermutation such as kataegis) and cancer samples. With more cancer genome sequences and the additional statistical power this will bring, new signatures may be found, the profiles of current signatures may be further refined, signatures may split into component signatures and signatures

See their COSMIC tutorial page here for instructional videos

Updated News: COSMIC v75 – 24^th November 2015

COSMIC v75 includes curations across GRIN2A, fusion pair TCF3-PBX1, and genomic data from 17 systematic screen publications. We are also beginning a reannotation of TCGA exome datasets using Sanger’s Cancer Genome Project analyis pipeline to ensure consistency; four studies are included in this release, to be expanded across the next few releases. The Cancer Gene Census now has a dedicated curator, Dr. Zbyslaw Sondka, who will be focused on expanding the Census, enhancing the evidence underpinning it, and developing improved expert-curated detail describing each gene’s impact in cancer. Finally, as we begin to streamline our ever-growing website, we have combined all information for each gene onto one page and simplified the layout and design to improve navigation

may be found in cancer types in which they are currently not detected.

Mutational signatures across human cancer

Patterns of mutational signatures [Download signatures]

COSMIC database identifies 30 mutational signatures in human cancer

Please goto to COSMIC site to see bigger .png of mutation signatures

Signature 1

Cancer types:

Signature 1 has been found in all cancer types and in most cancer samples.

Proposed aetiology:

Signature 1 is the result of an endogenous mutational process initiated by spontaneous deamination of 5-methylcytosine.

Additional mutational features:

Signature 1 is associated with small numbers of small insertions and deletions in most tissue types.

Comments:

The number of Signature 1 mutations correlates with age of cancer diagnosis.

Signature 2

Cancer types:

Signature 2 has been found in 22 cancer types, but most commonly in cervical and bladder cancers. In most of these 22 cancer types, Signature 2 is present in at least 10% of samples.

Proposed aetiology:

Signature 2 has been attributed to activity of the AID/APOBEC family of cytidine deaminases. On the basis of similarities in the sequence context of cytosine mutations caused by APOBEC enzymes in experimental systems, a role for APOBEC1, APOBEC3A and/or APOBEC3B in human cancer appears more likely than for other members of the family.

Additional mutational features:

Transcriptional strand bias of mutations has been observed in exons, but is not present or is weaker in introns.

Comments:

Signature 2 is usually found in the same samples as Signature 13. It has been proposed that activation of AID/APOBEC cytidine deaminases is due to viral infection, retrotransposon jumping or to tissue inflammation. Currently, there is limited evidence to support these hypotheses. A germline deletion polymorphism involving APOBEC3A and APOBEC3B is associated with the presence of large numbers of Signature 2 and 13 mutations and with predisposition to breast cancer. Mutations of similar patterns to Signatures 2 and 13 are commonly found in the phenomenon of local hypermutation present in some cancers, known as kataegis, potentially implicating AID/APOBEC enzymes in this process as well.

Signature 3

Cancer types:

Signature 3 has been found in breast, ovarian, and pancreatic cancers.

Proposed aetiology:

Signature 3 is associated with failure of DNA double-strand break-repair by homologous recombination.

Additional mutational features:

Signature 3 associates strongly with elevated numbers of large (longer than 3bp) insertions and deletions with overlapping microhomology at breakpoint junctions.

Comments:

Signature 3 is strongly associated with germline and somatic BRCA1 and BRCA2 mutations in breast, pancreatic, and ovarian cancers. In pancreatic cancer, responders to platinum therapy usually exhibit Signature 3 mutations.

Signature 4

Cancer types:

Signature 4 has been found in head and neck cancer, liver cancer, lung adenocarcinoma, lung squamous carcinoma, small cell lung carcinoma, and oesophageal cancer.

Proposed aetiology:

Signature 4 is associated with smoking and its profile is similar to the mutational pattern observed in experimental systems exposed to tobacco carcinogens (e.g., benzo[a]pyrene). Signature 4 is likely due to tobacco mutagens.

Additional mutational features:

Signature 4 exhibits transcriptional strand bias for C>A mutations, compatible with the notion that damage to guanine is repaired by transcription-coupled nucleotide excision repair. Signature 4 is also associated with CC>AA dinucleotide substitutions.

Comments:

Signature 29 is found in cancers associated with tobacco chewing and appears different from Signature 4.

Signature 5

Cancer types:

Signature 5 has been found in all cancer types and most cancer samples.

Proposed aetiology:

The aetiology of Signature 5 is unknown.

Additional mutational features:

Signature 5 exhibits transcriptional strand bias for T>C substitutions at ApTpN context.

Comments:

Signature 6

Cancer types:

Signature 6 has been found in 17 cancer types and is most common in colorectal and uterine cancers. In most other cancer types, Signature 6 is found in less than 3% of examined samples.

Proposed aetiology:

Signature 6 is associated with defective DNA mismatch repair and is found in microsatellite unstable tumours.

Additional mutational features:

Signature 6 is associated with high numbers of small (shorter than 3bp) insertions and deletions at mono/polynucleotide repeats.

Comments:

Signature 6 is one of four mutational signatures associated with defective DNA mismatch repair and is often found in the same samples as Signatures 15, 20, and 26.

Signature 7

Cancer types:

Signature 7 has been found predominantly in skin cancers and in cancers of the lip categorized as head and neck or oral squamous cancers.

Proposed aetiology:

Based on its prevalence in ultraviolet exposed areas and the similarity of the mutational pattern to that observed in experimental systems exposed to ultraviolet light Signature 7 is likely due to ultraviolet light exposure.

Additional mutational features:

Signature 7 is associated with large numbers of CC>TT dinucleotide mutations at dipyrimidines. Additionally, Signature 7 exhibits a strong transcriptional strand-bias indicating that mutations occur at pyrimidines (viz., by formation of pyrimidine-pyrimidine photodimers) and these mutations are being repaired by transcription-coupled nucleotide excision repair.

Comments:

Signature 8

Cancer types:

Signature 8 has been found in breast cancer and medulloblastoma.

Proposed aetiology:

The aetiology of Signature 8 remains unknown.

Additional mutational features:

Signature 8 exhibits weak strand bias for C>A substitutions and is associated with double nucleotide substitutions, notably CC>AA.

Comments:

Signature 9

Cancer types:

Signature 9 has been found in chronic lymphocytic leukaemias and malignant B-cell lymphomas.

Proposed aetiology:

Signature 9 is characterized by a pattern of mutations that has been attributed to polymerase η, which is implicated with the activity of AID during somatic hypermutation.

Additional mutational features:

Comments:

Chronic lymphocytic leukaemias that possess immunoglobulin gene hypermutation (IGHV-mutated) have elevated numbers of mutations attributed to Signature 9 compared to those that do not have immunoglobulin gene hypermutation.

Signature 10

Cancer types:

Signature 10 has been found in six cancer types, notably colorectal and uterine cancer, usually generating huge numbers of mutations in small subsets of samples.

Proposed aetiology:

It has been proposed that the mutational process underlying this signature is altered activity of the error-prone polymerase POLE. The presence of large numbers of Signature 10 mutations is associated with recurrent POLE somatic mutations, viz., Pro286Arg and Val411Leu.

Additional mutational features:

Signature 10 exhibits strand bias for C>A mutations at TpCpT context and T>G mutations at TpTpT context.

Comments:

Signature 10 is associated with some of most mutated cancer samples. Samples exhibiting this mutational signature have been termed ultra-hypermutators.

Signature 11

Cancer types:

Signature 11 has been found in melanoma and glioblastoma.

Proposed aetiology:

Signature 11 exhibits a mutational pattern resembling that of alkylating agents. Patient histories have revealed an association between treatments with the alkylating agent temozolomide and Signature 11 mutations.

Additional mutational features:

Signature 11 exhibits a strong transcriptional strand-bias for C>T substitutions indicating that mutations occur on guanine and that these mutations are effectively repaired by transcription-coupled nucleotide excision repair.

Comments:

Signature 12

Cancer types:

Signature 12 has been found in liver cancer.

Proposed aetiology:

The aetiology of Signature 12 remains unknown.

Additional mutational features:

Signature 12 exhibits a strong transcriptional strand-bias for T>C substitutions.

Comments:

Signature 12 usually contributes a small percentage (<20%) of the mutations observed in a liver cancer sample.

Signature 13

Cancer types:

Signature 13 has been found in 22 cancer types and seems to be commonest in cervical and bladder cancers. In most of these 22 cancer types, Signature 13 is present in at least 10% of samples.

Proposed aetiology:

Signature 13 has been attributed to activity of the AID/APOBEC family of cytidine deaminases converting cytosine to uracil. On the basis of similarities in the sequence context of cytosine mutations caused by APOBEC enzymes in experimental systems, a role for APOBEC1, APOBEC3A and/or APOBEC3B in human cancer appears more likely than for other members of the family. Signature 13 causes predominantly C>G mutations. This may be due to generation of abasic sites after removal of uracil by base excision repair and replication over these abasic sites by REV1.

Additional mutational features:

Transcriptional strand bias of mutations has been observed in exons, but is not present or is weaker in introns.

Comments:

Signature 14

Cancer types:

Signature 14 has been observed in four uterine cancers and a single adult low-grade glioma sample.

Proposed aetiology:

The aetiology of Signature 14 remains unknown.

Additional mutational features:

Comments:

Signature 14 generates very high numbers of somatic mutations (>200 mutations per MB) in all samples in which it has been observed.

Signature 15

Cancer types:

Signature 15 has been found in several stomach cancers and a single small cell lung carcinoma.

Proposed aetiology:

Signature 15 is associated with defective DNA mismatch repair.

Additional mutational features:

Signature 15 is associated with high numbers of small (shorter than 3bp) insertions and deletions at mono/polynucleotide repeats.

Comments:

Signature 15 is one of four mutational signatures associated with defective DNA mismatch repair and is often found in the same samples as Signatures 6, 20, and 26.

Signature 16

Cancer types:

Signature 16 has been found in liver cancer.

Proposed aetiology:

The aetiology of Signature 16 remains unknown.

Additional mutational features:

Signature 16 exhibits an extremely strong transcriptional strand bias for T>C mutations at ApTpN context, with T>C mutations occurring almost exclusively on the transcribed strand.

Comments:

Signature 17

Cancer types:

Signature 17 has been found in oesophagus cancer, breast cancer, liver cancer, lung adenocarcinoma, B-cell lymphoma, stomach cancer and melanoma.

Proposed aetiology:

The aetiology of Signature 17 remains unknown.

Additional mutational features:

Comments:

Signature 1Signature 18

Cancer types:

Signature 18 has been found commonly in neuroblastoma. Additionally, Signature 18 has been also observed in breast and stomach carcinomas.

Proposed aetiology:

The aetiology of Signature 18 remains unknown.

Additional mutational features:

Comments:

Signature 19

Cancer types:

Signature 19 has been found only in pilocytic astrocytoma.

Proposed aetiology:

The aetiology of Signature 19 remains unknown.

Additional mutational features:

Comments:

Signature 20

Cancer types:

Signature 20 has been found in stomach and breast cancers.

Proposed aetiology:

Signature 20 is believed to be associated with defective DNA mismatch repair.

Additional mutational features:

Signature 20 is associated with high numbers of small (shorter than 3bp) insertions and deletions at mono/polynucleotide repeats.

Comments:

Signature 20 is one of four mutational signatures associated with defective DNA mismatch repair and is often found in the same samples as Signatures 6, 15, and 26.

Signature 21

Cancer types:

Signature 21 has been found only in stomach cancer.

Proposed aetiology:

The aetiology of Signature 21 remains unknown.

Additional mutational features:

Comments:

Signature 21 is found only in four samples all generated by the same sequencing centre. The mutational pattern of Signature 21 is somewhat similar to the one of Signature 26. Additionally, Signature 21 is found only in samples that also have Signatures 15 and 20. As such, Signature 21 is probably also related to microsatellite unstable tumours.

Signature 22

Cancer types:

Signature 22 has been found in urothelial (renal pelvis) carcinoma and liver cancers.

Proposed aetiology:

Signature 22 has been found in cancer samples with known exposures to aristolochic acid. Additionally, the pattern of mutations exhibited by the signature is consistent with the one previous observed in experimental systems exposed to aristolochic acid.

Additional mutational features:

Signature 22 exhibits a very strong transcriptional strand bias for T>A mutations indicating adenine damage that is being repaired by transcription-coupled nucleotide excision repair.

Comments:

Signature 22 has a very high mutational burden in urothelial carcinoma; however, its mutational burden is much lower in liver cancers.

Signature 23

Cancer types:

Signature 23 has been found only in a single liver cancer sample.

Proposed aetiology:

The aetiology of Signature 23 remains unknown.

Additional mutational features:

Signature 23 exhibits very strong transcriptional strand bias for C>T mutations.

Comments:

Signature 24

Cancer types:

Signature 24 has been observed in a subset of liver cancers.

Proposed aetiology:

Signature 24 has been found in cancer samples with known exposures to aflatoxin. Additionally, the pattern of mutations exhibited by the signature is consistent with that previous observed in experimental systems exposed to aflatoxin.

Additional mutational features:

Signature 24 exhibits a very strong transcriptional strand bias for C>A mutations indicating guanine damage that is being repaired by transcription-coupled nucleotide excision repair.

Comments:

Signature 25

Cancer types:

Signature 25 has been observed in Hodgkin lymphomas.

Proposed aetiology:

The aetiology of Signature 25 remains unknown.

Additional mutational features:

Signature 25 exhibits transcriptional strand bias for T>A mutations.

Comments:

This signature has only been identified in Hodgkin’s cell lines. Data is not available from primary Hodgkin lymphomas.

Signature 26

Cancer types:

Signature 26 has been found in breast cancer, cervical cancer, stomach cancer and uterine carcinoma.

Proposed aetiology:

Signature 26 is believed to be associated with defective DNA mismatch repair.

Additional mutational features:

Signature 26 is associated with high numbers of small (shorter than 3bp) insertions and deletions at mono/polynucleotide repeats.

Comments:

Signature 26 is one of four mutational signatures associated with defective DNA mismatch repair and is often found in the same samples as Signatures 6, 15 and 20.

Signature 27

Cancer types:

Signature 27 has been observed in a subset of kidney clear cell carcinomas.

Proposed aetiology:

The aetiology of Signature 27 remains unknown.

Additional mutational features:

Signature 27 exhibits very strong transcriptional strand bias for T>A mutations. Signature 27 is associated with high numbers of small (shorter than 3bp) insertions and deletions at mono/polynucleotide repeats.

Comments:

Signature 28

Cancer types:

Signature 28 has been observed in a subset of stomach cancers.

Proposed aetiology:

The aetiology of Signature 28 remains unknown.

Additional mutational features:

Comments:

Signature 29

Cancer types:

Signature 29 has been observed only in gingivo-buccal oral squamous cell carcinoma.

Proposed aetiology:

Signature 29 has been found in cancer samples from individuals with a tobacco chewing habit.

Additional mutational features:

Signature 29 exhibits transcriptional strand bias for C>A mutations indicating guanine damage that is most likely repaired by transcription-coupled nucleotide excision repair. Signature 29 is also associated with CC>AA dinucleotide substitutions.

Comments:

The Signature 29 pattern of C>A mutations due to tobacco chewing appears different from the pattern of mutations due to tobacco smoking reflected by Signature 4.

Signature 30

Cancer types:

Signature 30 has been observed in a small subset of breast cancers.

Proposed aetiology:

The aetiology of Signature 30 remains unknown.

Examples in the literature of deposits into or analysis from the COSMIC database

The Genomic Landscapes of Human Breast and Colorectal Cancers from Wood et al., 318 (5853): 1108–1113 Science 2007

“analysis of exons representing 20,857 transcripts from 18,191 genes, we conclude that the genomic landscapes of breast and colorectal cancers are composed of a handful of commonly mutated gene “mountains” and a much larger number of gene “hills” that are mutated at low frequency. “

found cellular pathways with multiple pathways
analyzed a highly curated database (Metacore, GeneGo, Inc.) that includes human protein-protein interactions, signal transduction and metabolic pathways
There were 108 pathways that were found to be preferentially mutated in breast tumors. Many of the pathways involved phosphatidylinositol 3-kinase (PI3K) signaling
the cancer genome landscape consists of relief features (mutated genes) with heterogeneous heights (determined by CaMP scores). There are a few “mountains” representing individual CAN-genes mutated at high frequency. However, the landscapes contain a much larger number of “hills” representing the CAN-genes that are mutated at relatively low frequency. It is notable that this general genomic landscape (few gene mountains and many gene hills) is a common feature of both breast and colorectal tumors.
developed software to analyze multiple mutations and mutation frequencies available from Harvard Bioinformatics at

http://bcb.dfci.harvard.edu/~gp/software/CancerMutationAnalysis/cma.htm

R Software for Cancer Mutation Analysis (download here)

CancerMutationAnalysis Version 1.0:

R package to reproduce the statistical analyses of the Sjoblom et al article and the associated Technical Comment. This package is build for reproducibility of the original results and not for flexibility. Future version will be more general and define classes for the data types used. Further details are available in Working Paper 126.

CancerMutationAnalysis Version 2.0:

R package to reproduce the statistical analyses of the Wood et al article. Like its predecessor, this package is still build for reproducibility of the original results and not for flexibility. Further details are available in Working Paper 126

Update 04/27/2019

Review 2018. The COSMIC Cancer Gene Census: describing genetic dysfunction across all human cancers. Z. Sondka et al. Nature Reviews. 2018.

The Catalogue of Somatic Mutations in Cancer (COSMIC) Cancer Gene Census (CGC) reevaluates the cancer genome landscape periodically and curates the findings into a database of genetic changes occurring in various tumor types. The 2018 CGC describes in detail the effect of 719 cancer driving genes. The recent expansion includes functional and mechanistic descriptions of how each gene contributes to disease etiology and in terms of the cancer hallmarks as described by Hanahan and Weinberg. These functional characteristics show the complexity of the cancer mutational landscape and genome and suggest ” multiple cancer-related functions for many genes, which are often highly tissue-dependent or tumour stage-dependent.” The 2018 CGC expands a second tier of genes, expanding the list of cancer related genes.

Criteria for curation of genes into CGC (curation process)

choosing candidate genes are selected from published literature, conference abstracts, large cancer genome screens deposited in databases, and analysis of current COSMIC database
COSMIC data are analyzed to determine presence of patterns of somatic mutations and frequency of such mutations in cancer
literature review to determine the role of the gene in cancer
Minimum evidence

– at least two publications from different groups shows increased mutation frequency in at least one type of cancer (PubMed)

– at least two publications from different groups showing experimental evidence of functional involvement in at least one hallmark of cancer in order to classify the mutant gene as oncogene, tumor suppressor, or fusion partner (like BCR-Abl)

independent assessment by at least two postdoctoral fellows
gene must be classified as either Tier 1 of Tier 2 CGC gene
inclusion in database
continued curation efforts

definitions:

Tier 1 gene: genes which have strong evidence from both mutational and functional analysis as being involved in cancer

Tier 2 gene: genes with mutational patterns typical of cancer drivers but not functionally characterized as well as genes with published mechanistic description of involvement in cancer but without proof of somatic mutations in cancer

Current Status of Tier 1 and Tier 2 genes in CGC

Tier 1 genes (574 genes): include 79 oncogenes, 140 tumor suppressor genes, 93 fusion partners

Tier 2 genes (719 genes): include 103 oncogenes, 181 tumor suppressors, 134 fusion partners and 31 with unknown function

Updated 7/26/2019

The COSMIC database is undergoing an extensive update and reannotation, in order to ensure standardisation and modernisation across COSMIC data. This will substantially improve the identification of unique variants that may have been described at the genome, transcript and/or protein level. The introduction of a Genomic Identifier, along with complete annotation across multiple, high quality Ensembl transcripts and improved compliance with current HGVS syntax, will enable variant matching both within COSMIC and across other bioinformatic datasets.

As a result of these updates there will be significant changes in the upcoming releases as we work through this process. The first stage of this work was the introduction of improvedHGVS syntax compliance in our May release. The majority of the changes will be reflected in COSMIC v90, which will be released in late August or early September, and the remaining changes will be introduced over the next few releases.

The significant changes in v90 include:

Updated genes, transcripts and proteins from Ensembl release 93 on both the GRCh37 and GRCh38 assemblies.
Full reannotation of COSMIC variants with known genomic coordinates using Ensembl’s Variant Effect Predictor (VEP). This provides accurate and standardised annotation uniformly across all relevant transcripts and genes that include the genomic location of the variant.
New stable genomic identifiers (COSV) that indicate the definitive position of the variant on the genome. These unique identifiers allow variants to be mapped between GRCh37 and GRCh38 assemblies and displayed on a selection of transcripts.
Updated cross-reference links between COSMIC genes and other widely-used databases such as HGNC, RefSeq, Uniprot and CCDS.
Complete standardised representation of COSMIC variants, following the most recent HGVS recommendations, where possible.
Remapping of gene fusions on the updated transcripts on both the GRCh37 and GRCh38 assemblies, along with the genomic coordinates for the breakpoint positions.
Reduced redundancy of mutations. Duplicate variants have been merged into one representative variant.

Key points for you

COSMIC variants have been annotated on all relevant Ensembl transcripts across both the GRCh37 and GRCh38 assemblies from Ensembl release 93. New genomic identifiers (e.g. COSV56056643) are used, which refers to the variant change at the genomic level rather than gene, transcript or protein level and can thus be used universally. Existing COSM IDs will continue to be supported and will now be referred to as legacy identifiers e.g. COSM476. The legacy identifiers (COSM) are still searchable. In the case of mutations without genomic coordinates, hence without a COSV identifier, COSM identifiers will continue to be used.

All relevant Ensembl transcripts in COSMIC (which have been selected based on Ensembl canonical classification and on the quality of the dataset to include only GENCODE basic transcripts) will now have both accession and version numbers, so that the exact transcript is known, ensuring reproducibility. This also provides transparency and clarity as the data are updated.

How these changes will be reflected in the download files

As we are now mapping all variants on all relevant Ensembl transcripts, the number of rows in the majority of variant download files has increased significantly. In the download files, additional columns are provided including the legacy identifier (COSM) and the new genomic identifier (COSV). An internal mutation identifier is also provided to uniquely represent each mutation, on a specific transcript, on a given assembly build. The accession and version number for each transcript are included. File descriptions for each of the download files will be available from the downloads page for clarity. We have included an example of the new columns below.

For example: COSMIC Complete Mutation Data (Targeted screens)

We will shortly have some sample data that can be downloaded in the new table structure, to give you real data to manipulate and integrate, this will be available on the variant updates page.

How this affects you

We are aware that many of the changes we are making will affect integration into your pipelines and analytical platforms. By giving you advance notice of the changes, we hope much of this can be mitigated, and the end result of having clean, standardised data will be well worth any disruption. The variant updates page on the COSMIC website will provide a central point for this information and further technical details of the changes that we are making to COSMIC.

Kind Regards,

The COSMIC Team

cosmic@sanger.ac.uk

Wellcome Sanger Institute

Wellcome Genome Campus,

Hinxton CB10 1SA

Read Full Post »

Posts Tagged ‘clinical genomics’

Personalized Medicine, Omics, and Health Disparities in Cancer: Can Personalized Medicine Help Reduce the Disparity Problem?

Europe’s Medicine Access Crisis: What the Latest Data Means for Lung Cancer

What this means for lung cancer

What people impacted by lung cancer are telling us

What needs to change

PERSPECTIVEGLOBAL HEALTH

Other articles on Cancer Health Disparities and Genomics on this Online Open Access Journal Include:

Share this:

Like this:

Structure of the nCoV trimeric spike

Abstract

New Coronavirus Protease Structure Available

Coronavirus Disease 2019 (COVID-19) and Cardiovascular Disease

ACE2

Angiotensin converting enzyme 2 (ACE2)

A Summary of current potential repurposed therapeutics for COVID-19 Infection from In The Pipeline: A Science blog from Derick Lowe

Other related articles published in this Open Access Online Scientific Journal include the following:

Share this:

Like this:

THE 3RD STAT4ONC ANNUAL SYMPOSIUM APRIL 25-27, 2019, HILTON, HARTFORD, CONNECTICUT, 315 Trumbull St, Hartford, CT 06103

SYMPOSIUM OBJECTIVES

Other 2019 Conference Announcement Posts on this Open Access Journal Include:

Share this:

Like this:

Can Blockchain Technology and Artificial Intelligence Cure What Ails Biomedical Research and Healthcare

Advances in Artificial Intelligence

Highly Distributed Storage Systems (HDSS)

Data Privacy and Regulatory Issues

Articles from clinicalinformaticsnews.com

Healthcare Organizations Form Synaptic Health Alliance, Explore Blockchain’s Impact On Data Quality

“Maybe that could fuel partnership,” Arrieta said. “I can make data available to researchers in the field in real-time so they can actually test their hypothesis, test their intuition, and test their imagination as it relates to solving real-world problems.”

Andhra Pradesh, DNA, and blockchain

Other Articles in this Open Access Journal on Digital Health include:

Share this:

Like this:

PrecisionFDA Consistency Challenge supports projects to validate and increase reproduceability of genomic testing methods

PrecisionFDA Consistency Challenge

AT A GLANCE

CHALLENGE DETAILS

Footnotes

Share this:

Like this:

Cambridge Healthtech Institute’s Third Annual

Clinical NGS Assays

Addressing Validation, Standards, and Clinical Relevance for Improved Outcomes

Share this:

Like this:

Roche is developing a high-throughput low cost sequencer for NGS

Long-Read Sequencing in the Age of Genomic Medicine

Share this:

Like this:

How Will FDA’s new precisionFDA Science 2.0 Collaboration Platform Protect Data?

So What Are the Other Successes With Such Open Science 2.0 Collaborative Networks?

Curation: an Essential Practice to Manage “Open Science”

Curation and Scientific and Technical Culture: Creating Hybrid Networks

CURATION MAY OFFER SOLUTIONS

which has created a need for more context-driven scientific search and discourse.

So what about the safety and privacy of Data?

Located in the post Can Mobile Health Apps Improve Oral-Chemotherapy Adherence? The Benefit of Gamification.

Mobile Health Applications on Rise in Developing World: Worldwide Opportunity

Therefore the question will be Who Will Protect The Data?

Share this:

Like this:

Examples in the literature of deposits into or analysis from the COSMIC database

Review 2018. The COSMIC Cancer Gene Census: describing genetic dysfunction across all human cancers. Z. Sondka et al. Nature Reviews. 2018.

Current Status of Tier 1 and Tier 2 genes in CGC

The significant changes in v90 include:

Key points for you

How these changes will be reflected in the download files

How this affects you

Share this:

Like this:

Follow Blog via Email

Recent Posts

Archives

Categories

Meta

THE 3^RD STAT4ONC ANNUAL SYMPOSIUM APRIL 25-27, 2019, HILTON, HARTFORD, CONNECTICUT, 315 Trumbull St, Hartford, CT 06103

PrecisionFDA
Consistency Challenge