Skip NavigationSkip to Content

A network-based deep learning methodology for stratification of tumor mutations

  1. Author:
    Liu, Chuang
    Han, Zhen
    Zhang, Zi-Ke
    Nussinov,Ruth
    Cheng, Feixiong
  2. Author Address

    Hangzhou Normal Univ, Alibaba Res Ctr Complex Sci, Hangzhou 311121, Peoples R China.Zhejiang Univ, Coll Media & Int Culture, Hangzhou 310028, Peoples R China.NCI, Canc & Inflammat Program, Leidos Biomed Res Inc, Frederick Natl Lab Canc Res, Frederick, MD 21702 USA.Tel Aviv Univ, Sackler Sch Med, Dept Human Mol Genet & Biochem, IL-69978 Tel Aviv, Israel.Cleveland Clin, Lerner Res Inst, Genom Med Inst, Cleveland, OH 44106 USA.Case Western Reserve Univ, Cleveland Clin, Dept Mol Med, Lerner Coll Med, Cleveland, OH 44195 USA.Case Western Reserve Univ, Case Comprehens Canc Ctr, Sch Med, Cleveland, OH 44106 USA.
    1. Year: 2021
    2. Date: Jan 08
  1. Journal: Bioinformatics
  2. OXFORD UNIV PRESS,
    1. 37
    2. 1
    3. Pages: 82-88
  3. Type of Article: Article
  4. ISSN: 1367-4803
  1. Abstract:

    Motivation: Tumor stratification has a wide range of biomedical and clinical applications, including diagnosis, prognosis and personalized treatment. However, cancer is always driven by the combination of mutated genes, which are highly heterogeneous across patients. Accurately subdividing the tumors into subtypes is challenging. Results: We developed a network-embedding based stratification (NES) methodology to identify clinically relevant patient subtypes from large-scale patients' somatic mutation profiles. The central hypothesis of NES is that two tumors would be classified into the same subtypes if their somatic mutated genes located in the similar network regions of the human interactome. We encoded the genes on the human protein-protein interactome with a network embedding approach and constructed the patients' vectors by integrating the somatic mutation profiles of 7,344 tumor exomes across 15 cancer types. We firstly adopted the lightGBM classification algorithm to train the patients' vectors. The AUC value is around 0.89 in the prediction of the patient's cancer type and around 0.78 in the prediction of the tumor stage within a specific cancer type. The high classification accuracy suggests that network embedding-based patients' features are reliable for dividing the patients. We conclude that we can cluster patients with a specific cancer type into several subtypes by using an unsupervised clustering algorithm to learn the patients' vectors. Among the 15 cancer types, the new patient clusters (subtypes) identified by the NES are significantly correlated with patient survival across 12 cancer types. In summary, this study offers a powerful network-based deep learning methodology for personalized cancer medicine. Availability and implementation: Source code and data can be downloaded from https://github.com/ChengF-Lab/NES. Supplementary information: Supplementary data are available at Bioinformatics online.

    See More

External Sources

  1. DOI: 10.1093/bioinformatics/btaa1099
  2. PMID: 33416857
  3. WOS: 000649437800012

Library Notes

  1. Fiscal Year: FY2020-2021
NCI at Frederick

You are leaving a government website.

This external link provides additional information that is consistent with the intended purpose of this site. The government cannot attest to the accuracy of a non-federal site.

Linking to a non-federal site does not constitute an endorsement by this institution or any of its employees of the sponsors or the information and products presented on the site. You will be subject to the destination site's privacy policy when you follow the link.

ContinueCancel