Identifying driver genes in cancer by triangulating gene expression, gene location, and survival data sigrid rouam, 1 lance d miller, 2 and r krishna murthy karuturi 3 1 procter and gamble international operations sa singapore branch, statistics asia, singapore. Overlap with the cgc, mutdriver and hiconf gene lists is a benchmark for cancer driver genes, similar to the descriptions of tokheim et al. In section 3, we will evaluate the new method using lung tumor genome sequences. We report a pancancer and pansoftware analysis spanning 9,423 tumor exomes comprising all 33 of the cancer genome atlas projects. Gene mutation patterns are driven by negative epistasis g 1,2, tumor subtypes g 3,4, and tumor mutation load tml. A novel method for identifying the potential cancer driver. Identifying driver genes whose mutations cause cancer could help us decipher the mechanism of cancer, which is beneficial to the. Highthroughput sequencing approaches have identified cancer genes, but distinguishing driver genes from passengers remains challenging.
In section 2, we will define pvalues for testing whether a gene is a driver gene. Identifying cancer driver genes in tumor genome sequencing. Here we present a novel approach combining both statistical and evolutionary thinking to identify driver mutations in cancer genomes using crosssectional mutation data. We compare the results of these three methods with regard to their specific coverage and fmeasure of pathway enrichment. Although different cancer gene signatures have been revealed, the mechanism of tumourigenesis has yet to be completely understood. We applied our method to wholeexome sequencing data from 11,873 tumor normal pairs and identified 460 driver genes that clustered into 21. Identifying cancer driver genes from functional genomics screens togar t rupti ab, desai sanket ab, mishra rohit a, terwadkar prachi a, ramteke manoj a. The field is also moving towards cancerspecific driver identification, because different cancer types are characterized by different driver mutations. Cancers free fulltext identifying cancer driver genes. Identification of cancer driver genes through a genebased. The current version of the intogen pipeline uses seven methods to identify cancer driver genes from. For breast cancer in table 3, our method successfully achieved a high precision in identifying the top 10 cancer driver genes with 8 out of 10 accuracy rates. Major tumor sequencing projects have been conducted in the past few years to identify genes that contain driver somatic mutations in tumor.
Here we present oncodrivefml, a method designed to analyze the pattern of somatic mutations across tumors in both coding and noncoding genomic regions to identify signals of positive selection. In this article, we propose and evaluate a new method for identifying driver genes. Identifying mutually exclusive gene sets with prognostic. Numerous methods have been developed to identify driver genes, but evaluation of the performance of these methods is hindered by the lack of a gold standard, that is, bona fide driver gene mutations.
This paper establishes novel ways to judge the techniques. Intriguingly, maxmif is able to identify potential cancer driver genes, with strong experimental data support. However, they attempt to identify cancer driver modules consisting of a number of genes rather than individual genes crucial to cancer development. Moreover, cancer genes may or may not actually be drivers in the cancer type with the cna of interest. Integration of multiple networks and pathways identifies. Overlap with the cgc, mut driver and hiconf gene lists is a benchmark for cancer driver genes, similar to the descriptions of tokheim et al. Comparison of different functional prediction scores using a. Modern highthroughput genomic technologies represent a comprehensive hallmark of molecular changes in pancancer studies.
Jul 23, 2018 the observation that mutations in a cancer genome tend to converge on a few biological pathways, 15 has prompted the development of pathway. Pdf identifying cancer driver genes from functional. Identifying cancerdriving gene mutations cancer network. Furthermore, the ratio of predicted tumor suppressor genes to oncogenes widely varies by tissue figure s4 b. Most existing methods are focused on individual cancer drivers which regulate biological processes leading to cancer. Insertional mutagenesis screens using replicationincompetent retroviral vectors have emerged as a. Because cancer cells have a large variety of relatively rare mutations, genomewide studies for identifying cancer driver mutations require sequencing numerous patients. Not having mutations with high scores does not preclude a gene being a potential driver gene, but these cases tend to be infrequently mutated genes that occur in samples with large mutation rates as was the case for cdh11 and. Integration of multiomics data of cancer can help people to explore cancers comprehensively. The identification of mutated driver pathways for the treatment of cancer is a significant problem in bioinformatics.
In this paper, we present a gene length based network method, named driverfinder, to identify driver genes by integrating. The total number of driver genes is unknown, but we assume that is considerably less than 19,000. While most of the alterations are passenger alterations with no significant effect on cellular phenotype, cancer cells are dependent on a few driver genes for the constitutive activation of the. Cancer cells are dependent on a few driver genes for the constitutive activation of the signalling pathways which aid cellular proliferation. Of course, if a driver gene is mutated in a very high percentage of samples more than 20%, for example, even an inaccurate estimate of the bmr is sufficient to correctly identify such a gene as recurrently mutated. Many statistical models to address this question have been developed. However, with a large volume of different omics and functional data being generated, there is a major challenge to distinguish functional driver genes from a sea of inconsequential passenger genes that accrue stochastically but do not contribute to cancer development. Getting to know intogen for the identification of cancer. Among these mutated genes, driver genes are defined as being. Ovarian cancer is the deadliest of all the gynecologic cancers 8 and, according to the data, the mortality rates for ovarian cancer have not improved over the past 20 years.
The results on multiple gene interaction networks indicate that a proper reference gene interaction network is a key factor for pnc on identifying cancer driver genes and more complete and higher quality gene interaction information would improve pncs prediction power. One key aspect of studying tumorigenesis is to distinguish the driver mutations providing a fitness advantage to cancer cells against neutral passenger or hitchhiking mutations. Insertional mutagenesis screens using replicationincompetent retroviral vectors have emerged as a powerful tool to identify cancer. Among these mutated genes, driver genes are defined as being causally linked to oncogenesis, while passenger genes are thought to be irrelevant for cancer development. Introduction cancer is a disease defined by several genetic alterations, such as mutations, gene expression changes and copy number changes, in addition to epigenomic alterations. Interpreting pathways to discover cancer driver genes with. Other forms of genomic aberrations, such as copy number variations cnvs and epigenetic changes, may also reflect cancer progression. However, the effect of a single gene may not be sufficient to drive cancer progression. Identifying novel genes that drive tumor metastasis and drug resistance has significant potential to improve patient outcomes. A much smaller fraction of driver mutations are important for cancer development, with current estimates ranging from 10 to 20 driver mutations per tumor. We applied the method to the challenging problem of identifying driver genes associated with drug response in ovarian cancer. Identification of cancer driver genes through a gene. However, driver gene discovery is a very challenging task because we are not only dealing with huge amount of data. Tumors are ranked by subtype and tumor mutation load.
Identifying driver genes involving gene dysregulated. Jun 16, 2016 distinguishing the driver mutations from somatic mutations in a tumor genome is one of the major challenges of cancer research. The genomic alteration profile and clinical information were derived from the cancer genome atlas, and the megsa method was used to identify the megs. Especially for the met, some researchers found that high met gene copy number leads to shorter survival in patients with nonsmall cell lung cancer. Although rapid progress has been made in computational approaches for prioritizing cancer driver genes, research is far from achieving the ultimate goal of discovering a complete catalog of genes truly associated with cancer. Identifying which genes affected by cnas are drivers without relying on cancer gene lists is thus important for both developing comprehensive cancer gene lists and understanding cnadominated cancer types. The mutation frequency of most driver genes is in the middle 220% or even lower range, which makes it difficult to find the driver genes with lowfrequency mutations. While most of the alterations are passenger alterations with no significant effect on cellular phenotype, cancer cells are dependent on a few driver genes for the constitutive activation of the signalling pathways. Identifying cancer specific driver modules using a network.
Oct 07, 2019 using median expression level as the cutoff point, the 14 driver genes were categorized into highexpression group and lowexpression group. Accurate identification of driver genes and driver mutations is critical for advancing cancer research and personalizing treatment based on accurate stratification of patients. Our methodology is general and can be applied to dierent cancer subtypes to identify speci. Mutual exclusivity analysis can distinguish driver genes and pathways from passenger ones. All cancers depend upon mutations in critical genes, which confer a selective advantage to the tumour cell. Comprehensive assessment of computational algorithms in. Identifying driver mutations in sequenced cancer genomes. Nevertheless, despite present understanding of the term driver mutation in cancer, defining what is meant by a driver of metastasis is somewhat looser, especially since gene mutations, it would appear, constitute only a small part of the spectrum of possible driver molecular events in. Check which methods for cancer driver gene identification are used. New bioinformatics tool tests methods for finding mutant. Multiple gene pairs show mutual exclusivity, whereas this pattern reflects epistasis only in the case of gene pair g.
Identifying driver mutations in cancers semantic scholar. Here, we hypothesise that there are driver gene groups that work in concert to regulate cancer and we develop. A cancer driver gene is activated by driver mutations, but may also contain. Therefore, maxmif can be very useful for identifying or prioritizing cancer driver genes in the increasing number of available cancer genomic data. Over the decade, many computational algorithms have been developed to predict the effects of. Identifying hepatocellular carcinoma driver genes by. Using a multifaceted, automated, highthroughput approach to detect driver gene fusion events chromosomal rearrangements, insertions or deletions in patient rnasequencing data, researchers in the steve and cindy rasmussen institute for genomic medicine have identified 20 clinically meaningful fusions in 73 pediatric cancer cases so far. Dec 19, 2016 identifying the genes that cause cancer when altered is often challenging, but is critical for directing research along the most fruitful course, said bert vogelstein, a member of the ludwig center at the johns hopkins kimmel cancer center and one of the journal articles coauthors. Distinguishing the driver mutations from somatic mutations in a tumor genome is one of the major challenges of cancer research. An evolutionary approach for identifying driver mutations in. We observed a significant positive correlation pearsons r 0. The wellstudied breast cancer driver genes including tp53, pik3ca, map 3 k1, cdh1, erbb2 and pten were also put in the top list of our method. The purpose of this study was to identify mutually exclusive gene sets megss that have prognostic value and to detect novel driver genes in gbm.
Most of those genes are wellknown cancer driver genes, and incorporating mutation scores helps in their identification. As shown in table 1, the significant association was represented between driver genes expression and cancernormal p 1. We have applied the new pipeline to identify cancer driver genes and their tumorigenic mode of action to more than 28,000 human tumor samples across 221 cohorts of 66 different tumor types, including the pan cancer analysis of whole genomes. Chaos3 mice look normal, but eventually they all get cancer. The genomic alteration profile and clinical information. New bioinformatics tool tests methods for finding mutant genes that drive cancer. Accumulation of large amounts of cancer sequencing data led to the rise of computational and statistical techniques as primary tools in identifying cancer driver genes and mutations. Publicly available cancer databases have been combined by a team of researchers to identify new genes associated with cancer.
In this study, we use somatic mutation data generated via dna. The initiation and subsequent evolution of cancer are largely driven by a relatively small number of somatic mutations with critical functional impacts, socalled driver mutations. Identifying cancer driver genes cdg is a crucial step in cancer genomic toward the advancement of precision medicine. Identifying cancer driver genes in tumor genome sequencing studies. Swiss medical weekly identifying cancer driver genes from. Cancer is a genomic disease associated with a plethora of gene mutations resulting in a loss of control over vital cellular functions. A comprehensive list of cancer driver genes published in nature. Introduction cancer is a complex disease driven by dierent genetic, genomic or epigenetic mech. Comparison of different functional prediction scores using.
Using median expression level as the cutoff point, the 14 driver genes were categorized into highexpression group and lowexpression group. The observation that mutations in a cancer genome tend to converge on a few biological pathways, 15 has prompted the development of pathway. Chapters guide readers through a brief history of cancer gene discovery, in silico. The number of detected cancer driver genes varies among cancer types, with kidney chromophobe kich having the fewest 2 genes and ucec having the most 55 genes. An evolutionary approach for identifying driver mutations. Targeting oncogenicdependent genes has met with success, as demonstrated in several cancer types.
Swiss medical weekly identifying cancer driver genes. Here, we hypothesise that there are driver gene groups that work in. Multiple advanced algorithms to identify drivers now exist, but systematic attempts to combine and optimize them on large datasets are few. Current approaches either identify driver genes on the basis of mutational recurrence. Other forms of genomic aberrations, such as copy number variations cnvs and epigenetic changes, may also reflect. A novel network control model for identifying personalized. A patients therapeutic response to drugs targeting a specific gene and optimal assignment to a clinical trial is increasingly understood to depend on both the specific mutation in the gene of. The key to understanding the contribution of a diseaseassociated mutation to the development and progression of cancer comes from an understanding of the consequences of that mutation on the function of the affected protein, and the impact on the pathways in which. Identifying cancer type specific oncogenes and tumor. Author summary evolutionary dynamic models have been intensively studied to elucidate the process of tumorigenesis. While many frequentlymutated cancer driver genes have already been identified and are being utilized for diagnostic, prognostic. Identifying potential cancer driver genes by genomic data.
After establishing the baseline parameters for each cancer type, we identify new driver genes for each cancer type, and the molecular pathways that are highly affected by them. Cancer driver genes methods and protocols tim starr springer. The experimental results on multiple cancer datasets highlight that pnc is effective for identifying personalized driver genes in cancer. Identification of cancer driver genes based on nucleotide. This gene is a critical epigenetic cancer driver gene hypomethylated in six cancer types, including headandneck and lung cancer, and is associated with multiple other cancers. Identifying cancer driver genes from functional genomics screens togar t rupti ab, desai sanket ab, mishra rohit a, terwadkar prachi a, ramteke manoj a, ranjan malika a, kawle dhananjay a. Recently, the methods of exploiting the structurebased control principles of complex networks provide new clues for identifying minimum number of driver nodes to drive the state transition of largescale. This challenge is more acute and far from solved for noncoding mutations. Here we present oncodrivefml, a method designed to analyze the pattern of somatic mutations across tumors in both coding and noncoding genomic regions to. Thus, wellknown cancer genes such as tp53 are readily identified as recurrently mutated genes by all computational methods. Cancer driver genes are genes that give cells a growth advantage when they are mutated, helping tumours proliferate. People have lists of what they consider to be cancer driver genes, but theres no official reference guide, no gold. Which methods for cancer driver gene identification are used. With increasing numbers of largescale genomic datasets available.
Identifying cancer driver genes is a key task in cancer informatics. Identifying mutated driver pathways in cancer by integrating. Identifying molecular cancer drivers is critical for precision oncology. To overcome this problem, some methods prioritize the candidate. Tokheim said one of the challenges the team faced was the lack of a widely accepted consensus on what qualifies as a cancer driver gene. May 21, 2019 the identification of the cancer driver genes is essential for personalized therapy. Chaos3 mouse models are prone to cancers of all types, but in a certain strain, the females end up with mammary tumors, the equivalent of human breast cancer.
The pathogenesis and prognosis of glioblastoma gbm remain poorly understood. Nevertheless, despite present understanding of the term driver mutation in cancer, defining what is meant by a driver of metastasis is somewhat looser, especially since gene mutations, it would appear, constitute only a small part of the spectrum of possible driver molecular events in metastasis. The second benchmark is the ability to identify a core set of driver genes that are also predicted by several other methods. As shown in table 1, the significant association was represented between driver genes expression and cancer normal p 1. We have applied the new pipeline to identify cancer driver genes and their tumorigenic mode of action to more than 28,000 human tumor samples across 221 cohorts of 66 different tumor types, including the pancancer analysis of whole genomes. Pdf identifying potential cancer driver genes by genomic. Identifying driver genes in cancer by triangulating gene. The gene arid1a, a potential driver of breast cancer. Identifying driver mutations in a patients tumor cells is a central task in the era of precision cancer medicine. This book presents protocols for identification of genetic drivers of cancer.
Comprehensive characterization of cancer driver genes and. Comprehensive characterization of cancer driver genes. The identification of the cancer driver genes is essential for personalized therapy. Evolutionary models, however, add another layer of complexity by taking into account the process of mutation accumulation and selection within the tissue. The study identified more than 100 novel cancer driver genes and helps. Pathways and networks are important tools to explain the role of genes in functional genomic. Our methodology is general and can be applied to different cancer subtypes to identify specific driver genes and improve personalized therapy. Identification of cancer driver genes based on nucleotide context. In this paper, an improved maximum weight submatrix problem model is formulated to integrate somatic mutations, copy number variations, and gene expressions data to detect mutate gene sets in cancer. At present, the only way to assess the evidence for a gene being a driver gene in vivo. Csdm detects the specific driver modules for a certain cancer type to other cancer types. Cancer genomes contain large numbers of somatic mutations but few of these mutations drive tumor development.
889 998 1427 121 725 1419 60 1467 1407 781 1268 1167 1162 260 320 891 592 905 895 758 795 843 263 676 911 1270 1273 768 662 941 669 947