基因组学paper

肿瘤驱动基因和显著突变基因识别的生信方法进展

2020-12-06  本文已影响0人  evolisgreat

概括:

  1. Cancer is often driven by the accumulation of genetic alterations, including single nucleotide variants, small insertions or deletions, gene fusions, copy-number variations, and large chromosomal rearrangements.

  2. Recent advances in next generation sequencing technologies have helped investigators generate massive amounts of cancer genomic data and catalog somatic mutations in both common and rare cancer types. So far, the somatic mutation landscapes and signatures of >10 major cancer types have been reported; however, pinpointing driver mutations and cancer genes from millions of available cancer somatic mutations remains a monumental challenge.

  3. To tackle this important task, many methods and computational tools have been developed during the past several years and, thus, a review of its advances is urgently needed.

  4. Here, we first summarize the main features of these methods and tools for whole-exome, whole-genome and whole transcriptome sequencing data.

  5. Then, we discuss major challenges like tumor intra-heterogeneity, tumor sample saturation and functionality of synonymous mutations in cancer, all of which may result in false-positive discoveries.

  6. Finally, we highlight new directions in studying regulatory roles of noncoding somatic mutations and quantitatively measuring circulating tumor DNA in cancer.

This review may help investigators find an appropriate tool for detecting potential driver or actionable mutations in rapidly emerging precision cancer medicine


文章思路:

In this review, we focus on the description of computational approaches and tools in identifying driver mutations and SMGs in cancer using NGS data.

In this review article:

  1. we first summarized the major biological resources that are commonly used for the development of these tools.

  2. Then, we described the main features of the tools in these five types.

  3. Next, we discussed some major challenges on identification of driver mutations or SMGs from large number of somatic mutations in cancer NGS data.

  4. Finally, we highlight several new directions, such as the study of noncoding regulatory mutations through integrated pan-cancer analyses of somatic mutations using functional genomics and whole-genome
    sequencing (WGS) data.


1. Data resources for method and tool development and evaluation

NGS data resources

Network and pathway data resources

Table 1.PNG

2. Method and computational tools

识别肿瘤驱动基因的方法学分类:作者将用于识别肿瘤驱动基因和SMGs的算法和工具分为了以下五类:

Figure 1..PNG Table 2-2.PNG

2.1 Mutation frequency based

这种类型的算法大多基因突变频率:

2.2 Functional impact based

这类算法大多与基因功能相关,预测异常对基因及其蛋白质功能的影响。

2.3 Structural genomics based

基于结构的分析算法大多基于SNV,比较少的考虑到其他类型的变异(如融合基因),该类方法的限制在于不是所有的蛋白质都有其明确的结构域信息。

2.4 Network or pathway based

基于网络和通路分析的算法可以很好的对肿瘤中突变产生的突变效应有一个很好的评估。

不过,由于目前技术的限制,仍然只能覆盖潜在PPI中的20-30%,而且其中很多分析出来的网络和通路与样本有密切关系(组织类型、细胞组成、生理状态)。

2.5 Data integration based

整合体细胞突变、结构变异、基因表达、甲基化谱来构建网络分析方法是一个重要的研究方向。人的15%基因组都有CNV变化,至关重要。

3. Challenges on current approaches

3.1 Tumor heterogeneity and sample saturation

3.2 The accuracy of somatic mutation calling

3.3 Functional synonymous mutations in cancer

参考资料

  1. Cheng, F., Zhao, J., & Zhao, Z. (2016). Advances in computational approaches for prioritizing driver mutations and significantly mutated genes in cancer genomes. Briefings in bioinformatics, 17(4), 642–656. https://doi.org/10.1093/bib/bbv068
上一篇下一篇

猜你喜欢

热点阅读