Giter VIP home page Giter VIP logo

apobec_expression's Introduction

apobec_expression

Background

Somatic mutations is a commonly acceptable cause of human cancer. Known sources of mutations can have both exogenous and endogenous nature. Examples of the endogenous mutagens are ultraviolet light, ionizing radiation and tobacco smoke, whereas known endogenous mutagens are DNA repair deficiency, free radical species and deamination, which is mainly caused, as was recently discovered, by the activity of components of the human immune system - APOBEC (apolipoprotein B mRNA editing enzyme, catalytic polypeptide-like) family enzymes. Recent advances in sequencing of cancer genomes allows to track activity of particular mutagens based on so-called mutational signature, which describe a nucleotide context of mutations being unique for many mutational processes. This make possible to investigate the heterogeneity of the rate of mutations induced by various mutagen along the genome and recognize the possible like of mutation rate with the various epigenetic features such as transcription, replication timing and chromatin organization. The understanding of mutational heterogeneity along the genome is important for implication in statistical methods aiming the identification of the exhaustive catalog of cancer-driving genes. In this project we suggest exploration of the influence of epigenetic features on mutational heterogeneity along the genome for APOBEC enzymes and other endogenous mutagens based on new data from sequenced cancer genomes.

Data acquisition

All executed methods are from DataLoad class.

  1. Load human genome: LoadHumanGenome()
  2. Load cancer mutations: LoadMutations()
  3. Load replication timing data: LoadReplicationTiming()
  4. Load gene expression data: LoadExpression()
  5. Load gene information: LoadGenes()

Data dictionaries are from DataDictionary class.

  1. Chromosome names dicrionary: SetChromosomeNames() SetChromosomeLowerName()
  2. Cancer types dictionary: SetCancerTypes()
  3. Cell lines dictionary: SetCellLinesReplication()
  4. Cancer types - cell lines correspondence: SetCancer2RTCellLines()
  5. Gene expression bins: SetExpressionBins()

APOBEC mutations and replication timing

All methods are from class ReplicationTime

  1. Create replication timing bins: CreateRTBins()
  2. Set replication timing bins to replication timing intervals: SetRTBins()
  3. Calculate number of TCW motifs in replication timing intervals: GetRTNumberOfTCW()
  4. Calculate number of bases in replication timing intervals: GetRTNumberOfAll()
  5. Calculate distribution of APOBEC mutations over replication timing: MutationRTAnalysis()

APOBEC mutations and gene expression

All methods are from Expression class.

  1. Set the maximum end of genes among intersecting genes SetGeneIntervals()
  2. Divide DNA positions covering by genes into intervals covered by only single gene and intervals covered by multiple genes: DivideGenesIntoIntervals()
  3. Calculate number of TCW motifs in expression bins: GetExpTCW()
  4. Calculate number of bases in expression bins: GetExpAll()
  5. Calculate number of APOBEC and other mutations relative to expression bins: GetExpMutation()

APOBEC mutations and interaction of replication timing and gene expression

All methods are from ExpressionRT class.

  1. Calculate number of TCW motifs in expression & replication timing bins: GetExpRTTCW()
  2. Calculate number of bases in expression & replication timing bins: GetExpRTAll()
  3. Calculate number of APOBEC and other mutations relative to expression & replication timing bins: GetRTExpMutation()

Export results

All methods are from ExportData class.

  1. Prepare results of the APOBEC mutations density relative to replication timing for exporting: PlotRT()
  2. Prepare results of the APOBEC mutations density relative to gene expression for exporting: PlotExp()
  3. Prepare results of the APOBEC mutations density relative to replication timing & gene expression for exporting: PlotExpRT()
  4. Export results of replication timing analysis: ExportPlotRT()
  5. Export results of gene expression analysis ExportPlotExp()
  6. Export results of replication timing & gene epression analysis: ExportPlotExpRT()

apobec_expression's People

Contributors

evshvarov avatar mkazanov avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.