Giter VIP home page Giter VIP logo

triocnv's Introduction

#TrioCNV #Introduction TrioCNV is a tool designed to jointly detecting CNVs from WGS data in parent-offspring trios. It models read depth signal with the negative binomial regression to accommodate over-dispersion and considered GC content and mappability bias. It leverages parent-offspring relationship to apply Mendelian inheritance constraint while allowing for the rare incidence of de novo events. It uses a hidden Markov model (HMM) by combining the two aforementioned models to jointly perform CNV segmentation for the trio. #Installation If you want to run TrioCNV, you'll need:

  1. Java 1.7+

  2. Apache Maven (if you want to build the source)

  3. R (Rscript exectuable must be set in the PATH environment variable)

  4. Runiversal package(http://cran.rproject.org/web/packages/Runiversal/index.html) in R environment

  5. The easiest way to get TrioCNV is to download the binary distribution from the TrioCNV github release page. Alternatively, you can build TrioCNV from source with Maven.

    git clone --recursive https://github.com/yongzhuang/TrioCNV.git

    cd TrioCNV/

    mvn clean install -Dmaven.compiler.source=1.7 -Dmaven.compiler.target=1.7

#Running

usage: java -jar TrioCNV.jar [OPTIONS]

  1. preprocess

    This command is to extract the information from the BAM file.

     usage: java -jar TrioCNV.jar preprocess [OPTIONS] 
    
     -R,--reference  <FILE>  reference genome file (required)
    
     -B,--bams       <FILE>  bam list file (required)
    
     -P,--pedigree   <FILE>  pedigree file (required)
    
     -M,--mappability        <FILE>  mappability file (required)
    
     -O,--output     <FILE>  perfix of output file (required)
    
        --window     <INT>   window size (optional, default 200)
        
        --min_mapping_quality        <INT>   minumum mapping quality (optional,default 0)
    
  2. call

    This command is to jointly call CNVs from a parent-offspring trio.

     usage: java -jar TrioCNV.jar call [OPTIONS] 
    
     -I,--input      <FILE>  read count file got by the preprocess step (required)
    
     -P,--pedigree   <FILE>  pedigree file (required)
    
     -M,--mappability        <FILE>  mappability file (required)
    
     -O,--output     <FILE>  output structural variation file (required)
    
        --min_mappability    <FLOAT> minumum mappability(optional, default 0)
        
        --mutation_rate      <FLOAT> de novo mutation rate (optional, default 0.0001)
        
        --transition_prob    <FLOAT> probability of transition between two different copy number states(optional, default 0.00001)
        
        --outlier	<FLOAT>	the predefined percentage of outliers (optional, default 0.025)
        
        --min_distance       <INT>   minumum distance to merge two adjacent CNVs (optional, default 10K)
        
        --nt <INT>   number of threads (optional, default 1)
        
        --gc_bin_size	<INT>	size of gc bin by percent (optional, default 1)
    

#File Instruction

  1. bam list file (one columns)

    Column 1: path of .bam file

    Example:

     /path/Sample1.bam
    
     /path/Sample2.bam
    
     /path/Sample3.bam
     ...
    
  2. pedigree file

    See (http://pngu.mgh.harvard.edu/~purcell/plink/data.shtml)

    Note: The Individual ID must be same as the @RG SM tag of the bam file.

#Contact [email protected]

triocnv's People

Contributors

yongzhuang avatar dustincys avatar

Stargazers

Du avatar Martin Rippin avatar  avatar  avatar Renjie Tan avatar  avatar nuin avatar  avatar

Watchers

 avatar  avatar

Forkers

ichobits

triocnv's Issues

Build Failure

Hi,

I tried following the installation instructions but when I try running

'mvn clean install -Dmaven.compiler.source=1.7 -Dmaven.compiler.target=1.7'

I receive an error message.

"Failed to execute goal on project TrioCNV: Could not resolve dependencies for project cn.edu.hit:TrioCNV:jar:0.1.2: The following artifacts could not be resolved: rcaller:RCaller:jar:2.5, org.broad.igv:igv:jar:2.3.23: Failure to find rcaller:RCaller:jar:2.5 in file:///Users/brandenlau/TrioCNV/repo was cached in the local repository, resolution will not be reattempted until the update interval of repo has elapsed or updates are forced -> [Help 1]"

I'm trying to build TrioCNV using Java 1.8 and Apache Maven 3.3.9.

Thanks,

Branden

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.