Giter VIP home page Giter VIP logo

gcdss's Introduction

GCDSS

GCDSS: Distributed Gene Clinical Decision Support System Based on Cloud Computing

1 Building GCDSS

####1.1 Preparing: we should prepare before run GCDSS.

##2.build mvn -DskipTests clean package ##3.run

spark-submit --class com.github.xubo245.gcdss.adam.postProcessing.ReadPostProcessing \
 --master spark://MaspterIP:7077 \
 --conf "spark.executor.extraJavaOptions=-Djava.library.path=/home/hadoop/disk2/xubo/lib/" \
  --jars /home/hadoop/cloud/adam/lib/adam_2.10-0.23.0-SNAPSHOT.jar \
  --executor-memory 20G \
GCDSS.jar \
$1 $2 $3 $4

Parameter:

$1: operate type, such as markDuplicate,sort,BQSR,realignIndel
$2: reads file location
$3: output path

For Example:

for num in 2000000 10000000 20000000
do
for i in {1..5}
do
fq='/xubo/project/alignment/CloudBWA/g38/time/cloudBWAnewg38L50c'$num'Nhs20Paired12time10num16k1.adam'
out='/xubo/project/alignment/CloudBWA/g38/time/cloudBWAnewg38L50c'$num'Nhs20Paired12time10num16k1.markDuplicateI'$i'.adam'
vcf='/xubo/callVariant/vcf/vcfSelectAddSequenceDictionaryWithChr.adam'

hadoop fs -rm -R -f $out
sh testReadPostProcessing.sh "markDuplicate" $fq $out $vcf
hadoop fs -rm -R -f $out
done
done

more shell example in sh file

4.Append

CloudBWA is a distributed read mapping algorithms in GCDSS

The code of CloudBWA in another github project: CloudBWA

Tutorial

the Tutorial or docs is being written.

Help

If you have any questions or suggestions, please write it in the issue of this project or send an e-mail to me: [email protected]

gcdss's People

Contributors

xubo245 avatar

Stargazers

raymond avatar

Watchers

James Cloos avatar  avatar

Forkers

dasaradhs1

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.