Giter VIP home page Giter VIP logo

angus's People

Contributors

acharbonneau avatar adamjorr avatar astrobiomike avatar bluegenes avatar brooksph avatar charlesreid1 avatar crivaldi avatar ctb avatar elijahlowe avatar fpsom avatar joiry avatar karenword avatar katffowler avatar luizirber avatar rachellombardi avatar raquelpinho avatar raynamharris avatar rociotmartinez avatar sateeshperi avatar shannonekj avatar sichongp avatar taylorreiter avatar upendrak avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

angus's Issues

Plink download fails

Need to replace code for downloading PLINK in the Genome wide association study tutorial

running large scale blasts - what should we do?

https://angus.readthedocs.io/en/2017/running-blast-large-scale.html doesn't exist yet.

my thought was to run something that took a little while, output some data that we could use in https://angus.readthedocs.io/en/2017/visualizing-blast-scores-with-RStudio.html, and let us show off "starting something big and walking away for a bit."

my first naive thought was to calculate BLAST score distributions for some large reciprocal BLAST (e.g. ecoli vs shewanella or something), visualize it, and then ask students to pick a cutoff based on the graph, and maybe do a little bit of analysis of how many putative orthologs there are.

or we could drag shmlast into this.

@camillescott what do you think?

table in Bibliography part of R markdown tutorial not rendering

See below

## Bibliography  

It's also possible to include a bibliography file in the YAML header.  Bibliography formats that are readable by Pandoc include the following:  

| Format | 	File extension   |
|-----+-------|
| MODS  | 	.mods |
| BibLaTeX  | .bib  |
| BibTeX  | .bibtex  |
| RIS  | .ris  |
| EndNote  | .enl  |
| EndNote XML  | .xml  |
| ISI  | .wos  |
| MEDLINE  | .medline  |
| Copac  | 	.copac  |
| JSON citeproc  | 	.json  |

second line should be | --- | --- |

Syntax error in GWAS lesson $

All_SKAT_Data_reduced$ <- All_SKAT_Data_reduced[order(All_SKAT_Data_reduced$P.value),]
Has one too many $.

Perhaps progressive taxation will help?

theme improvements

  • Activate syntax highlighting for code blocks
  • Make the code blocks more visually distinct than other text
  • Number the code blocks (so people can reference back to it)

Notes for the Intro to automation lesson

Notes to be included in the lesson

-Use set -x: stops the execution of a script if a command or pipeline has an error - which is the opposite of the default shell behaviour, which is to ignore errors in scripts

-Use set -e: prints out each command before executing it - extra useful for debug

Both options are great to include in the beginning of the file

  • Include #! /bin/bash in the beginning of the script to define the script interpreter

  • Include bash <scriptfile> as a means of running the script file (instead of going directly to ./<scriptfile>)

  • Include information on batch scripts

  • Check for flag in curl for not re-downloading the file every single time

  • Add the way to rename a screen; at creation screen -S <name>, after creation Ctrl+a, :sessionname <name>

  • Add commands to detach a screen without the shortcut: screen -d <ID>

  • Another way to terminate a screen without needing to reattach: screen -X -S [session # you want to kill] quit

docker as a topic

we should at least mention this, and maybe demo it (if we have a group of people that are prepared for complexity in 2nd week)

File not found error in sourmash tutorial

In the "what's in my metagenome" section, the command
sourmash gather -k 31 ecoli-genome.sig genbank-k31.sbt.json
fails with a file not found error. Need to add in ../ before the genbank file

exports are not disconnection friendly

In the nanopore tutorial (and probably others) there are quite a few export that get forgotten when people get disconnected from their favourite cloud provider

export PATH=/home/linuxbrew/.linuxbrew/bin:$PATH

export PATH=$PWD/prokka/bin:$PATH

TODO: add to .bashrc or .bash_profile and source it

Overall thoughts on current status of materials

Some high level thoughts from a quick review --

We don't have diagrams or explanations for a lot of what we are doing in week 1.

We should sprinkle in topic guides to help pace things through week 1.

Many of the files downloaded and used are coming in from other repos, e.g. the install-edgeR.R script here. We should move them into angus/2018.

We don't have any formative assessment stuff sprinkled in (questions, mechanisms, etc.)

We may want to add in instructor guides for extra material in case people end too quickly.

We should add hackmds for each major topic shift, as per day1.

DESeq2 install

On a new instance, following http://angus.readthedocs.io/en/2017/deseq2-asthma.html, install-deseq2.R doesn't install GenomicFeatures (nor DESeq2)

R version 3.4.1 (2017-06-30) -- "Single Candle"
> library(GenomicFeatures)
Error in library(GenomicFeatures) :
  there is no package called ‘GenomicFeatures’

when installing with install-deseq2.R:

* removing ‘/usr/local/lib/R/site-library/GenomicAlignments’
ERROR: dependency ‘annotate’ is not available for package ‘genefilter’
* removing ‘/usr/local/lib/R/site-library/genefilter’
ERROR: dependency ‘annotate’ is not available for package ‘geneplotter’
* removing ‘/usr/local/lib/R/site-library/geneplotter’
ERROR: dependencies ‘GenomicRanges’, ‘XML’, ‘GenomeInfoDb’, ‘RCurl’, ‘Rsamtools’, ‘GenomicAlignments’ are not available for package ‘rtracklayer’
* removing ‘/usr/local/lib/R/site-library/rtracklayer’
ERROR: dependencies ‘GenomicRanges’, ‘SummarizedExperiment’, ‘genefilter’, ‘geneplotter’ are not available for package ‘DESeq2’
* removing ‘/usr/local/lib/R/site-library/DESeq2’
ERROR: dependencies ‘GenomeInfoDb’, ‘GenomicRanges’, ‘RCurl’, ‘rtracklayer’, ‘biomaRt’ are not available for package ‘GenomicFeatures’
* removing ‘/usr/local/lib/R/site-library/GenomicFeatures’```

discuss: file naming schemes

points to touch on:

  • file names don't intrinsically "mean" anything in UNIX
  • but it's good to pick something systematic
  • separate raw data and worked-on data
  • think about naming for tab-completion
  • e.g. Titus likes "sample.step1.step2.step3." for his stuff
  • think about future you - how will you know/guess what a file is in 6 months?

software install discussion

put in an optional discussion/info on apt-get vs pip vs 'git clone' vs perl vs ruby (if we have any ruby)

section/session feedback form

do we want to link in session feedback forms? if so how do we avoid unconscious bias issues?

@lexnederbragt has a nice formula "a wish and a star" - we could ask attendees who are giving feedback on a session to say at least one nice thing (a star) as well as one "wish" for change.

I would also say that only a few of us could/should read the feedback and then summarize to the instructors. this can be a lot of work, though, and in the past I've never gotten to actually doing it.

@karenword ?

variant calling lesson probz: EOF error, installs, etc

Problems I see from variant calling lesson:
(https://github.com/ngs-docs/angus/blob/2017/variant-calling.md)

  • EOF error on command to sort bam file by position in genome samtools sort SRR2584857.bam SRR2584857.sorted
  • no installation for vcftools
  • extracting reads command at the end of lesson has same position in e. coli genome samtools view SRR2584857.sorted.bam 'ecoli:920514-920514' > out.bam
  • also for previous line, should be out.sam to be human readable and compatible with wc -l

I'll try and fix some of these tomorrow!

introductions

for tonight, maybe -

  • lead instructors introduce selves
  • code of conduct, sticky notes, index cards (what else?)
  • web site
  • booting jetstream
  • quick slack intro while computers booting
  • BLAST
  • done & sleep!

Wrong filename in GWAS lesson

The line defining All_SKAT_Data_reduced in NGS_GWAS_via_SKAT.md should be

All_SKAT_Data_reduced <- read.table(file = "data/SKAT_all_reduced-pvals.results", header =TRUE)

Comments on Introduction to Docker

Comment about the Intro to Docker tutorial.

  • If the students are already familiar with Git, then it might make sense to make some parallel between container ID's and git commits, and container "name" and git tags. Docker is essentially building a tree based on the base container.
  • When building the Dockerfile, it would be better to put the apt-get update and apt-get install on the same line, the same way it was done interactively. It has a few benefits: it builds only one layer, and the install runs only if the update is successful. Arguably, the git clone and the make call could be together too, but that might obscure things too much. It's probably better to just stick to the sequence of actions done interactively, i.e. the dockerfile is essentially the content of history with a few special commands prepended to each line.

RNAseq lesson: add reference

The link to the 48-replicate experiment lacks the reference, which will survive longer than the link.
Add author, publication information for this paper to hands-on.Rmd

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.