Giter VIP home page Giter VIP logo

r_workshops's Introduction

title author date output
R workshops
Tim Knutsen
12.1.2016
html_document

Repository for the CIGENE R workshops.

We used to have a R-lovers club, so this is a great opportunity to revive it.

Table of Contents

  1. dplyr example session Monday 01.02.16
  2. Preparation
  3. Built in Datasets
  4. Learning
  5. Version control with git and github

dplyr example session Monday

  1. Download this repository to your computer by clicking the Download Zip buttun. Unzip, change working directory to the folder and yoy will have the data ready to copy and paste from the file below.
  2. Navigate to https://github.com/timknut/r_workshops/blob/master/dplyr_parsing.Rmd and run the commands.
  3. Alternativly you can open the dplyr_parsing.Rmdfile directly and run the commands from that script.
  4. Go through salmon_res.R which shows a simply analysis with dplyr and Bioconductor.
  5. The two websites mentioned during the workshop for GO analysis: http://www.geneontology.org and http://revigo.irb.hr/

Preparation

To prepare for this, I suggest that you find yourself some data, and try to solve some problems. eg:

  • read the data into R.
  • select columns to new data frame.
  • filter the new df based on a column.
  • Make a new column by adding two other columns together.
  • summarize the data by eg. grouping by a column and showing the mean for each group.
  • Plot your results using ggplot2. http://www.cookbook-r.com/Graphs/index.html

Built in Datasets:

  • Use the built in datasets i R:
if(!require(ggplot2)) install.packages("ggplot2")
cars <- mtcars
flowers <- iris
diamonds <- ggplot2::diamonds

ggplot2::ggplot(cars, aes(hp, mpg)) + 
	geom_point(aes(colour = as.factor(cyl)))

plot of chunk unnamed-chunk-1

Learning:

There a many, many books and resources. It is easy to get lost. Find a good source, and stick to it. Also, google will solve almost everything. For the basics, I really like http://www.statmethods.net/ . This has examples for doing 90 % of the most basic stuff you need to do.

Version control with git and github.

If you are going to write a lot of code, I really, REALLY recommend reading about version control with git and how it can help you organize, backup, collaborate and share your code. This article gives an overview. There are many other tutorials out there. Just google around if you feel for it. If you master git, you will be ahead of your teckno-conservative project leader.

Hadley

For this workshop, we can go through how to do a rna-seq basic analysis using Hadley Wickams package dplyr which I find immensely useful. https://github.com/hadley/dplyr He has created small intro tutorial using a built in dataset. https://cran.r-project.org/web/packages/dplyr/vignettes/introduction.html

Videos

If you like videos, I think this course is easy to understand and explained well. http://varianceexplained.org/RData/

If you REALLY like videos, go for one of these full blown online courses: https://www.coursera.org/learn/r-programming https://www.coursera.org/learn/data-cleaning and see related courses.

r_workshops's People

Contributors

timknut avatar

Watchers

James Cloos avatar Torfinn Nome avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.