rstudio-education / primers Goto Github PK
View Code? Open in Web Editor NEWThe learnr tutorials in RStudio Cloud's primers
Home Page: https://rstudio.cloud/learn/primers
License: Other
The learnr tutorials in RStudio Cloud's primers
Home Page: https://rstudio.cloud/learn/primers
License: Other
After I previewed a tutorial locally (open .Rmd
in RStudio 1.3, run tutorial), Git reported a new file tidy-data/01-Reshape-Data/rsconnect/shinyapps.io/gvwilson/01-reshape-data.dcf
. I presume this shouldn't be checked into the repo - what change is needed to .gitignore
? (There are lots of .dcf
files in the repo, so I'm guessing *.dcf
isn't the right answer.)
Hey!
As I was reading through the primers on rstudio.cloud today, I noticed a typo. It seems that the typo was fixed in commit 712baa0. However this isn't reflected in the deployment on cloud (https://rstudio.cloud/learn/primers/2.1)
in 03-join-datasets.Rmd, "The easiest way to do this is to save the input data sets as a named list." seems a bit of a jump - the primer has introduced named vectors, but not named lists (or lists at all, really).
The text in visualize-data/01-bar-charts/www/images/positions.png is slightly blurry for me - is it possible to create an SVG version of this image?
In 01-tibbles.Rmd, image transform-data/01-tibbles/www/images/tibble_display.png, why does the readr logo show up?
I load libraries using library(tidyverse)
rather than library("tidyverse")
; this tutorial tells me to do the latter and doesn't mention the former. Should it? Which is more common?
In 03-deriving.Rmd, "which take a vector of input" => "which take vectors as inputs" maybe?
In introduction-to-iteration.Rmd, "R’s lapply family" -> lapply
hasn't been introduced, so readers won't know what this refers to.
in 01-Reshape-Data.Rmd: "Here the type column lists the keys" - what's a "type column"? The term doesn't seem to appear elsewhere in the primers.
The primer home page (https://rstudio.cloud/learn/primers) lists three primers as "Coming Soon." They are Report Reproducibly, Build Interactive Web Apps, and Organize Your Work. They look really good! Is there a schedule for when they might become available?
Why is the section titled "Isolating Data"? Doesn't seem to connect with the content.
I'm in the process of developing materials for a data science class I will teach in fall. I would like to include parts of some of the primers in my exercises for students. Is such re-use and adaptation acceptable? I can't find the licensing conditions for the primers specified anywhere. Thanks!
Add a section to the primer on testing that discusses error handling.
just starting w R and RStudio Cloud! Also new to github. So feel free to let me know how to do all this...
I found a small typo on the Isolating Data with dplyr primer.
The one_of example in the table is missing a closing parentheses. 😊
one_of() Columns whose name appear in the given set select(babynames, one_of(c("sex", "gender"))
visualize-data/
has both 01-Exploratory-Data-Analysis
and 01-bar-charts
- should the former not be 00
?
In multiple-vectors.Rmd:
usa_mod1 <- model1 %>% pluck("United States")
usa_mod2 <- model2 %>% pluck("United States")
What are model1 and model2? I'm guessing from context that they are models based on year and (year + GDP per capita) but it would be helpful to re-create them here (since I made tea between doing the previous tutorial and this one and have forgotten).
Dotplots are introduced, and then their shortcomings enumerated - given that this is a primer, not a reference, why bother introducing them at all?
in 01-Reshape-Data.rmd, "cases, table4a, and table4b are all two by two tables" - those tables don't look 2x2 to me. Is this a statistical term I don't know?
In list-columns.Rmd, the exercise that's introduced with "Add a new column to your data frame named models
" has this as a solution:
nested_gapminder %>% mutate(models = map(data, ~lm(lifeExp ~ year, data = .x)))
It may be worth pointing out that the word data
is being used in two different ways here.
I finish "tidyverse", and there's a button to go back to the previous topic, but how do I get to the next one in "Work with Data"? I can click on "Work with Data" to get the welcome page and then select "Isolating Data with dplyr", or click the hamburger menu and select it, but neither is immediately obvious.
As I work through the tutorials in a primer, it's sometimes hard to tell which one I'm in and which one is next. For example, this screenshot shows the menu when I've just started the tutorial on how to write a function - could that line in the drop-down be highlighted more clearly (I have old eyes), and/or could the title of the current tutorial show up in the menu bar?
This package provides a shortcut for installing and loading the entire suite of packages in “the tidyverse”, e.g.
Dangling sentence in 01-tibbles.Rmd.
Might be worth pointing out that sep = 1
in separate()
splits after location 1, not from location 1 and forward (i.e., the indices are the top ends of ranges, not the starts).
in 01-tibbles.Rmd, "However, babynames
is already a tibble" - should explain how I can tell.
If I submit a wrong answer for an MCQ, I have to click "Try Again" before I can change my answer. It seems like I ought to be able to immediately select another radio button option and submit that without the extra step.
The first example of a named vector in 03-join-datasets.Rmd is c(uno = 1, dos = 2, tres = 3)
without quotes around the names, but later examples doing joins quote the names. Quote the names here for consistency?
In 03-join-datasets.Rmd, the discussion of naming the source of data in bind_rows
says, "bind_rows() will use the character string as the name of a new column that displays the name of the data set that each row comes from (as determined by the names in the list)." However, when I run:
bands <- list(df1 = band,
df2 = band2)
band %>% bind_rows(band2, .id = "origin")
the values in the column "origin" are "1"
and "2"
, not (as I expected) "df1"
and "df2"
.
"functions are like Las Vegas" -> readers outside North America may not catch this (particularly if English isn't their first language)
some directories and files have all lowercase names (e.g., ./visualize-data/04-boxplots/04-boxplots.Rmd
) while others have mixed case (e.g., ./tidy-data/01-Reshape-Data/01-Reshape-Data.Rmd
). Is there a reason?
The discussion of pmap
described the use of named arguments; the first example (ANOVA across three models) doesn't use them. Introduce another example that does to make the ramp a little gentler?
When working through the questions under any particular heading, I don't have any feeling for how much is left in the section. Some sort of progress bar showing how many sections or questions have been done/are left would be comforting.
In 03-deriving.Rmd, "with the exception of ties, there was only one 1 in the entire data set, only one 2, and so on" - once ties are excluded, isn't there always one of each?
babynames %>%
group_by(year, sex) %>%
summarise(total = sum(n)) %>%
ggplot() +
geom_line(aes(x= year, y= total, color = sex)) #total = sum(n)
In 04-boxplots.Rmd, "Here each group functions like a vertical histogram." I would say that the output was a set of horizontal histograms.
In list-columns.Rmd, I got to this:
Within mutate(), use a function that takes a list as input (models) and returns a double vector as output to place in coefficient. Don’t forget to actually provide models as an argument when you call the function.
Use the map expression ~coef(.x) %>% pluck("year") to populate coefficient with the year coefficients of each model.
and was lost - there's a lot going on in this all at once.
Why does the "Your turn" example use a data.frame
instead of a tibble?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.