brad-cannell / r4epi Goto Github PK

View Code? Open in Web Editor NEW

19.0 5.0 50.0 572.31 MB

Repository for the R for Epidemiology book

Home Page: http://www.r4epi.com/

License: Other

CSS 0.15% TeX 1.34% HTML 98.05% R 0.45%

epidemiology r data-analysis data-visualization data-management

r4epi's Introduction

R4Epi Electronic Textbook

This repository is for the R for Epidemiology electronic textbook. This electronic book was originally created to accompany my Introduction to R Programming for Epidemiologic Research course at the University of Texas Health Science Center School of Public Health. However, I hope it will be useful to anyone who is interested in R and epidemiology.

Useful sites:

Tasks are located at: https://github.com/orgs/brad-cannell/projects/3
Bookdown help: https://bookdown.org/yihui/bookdown/

Textbook version Notes:

Major: physical copy editions
Minor: new chapters, deletion of chapters, chapter reordering.
3rd level: significant edits to existing chapters
Version number doesn’t change with typo (I.e., spelling and grammar) corrections.

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

r4epi's People

Contributors

Stargazers

Watchers

r4epi's Issues

Move over front matter to Quarto

Overview

Move over the front matter from index.Rmd to this repository.

Although it isn't directly related to moving over index, there are a number of issues in the R4Epi project related to the functionality of Quarto. We should probably get those knocked out before

Update structure page in Wiki.
Add formatting guidance to the formatting page in the wiki.
Add font awesome graphics to wiki
Add TOC to the top of wiki pages
Add "Top" throughout
Clean up the wiki
Add instructions for adding hyperlinked keywords to the glossary (#97)

Tasks

Create a glossary, add hyperlinked keywords, and maybe a note to NOTES (#97)
Add reference page
Finish moving over the rest of the content for the contributing chapter
- Revise the text
- Republish so that we can get accurate screenshots
- Update screenshots as needed
- Move over the Issues section
Move over the License information section
Move over the About the authors page

Add skip patterns chapters

The PowerPoints should be pretty easy to adapt. I just didn't have time in Summer 2020. I assigned them the videos on YouTube instead.

Review and improve the Populations chapter

Overview

In the Fall of 2023, I moved over a bunch of stuff from PowerPoint slides (nearly) verbatim. I was in a rush, so I told myself to move it just move it over and improve it later.

Go back, reread, and improve. PowerPoint doesn't always translate perfectly to book format.

Population plots

In the future, I may actually want to show readers how to make population plots. That may be useful information in a book about using R to do applied epidemiology. Perhaps just add the functions to the appendix?

Clean up the wiki

Overview

I'm trying to create a wiki that will help me (and any other potential coauthors) create and revise content for the book in a more efficient and consistent way.

On 2022-12-14, I moved over a lot of the content from two Google Docs that were sort of serving the same purpose (R for Epidemiology Textbook Notes and 📚Textbook). However, it needs a lot of cleaning up. Some of the content is outdated, so notes were just quickly jotted down, and the organization isn't good.

Task list

Figure out the nuts and bolts of building the wiki (#82, #78, #79)
Complete first draft of formatting page
Complete first draft of ideas page
Complete first draft of references page

Left off at...

I was working on this and then jumped into #82

Change to freqtables in the Describing the relationship between a categorical outcome and a categorical predictor chapter

Right now, it uses all gmodels::CrossTable().

Add version number conventions to README

Textbook versions:

Major: physical copy editions
Minor: new chapters, deletion of chapters, chapter reordering.
3rd level: significant edits to existing chapters
Version number doesn’t change with typo (I.e., spelling and grammar) corrections.

Revise to intro to epi chapter

Overview

I want to add all the modules from the Fall 2022 Epidemiology III class to the epidemiology half of the book.

The plan for right now is just to get the slides into Rmd format wholesale. I can make them better later.

Left off

Reading through the Intro to epi chapter.
I got sidetracked creating the wiki. Come back to this when the wiki is done.
I turned the entire PowerPoint into images that can be added. I need to trim out the slides I don't actually need and give the slides that I will use more informative names.

Tasks

Revise the intro to epi page. There was some stuff on the Google Doc that made me feel like we the intro section needs more development.

Expand discussion of vector types in section 5.2

Overview

From CRC Press review: One suggestion I have is to maybe expand a bit the vector types in section 5.2 and introduce vectors of characters and factors. Factor vectors are introduced in section 19, which seems rather late, since the importing part often involves messing with character and/or factor variables/columns.

Show them how to make a project in RStudio

Show them how to make projects. You don’t have to give a ton more explanation than you do now, but add screenshots showing them how to create a project.

Add relative file paths to projects chapter and file paths chapter. Instead of directions from home, we are getting directions from somewhere else that is already on the way.

https://r4ds.had.co.nz/workflow-projects.html

Add logo to the homepage

Make the epi hex logo the logo for the R4Epi book.
Use this code as an example: https://geocompr.robinlovelace.net

Test issue

Add vocabulary to Using R for Epidemiology chapter

We may want to introduce some basic vocabulary very early in the book. This is not a complete appendix, it's just a short review chapter that will get us up to speed. Here is a running list of potential words to start with:

Turn downloading and installing R and RStudio into an appendix

Overview

I was reading some samples of books on Power BI and SharePoint on my Kindle last night. It was annoying how they all started with chapters on installing the software. Then, I realized that R4Epi does the same thing. Let's keep the downloading and installing material for people who want it, but let's make it an appendix. That way, people who don't need that content can jump directly into something meatier.

Write first draft of Random Error chapter

Overview

Fall 2023

This content needs to come before we start calculating confidence intervals for anything. Given that the source of random error we focus on the most is sampling variability, perhaps it should come right after our discussion of populations. We can then immediately start calculating confidence intervals in the measures of occurrence chapter. This will also mean that we won't have learned any measures that we can use for examples in the random error chapter. So, we would just have to say something like, "don't worry about the measure for now. Just interpret the p-value and/or confidence interval." I don't think I like that. So, let's keep random error between measures of occurrence and measures of association.

I didn't have time to write this chapter. I assigned Modern Epidemiology chapter 15 instead.
My intent is to create a lab warm-up in PowerPoint that can serve as a foundation for the random error chapter in R4Epi.
If I get that PowerPoint made this week, then I can start the chapter by moving that content over.
This task is related to brad-cannell/epi_3_public#26

Terms/concepts to include

For each of the measures covered in measures of occurrence, show them how to calculate the confidence interval, p-value, and p-value curve for each measure.
What is random error? Chance vs. deterministic
P-value curves (https://paperpile.com/app/p/9c5734e3-eace-4919-9300-f8c046bcdd5d)
Sample size effect on p-values, confidence intervals, and p-value curves
Simulate differential and non-differential misclassification using the methods in Rudolph and Fox, example 2.

Tasks

Currently, the lab warm-up R code uses a regression model to demonstrate p-values, but we haven't yet covered the regression. I should probably come back and change this to some measure from measures of occurrence, which we have already measured.

Add executable embedded R code chunks throughout the chapters

Overview

I want to embed interactive coding practice blocks and quiz questions in the chapters.

Best solution for embedding R coding exercises into a book down book: https://rstudio.github.io/learnr/. It doesn’t look like you can currently add interactive quiz questions directly into bookdown books. I think the best you can currently do is build a learnr app with shiny, post it to shinyapps.io, and then add links to shinyapps.io into your bookdown book. Alternatively, you could create an R4Epi package that includes data, interactive tutorials, and automatically downloads freqtables and meantables as dependencies. Eventually, this may be the sort of thing you want to charge for.

See if Quarto changes this.

2023-01-23: Brian Law suggested I try WebR. He just warns that it is "very beta" right now.

Add sensitivity and specificity

Add a chapter (or at least a section) on sensitivity and specificity. Possibly to the descriptive analysis part.

First draft of chapter on asking questions

Chapter 35.2 Across with filter

Hi,

"Chapter 35.2 Across with filter" needs to be revised. The reason is that usage of across()infilter()` is deprecated.
For instance, this code in the book

df_xyz %>% 
  filter(
    across(
      .cols = everything(),
      .fns  = ~ !is.na(.x)
    )
  )

will generate the following message:
Using across() in filter() is deprecated, use if_any() or if_all().

Kind regards,
Leyla

Switch from magrittr pipe to base R pipe

Overview

From CRC Press review: The authors also use the magrittr pipe, %>%, rather than the base R pipe, |>. It might be worth mentioning both in chapter 11 and briefly comparing them (or pointing the reader to an external source for further details).

Write first draft of chapter on Git and GitHub

This is now going to be multiple separate chapters

Introduction to git and GitHub
Using git and GitHub
Common Git and GitHub workflows
In the introduction chapter, link back to the section on making contributions to the book and vice versa.
Replace contributing to R4Epi portion of the book's welcome page in the Example 1: Contribute to R4Epi section with a link to the actual contributing to r4epi section. See if it works after build book.

Add a section on using RStudio's Find and Replace Tool

Overview

I actually use the find and replace tool quite a bit. In the Intro to R class, I teach students how to use it the Find and Replace Tool to make it easier to copy and paste data into RStudio. I think I should also add this into the textbook.

Scenarios:

Add commas between values in a vector.
Add spaces and commas to a data frame -- use a baby example data frame.
Changing the name of a variable or data frame.
Regular expressions?

Add font awesome graphics to wiki

Overview

Link to wiki

Play around with including font awesome graphics instead of, or in addition to, emojis. Font awesome should have a more consistent look across operating systems.

See here: https://github.com/Netflix/Hystrix/wiki
And here: https://github.com/d3/d3/wiki

Also, add font awesome to headers and TOC

Terms to add to the measures of occurrence chapter

Overview

In the Fall of 2023, I was adding the content from PowerPoint to the book. There were some hidden slides with terms I wanted to add to the chapter, but hadn't gotten around to yet. I'm writing them below in hopes that I will get time to add them sometime soon.

Terms

In looking through this list, some of these are not appropriate for the measures of occurrence chapter. I need to move them to a different list at some point.

Put instructions for making feedback on R4Epi

Copy Hadley's page on making contributions.
And also on the README?

Figure out how to automatically check for broken links

Overview

I'd like to implement some kind of automatic checks for broken links in the book. There is an open issue requesting an automatic URL checker on Quarto's GitHub. That GitHub thread also recommends this website for doing manual checks.

Currently, I'm using the Test Quarto Book to experiment.

Left off at

Still looking for an automated solution. Try continuous integration?

Tasks

Figure out why the "Edit page on Github" links aren't working. I ran the 2023-07-12 of Test Quarto Book through the link checker and none of those links were working.
Find a more automated, R-like way of checking for broken links than manually checking in dead link checker.

Edit instructions for changing preferences on Mac

Overview

Section 3.5 discusses how to change global options in RStudio.

Clicking on the Apple menu and then Preferences no longer works. Now, Mac users need to click on Global options... in the Tools menu just like Windows users.

Tasks

Update the text
Add a new sreenshot

Incorporate a WebR practice exercise into the book

Overview

WebR allows users to R in the browser. Can I use it to add practice exercises, with feedback, directly into R4Epi?

I have a working example in Test Quarto Book

Useful websites/resources

Tasks

Figure out what to do about PDF format
Figure out if you can pass data to webR code chunks
Figure out if you can make multiple choice questions with webR chunks

Review and improve the measures of occurrence chapter

Overview

In the Fall of 2023, I moved over a bunch of stuff from PowerPoint slides (nearly) verbatim. I was in a rush, so I told myself to move it just move it over and improve it later.

Go back, reread, and improve. PowerPoint doesn't always translate perfectly to book format.

Tasks

Remove sentence about across() being new from column wise chapter

Next year or so, delete this sentence:
"As of this writing, the across function is a relatively new addition to dplyr."

Add sidebar to Wiki

See this page for an example: https://github.com/Netflix/Hystrix/wiki/Configuration

Review and improve the Using R for Epidemiology Chapter

Overview

In the Fall of 2023, I moved over a bunch of stuff from PowerPoint slides (nearly) verbatim. I was in a rush, so I told myself to move it just move it over and improve it later.

Go back, reread, and improve. PowerPoint doesn't always translate perfectly to book format.

Tasks

Move the probability and conditional probability stuff out of measures of association and into intro to epi. We need to be able to discuss probabilities early on. Especially in the random error module.

Test using a Qmd file

Overview

We can create the README markdown file using an Rmd file with the output set to github_document. I wonder if we can use a qmd document to create the markdown pages used in the wiki?

Potential advantages

We can add R code/output to the wiki
We can add font awesome icons to the wiki

Footer and sidebar

I explored generating the footer and sidebar markdown documents from a qmd document, but at this point, I don't see any advantage to doing so. I'm going to continue writing those in pure markdown for now.

Tasks

Convert Home
Convert Formatting
Convert Ideas

Left off

When I left off, I had just finished converting formatting and ideas. I want to double-check everything before I close out this issue. If it all looks good, then move on to cleaning up the text.

Review and improve the measures of association chapter

Overview

In the Fall of 2023, I moved over a bunch of stuff from PowerPoint slides (nearly) verbatim. I was in a rush, so I told myself to move it just move it over and improve it later.

Go back, reread, and improve. PowerPoint doesn't always translate perfectly to book format.

Left off

2023-09-25

Finished the first draft of the chapter. There is lots of room for improvement.

Tasks

Add a section on writing commit messages

Do this after you complete the chapter on git and GitHub.
Then, go back and link the text in intro to git and github that says "That way, it will be easy to find that version in the future if we ever need to refer to it (assuming we give it an informative name)." to the newly created section on writing good commit messages.

Add social network connections to the welcome page

Add a lot more general information about ggplot2

Overview

We really don't have much explanation about how to use ggplot2 in the book. There are just a couple of plots with minimal explanation. I don't think we want to cover all of the basics of ggplot2, instead we should just refer them to Hadley's book (https://ggplot2-book.org/index.html). However, learners have repeatedly asked for more information than we currently give them.

In this part:

Cover the very basics of the grammar of graphics and how ggplot2 works.
Numeric descriptions of variables

In the presenting results part:

Formatting ggplots and making them pretty
Other types of plots

Change Rmd chapter over to Quarto

Overview

Change the R4Epi chapter about R markdown to be about Quarto. This is the direction RStudio is moving in.

Here is a link to the R4DS chapter on Quarto.

Try making a new repository for a quarto version of R4Epi

Overview

I was reading about Quarto Books last night. I think we may want to try making a version of R4Epi that is created with Quarto. However, I think it's best if we use a totally new project/repository for this.

Additionally, you may first want to experiment with R Notes Bookdown.

Why?

It looks like Quarto is the future for RStudio and that is where the bulk of development will be.
Here's an article that discusses using Quarto even if you only use R: https://www.jumpingrivers.com/blog/quarto-rmarkdown-comparison/
Yihui's blog: https://yihui.org/en/2022/04/quarto-r-markdown/

New Repositories

Fix search functionality

Currently, the search function is only working for the currently displayed page. For example, if I click the search button from the R4Epi homepage, and search for the term "Welcome", it works as expected. However, if I search for the term "Goals", which appears on the very next page, no results are returned.

I think this page might be helpful: https://community.rstudio.com/t/search-in-bookdown-not-working/91942

Add TOC to the top of wiki pages

See this page for an example: https://github.com/Netflix/Hystrix/wiki/Configuration

Add Doug's info to about the authors

https://sph.emory.edu/faculty/profile/index.php?FID=melvin-livingston-8970

Move names above photos
Round photo corners (https://ianrmedia.unl.edu/resources/rounded-corners-images)
Insert social icons (rstudio/rmarkdown#813)
Add font awesome stuff to notes
Add anchor tag stuff -- including open in new tab -- to notes (https://stackoverflow.com/questions/4425198/can-i-create-links-with-target-blank-in-markdown)
Add image tag stuff to notes (https://bookdown.org/yihui/bookdown/figures.html)

Add check on learning to the end of each chapter (learnr)

Overview

I think it would be great to add COL questions to the end of each section or chapter like Hadley does in R4DS. To begin with, the questions can be the same as the questions we use for COL quizzes in Canvas.

How?

Use learnr

Using the learnr package seems like a good place to start. However, I'm pretty sure there are big limitations as to what we can add directly to the book. One potential option is to create an accompanying package that only contains quiz questions and add links to those quiz questions into R4Epi.

- Name the package exerciser4epi

Use webr

Brian Law from Posit suggested looking into the webr packageas well. Although, he warns that it is very beta right now.

Make an R style guide appendix

Overview

Style, and misuses of it, are one of the biggest issues I see with student code. I think it would be helpful to create a style guide appendix. I imagine it would be similar to the Tidyverse style guide.

I don't think this should replace the chapter on style best practices. It should augment it. Tell them to use this appendix to look things up.

You may want to include something like this snippet you wrote for your Power Automate wiki:

This page will serve as a style guide for authoring this wiki and for authoring Power Automate flows. The ultimate goal of a style guide is to reduce cognitive load, and as a result, make it easier to write - text or code. How does a style guide do this? First, it reduces the number of choices you have to make as you are writing. For example, "should I write this variable name in snake case or camel case?" Second, having the predetermined choices written down for references reduces the amount of information you need to store in your intentional memory (i.e., "OK, remember to always use snake case"); although, it may eventually bleed over into your incidental memory. Third, having uniformly styled text/code makes it easier for others -- including future you -- to read. You can focus on the content instead of the style and/or organization.

Add open review to the book

I came across this when I was trying to figure out how to add the Google Analytics tag to the book. It looks super cool. I think I can use this in place of -- or alongside -- making edits in GitHub.

https://benmarwick.github.io/bookdown-ort/mods.html

Add box stating that the chapter is under development

Something similar to this: https://bookdown.org/yihui/blogdown/
For the Epidemiology chapters. We may also want to update the introduction to talk about who the intended audience is and the way the book is roughly divided into two halves.

Turn measures of occurrence module into a chapter

Overview

I want to add all the modules from the Fall 2022 Epidemiology III class to the epidemiology half of the book.

The plan for right now is just to get the slides into Rmd format wholesale. I can make them better later.

Left off

2023-09-05
Working on moving over the PP slides.

Left off at slide 2 and line 127 of 03_measures_of_occurrence.
Just get the PowerPoint slides moved over. Improve later.

Tasks

Move usable parts of PowerPoint slides over to Rmd.
Create a first draft of this chapter.
Add to reading list.
Post announcement

Clone wiki to computer

Link text in the typos section of the Welcome chapter to the GitHub chapter of the book.

In Contributing to R4Epi > Typos > second paragraph, there is some bracketed text "[using Git and Github]" that needs to be linked once the chapter on using Git and GitHub is written.

Update the language in the section on Tidy Evaluation

The language about non-standard evaluation in the Tidy Evaluation section of the introduction to repeated operations chapter isn't wrong, but I think the language and examples used in the rlang data-masking article is probably more helpful. Let's update that language.

From Advanced R Second Edition:

Closely related to metaprogramming is non-standard evaluation, NSE for short. This term, which is commonly used to describe the behaviour of R functions, is problematic in two ways. Firstly, NSE is actually a property of the argument (or arguments) of a function, so talking about NSE functions is a little sloppy. Secondly, it’s confusing to define something by what it’s not (standard), so in this book I’ll introduce more precise vocabulary.

Create a cross-referenced glossary

Overview

I was working on moving the front matter from the original bookdown Rmd documents to the new qmd documents (#96).
As I was doing so, I decided it would be a good time to double-check and enforce our conventions for emphasizing text.
That led me to notice that a bolded word in contributing.qmd should really be hyperlinked to the glossary.
That led me to try to figure out how to make that happen.

I don't think we can link random words in the glossary. However, I think we may be able to make the glossary words headers, style them with CSS, then link words in the chapters to those headers in the glossary.

Useful links

Left off at

2023-07-19: I think this is working now, thanks to this SO post.

Tasks

Get cross-references to work in cross_references.qmd of test quarto book.
Link at least one word to the glossary in the test quarto book.
Figure out how to link to words in the glossary without them showing up in the PDF table of contents.

brad-cannell / r4epi Goto Github PK

r4epi's Introduction

Useful sites:

Textbook version Notes:

r4epi's People

Contributors

Stargazers

Watchers

Forkers

r4epi's Issues

Overview

Tasks

Overview

Population plots

Overview

Task list

Left off at...

Overview

Left off

Tasks

Overview

Overview

Overview

Terms/concepts to include

Tasks

Overview

Overview

Overview

Overview

Overview

Terms

Overview

Left off at

Tasks

Overview

Tasks

Overview

Useful websites/resources

Tasks

Overview

Tasks

Overview

Tasks

Overview

Potential advantages

Footer and sidebar

Tasks

Left off

Overview

Left off

Tasks

Overview

Overview

Overview

New Repositories

Overview

How?

Use learnr

Use webr

Overview

Overview

Left off

Tasks

Overview

Useful links

Left off at

Tasks

Recommend Projects

Recommend Topics

Recommend Org