Comments (2)
This is great added context. May seem superfluous given the usage of the data within R, but I really like the flow this adds to the paper.
from performance.
Here are the revised sections (updated on the JOSE branch), in which we now describe the dataset used as well as the results of the analyses:
Below we provide example code using the
mtcars
dataset, which was extracted from the 1974 Motor Trend US magazine. The dataset contains fuel consumption and 10 characteristics of automobile design and performance for 32 different car models (see?mtcars
for details). We chose this dataset because it is accessible from base R and familiar to many R users. We might want to conduct specific statistical analyses on this data set, say, t tests or structural equation modelling, but first, we want to check for outliers that may influence those test results.Because the automobile names are stored as column names in
mtcars
, we first have to convert them to an ID column to benefit from thecheck_outliers()
ID argument. Furthermore, we only really need a couple columns for this demonstration, so we choose the first four (mpg
= Miles/(US) gallon;cyl
= Number of cylinders;disp
= Displacement;hp
= Gross horsepower). Finally, because there are no outliers in this dataset, we add two artificial outliers before running our function.library(performance) outliers <- check_outliers(data, method = "zscore_robust", ID = "car") outliers
What we see is that
check_outliers()
with the robust z score method detected two outliers: cases 33 and 34, which were the observations we added ourselves. They were flagged for two variables specifically:mpg
(Miles/(US) gallon) andcyl
(Number of cylinders), and the output provides their exact z score for those variables.We describe how to deal with those cases in more details later in the paper, but should we want to exclude these detected outliers from the main dataset, we can extract row numbers using
which()
on the output object, which can then be used for indexing: [...]
from performance.
Related Issues (20)
- Revising `check_model()` HOT 1
- check_model failing on logistic regression HOT 2
- Check_model in version 0.11.0 no longer produces qq plot residuals HOT 19
- r2_nakagawa and glmmTMB with beta_family HOT 4
- Outlier detection in Linear mixed models failed? HOT 5
- cannot apply check_model title with patchwork::plot_annotation HOT 4
- check_model error suggestions are not complete HOT 5
- Error and Incomplete Output Using performance::check_collinearity with Cox Models HOT 1
- Normality of Residuals of check_model is abnormal. HOT 2
- Revise compare_models() for Bayesian models HOT 5
- R-squared for glmmTMB (binomial) HOT 9
- check_model() bugged for lmer models *only* when run as part of an RMD chunk HOT 3
- check_predictions() fails when outcome is log-transformed and named like a valid function HOT 2
- Error in `check_model(<glmer>)` HOT 3
- Problems using `r2_nakagawa()` HOT 3
- check_model fails if dependent variable is labelled HOT 5
- Remove unnecessary `tryCatch()` statements targeting `insight::download_model()` HOT 2
- check_collinearity() does not work with orthogonal polynomials HOT 10
- Should check_overdispersion give warning when applied to quasipoisson? HOT 1
- Check for influential observations of GLM w/o numeric variables
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from performance.