Giter VIP home page Giter VIP logo

rfer's Introduction

Rfer

Team

Group Member Github
Gabriel Bogo @GabrielBogo
Yuwei Liu @liuyuwei169
Weifeng (Davy) Guo @DavyGuo
Mohamad Makkaoui @makka3

Reimplementation of the infer R package, that offers a tidy way of developing statistical inference built on top of Tidyverse.

The infer package streamlines the process of reshuffling and bootstrapping of samples, calculating summary statistics and confidence intervals, and performing hypothesis tests for statistical inference. It does this using a combination of functions that are built with the emphasis on clear expressive code and using correct statistical grammar that explains the way the values are calculated and the tests are evaluated in statistical inference.

With this package as the inspiration, rfer will have four main functions (specify,generate,calculate,get_ci) for the first iteration. These functions will, given a data frame and the specified response variable; calculate summary statistics and confidence intervals for the response variable. Further details follow in the description of the functions below.

Where does rfer fit into the R ecosystem?

Currently, infer does a great job at implementing what we've specified for the functions in the R ecosystem. Nevertheless, we will begin by developing similar functions for the initial iterations with the expectation that we will add on extra functions that will enhance infer package at a later time.

Functions

specify

Function Description: choose specific columns to feed the subsequent pipeline.

Inputs:

  • data: a Dataframe
  • response: string. The column of your dataframe to be the response variable.

Output:

  • Dataframe containing one column for response variable and zero or more columns for the explanatory variables. The first column is always the response.

generate

Function Description: Generate bootstrap resamples and permutations

Inputs:

  • data: a Dataframe generated from a specify function.
  • n_samples: Integer. Number of resamples.
  • type: "Bootstrap" (default), or "Permutation".

Output:

  • Dataframe containing all resamples stacked vertically. Will keep all columns from the input data and an additional sample_id column to identify individual resamples.

calculate

Function Description: calculate a summarizing statistic for each bootstrap sample.

Inputs:

  • data: Dataframe generated by the generate function.
  • stat: Summarizing statistic. "mean" (default) or "median"

Output:

  • Dataframe of summarized data. Each row contains the summary statistic for a given resample..

get_ci

Function Description: return the bootstrap confidence interval for a point estimate.

Inputs:

  • data: Dataframe generated from calculate function.
  • interval: Significance level. Percentage Float (0-100)

Output:

  • Dataframe containing 1 row and columns for Statistic (Point Estimate), significance level, Lower Bound and Upper Bound.

rfer's People

Contributors

davyguo avatar gabrielbogo avatar makka3 avatar yuweiii avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.