Giter VIP home page Giter VIP logo

outreachy-datascience-2018's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

outreachy-datascience-2018's Issues

Device Failure: Attributes and Failure Mode

A better understanding is needed on how the attributes relate to device failure.

  • How do the attributes relate to device failure?
  • Is there a single attribute, or a combination of the attributes, that follows device failure?

Device Failure: Seasonality Effects

It is important to determine if there are seasonality effects regarding the telemetry data.

  • What seasonality effects are observed in the data?
  • Is there seasonality influences in device failure?

Device Failure: Relationship between attributes needed

The relationship between the attributes is poorly understood. A better understanding is needed in this regard for the data we are collecting.

  • What is the correlation between the attributes?
  • How do the scales differ or are similar between attributes?

Device Failure: Integrity of Telemetry Data

A big concern is the overall integrity of the telemetry data, which needs to be addressed through investigation.

  • Is there missing data?
  • Is the data consistent?
  • Are there bugs in the data?

Device Failue: Modeling Dataset - Feature Generation

The desire is to try to utilize the telemetry attributes to predict device failure in the field. This requires generating a feature set for use in modeling.

  • What are good features for modeling device failure?
  • How would you choose the best features?

School-YRBS: Friends

The administration wants to better understand the social dynamic of the high school. The dataset contains a relevant column called friends, which is the number of friends of the student.

  • What is the distribution of the number of friends?
  • What are the gender breakdown of friends?
  • What is the breakdown by race?

Device Failure: Reduce Number of Telemetry Attributes

The current telemetry payload is too large. We need a good representation that approximates the dataset, with a reduced number of columns.

  • What are some methods for reducing the size of the dataset?
  • What is the trade-off between dataset size and fidelity to original dataset

School-YRBS: Parents column

The current dataset stores parent information in the parents field. One value is my mother and my father. It is desirable to break out this information into single caregivers. e.g., my mother and my father becomes mother and father, whereas my mother only is just mother.

  • What is the distribution of each single caregiver type? e.g., What is the distribution of mother caregiver?

School-YRBS: Integrity of data

A big concern is the overall integrity of the school dataset, which needs to be addressed through investigation.

  • Is there missing data?
  • Is the data consistent?
  • Are there bugs in the data?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.