deepiv-gwas's People
deepiv-gwas's Issues
Download data
- download all files from GTex
- download all files from TCGA (breast cancer only)
Tune first stage
- normalize the p matrix
(p - np.mean(p, axis=0)) / np.std(p, axis=0)
- try large/original network on AWS
- vary learning rate alpha
- add dropout and/or L2 regularization
Go over BibTex workflow
I added BibTex compatibility to deepiv_milestone.tex
with commit e0751bf6
. It's a Tex feature used by researchers to streamline the addition of citations and bibliographies in LaTeX documents, and it saves headaches when it comes to keeping track of citations.
Clean Data
- match TCGA data
- write R scripts to clean data
- put results in derived data
- match GTEx data
- write R scripts to clean GTEx data
- put results in derived data
- match TCGA and GTEx data
- put results in derived data
Get permissions from some prof
- get in touch with Quake
- Contact Bologna
Tune second stage
Same as in #10
Structure code corresponding to guides
The 230 development pipeline will ultimately (I believe) lead to cleaner code and a nicer development process as we move forward with this project. I'll silo myself in a branch until I'm sure everything is working.
Set up AWS
Determine health of GTex sample
See https://www.gtexportal.org/home/tissueSummaryPage#cause as a start.
How do we identify important genes?
It's not like we have coefficients on our inputs that neatly describe their effect on the outcome; do we have some script that fixes all other genes at a baseline, then perturbs a single gene at a time to look at its effect on y?
Add background info to deepiv_milestone .tex
Go to milestone/writeup/deepiv_milestone.tex
and start filling out information.
-
Add problem statement, biology background to Introduction (pull from proposal)
-
Talk about our candidate dataset
-
Talk about how we will be getting our dataset extremely soon
Also, a note on git commit etiquette: please add the issue number relevant to your commit in your commit message, e.g. git commit -m "#1 adding a beautiful and perfect introduction"
Pull together poster
Talk to @billyferguson for sketch of what we want; also see project examples on the 230 website, other CS project websites (e.g. CS229, 224W...)
Look at causal effects
- sample from second stage DGP
- use decision tree or Lasso regression
Are CNNs useful for our data?
Some preliminary reading:
https://www.quora.com/How-can-convolutional-neural-networks-be-used-for-non-image-data
My best guess based on skimming those links: yes, if our input data is something like a full genome where entries that are close together in our vector are also physically close to one another on the chromosome.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.