Comments (4)
Hi Jens,
go.env
before it goes into the list2env
function is a list of lists with the GO term names as the primary list and for each GO term, there is a list of gene names associated where the gene names correspond to the row names of your counts matrix. Then list2env
just turns it into an R environment, which was convenient for us to toggle between different gene sets during development.
If you want to browse the go.env
object in its environment state, you will need to do the following:
> data(go.env)
> gos <- ls(go.env) # list the gene sets
> head(gos)
[1] "GO:0000002 mitochondrial genome maintenance" "GO:0000003 reproduction"
[3] "GO:0000012 single strand break repair" "GO:0000014 single-stranded DNA endodeoxyribonuclease activity"
[5] "GO:0000018 regulation of DNA recombination" "GO:0000028 ribosomal small subunit assembly"
> get(gos[1], go.env) # get the genes in the first gene set
[1] "SLC25A4" "DNA2" "TYMP" "LIG3" "MEF2A" "MPV17" "MDP1" "DNAJA3" "LONP1" "LONP1" "AKT3" "PPARGC1A"
[13] "STOML2" "RRM2B" "PID1" "C10orf2" "C10orf2" "PIF1" "SESN2" "MGME1" "MGME1" "CCDC111" "RNASEH1"
Here is also a tutorial on how to go from a gmt
file to one of these environment along with some other common gene sets in the same format as go.env
: https://github.com/JEFworks/genesets
Hope that helps! Let me know if you need anything else.
Best,
Jean
from scde.
Hi Jens,
Hopefully Jean’s reply clarifies how the environment is constructed. Can you please let us know what errors you’ve encountered in trying to reproduce the current tutorial code, so that we can fix that.
Thanks,
-peter.
On Aug 6, 2015, at 4:43 AM, Jens Preußner [email protected] wrote:
Hi all,
I'm trying to use the biomaRt-package together with GO.db to construct a proper go.env environment for evaluation of overdispered gene sets (like in http://hms-dbmi.github.io/scde/pagoda.html http://hms-dbmi.github.io/scde/pagoda.html). Can someone clarify what the structure of go.env is, before it goes into the list2env function in the example code on the web site mentioned above? I failed to reproduce the code given there and a head() on go.env would be enough for me I guess.
Thanks a lot,
Jens—
Reply to this email directly or view it on GitHub #9.
from scde.
Great! Your comments helped me a lot. Thanks, Jean, for pointing me to the gene sets you created from MSigDB. This is really great! So, if I got it right, two things hold:
- If, for example, the row names are Ensemble IDs, the go.env does not need to contain gene symbols, but those Ensembl IDs.
- The names of the list items are just for identification, they also could be something else.
Peter, I was able to run the current tutorial code, but I wanted to to use the biomaRt library to not depend on org.Hs.eg.db. If you're interested, I can create a pull request with the alternative code to the gh-pages branch. Just let me know :)
from scde.
Hi Jens,
- Yes, if the rownames are Ensembl IDs, then the go.env should contain lists of lists of Ensemble IDs. We use go.env to grab relevant rows from your count matrix, so they need to match.
- Yes, the names in the first list are just identifiers. They are later used in the PAGODA app so it helps if they're descriptive for browsing purposes (as opposed to just 'GO:XYZ', I made them 'GO:XYZ description')
Yes, please do make a pull request! I'd be happy to integrate your code to improve the tutorials. Thanks!
Best,
Jean
from scde.
Related Issues (20)
- incredible difficulty in installing 'scde' on R3.4.3 and R3.5.1 HOT 2
- Error in checkSlotAssignment(object, name, value) HOT 2
- Computing the statistical significance of overdispersion
- scde bioconductor version 2.10.0 still has flexmix problem! HOT 2
- Fail to run scde.error.models using example dataset HOT 3
- scDE is too slow when running large scale datasets. HOT 1
- What is the meaning of zscore in the results of differentially expressed analysis? HOT 4
- i could not install scde HOT 1
- Is there an equivalent pagoda.show.pathways() function for looking at clusters and not pathways
- last biocoductor version HOT 4
- Naive questions
- How to get the baseMeanA and baseMeanB in result of scde.expression.difference()
- Question regarding line 138 in jpmatLogBoot.cpp
- fisher.test error
- ES/MEF cell dataset published by Islam et al
- scde issue HOT 4
- error in calculating models
- ERROR encountered in building a model
- Trouble installing scde
- could scde use in differencial expression transcripts analysis?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from scde.