Comments (8)
Hi Damien,
Thanks for your question. Regarding the rownames/colnames of the sparse GRM, please add the following script before fitting the null model:
sample_id <- unlist(lapply(strsplit(colnames(sGRM),"_"),`[[`,1))
colnames(sGRM) <- sample_id
rownames(sGRM) <- sample_id
Regarding the fitNullModel()
function, please refer to the GENESIS package manual for detailed specifications. Alternatively, you may follow the STAARpipeline_Null_Model.r
scripts to fit the null model for STAARpipeline.
Best,
Xihao
from staarpipeline-tutorial.
Hi Damien,
Thanks for your question. Regarding the rownames/colnames of the sparse GRM, please add the following script before fitting the null model:
sample_id <- unlist(lapply(strsplit(colnames(sGRM),"_"),`[[`,1)) colnames(sGRM) <- sample_id rownames(sGRM) <- sample_id
Regarding the
fitNullModel()
function, please refer to the GENESIS package manual for detailed specifications. Alternatively, you may follow theSTAARpipeline_Null_Model.r
scripts to fit the null model for STAARpipeline.Best, Xihao
Very appreciative of your help. I found that some of sample ids in my data are different, such as DT2007037898-1_DT2007037898-1
. They could not be split by using gsub("_D[0-9]+", "", sGRM@Dimnames[[1]])
. But it works by using your scripts. And I have done this fitnullmodel
step already.
Following the STAARpipeline-tutorial, I ran the individual analysis. When I ran the command Rscript STAARpipeline_Individual_Analysis.r 14
, the output file size is very small. That was strange and maybe there was something wrong.
4.0K -rw-rw-r--. 1 cxm cxm 90 Dec 9 20:09 plt_results_individual_analysis_14.Rdata
BTW, I still could NOT understand the arrayid
#7. I ran analysis in DELL PowerEdge T640 tower server
, without any cluster. What I should do to adapt the script?
Hope to have your generous help and thanks again for your previous help.
from staarpipeline-tutorial.
Hi Damien,
Glad that your previous question has been resolved. For your follow-up question, could you please paste the header and dimension of plt_results_individual_analysis_14.Rdata
? I can provide more details by looking at the output. Thanks.
Best,
Xihao
from staarpipeline-tutorial.
Hi Damien,
Glad that your previous question has been resolved. For your follow-up question, could you please paste the header and dimension of
plt_results_individual_analysis_14.Rdata
? I can provide more details by looking at the output. Thanks.Best, Xihao
It is NULL in plt_results_individual_analysis_14.Rdata
. Here are the other output files using Rscript STAARpipeline_Individual_Analysis.r 13
and Rscript STAARpipeline_Individual_Analysis.r 15
:
I think the problem maybe is the incorrect use of the Rscript STAARpipeline_Individual_Analysis.r <int number>
due to my misunderstanding. In the screenshots above, I found that the second column contained only chromesome 1. But to my mind, it should be contained chromsome 13 or 15 which was consistent with the command line argument.
from staarpipeline-tutorial.
Hi Damian,
Thanks for sharing and now I get it. For individual variant analysis, the number of output files is the summation of the column "individual_analysis_num" for the object in jobs_num.Rdata
. In our example, we submitted a total of 293 jobs in the SLURM cluster. For your case, the <int number>
in your command Rscript STAARpipeline_Individual_Analysis.r <int number>
does not represent the chromosome number but the job number.
Therefore, you would need to run the scripts for a total of sum(jobs_num$individual_analysis_num)
times instead of 22 times by specifying Rscript STAARpipeline_Individual_Analysis.r 1
, Rscript STAARpipeline_Individual_Analysis.r 2
, etc. Finally, it is possible for your command Rscript STAARpipeline_Individual_Analysis.r 14
to give a NULL
result since we are splitting the job based on physical positions, and it is possible for a particular region in chromosome 1 that does not contain any variant.
Hope this is clear.
Best,
Xihao
from staarpipeline-tutorial.
Gotcha, I ran sum(jobs_num$individual_analysis_num) = 294
just now. That means I should run commands from Rscript STAARpipeline_Individual_Analysis.r 1
to Rscript STAARpipeline_Individual_Analysis.r 294
. Maybe I could write a for
loop in a shell script to run this step.
from staarpipeline-tutorial.
Yes, that is correct. Glad that you've got the point.
from staarpipeline-tutorial.
Yes, that is correct. Glad that you've got the point.
Thanks again. I will try to finish this step, and there may be other problems in the next steps. Hope to ask you for help again.
from staarpipeline-tutorial.
Related Issues (20)
- fit_nullmodel Output is mostly Null and 0 HOT 16
- Fitting NULL model for binary outcomes HOT 5
- Error in Gene Centric Analysis HOT 1
- Error in results_plof_genome[, "cMAC"] : subscript out of bounds HOT 2
- Followup Question to Issue #28 HOT 2
- STAARpipeline_Gene_Centric_Noncoding HOT 2
- Dynamic Window dim(X) error HOT 3
- Can't annotate individual variant results HOT 2
- [Suggestion-Implementation] Add information to summary and annotations of results HOT 1
- Conditional analysis - Summary Gene Centric Noncoding not running to completion HOT 6
- Ukbiobank Agds files generation HOT 16
- Plots for gene centric ncRNA regions HOT 5
- FATAL ERROR - Too many first alleles as the major allele (~21.5%). HOT 1
- warning messages in generating the annotated GDS (aGDS) file. HOT 3
- Controls / cases counts inverted when using binary model HOT 7
- kinship matrix HOT 2
- variant set in gene-centric coding/noncoding analysis HOT 2
- in the Step 2: Individual (single-variant) analysis, Error in if (chr == 1) { : argument is of length zero HOT 3
- Error : Mat::operator(): index out of bounds & Error in apply(emthr_SCANG_O, 2, max) : HOT 1
- What is the difference between the "results_temp" and "results_m" results? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from staarpipeline-tutorial.