Dear Dr. Li, I am facing another problem when I ran fitn

fitnullmodel error about staarpipeline-tutorial HOT 8 CLOSED

xihaoli commented on September 25, 2024

fitnullmodel error

from staarpipeline-tutorial.

Comments (8)

xihaoli commented on September 25, 2024

Hi Damien,

Thanks for your question. Regarding the rownames/colnames of the sparse GRM, please add the following script before fitting the null model:

sample_id <- unlist(lapply(strsplit(colnames(sGRM),"_"),`[[`,1))
colnames(sGRM) <- sample_id
rownames(sGRM) <- sample_id

Regarding the fitNullModel() function, please refer to the GENESIS package manual for detailed specifications. Alternatively, you may follow the STAARpipeline_Null_Model.r scripts to fit the null model for STAARpipeline.

Best,
Xihao

from staarpipeline-tutorial.

DamienTan commented on September 25, 2024

Hi Damien,

Thanks for your question. Regarding the rownames/colnames of the sparse GRM, please add the following script before fitting the null model:
sample_id <- unlist(lapply(strsplit(colnames(sGRM),"_"),`[[`,1))
colnames(sGRM) <- sample_id
rownames(sGRM) <- sample_id
Regarding the fitNullModel() function, please refer to the GENESIS package manual for detailed specifications. Alternatively, you may follow the STAARpipeline_Null_Model.r scripts to fit the null model for STAARpipeline.

Best, Xihao

Very appreciative of your help. I found that some of sample ids in my data are different, such as DT2007037898-1_DT2007037898-1. They could not be split by using gsub("_D[0-9]+", "", sGRM@Dimnames[[1]]). But it works by using your scripts. And I have done this fitnullmodel step already.
Following the STAARpipeline-tutorial, I ran the individual analysis. When I ran the command Rscript STAARpipeline_Individual_Analysis.r 14, the output file size is very small. That was strange and maybe there was something wrong.

4.0K -rw-rw-r--. 1 cxm cxm 90 Dec 9 20:09 plt_results_individual_analysis_14.Rdata

BTW, I still could NOT understand the arrayid #7. I ran analysis in DELL PowerEdge T640 tower server, without any cluster. What I should do to adapt the script?
Hope to have your generous help and thanks again for your previous help.

from staarpipeline-tutorial.

xihaoli commented on September 25, 2024

Hi Damien,

Glad that your previous question has been resolved. For your follow-up question, could you please paste the header and dimension of plt_results_individual_analysis_14.Rdata? I can provide more details by looking at the output. Thanks.

Best,
Xihao

from staarpipeline-tutorial.

DamienTan commented on September 25, 2024

Hi Damien,

Glad that your previous question has been resolved. For your follow-up question, could you please paste the header and dimension of plt_results_individual_analysis_14.Rdata? I can provide more details by looking at the output. Thanks.

Best, Xihao

It is NULL in plt_results_individual_analysis_14.Rdata. Here are the other output files using Rscript STAARpipeline_Individual_Analysis.r 13 and Rscript STAARpipeline_Individual_Analysis.r 15:

I think the problem maybe is the incorrect use of the Rscript STAARpipeline_Individual_Analysis.r <int number> due to my misunderstanding. In the screenshots above, I found that the second column contained only chromesome 1. But to my mind, it should be contained chromsome 13 or 15 which was consistent with the command line argument.

from staarpipeline-tutorial.

xihaoli commented on September 25, 2024

Hi Damian,

Thanks for sharing and now I get it. For individual variant analysis, the number of output files is the summation of the column "individual_analysis_num" for the object in jobs_num.Rdata. In our example, we submitted a total of 293 jobs in the SLURM cluster. For your case, the <int number> in your command Rscript STAARpipeline_Individual_Analysis.r <int number> does not represent the chromosome number but the job number.

Therefore, you would need to run the scripts for a total of sum(jobs_num$individual_analysis_num) times instead of 22 times by specifying Rscript STAARpipeline_Individual_Analysis.r 1, Rscript STAARpipeline_Individual_Analysis.r 2, etc. Finally, it is possible for your command Rscript STAARpipeline_Individual_Analysis.r 14 to give a NULL result since we are splitting the job based on physical positions, and it is possible for a particular region in chromosome 1 that does not contain any variant.

Hope this is clear.

Best,
Xihao

from staarpipeline-tutorial.

DamienTan commented on September 25, 2024

Gotcha, I ran sum(jobs_num$individual_analysis_num) = 294 just now. That means I should run commands from Rscript STAARpipeline_Individual_Analysis.r 1 to Rscript STAARpipeline_Individual_Analysis.r 294. Maybe I could write a for loop in a shell script to run this step.

from staarpipeline-tutorial.

xihaoli commented on September 25, 2024

Yes, that is correct. Glad that you've got the point.

from staarpipeline-tutorial.

DamienTan commented on September 25, 2024

Yes, that is correct. Glad that you've got the point.

Thanks again. I will try to finish this step, and there may be other problems in the next steps. Hope to ask you for help again.

from staarpipeline-tutorial.

fitnullmodel error about staarpipeline-tutorial HOT 8 CLOSED

Comments (8)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent