Comments (4)
Dear Alex,
Thank you for letting me know. We have yet to encounter this type of error. After a brief investigation, I can confirm that (1) all of the results that you currently have (the p-values in your Manhattan plot) are valid; (2) this is more of a numerical issue, which relatively small sample sizes may cause.
We will take a further investigation and see if anything can be amended to avoid this error.
Best,
Xihao
from staarpipeline-tutorial.
Dear Xihao,
Thank you for replying. I'm working with ~400 cases and ~700 controls, so indeed my sample size is considerably smaller than the example in your paper (~20,000).
The default setting for the sliding window analysis is a window size of 2 kb and a "rare_MAF_cutoff" of 0.01. To try to circumvent my small sample size issue, I increased this to a window size of 100 kb and a "rare_MAF_cutoff" of 0.05. However, it still doesn't work.
With 800 alleles in cases, I should be able to detect variants down to a frequency of 0.125%. The rare MAF cutoff is above this, at 5%. In addition, I am accumulating alleles across a window of 100 kb, which is a reasonably large distance.
So, somehow, I am not able to detect at least two variants less than 5% MAF in the majority of 100 kb windows across the genome, using 1100 individuals.
Also, I'm not sure a low sample size explains why the sliding window works properly only at the start of chromosomes?
Do you have any suggestions? For instance, what parameters I should choose for the sliding window analysis? Perhaps I should try the dynamic window analysis?
Many thanks for your help,
Alex
from staarpipeline-tutorial.
Dear Alex,
Thank you for providing the additional information. Your justifications for using a "rare_MAF_cutoff" of 0.05 and longer sliding windows are reasonable. Yes, please try the dynamic window analysis, and feel free to keep us posted via email if you have any follow-up questions.
Hope this helps.
Best,
Xihao
from staarpipeline-tutorial.
Thank you, Xihao, for your feedback.
from staarpipeline-tutorial.
Related Issues (20)
- fit_nullmodel Output is mostly Null and 0 HOT 16
- Fitting NULL model for binary outcomes HOT 5
- Error in Gene Centric Analysis HOT 1
- Error in results_plof_genome[, "cMAC"] : subscript out of bounds HOT 2
- Followup Question to Issue #28 HOT 2
- STAARpipeline_Gene_Centric_Noncoding HOT 2
- Dynamic Window dim(X) error HOT 3
- Can't annotate individual variant results HOT 2
- [Suggestion-Implementation] Add information to summary and annotations of results HOT 1
- Conditional analysis - Summary Gene Centric Noncoding not running to completion HOT 6
- Ukbiobank Agds files generation HOT 16
- Plots for gene centric ncRNA regions HOT 5
- FATAL ERROR - Too many first alleles as the major allele (~21.5%). HOT 1
- warning messages in generating the annotated GDS (aGDS) file. HOT 3
- Controls / cases counts inverted when using binary model HOT 7
- kinship matrix HOT 2
- variant set in gene-centric coding/noncoding analysis HOT 2
- in the Step 2: Individual (single-variant) analysis, Error in if (chr == 1) { : argument is of length zero HOT 3
- Error : Mat::operator(): index out of bounds & Error in apply(emthr_SCANG_O, 2, max) : HOT 1
- What is the difference between the "results_temp" and "results_m" results? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from staarpipeline-tutorial.