tanjimin / c.origami Goto Github PK
View Code? Open in Web Editor NEWC.Origami, a prediction and screening framework for cell type-specific 3D chromatin structure.
C.Origami, a prediction and screening framework for cell type-specific 3D chromatin structure.
Hi! Dear author.
I encountered problems when I tried to train the model of hg19. Neither using nf-core/hic(https://nf-co.re/hic) nor converting the .mcool file on 4DN into npz could I train the exact model, and the prediction result was empty. In addition, I found that the train loss were very low.
Could you please tell me more about how to process the Hi-C data?
Hello,
I have CTCF chip-exo data and unlike chip-seq it doesnt have an input sample. I have generated the bigwig file but cannot run ctcf_norm.sh to normalize it as I dont have an input sample. Can I use the bigwig file directly without normalizing it?
I do have a chip-seq input sample from the same cell line but I am not sure if I should use that for normalizing the chip-exo data.
Thanks
I am interested in the performance comparison with deepC. Could you please provide more details about training deepC with IMR-90 data? For example:
I'm looking forward to your response.
I saw in the paper it states that "we converted the predicted matrix back to valid pairs by merging predictions to chromosomes and counting the discretized intensity value." I also saw that someone raised the issue and it was closed stating it takes a lot of effort. I am wondering if you can specify what method was used in the paper so that it can be replicated by others and FitHiC can be used for loop calling?
The files corigami_data.tar.gz and corigami_data_gm12878_add_on.tar.gz have the same download link.
Dear all,
I followed the example you give to predict GM12878 HiC contact map in a certain region of 2MB
To measure the correlation between this prediction and the original HiC matrix of the same region, I understand that I have to perform (log(ICE normalized counts + 1))
on the original 2MB size HiC matrix, and then measure the correlation directly with the predictions.
Is that true?
Thanks
hello!
I am currently engaged in replicating the C.Origami model and have encountered some difficulties.
I have diligently replicated the training process detailed on GitHub, employing the same datasets and code across two distinct servers, and have done multiple times. I've noticed considerable randomness in the training process, and the weights saved automatically by PyTorchlightning do not seem to effectively predict the Hi-C matrices in the training, validation, and test datasets. Using the checkpoints provided on GitHub, I found that there were no issues with the predictions.
I am curious to know if there have been previous reports of such variability, and if not, I plan to continue repeating my experiments to understand this phenomenon better.
thanks!
Hello,
I noticed there was code for evaluation in issue 29, but I want to ask the specific code used for calculating the insulation score, ''import insulation as insu''.
Additionally, I am intrigued by the visualization presented in Figure 2.d of the ATAC_seq and CTCF. Are the values within each window_size region averaged to generate these plots?
Thanks!
Dear Jimin, Your research has been very helpful in my work!
I am trying to reproduce the best results of your method as our baseline to compare.
Does Origami provide pre trained models for other cell types? (excluding IMR90)
thanks!
Dear developers,
Thanks for your work to develop the state-of-art tool.I have tried the example and my own data for intra chromosomal contacts, looking OK. However, how to predict intra chromosomal like you paper Fig. 4. If there are some example or tutorial to make the inter chromosomal contacts?
A lot of request was received on chromosome-wide predictions. I will add this feature soon.
Hello,
Thank you for sharing your work. It is very helpful and inspiring.
Would it be possible for you to share the evaluation code (e.g. observed versus expected Hi-C matrices correlation, distance-stratified correlation, insulation score correlation) or provide more detailed information on how to calculate these metrics?
Thank you for your time and help.
I have the cut&tag data for CTCF, but no corresponding input data. Can I just use ip data to predict the architecture?
Hi @tanjimin,
First of all, thanks for sharing your work in this repo.
I was trying to run the example provided in examples/prediction.sh
, but it did not give the expected result. To be more specific, I modified the chromosome to --chr "chr15"
, and the rest is consistent with the documentation. The output Hi-C image is blank, while the numpy array looks fine. The output image remains blank for other start positions as well.
Can you please look into this problem? Thank you.
Hi @tanjimin ,
I really enjoyed using C.Origami, but I got a naive question that has been bothering me for a while. I was wondering how to downscale the 10-kb experimental Hi-C matrix to 8192 bp resolution in order to match the model outputs.
Best,
Jiankun
Hi
I was curious to know if the C.Origami model can be used on single-cell ATACseq data?
regards
Arch
Hi,jimin
How do you perform balance processing on mcool files? When I use hic-bench's track to generate an mcool file using juicertools, I encounter an error because the -k parameter in addnorm is not available in the new version. Then I use Cooler to balance the mcool file, but the generated values are extremely small, mostly ranging from 0 to 0.01. However, when I downloaded your data and viewed it, I found that the values were distributed from 0 to 40.
Thanks!
Hi,
Dear author, the C.Origami is a beautiful job!
Could I replace the HiC experiment data with HiChIP or ChIA-PET data during training my model? As we all know, they are techniques that incorporate chromatin immunoprecipitation(ChIP)-based enrichment and chromatin proximity ligation, thus reflecting three-dimensional conformation data that capture specific chromatin modification. I wonder if your model can compatible with similar data.
Best wishes
Gemma
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.