Comments (4)
HISAT2-build used about 160 GB of RAM when building indexes using the human reference genome and 12.3 million common SNPs, so you may need a machine with 256 GB of RAM (or something like that) to build indexes for the hamster genome.
from hisat2.
So I actually did build the index on a much large machine.
However when I ran hisat2 with the above code for alignment, nothing seemed to happen. No output, constant use of 1 core and a few gigs of memory.
from hisat2.
I'm trying to build an index for C. elegans genome using the command below:
$HISAT2_HOME/hisat2-build Caenorhabditis_elegans.WBcel235.dna.toplevel.fa -ss splice_sites.txt and --exon exon.txt C_elegans
It builds an index in under 2 minutes but when I use hisat2 inspect --ss --exon C_elegans, I see that splice sites and exon info is not included in the index. Should I add other options to the script for this to work? Do you think the reason can be the difference between chromosome names between human and C. elegans?
Would it be possible for you to provide hisat2 indexes for C. elegans in your website? These are the ensembl files for latest fasta and gtf for C. elegans.
ftp://ftp.ensembl.org/pub/release-83/fasta/caenorhabditis_elegans/dna/
ftp://ftp.ensembl.org/pub/release-83/gtf/caenorhabditis_elegans/Caenorhabditis_elegans.WBcel235.83.gtf.gz
head splice_sites.txt
I 4357 5194 -
I 5295 6036 -
I 6326 9726 -
I 9845 10094 -
I 10584 11617 +
I 11556 11617 +
I 11560 11617 +
I 11560 11622 +
I 11685 14949 +
I 11688 14950 +
head exons.txt
I 3746 3908 -
I 4118 4357 -
I 5194 5295 -
I 6036 6326 -
I 9726 9845 -
I 10094 10229 -
I 10412 10584 +
I 11494 11560 +
I 11617 11688 +
I 14949 15159 +
from hisat2.
I ran into the same issue while building index for human hg38 using a gff annotation file downloaded from Ensemble. hisat built kept adjusting the memory (I had 64Gb) and finally finished. However, the subsequent use of hisat2 for alignment seems to be stuck with steady use of 1 core.
Remark: Some of the index file seem to be missing. I get 1,2,3,4,7,8 but not 5 and 6.
from hisat2.
Related Issues (20)
- hisat2 hangs aligning axolotl reads HOT 1
- Output files(.snp, .haplotype) of hisat2_extract_snps_haplotypes_*.py are empty
- Please add the pbat option of hisat-3n
- A question about methylation information extraction
- Any plans to support Apple Silicon architecture? HOT 1
- Installation Issue Error 1 - make HOT 1
- -np argument seemingly not working
- ERR): "fastq file.fastq" does not exist. Exiting now ...
- [Bug Report] hisat2-align exited with value 137, space complexity of hisat2
- hisat2 location does not exist
- Hisat-3N mapping quality
- hisat2-build index for circRNA-seq
- hisat2-build failed for Segmentation fault
- [Future request] hisat-3n table option to report conversions summarized to genomic feature or reads counts
- Issue with hisatgenotype HOT 1
- Mapping using different parameters --very-sensitive and default
- (ERR): "ref.genome" does not exist Exiting now ...
- --directional-mapping-reverse vs. --rna-strandness on HISAT3N
- Question about calculation of base counts in hisat3Ntable
- mkfifo failed error and change $temp_dir HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from hisat2.