Comments (2)
Hi Fran,
Since you mention that the reads usually are processed, I think --isoseq
parameter is fine. However, please skip the --ignore_rc
flag. This was an experimental parameter used in development. Looking through the codebase I'm actually unsure it has any effect.
Another parameter is the number of cores/threads --t
. Default is 3 and uLTRA's memory footprint grows with the number of cores, it may be good to set this to what SQANTI3 uses.
I'm not sure about the most common use case you see with SQANTI3 (e.g., isoseq or ONT, human/non-human, and the number of reads in the datasets), but that is what I could think about if you have to choose a fixed parameter setting for uLTRA.
I recently implemented an option --use_NAM_seeds
there are some tradeoffs with this option - faster and smaller intermediate files, but can be more memory consuming/slightly less accurate. If you're interested, see here.
Glad that you want to try it out and happy to receive feedback if you get any!
Best,
Kristoffer
from ultra.
Thanks for your quick response and your advise! I will remove the --ignore_rc
flag and set the --t
parameter as you said. Regarding the --use_NAM_seeds
, I read about this option, but I will keep it "off". The idea is to map transcript-models, so it shouldn't be a problem the amount of sequences. The reference transcriptome from mouse has ~140k transcripts, it would be crazy to have >5M transcripts...
I'll let you know any insight that we get of this. But I won't open an issue, but I'll contact you :)
Bests,
Fran
from ultra.
Related Issues (20)
- KeyError when running test pipeline HOT 3
- How to control minimap 2 parameters during uLTRA alignment HOT 6
- Non-absolute paths don't resolve HOT 5
- Python bindings? HOT 1
- ultra installation and run error HOT 7
- CIGAR string starts/ends with N HOT 7
- Mapping with uLTRA without GTF? HOT 3
- Cigar is None HOT 2
- BUG -4294967296 HOT 8
- Controlling (high) uLTRA RAM usage HOT 1
- a bug of `--alignment_threshold` HOT 1
- Out of bound reads HOT 13
- Genomes FASTA/GTF files needed to run the evaluation HOT 4
- Can not access local variable 'read_mems_tmp' when using --use_NAM_seeds HOT 2
- Error: invalid feature coordinates (end<start!) at line: HOT 1
- Bug with uLTRA align : TypeError: bad argument type for built-in operation HOT 3
- UnboundLocalError: local variable 'i' referenced before assignment HOT 4
- Can I use ultra to align est to references HOT 1
- Failed in building index HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ultra.